Building on HF

jiakai PRO

real-jiakai

https://blog.gujiakai.top

AI & ML interests

LLM && Smart QA

Recent Activity

upvoted a paper 5 days ago

Code as Agent Harness

upvoted a paper 12 days ago

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

upvoted a paper 16 days ago

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

View all activity

Organizations

upvoted a paper 5 days ago

Code as Agent Harness

Paper • 2605.18747 • Published 7 days ago • 199

upvoted a paper 12 days ago

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

Paper • 2605.09063 • Published 16 days ago • 78

upvoted a paper 16 days ago

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

Paper • 2605.05242 • Published 22 days ago • 113

upvoted a paper 19 days ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 22 days ago • 162

upvoted a paper 25 days ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published 27 days ago • 273

upvoted an article 30 days ago

Article

DeepSeek-V4: a million-token context that agents can actually use

burtenshaw

•

about 1 month ago

• 47

upvoted a collection about 1 month ago

DeepSeek-V4

Collection

4 items • Updated about 1 month ago • 656

upvoted an article about 1 month ago

Article

Meet HoloTab by HCompany. Your AI browser companion.

Hcompany

•

Apr 15

• 24

upvoted 7 papers about 1 month ago

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

Paper • 2604.12627 • Published Apr 14 • 101

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 326

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published Apr 9 • 291

Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning

Paper • 2604.04746 • Published Apr 8 • 72

upvoted 4 papers about 2 months ago

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

Paper • 2604.06132 • Published Apr 7 • 121

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Paper • 2604.04771 • Published Apr 6 • 123

The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

Paper • 2604.02029 • Published Apr 2 • 151

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published Mar 31 • 96

upvoted an article about 2 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 899

jiakai PRO

AI & ML interests

Recent Activity

Organizations

real-jiakai's activity

DeepSeek-V4: a million-token context that agents can actually use

Meet HoloTab by HCompany. Your AI browser companion.

Welcome Gemma 4: Frontier multimodal intelligence on device