peterlee6706
's Collections
WeekDaily
updated
rStar2-Agent: Agentic Reasoning Technical Report
Paper
•
2508.20722
•
Published
•
116
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper
•
2508.16153
•
Published
•
160
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains
RLVR
Paper
•
2508.14029
•
Published
•
118
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility,
Reasoning, and Efficiency
Paper
•
2508.18265
•
Published
•
211
AWorld: Orchestrating the Training Recipe for Agentic AI
Paper
•
2508.20404
•
Published
•
38
Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory
and Test-Time Compute Scaling
Paper
•
2508.16745
•
Published
•
29
Persuasion Dynamics in LLMs: Investigating Robustness and Adaptability
in Knowledge and Safety with DuET-PD
Paper
•
2508.17450
•
Published
•
9
Jailbreaking Commercial Black-Box LLMs with Explicitly Harmful Prompts
Paper
•
2508.10390
•
Published
•
1
InMind: Evaluating LLMs in Capturing and Applying Individual Human
Reasoning Styles
Paper
•
2508.16072
•
Published
•
4