OceanGym: A Benchmark Environment for Underwater Embodied Agents Paper • 2509.26536 • Published Sep 30, 2025 • 36
Executable Knowledge Graphs for Replicating AI Research Paper • 2510.17795 • Published Oct 20, 2025 • 15
InnoGym: Benchmarking the Innovation Potential of AI Agents Paper • 2512.01822 • Published Dec 1, 2025 • 36
LightThinker++: From Reasoning Compression to Memory Management Paper • 2604.03679 • Published about 1 month ago • 38
Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis Paper • 2604.24198 • Published 8 days ago • 20
LightThinker++: From Reasoning Compression to Memory Management Paper • 2604.03679 • Published about 1 month ago • 38
How to Unleash the Power of Large Language Models for Few-shot Relation Extraction? Paper • 2305.01555 • Published May 2, 2023
LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities Paper • 2305.13168 • Published May 22, 2023
KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents Paper • 2403.03101 • Published Mar 5, 2024 • 1
Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey Paper • 2505.03418 • Published May 6, 2025 • 9
Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study Paper • 2506.19794 • Published Jun 24, 2025 • 8