Mohamed EL harchaoui
medelharchaoui
AI & ML interests
None yet
Recent Activity
updated
a collection
6 days ago
Agentic
updated
a collection
23 days ago
Diffusion Language
liked
a model
25 days ago
nvidia/canary-180m-flash
Organizations
Diffusion Language
RL-LLMs
-
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
Paper • 2509.09674 • Published • 80 -
A Survey of Reinforcement Learning for Large Reasoning Models
Paper • 2509.08827 • Published • 190 -
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs
Paper • 2510.11696 • Published • 177
Interessting papers
-
PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning
Paper • 2508.21104 • Published • 35 -
FNet: Mixing Tokens with Fourier Transforms
Paper • 2105.03824 • Published • 1 -
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
Paper • 2509.02479 • Published • 83 -
RL + Transformer = A General-Purpose Problem Solver
Paper • 2501.14176 • Published • 28
Agentic
LLM+Search
-
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information
Paper • 2510.03632 • Published • 41 -
Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Paper • 2309.17179 • Published • 2 -
First Finish Search: Efficient Test-Time Scaling in Large Language Models
Paper • 2505.18149 • Published • 1
Robotics
-
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control
Paper • 2508.21112 • Published • 77 -
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model
Paper • 2509.09372 • Published • 243 -
Robot Learning: A Tutorial
Paper • 2510.12403 • Published • 120
RAG
Agentic
Diffusion Language
LLM+Search
-
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information
Paper • 2510.03632 • Published • 41 -
Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Paper • 2309.17179 • Published • 2 -
First Finish Search: Efficient Test-Time Scaling in Large Language Models
Paper • 2505.18149 • Published • 1
RL-LLMs
-
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
Paper • 2509.09674 • Published • 80 -
A Survey of Reinforcement Learning for Large Reasoning Models
Paper • 2509.08827 • Published • 190 -
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs
Paper • 2510.11696 • Published • 177
Robotics
-
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control
Paper • 2508.21112 • Published • 77 -
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model
Paper • 2509.09372 • Published • 243 -
Robot Learning: A Tutorial
Paper • 2510.12403 • Published • 120
Interessting papers
-
PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning
Paper • 2508.21104 • Published • 35 -
FNet: Mixing Tokens with Fourier Transforms
Paper • 2105.03824 • Published • 1 -
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
Paper • 2509.02479 • Published • 83 -
RL + Transformer = A General-Purpose Problem Solver
Paper • 2501.14176 • Published • 28