Mallard74 (Marc-Antoine Allard)

upvoted 2 articles 4 months ago

Article

Gaia2 and ARE: Empowering the community to study agents

+9

Sep 22, 2025

•

125

Article

Open-source DeepResearch – Freeing our search agents

+3

Feb 4, 2025

•

1.32k

upvoted 4 papers 4 months ago

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Paper • 2509.09265 • Published Sep 11, 2025 • 47

Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks

Paper • 2503.09572 • Published Mar 12, 2025 • 2

Can We Rely on LLM Agents to Draft Long-Horizon Plans? Let's Take TravelPlanner as an Example

Paper • 2408.06318 • Published Aug 12, 2024 • 1

OdysseyBench: Evaluating LLM Agents on Long-Horizon Complex Office Application Workflows

Paper • 2508.09124 • Published Aug 12, 2025 • 3

upvoted a collection 6 months ago

Speech Evals

Collection

Synthesized speech evals generated by MistralAI from popular text evaluation datasets to evaluate spoken-language reasoning capabilities of Audio LLMs • 3 items • Updated Nov 28, 2025 • 12

upvoted 4 articles 6 months ago

Article

5 Things You Need to Know About Moonshot AI and Kimi K2, the New #1 model on the Hub

Jul 15, 2025

•

24

Article

Introducing ColQwen-Omni: Retrieve in every modality

Jul 17, 2025

•

76

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9, 2025

•

766

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

749

upvoted an article 7 months ago

Article

Qwen2-VL-OCR-2B-Instruct and VisionOCR-3B-061125 for precise recognition of [messy] handwriting.

Jun 17, 2025

•

11

upvoted a paper 7 months ago

YODAS: Youtube-Oriented Dataset for Audio and Speech

Paper • 2406.00899 • Published Jun 2, 2024 • 4

upvoted an article 8 months ago

Article

Context Is Gold to Find the Gold Passage: Evaluating and Training Contextual Document Embeddings

Jun 2, 2025

•

27

upvoted a collection 9 months ago

Qwen3

Collection

84 items • Updated 20 days ago • 1.57k

upvoted an article 9 months ago

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

Apr 25, 2025

•

305

upvoted an article 10 months ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face

+5

Apr 5, 2025

•

146

upvoted a collection 10 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29, 2025 • 685

upvoted 2 articles 10 months ago

Article

Serverless Inference with Hugging Face and NVIDIA NIM

Jul 29, 2024

•

34

Article

DeepSearch Using Visual RAG in Agentic Frameworks 🔎

Mar 21, 2025

•

37

Marc-Antoine Allard

AI & ML interests

Organizations

Gaia2 and ARE: Empowering the community to study agents

Open-source DeepResearch – Freeing our search agents

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks

Can We Rely on LLM Agents to Draft Long-Horizon Plans? Let's Take TravelPlanner as an Example

OdysseyBench: Evaluating LLM Agents on Long-Horizon Complex Office Application Workflows

Speech Evals

5 Things You Need to Know About Moonshot AI and Kimi K2, the New #1 model on the Hub

Introducing ColQwen-Omni: Retrieve in every modality

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

SmolLM3: smol, multilingual, long-context reasoner

Qwen2-VL-OCR-2B-Instruct and VisionOCR-3B-061125 for precise recognition of [messy] handwriting.

YODAS: Youtube-Oriented Dataset for Audio and Speech

Context Is Gold to Find the Gold Passage: Evaluating and Training Contextual Document Embeddings

Qwen3

Tiny Agents: an MCP-powered agent in 50 lines of code

Welcome Llama 4 Maverick & Scout on Hugging Face

Llama 4

Serverless Inference with Hugging Face and NVIDIA NIM

DeepSearch Using Visual RAG in Agentic Frameworks 🔎

Marc-Antoine Allard

AI & ML interests

Organizations

Mallard74's activity

Gaia2 and ARE: Empowering the community to study agents

Open-source DeepResearch – Freeing our search agents

5 Things You Need to Know About Moonshot AI and Kimi K2, the New #1 model on the Hub

Introducing ColQwen-Omni: Retrieve in every modality

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

SmolLM3: smol, multilingual, long-context reasoner

Qwen2-VL-OCR-2B-Instruct and VisionOCR-3B-061125 for precise recognition of [messy] handwriting.

*Context Is Gold to Find the Gold Passage*: Evaluating and Training Contextual Document Embeddings

Tiny Agents: an MCP-powered agent in 50 lines of code

Welcome Llama 4 Maverick & Scout on Hugging Face

Serverless Inference with Hugging Face and NVIDIA NIM

DeepSearch Using Visual RAG in Agentic Frameworks 🔎

Context Is Gold to Find the Gold Passage: Evaluating and Training Contextual Document Embeddings