byteprobe (忍者)

upvoted a changelog 7 months ago

Changelog

Organization and User profiles now include repository listing pages

Jun 20, 2025

• 131

upvoted 8 papers 7 months ago

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

Paper • 2506.10521 • Published Jun 12, 2025 • 73

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

Paper • 2506.09513 • Published Jun 11, 2025 • 101

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 273

Magistral

Paper • 2506.10910 • Published Jun 12, 2025 • 66

upvoted 4 changelogs 7 months ago

Changelog

Add MCP-Compatible Spaces to Your Tools

Jun 17, 2025

• 85

Changelog

New Model Filtering Options on the Hub

Jun 16, 2025

• 75

Changelog

New Inference Providers Dashboard

Jun 5, 2025

• 65

Changelog

Connect Your MCP Client to the Hugging Face Hub

Jun 6, 2025

• 111

upvoted 7 papers 7 months ago

Skywork Open Reasoner 1 Technical Report

Paper • 2505.22312 • Published May 28, 2025 • 54

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

Paper • 2505.11594 • Published May 16, 2025 • 75

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published May 20, 2025 • 76

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23, 2025 • 81

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 88

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published May 26, 2025 • 92

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17, 2025 • 121

忍者

AI & ML interests

Organizations

Organization and User profiles now include repository listing pages

Scaling Test-time Compute for LLM Agents

Essential-Web v1.0: 24T tokens of organized web data

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Magistral

Add MCP-Compatible Spaces to Your Tools

New Model Filtering Options on the Hub

New Inference Providers Dashboard

Connect Your MCP Client to the Hugging Face Hub

Skywork Open Reasoner 1 Technical Report

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

Scaling Law for Quantization-Aware Training

Distilling LLM Agent into Small Models with Retrieval and Code Tools

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Chain-of-Model Learning for Language Model

忍者

AI & ML interests

Organizations

byteprobe's activity

Organization and User profiles now include repository listing pages

Add MCP-Compatible Spaces to Your Tools

New Model Filtering Options on the Hub

New Inference Providers Dashboard

Connect Your MCP Client to the Hugging Face Hub