S.F.'s picture

S.F.

search-facility

·

ipv6

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

The Illusion of Specialization: Unveiling the Domain-Invariant "Standing Committee" in Mixture-of-Experts Models

upvoted a paper 5 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

upvoted a paper 7 days ago

MiMo-V2-Flash Technical Report

View all activity

Organizations

None yet

upvoted 2 papers 5 days ago

The Illusion of Specialization: Unveiling the Domain-Invariant "Standing Committee" in Mixture-of-Experts Models

Paper • 2601.03425 • Published 8 days ago • 15

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 6 days ago • 179

upvoted 4 papers 7 days ago

MiMo-V2-Flash Technical Report

Paper • 2601.02780 • Published 9 days ago • 31

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published 9 days ago • 112

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Paper • 2601.01554 • Published 11 days ago • 52

InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields

Paper • 2601.03252 • Published 8 days ago • 95

upvoted 2 papers 15 days ago

SpotEdit: Selective Region Editing in Diffusion Transformers

Paper • 2512.22323 • Published 20 days ago • 38

Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

Paper • 2512.23447 • Published 17 days ago • 94

upvoted 3 papers 16 days ago

TimeBill: Time-Budgeted Inference for Large Language Models

Paper • 2512.21859 • Published 20 days ago • 24

ProEdit: Inversion-based Editing From Prompts Done Right

Paper • 2512.22118 • Published 19 days ago • 17

InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion

Paper • 2512.17504 • Published 27 days ago • 96

upvoted a paper 19 days ago

How Much 3D Do Video Foundation Models Encode?

Paper • 2512.19949 • Published 23 days ago • 9

upvoted 4 papers 21 days ago

QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models

Paper • 2512.19526 • Published 24 days ago • 11

Scaling Laws for Code: Every Programming Language Matters

Paper • 2512.13472 • Published about 1 month ago • 10

INTELLECT-3: Technical Report

Paper • 2512.16144 • Published 28 days ago • 18

FaithLens: Detecting and Explaining Faithfulness Hallucination

Paper • 2512.20182 • Published 23 days ago • 8

upvoted a paper 22 days ago

WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion

Paper • 2512.19678 • Published 23 days ago • 29

upvoted a paper 23 days ago

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

Paper • 2512.17260 • Published 27 days ago • 48

upvoted 2 papers 27 days ago

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

Paper • 2512.15603 • Published 29 days ago • 60

Step-GUI Technical Report

Paper • 2512.15431 • Published 29 days ago • 129