Nvar Char
zombieofCrypto
·
AI & ML interests
machine learning to become more zombie-like
Organizations
llm_improvement_research
-
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 430 -
LightThinker: Thinking Step-by-Step Compression
Paper • 2502.15589 • Published • 31 -
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Paper • 2405.04434 • Published • 24 -
Model Compression and Efficient Inference for Large Language Models: A Survey
Paper • 2402.09748 • Published • 2
reinforcement_learning_research
llm_improvement_research
-
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 430 -
LightThinker: Thinking Step-by-Step Compression
Paper • 2502.15589 • Published • 31 -
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Paper • 2405.04434 • Published • 24 -
Model Compression and Efficient Inference for Large Language Models: A Survey
Paper • 2402.09748 • Published • 2
datasets
0
None public yet