Towfiq Ahmed's picture

4

Towfiq Ahmed

RafiBD

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 18 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

upvoted a paper 18 days ago

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

upvoted an article 5 months ago

Diffusers welcomes Stable Diffusion 3.5 Large

View all activity

Organizations

None yet

upvoted 2 papers 18 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 435

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5, 2024 • 51

upvoted an article 5 months ago

Article

Diffusers welcomes Stable Diffusion 3.5 Large

+6

Oct 22, 2024

•

55

upvoted an article 6 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

271