Suraj

ghishadow

AI & ML interests

None yet

Recent Activity

liked a model 21 days ago

LiquidAI/LFM2-2.6B-Exp

liked a model about 1 month ago

Qwen/Qwen3-VL-2B-Thinking

liked a model about 1 month ago

moonshotai/Kimi-Linear-48B-A3B-Instruct

View all activity

Organizations

liked a model 21 days ago

LiquidAI/LFM2-2.6B-Exp

Text Generation • 3B • Updated 12 days ago • 37.3k • 325

liked 2 models about 1 month ago

Qwen/Qwen3-VL-2B-Thinking

Image-Text-to-Text • 2B • Updated Oct 20, 2025 • 29.6k • 99

moonshotai/Kimi-Linear-48B-A3B-Instruct

Text Generation • 49B • Updated Dec 16, 2025 • 31.4k • 526

upvoted a collection about 2 months ago

Ministral 3

Collection

Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes. • 36 items • Updated 24 days ago • 27

liked a model about 2 months ago

litert-community/Gemma3-1B-IT

Text Generation • Updated 8 days ago • 20.3k • • 471

liked a model 2 months ago

maya-research/maya1

Text-to-Speech • 3B • Updated Nov 12, 2025 • 33.4k • 845

upvoted a paper 3 months ago

Latent Diffusion Model without Variational Autoencoder

Paper • 2510.15301 • Published Oct 17, 2025 • 49

liked 2 models 3 months ago

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 426k • 1.19k

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26, 2025 • 6.78M • • 4.22k

upvoted an article 5 months ago

Article

The Hacker's Guide to Building an AI Supercluster

Aug 31, 2025

•

liked a Space 5 months ago

The Ultra-Scale Playbook

🌌

3.65k

The ultimate guide to training LLM on large GPU Clusters

upvoted a collection 5 months ago

Gemma 3-270m

Collection

Collection of models for Gemma 3-270m • 4 items • Updated Dec 16, 2025 • 21

liked a Space 5 months ago

Wllama

🦙

Run GGUF directly on your browser!

liked a model 5 months ago

google/gemma-3-270m

Text Generation • 0.3B • Updated Aug 14, 2025 • 46.3k • 959

liked a Space 5 months ago

chat-ui

🔥

1.21k

Redirect to HuggingChat for conversations

liked a model 5 months ago

microsoft/Phi-3.5-mini-instruct

Text Generation • 4B • Updated Dec 10, 2025 • 343k • 947

upvoted a paper 6 months ago

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Paper • 2507.14111 • Published Jul 18, 2025 • 23

liked 2 models 6 months ago

tencent/HunyuanWorld-1

Image-to-3D • Updated Oct 20, 2025 • 2.93k • 478

HuggingFaceTB/SmolLM3-3B

Text Generation • 3B • Updated Sep 10, 2025 • 58.7k • • 876

liked a model 7 months ago

apple/DiffuCoder-7B-cpGRPO

8B • Updated Dec 8, 2025 • 2.44k • 316

Suraj

AI & ML interests

Recent Activity

Organizations

ghishadow's activity

The Hacker's Guide to Building an AI Supercluster

The Ultra-Scale Playbook

Wllama

chat-ui