Sachin Murali's picture

In a Training Loop 🔄

Sachin Murali

sachin6624

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Language Models are Few-Shot Learners

liked a model 2 days ago

Qwen/Qwen3.5-9B

liked a dataset 7 days ago

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Language Models are Few-Shot Learners

Paper • 2005.14165 • Published May 28, 2020 • 20

upvoted a collection 9 days ago

Qwen3.5

21 items • Updated 4 days ago • 1.15k

upvoted a paper 28 days ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 338

upvoted a paper 29 days ago

PaperBanana: Automating Academic Illustration for AI Scientists

Paper • 2601.23265 • Published Jan 30 • 216

upvoted a paper about 2 months ago

Recursive Language Models

Paper • 2512.24601 • Published Dec 31, 2025 • 91

upvoted a collection about 2 months ago

Papers

The goldmine of AI • 4 items • Updated Jan 26 • 1

upvoted 3 papers about 2 months ago

SimpleMem: Efficient Lifelong Memory for LLM Agents

Paper • 2601.02553 • Published Jan 5 • 37

Moshi: a speech-text foundation model for real-time dialogue

Paper • 2410.00037 • Published Sep 17, 2024 • 13

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 205

upvoted a collection 3 months ago

SLM

1 item • Updated Dec 19, 2025 • 1

upvoted 3 articles 3 months ago

Article

20x Faster TRL Fine-tuning with RapidFire AI

+1

Nov 21, 2025

•

27

Article

Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks

+2

Nov 21, 2025

•

26

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

608

upvoted a collection 3 months ago

VibeVoice

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated 11 days ago • 211

upvoted 2 articles 4 months ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

+8

Oct 23, 2025

•

150

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

244

upvoted an article 5 months ago

Article

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

Feb 11, 2025

•

106

upvoted an article 6 months ago

Article

TGI Multi-LoRA: Deploy Once, Serve 30 Models

+1

Jul 18, 2024

•

62

upvoted an article 7 months ago

Article

Assisted Generation: a new direction toward low-latency text generation

May 11, 2023

•

77

upvoted a paper 7 months ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17, 2025 • 261