3 14 5

Emil Zakirov

Emil-Zakirov

AI & ML interests

None yet

Recent Activity

liked a model 11 days ago

zai-org/GLM-4.7

liked a dataset 3 months ago

masint/gpt-oss-deflate-general

upvoted an article 4 months ago

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

View all activity

Organizations

None yet

liked a model 11 days ago

zai-org/GLM-4.7

Text Generation • 358B • Updated 12 days ago • 31.5k • • 1.43k

liked a dataset 3 months ago

masint/gpt-oss-deflate-general

Updated Sep 20, 2025 • 10 • 5

upvoted an article 4 months ago

Article

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

Sep 11, 2025

•

upvoted a paper 8 months ago

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published Apr 29, 2025 • 92

New activity in marcelbinz/Llama-3.1-Centaur-70B-adapter about 1 year ago

How can this be accessed for research without GPUs on hand?

#7 opened about 1 year ago by

BiasedByBytes

liked a model about 1 year ago

marcelbinz/Llama-3.1-Centaur-70B-adapter

Updated Jul 1, 2025 • 164

upvoted a paper over 1 year ago

Kolmogorov-Arnold Transformer

Paper • 2409.10594 • Published Sep 16, 2024 • 45

upvoted a collection over 1 year ago

MatMulfree LM

Collection

Pre-trined models for Matmulfree LM. • 4 items • Updated Jun 10, 2024 • 26

upvoted 3 papers over 1 year ago

upvoted 3 papers almost 2 years ago

Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU

Paper • 2403.06504 • Published Mar 11, 2024 • 56

MoAI: Mixture of All Intelligence for Large Language and Vision Models

Paper • 2403.07508 • Published Mar 12, 2024 • 77

LongAlign: A Recipe for Long Context Alignment of Large Language Models

Paper • 2401.18058 • Published Jan 31, 2024 • 22

liked a model about 2 years ago

dphn/dolphin-2.5-mixtral-8x7b

Text Generation • 47B • Updated May 21, 2024 • 1.66k • 1.24k

upvoted a paper about 2 years ago

AppAgent: Multimodal Agents as Smartphone Users

Paper • 2312.13771 • Published Dec 21, 2023 • 54

liked a model over 2 years ago

mistralai/Mistral-7B-v0.1

Text Generation • 7B • Updated Jul 24, 2025 • 346k • 4.02k

upvoted 2 papers over 2 years ago

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Paper • 2307.02486 • Published Jul 5, 2023 • 81

LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models

Paper • 2308.16137 • Published Aug 30, 2023 • 40

commented a paper over 2 years ago

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Paper • 2307.02486 • Published Jul 5, 2023 • 81 •

Emil Zakirov

AI & ML interests

Recent Activity

Organizations

Emil-Zakirov's activity

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

How can this be accessed for research without GPUs on hand?