eepos's picture

eepos

eepos

·

AI & ML interests

None yet

Recent Activity

liked a model 37 minutes ago

BitPoet/Ideogram4-Inpaint-LoRA

liked a model 7 days ago

bartowski/command-a-plus-05-2026-GGUF

liked a model 9 days ago

multimodalart/tarot-ideogram-4

View all activity

Organizations

None yet

upvoted an article 12 days ago

Article

Introducing North Mini Code: Cohere’s First Model For Developers

CohereLabs

•

12 days ago

• 71

upvoted an article 17 days ago

Article

Fine-tune FLUX.2 [klein] with a LoRA under 60 minutes

black-forest-labs

•

17 days ago

• 24

upvoted a paper about 1 month ago

Asymmetric Flow Models

Paper • 2605.12964 • Published May 13 • 22

upvoted a collection about 2 months ago

Gemma 4

15 items • Updated 11 days ago • 980

upvoted 2 collections 2 months ago

Qwen3.6

4 items • Updated Apr 22 • 414

MiniMax-M2

https://arxiv.org/abs/2605.26494 • 4 items • Updated 26 days ago • 29

upvoted a collection 4 months ago

Qwen3.5

Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 6 days ago • 161

upvoted an article 4 months ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

ggerganov, ngxson, allozaur, lysandre, victor, julien-c

•

Feb 20

• 507

upvoted a paper 4 months ago

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Paper • 2602.12675 • Published Feb 13 • 59

upvoted 2 collections 4 months ago

Hibiki-Zero

Streaming speech translation without the need for word-level alignments • 4 items • Updated May 9 • 4

Qwen3.5

21 items • Updated Mar 9 • 1.69k

upvoted 2 collections 5 months ago

TranslateGemma

3 items • Updated Mar 12 • 244

Text-To-Speech

https://kyutai.org/next/tts • 6 items • Updated Mar 2 • 27

upvoted 2 collections 6 months ago

FLUX.2

Our second generation of FLUX • 21 items • Updated Apr 6 • 244

CASA

CASA: Cross-Attention over Self-Attention for Efficient Vision-Language Fusion on long-context streaming inputs • 6 items • Updated Mar 9 • 8

upvoted an article 6 months ago

Article

New in llama.cpp: Model Management

ggml-org

•

Dec 11, 2025

• 137

upvoted 2 collections 7 months ago

Mistral Large 3

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 100

Qwen-Image

14 items • Updated Dec 31, 2025 • 114

upvoted a paper 9 months ago

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

Paper • 2509.24006 • Published Sep 28, 2025 • 119

upvoted a paper 12 months ago

Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation

Paper • 2506.19852 • Published Jun 24, 2025 • 43