On Vacation 🏝️

20 69 227

NB

Skier8402

https://nyab.notion.site

Shuyib

AI & ML interests

Explainable Computer Vision w/ Mech Interpretability, Optimization, NLP and multimodal system implementation.

Recent Activity

upvoted a collection 3 days ago

Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability

updated a collection 3 days ago

Interpretability tools

liked a Space 3 days ago

dlouapre/eiffel-tower-llama

View all activity

Organizations

upvoted a collection 3 days ago

Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability

Collection

A compilation of sparse auto-encoders trained on large language models. • 37 items • Updated 13 days ago • 16

updated a collection 3 days ago

Interpretability tools

Collection

Opening the hood of Computer Vision model for example ResNets, ConvNext & DETR, multimodal models and NLP models:BERT & GPTs. • 7 items • Updated 3 days ago • 2

liked a Space 3 days ago

The Eiffel Tower Llama

📝

Explore the Eiffel Tower Llama experiment with open-source models

updated a collection 5 days ago

biomedical

Collection

6 items • Updated 5 days ago

liked a model 5 days ago

Qwen/Qwen3-235B-A22B-Thinking-2507

Text Generation • 235B • Updated Aug 17 • 32.2k • • 390

updated a collection 5 days ago

biomedical

Collection

6 items • Updated 5 days ago

liked a dataset 5 days ago

nsk7153/MedCalc-Bench-Verified

Viewer • Updated 3 days ago • 11.7k • 141 • 3

liked 2 datasets 12 days ago

mistralai/mmlu_speech

Viewer • Updated Jul 15 • 14.3k • 517 • 14

mistralai/gsm8k_speech

Viewer • Updated Jul 15 • 1.32k • 121 • 6

upvoted a collection 12 days ago

Speech Evals

Collection

Synthesized speech evals generated by MistralAI from popular text evaluation datasets to evaluate spoken-language reasoning capabilities of Audio LLMs • 3 items • Updated about 1 month ago • 12

upvoted a paper 16 days ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 174

updated 2 collections 16 days ago

Interpretability tools

Collection

Opening the hood of Computer Vision model for example ResNets, ConvNext & DETR, multimodal models and NLP models:BERT & GPTs. • 7 items • Updated 3 days ago • 2

Diffusion model tools

Collection

a couple of controlnets to improve various aspects of an images • 8 items • Updated 16 days ago

updated a collection 17 days ago

Datasets

Collection

Interesting datasets to help train LLMs and beyond • 45 items • Updated 17 days ago

liked a dataset 17 days ago

OpenMed/Medical-Reasoning-SFT-GPT-OSS-120B

Viewer • Updated 17 days ago • 200k • 3.41k • 226

updated a collection 17 days ago

Datasets

Collection

Interesting datasets to help train LLMs and beyond • 45 items • Updated 17 days ago

liked a dataset 17 days ago

Nadhari/Swahili-Thinking

Viewer • Updated Nov 23 • 166 • 81 • 8

updated a collection 17 days ago

Swahili models

Collection

5 items • Updated 17 days ago

liked a model 17 days ago

Nadhari/swa-csm-1b

Text-to-Speech • Updated 17 days ago • 124 • 3

upvoted an article 24 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

26 days ago

•

544

NB

AI & ML interests

Recent Activity

Organizations

Skier8402's activity

The Eiffel Tower Llama

We Got Claude to Fine-Tune an Open Source LLM