Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
41
Juii Kim
watchstep
Follow
0 followers
·
2 following
watchstep
AI & ML interests
None yet
Recent Activity
reacted
to
Kseniase
's
post
with 👍
11 days ago
12 Foundational AI Model Types Let’s refresh some fundamentals today to stay fluent in the what we all work with. Here are some of the most popular model types that shape the vast world of AI (with examples in the brackets): 1. LLM - Large Language Model (GPT, LLaMA) -> https://huggingface.co/papers/2402.06196 + history of LLMs: https://www.turingpost.com/t/The%20History%20of%20LLMs It's trained on massive text datasets to understand and generate human language. They are mostly build on Transformer architecture, predicting the next token. LLMs scale by increasing overall parameter count across all components (layers, attention heads, MLPs, etc.) 2. SLM - Small Language Model (TinyLLaMA, Phi models, SmolLM) https://huggingface.co/papers/2410.20011 Lightweight LM optimized for efficiency, low memory use, fast inference, and edge use. SLMs work using the same principles as LLMs 3. VLM - Vision-Language Model (CLIP, Flamingo) -> https://huggingface.co/papers/2405.17247 Processes and understands both images and text. VLMs map images and text into a shared embedding space or generate captions/descriptions from both 4. MLLM - Multimodal Large Language Model (Gemini) -> https://huggingface.co/papers/2306.13549 A large-scale model that can understand and process multiple types of data (modalities) — usually text + other formats, like images, videos, audio, structured data, 3D or spatial inputs. MLLMs can be LLMs extended with modality adapters or trained jointly across vision, text, audio, etc. 5. LAM - Large Action Model (InstructDiffusion, RT-2) -> https://huggingface.co/papers/2412.10047 Understands and generates action sequences by predicting action tokens (discrete/continuous instructions) that guide agents. Trained on behavior datasets, LAMs generalize across tasks, environments, and modalities - video, sensor data, etc. Read about LRM, MoE, SSM, RNN, CNN, SAM and LNN below👇 Also, subscribe to the Turing Post: https://www.turingpost.com/subscribe
liked
a model
11 days ago
allenai/Bolmo-7B
updated
a dataset
3 months ago
watchstep/ko-en-code-mixing-sts
View all activity
Organizations
watchstep
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
11 days ago
allenai/Bolmo-7B
Text Generation
•
8B
•
Updated
5 days ago
•
618
•
43
liked
3 datasets
4 months ago
Jangyeong/Koglish_STS
Preview
•
Updated
Aug 20, 2024
•
54
•
1
Jangyeong/Koglish_GLUE
Preview
•
Updated
Aug 20, 2024
•
162
•
1
nlpai-lab/ko_commongen_v2
Viewer
•
Updated
Aug 11, 2024
•
852
•
77
•
4
liked
2 models
4 months ago
nlpai-lab/KURE-v1
Feature Extraction
•
0.6B
•
Updated
Dec 23, 2024
•
185k
•
•
73
Qwen/Qwen3-Embedding-8B
Feature Extraction
•
8B
•
Updated
Jul 7
•
977k
•
•
500
liked
2 datasets
4 months ago
HAERAE-HUB/Korean-Human-Judgements
Viewer
•
Updated
Jun 30, 2024
•
694
•
98
•
38
taeminlee/Ko-StrategyQA
Viewer
•
Updated
May 7
•
41.8k
•
15.5k
•
19
liked
a model
4 months ago
microsoft/Phi-4-mini-instruct
Text Generation
•
4B
•
Updated
16 days ago
•
241k
•
647
liked
a Space
8 months ago
Paused
Featured
981
Computer Agent
🖥
981
Interact with an AI agent to perform web tasks
liked
a model
9 months ago
GSAI-ML/LLaDA-8B-Instruct
Text Generation
•
8B
•
Updated
Oct 21
•
197k
•
338
liked
a model
10 months ago
intfloat/multilingual-e5-large
Feature Extraction
•
0.6B
•
Updated
Feb 17
•
3.1M
•
•
1.11k
liked
a Space
10 months ago
Restarting
on
CPU Upgrade
6.85k
MTEB Leaderboard
🥇
6.85k
Embedding Leaderboard
liked
2 models
10 months ago
intfloat/multilingual-e5-base
Sentence Similarity
•
0.3B
•
Updated
Feb 17
•
1.77M
•
•
318
agentica-org/DeepScaleR-1.5B-Preview
Text Generation
•
2B
•
Updated
Apr 9
•
53k
•
577
liked
4 models
11 months ago
BAAI/bge-base-en
Feature Extraction
•
0.1B
•
Updated
Apr 17, 2024
•
526k
•
•
61
CohereLabs/c4ai-command-r-plus
Text Generation
•
104B
•
Updated
Apr 16
•
2.84k
•
1.76k
facebook/rag-token-nq
Updated
Nov 13, 2023
•
2.78k
•
175
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation
•
33B
•
Updated
Jan 12
•
140k
•
•
1.96k
liked
a model
12 months ago
NovaSky-AI/Sky-T1-32B-Preview
Text Generation
•
33B
•
Updated
Jan 13
•
155
•
•
550
Load more