Juii Kim

watchstep

watchstep

AI & ML interests

None yet

Recent Activity

reacted to Kseniase's post with 👍 11 days ago

12 Foundational AI Model Types Let’s refresh some fundamentals today to stay fluent in the what we all work with. Here are some of the most popular model types that shape the vast world of AI (with examples in the brackets): 1. LLM - Large Language Model (GPT, LLaMA) -> https://huggingface.co/papers/2402.06196 + history of LLMs: https://www.turingpost.com/t/The%20History%20of%20LLMs It's trained on massive text datasets to understand and generate human language. They are mostly build on Transformer architecture, predicting the next token. LLMs scale by increasing overall parameter count across all components (layers, attention heads, MLPs, etc.) 2. SLM - Small Language Model (TinyLLaMA, Phi models, SmolLM) https://huggingface.co/papers/2410.20011 Lightweight LM optimized for efficiency, low memory use, fast inference, and edge use. SLMs work using the same principles as LLMs 3. VLM - Vision-Language Model (CLIP, Flamingo) -> https://huggingface.co/papers/2405.17247 Processes and understands both images and text. VLMs map images and text into a shared embedding space or generate captions/descriptions from both 4. MLLM - Multimodal Large Language Model (Gemini) -> https://huggingface.co/papers/2306.13549 A large-scale model that can understand and process multiple types of data (modalities) — usually text + other formats, like images, videos, audio, structured data, 3D or spatial inputs. MLLMs can be LLMs extended with modality adapters or trained jointly across vision, text, audio, etc. 5. LAM - Large Action Model (InstructDiffusion, RT-2) -> https://huggingface.co/papers/2412.10047 Understands and generates action sequences by predicting action tokens (discrete/continuous instructions) that guide agents. Trained on behavior datasets, LAMs generalize across tasks, environments, and modalities - video, sensor data, etc. Read about LRM, MoE, SSM, RNN, CNN, SAM and LNN below👇 Also, subscribe to the Turing Post: https://www.turingpost.com/subscribe

liked a model 11 days ago

allenai/Bolmo-7B

updated a dataset 3 months ago

watchstep/ko-en-code-mixing-sts

View all activity

Organizations

liked a model 11 days ago

allenai/Bolmo-7B

Text Generation • 8B • Updated 5 days ago • 618 • 43

liked 3 datasets 4 months ago

liked 2 models 4 months ago

nlpai-lab/KURE-v1

Feature Extraction • 0.6B • Updated Dec 23, 2024 • 185k • • 73

Qwen/Qwen3-Embedding-8B

Feature Extraction • 8B • Updated Jul 7 • 977k • • 500

liked 2 datasets 4 months ago

HAERAE-HUB/Korean-Human-Judgements

Viewer • Updated Jun 30, 2024 • 694 • 98 • 38

taeminlee/Ko-StrategyQA

Viewer • Updated May 7 • 41.8k • 15.5k • 19

liked a model 4 months ago

microsoft/Phi-4-mini-instruct

Text Generation • 4B • Updated 16 days ago • 241k • 647

liked a Space 8 months ago

Computer Agent

🖥

981

Interact with an AI agent to perform web tasks

liked a model 9 months ago

GSAI-ML/LLaDA-8B-Instruct

Text Generation • 8B • Updated Oct 21 • 197k • 338

liked a model 10 months ago

intfloat/multilingual-e5-large

Feature Extraction • 0.6B • Updated Feb 17 • 3.1M • • 1.11k

liked a Space 10 months ago

MTEB Leaderboard

🥇

6.85k

Embedding Leaderboard

liked 2 models 10 months ago

intfloat/multilingual-e5-base

agentica-org/DeepScaleR-1.5B-Preview

Text Generation • 2B • Updated Apr 9 • 53k • 577

liked 4 models 11 months ago

BAAI/bge-base-en

Feature Extraction • 0.1B • Updated Apr 17, 2024 • 526k • • 61

CohereLabs/c4ai-command-r-plus

Text Generation • 104B • Updated Apr 16 • 2.84k • 1.76k

facebook/rag-token-nq

Updated Nov 13, 2023 • 2.78k • 175

Qwen/Qwen2.5-Coder-32B-Instruct

Text Generation • 33B • Updated Jan 12 • 140k • • 1.96k

liked a model 12 months ago

NovaSky-AI/Sky-T1-32B-Preview

Text Generation • 33B • Updated Jan 13 • 155 • • 550