My (Chiffon) Nguyen's picture

20 14

My (Chiffon) Nguyen

chiffonng

·

https://mychiffonn.com/

AI & ML interests

Mulitlingual AI, AI Safety, human-AI interaction

Recent Activity

updated a collection about 2 months ago

Scaling Behavior of COT Monitoring

updated a dataset about 2 months ago

chiffonng/hmmt_2025

published a dataset about 2 months ago

chiffonng/hmmt_2025

View all activity

Organizations

updated a collection about 2 months ago

Scaling Behavior of COT Monitoring

4 items • Updated Nov 11

updated a dataset about 2 months ago

chiffonng/hmmt_2025

Viewer • Updated Nov 11 • 30 • 3

published a dataset about 2 months ago

chiffonng/hmmt_2025

Viewer • Updated Nov 11 • 30 • 3

updated a collection about 2 months ago

Scaling Behavior of COT Monitoring

4 items • Updated Nov 11

liked a dataset about 2 months ago

FlagEval/HMMT_2025

Viewer • Updated May 6 • 30 • 674 • 1

upvoted an article 3 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

Dec 9, 2022

•

385

liked a model 3 months ago

mistralai/Magistral-Small-2509

24B • Updated 25 days ago • 20.8k • 278

updated a collection 3 months ago

LINKS: English-English Mnemonics

Investigate the potential of mining linguistic knowledge/reasoning from LLM to generate mnemonic devices that aid vocabulary learning. • 7 items • Updated Sep 16 • 1

upvoted a paper 6 months ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49

updated a collection 8 months ago

LINKS: English-English Mnemonics

Investigate the potential of mining linguistic knowledge/reasoning from LLM to generate mnemonic devices that aid vocabulary learning. • 7 items • Updated Sep 16 • 1

upvoted a paper 8 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 113

liked a model 8 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • 33B • Updated Feb 24 • 2.71M • • 1.48k

upvoted a collection 8 months ago

QwQ

Qwen with Questions • 6 items • Updated Jul 21 • 101

liked 2 models 8 months ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 164k • 1.83k

google/electra-base-discriminator

Updated Feb 29, 2024 • 59M • 71

upvoted a collection 8 months ago

ELECTRA release

This collection regroups the ELECTRA models released by the Google team. • 6 items • Updated Jul 10 • 12

upvoted an article 9 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

Jan 28

•

887