Mingyu Derek Ma's picture

5 4

Mingyu Derek Ma

derekma

·

https://derek.ma

AI & ML interests

Generative Language Model, Scientific LM, Clinical LM, Decoding

Recent Activity

liked a model about 24 hours ago

karina-zadorozhny/ume

upvoted an article about 24 hours ago

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

liked a model 12 months ago

deepseek-ai/DeepSeek-R1

View all activity

Organizations

upvoted an article about 24 hours ago

Article

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

2 days ago

•

1

upvoted 4 papers over 1 year ago

mDPO: Conditional Preference Optimization for Multimodal Large Language Models

Paper • 2406.11839 • Published Jun 17, 2024 • 40

MIRAI: Evaluating LLM Agents for Event Forecasting

Paper • 2407.01231 • Published Jul 1, 2024 • 18

CliBench: Multifaceted Evaluation of Large Language Models in Clinical Decisions on Diagnoses, Procedures, Lab Tests Orders and Prescriptions

Paper • 2406.09923 • Published Jun 14, 2024 • 1

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Paper • 2406.09411 • Published Jun 13, 2024 • 19