Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mingyu Derek Ma's picture
5 4

Mingyu Derek Ma

derekma
ydeng9's profile picture SethTharo's profile picture chenchenye's profile picture
·
https://derek.ma
  • mingyu_ma
  • derekmma

AI & ML interests

Generative Language Model, Scientific LM, Clinical LM, Decoding

Recent Activity

liked a model about 24 hours ago
karina-zadorozhny/ume
upvoted an article about 24 hours ago
A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond
liked a model 12 months ago
deepseek-ai/DeepSeek-R1
View all activity

Organizations

rmimg's profile picture

upvoted an article about 24 hours ago
view article
Article

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

2 days ago
•
1
upvoted 4 papers over 1 year ago

mDPO: Conditional Preference Optimization for Multimodal Large Language Models

Paper • 2406.11839 • Published Jun 17, 2024 • 40

MIRAI: Evaluating LLM Agents for Event Forecasting

Paper • 2407.01231 • Published Jul 1, 2024 • 18

CliBench: Multifaceted Evaluation of Large Language Models in Clinical Decisions on Diagnoses, Procedures, Lab Tests Orders and Prescriptions

Paper • 2406.09923 • Published Jun 14, 2024 • 1

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Paper • 2406.09411 • Published Jun 13, 2024 • 19
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs