view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 Dec 9, 2022 • 385
LINKS: English-English Mnemonics Collection Investigate the potential of mining linguistic knowledge/reasoning from LLM to generate mnemonic devices that aid vocabulary learning. • 7 items • Updated Sep 16 • 1
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper • 2506.14965 • Published Jun 17 • 49
LINKS: English-English Mnemonics Collection Investigate the potential of mining linguistic knowledge/reasoning from LLM to generate mnemonic devices that aid vocabulary learning. • 7 items • Updated Sep 16 • 1
ELECTRA release Collection This collection regroups the ELECTRA models released by the Google team. • 6 items • Updated Jul 10 • 12