David Stanojevic
david-stan
ยท
AI & ML interests
None yet
Recent Activity
updated
a model
about 2 months ago
JetBrains-Research/premia-nes-7B-unsloth-mixed-v9-zeta-prompt
published
a model
about 2 months ago
JetBrains-Research/premia-nes-7B-unsloth-mixed-v9-zeta-prompt
upvoted
a
paper
about 2 months ago
The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N
Sampling via max@k Optimisation