pinned Runtime error 10 Stark Leaderboard 🥇 leaderboard of Semi-structured Retrieval Benchmark (STaRK)
Running on CPU Upgrade 4 SKB Explorer 🏢 Explore semi-structured knowledge bases with interactive graphs
snap-stanford/gpt-5-mini-2025-08-07_persona_values_post_dist Viewer • Updated 7 days ago • 3.37k • 21
snap-stanford/gpt-5-mini-2025-08-07_persona_interests_post_dist Viewer • Updated 7 days ago • 3.37k • 17
snap-stanford/gpt-5-mini-2025-08-07_persona_communication_post_dist Viewer • Updated 7 days ago • 3.37k • 16
snap-stanford/rl_response_only_gpt5_judge_train_disable_thinking_medium_global_step_125_post_dist Viewer • Updated 8 days ago • 5.4k • 6
snap-stanford/rl_response_only_gpt5_judge_train_disable_thinking_youtube_global_step_650_post_dist Viewer • Updated 8 days ago • 4k • 5
snap-stanford/gpt5_judge_weighed_multihier_no_thinking_ssr_reddit_global_step_100_post_dist Viewer • Updated 8 days ago • 3.37k • 4
snap-stanford/gpt5_judge_weighed_multihier_no_thinking_tr_reddit_global_step_150_post_dist Viewer • Updated 8 days ago • 3.37k • 5
snap-stanford/humanlm_batched_reward_gpt5_judge_weighmultihier_ssr_global_step_100_post_dist Viewer • Updated 8 days ago • 3.37k • 8
snap-stanford/outputs_final_rl_r_only_gpt5_judge_train_225_persona_values_post_dist Viewer • Updated 10 days ago • 3.37k • 9
snap-stanford/outputs_final_rl_r_only_gpt5_judge_train_225_persona_interests_post_dist Viewer • Updated 10 days ago • 3.37k • 10