seedbench yj12869741/SeedBench Viewer • Updated Jul 1, 2025 • 8.67k • 152 SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science Paper • 2505.13220 • Published May 19, 2025 • 4
SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science Paper • 2505.13220 • Published May 19, 2025 • 4
seedbench yj12869741/SeedBench Viewer • Updated Jul 1, 2025 • 8.67k • 152 SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science Paper • 2505.13220 • Published May 19, 2025 • 4
SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science Paper • 2505.13220 • Published May 19, 2025 • 4