Directly Optimizing Knowledge Graph Construction for RAG using Reinforcement Learning
-
gzone0111/AutoGraphR1-musique_hotpotqa_train-qwen2.5-7b-graph-retriever-grpo
8B • Updated • 3 -
gzone0111/AutoGraphR1-musique_hotpotqa_train-qwen2.5-7b-text-retriever-grpo
8B • Updated • 3 -
gzone0111/AutoGraphR1-musique_hotpotqa_train-qwen2.5-3b-graph-retriever-grpo
3B • Updated • 4 -
gzone0111/AutoGraphR1-musique_hotpotqa_train-qwen2.5-3b-text-retriever-grpo
3B • Updated • 4