Designing LLM Profile for Routing
ulab
university
AI & ML interests
None defined yet.
Recent Activity
PersonalizedRouter: Personalized LLM Routing via Graph-based User Preference Modeling
-
ulab-ai/ResearchArcade-openreview-paragraphs
Viewer • Updated • 1.35M • 16 -
ulab-ai/ResearchArcade-openreview-papers-revisions
Viewer • Updated • 54.5k • 19 -
ulab-ai/ResearchArcade-openreview-reviews
Viewer • Updated • 823k • 218 • 1 -
ulab-ai/ResearchArcade-openreview-arxiv
Viewer • Updated • 28.6k • 15
Sotopia-RL: Reward Design for Social Intelligence
IRanker: Towards Ranking Foundation Model
Time-R1: Framework and resources for endowing LLMs with comprehensive temporal reasoning (understanding, prediction, creative generation).
UniRec: Unified Multimodal Encoding for LLM-Based Recommendations
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
-
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
Paper • 2506.09033 • Published • 7 -
ulab-ai/Router-R1-Qwen2.5-3B-Instruct
3B • Updated • 16.7k • 3 -
ulab-ai/Router-R1-Qwen2.5-3B-Instruct-Alpha0.9
3B • Updated • 1 -
ulab-ai/Router-R1-Llama-3.2-3B-Instruct
4B • Updated • 31
Designing LLM Profile for Routing
UniRec: Unified Multimodal Encoding for LLM-Based Recommendations
PersonalizedRouter: Personalized LLM Routing via Graph-based User Preference Modeling
-
ulab-ai/ResearchArcade-openreview-paragraphs
Viewer • Updated • 1.35M • 16 -
ulab-ai/ResearchArcade-openreview-papers-revisions
Viewer • Updated • 54.5k • 19 -
ulab-ai/ResearchArcade-openreview-reviews
Viewer • Updated • 823k • 218 • 1 -
ulab-ai/ResearchArcade-openreview-arxiv
Viewer • Updated • 28.6k • 15
Sotopia-RL: Reward Design for Social Intelligence
IRanker: Towards Ranking Foundation Model
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
-
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
Paper • 2506.09033 • Published • 7 -
ulab-ai/Router-R1-Qwen2.5-3B-Instruct
3B • Updated • 16.7k • 3 -
ulab-ai/Router-R1-Qwen2.5-3B-Instruct-Alpha0.9
3B • Updated • 1 -
ulab-ai/Router-R1-Llama-3.2-3B-Instruct
4B • Updated • 31
Time-R1: Framework and resources for endowing LLMs with comprehensive temporal reasoning (understanding, prediction, creative generation).