16 13 18

GtZeng PRO

chaoscodes

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago

AgentCPT/Qwen3-4B_thinking_agent_sft_nemotron_tool_calling_v2_lr1e-5_epoch_1_ctx_16384_bs_256

published a model 3 days ago

AgentCPT/Qwen3-4B_thinking_agent_sft_nemotron_tool_calling_v2_lr1e-5_epoch_1_ctx_16384_bs_256

updated a model 6 days ago

AgentCPT/qwen-8b-agent-sft

View all activity

Organizations

updated a model 3 days ago

AgentCPT/Qwen3-4B_thinking_agent_sft_nemotron_tool_calling_v2_lr1e-5_epoch_1_ctx_16384_bs_256

4B • Updated 3 days ago • 8

published a model 3 days ago

AgentCPT/Qwen3-4B_thinking_agent_sft_nemotron_tool_calling_v2_lr1e-5_epoch_1_ctx_16384_bs_256

4B • Updated 3 days ago • 8

updated 2 models 6 days ago

AgentCPT/qwen-8b-agent-sft

8B • Updated 6 days ago • 6

AgentCPT/qwen-4b-agent-sft

4B • Updated 6 days ago • 5

published 2 models 6 days ago

AgentCPT/qwen-8b-agent-sft

8B • Updated 6 days ago • 6

AgentCPT/qwen-4b-agent-sft

4B • Updated 6 days ago • 5

updated a model 15 days ago

FuxiAISGLab/nonhis_game_behavior_clone_model_qwen-VL-2B

2B • Updated 15 days ago • 7

published a model 15 days ago

FuxiAISGLab/nonhis_game_behavior_clone_model_qwen-VL-2B

2B • Updated 15 days ago • 7

updated a model 15 days ago

FuxiAISGLab/game_behavior_clone_model_qwen-VL-4B

5B • Updated 15 days ago • 8

published a model 15 days ago

FuxiAISGLab/game_behavior_clone_model_qwen-VL-4B

5B • Updated 15 days ago • 8

updated a model 15 days ago

FuxiAISGLab/game_behavior_clone_model_qwen-VL-2B

2B • Updated 15 days ago • 9

published a model 15 days ago

FuxiAISGLab/game_behavior_clone_model_qwen-VL-2B

2B • Updated 15 days ago • 9

updated a dataset 15 days ago

chaoscodes/game_behavior_cloning

Viewer • Updated 15 days ago • 318 • 18

published a dataset 16 days ago

chaoscodes/game_behavior_cloning

Viewer • Updated 15 days ago • 318 • 18

upvoted 2 papers about 1 month ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 247

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 96

updated a dataset 6 months ago

chaoscodes/filter_swe_smith

Viewer • Updated Jul 19, 2025 • 10.8k • 5

published a dataset 6 months ago

chaoscodes/filter_swe_smith

Viewer • Updated Jul 19, 2025 • 10.8k • 5

upvoted 2 papers 7 months ago

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published Jun 23, 2025 • 56

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

GtZeng PRO

AI & ML interests

Recent Activity

Organizations

chaoscodes's activity