Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
1
Zeliang Zhang
zeliang0426
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination
updated
a model
about 1 month ago
zeliang0426/DS_Qwen25-7-full-lora-3k
updated
a model
about 1 month ago
zeliang0426/DS_Qwen25-7-cache-lora-3k
View all activity
Organizations
None yet
zeliang0426
's models
68
Sort: Recently updated
zeliang0426/Distill_Llama_Darpo-cache-adapter-3k
Text Generation
•
8B
•
Updated
Aug 15, 2025
•
3
zeliang0426/Distill_Llama_Darpo-cache-lora-3k
Updated
Aug 15, 2025
zeliang0426/8k_Distill_Llama_Darpo-cache-adapter-3k
Text Generation
•
8B
•
Updated
Aug 15, 2025
•
1
zeliang0426/SuperLong_Distill_Llama_Darpo-cache-lora-3k
Updated
Aug 14, 2025
zeliang0426/SuperLong_Distill_Llama_Darpo-cache-adapter-3k
Updated
Aug 14, 2025
zeliang0426/QKV_Qwen25-7-full-lora-3k
Updated
Aug 13, 2025
zeliang0426/QKV_Qwen25-7-cache-lora-3k
Updated
Aug 13, 2025
zeliang0426/qwen25_code_r1_grpo_cache
Updated
Aug 13, 2025
zeliang0426/qwen25_code_r1_grpo_think
Text Generation
•
3B
•
Updated
Aug 13, 2025
•
4
zeliang0426/qwen25_code_r1_grpo_full
Updated
Aug 13, 2025
zeliang0426/Gemma3-Darpo-full-lora-3k
Updated
Aug 11, 2025
zeliang0426/Limited_Base-Qwen25-7-Think-adapter-3k
Text Generation
•
8B
•
Updated
Aug 11, 2025
•
3
zeliang0426/Limted_Base-Qwen25-7-cache-lora-3k
Updated
Aug 11, 2025
zeliang0426/Gemma3-Darpo-cache-adapter-3k
Text Generation
•
4B
•
Updated
Aug 11, 2025
•
3
zeliang0426/Base-Qwen25-7-full-lora-3k
Updated
Aug 10, 2025
zeliang0426/Base-Qwen25-7-Think-adapter-3k
Text Generation
•
8B
•
Updated
Aug 9, 2025
•
3
zeliang0426/Llama_Darpo-full-lora-3k
Updated
Aug 9, 2025
zeliang0426/Base-Qwen25-7-cache-lora-3k
Updated
Aug 9, 2025
zeliang0426/Llama_Darpo-cache-adapter-3k
Text Generation
•
3B
•
Updated
Aug 8, 2025
•
5
zeliang0426/Gemma3-Darpo-cache-lora-3k
Updated
Aug 8, 2025
zeliang0426/Llama_Darpo-cache-lora-3k
Updated
Aug 8, 2025
zeliang0426/1e-6-Qwen25-7-Think-adapter-3k
Text Generation
•
8B
•
Updated
Aug 8, 2025
•
5
zeliang0426/Qwen25-7-Think-adapter-3k
Text Generation
•
8B
•
Updated
Aug 8, 2025
•
5
zeliang0426/1e-6-Qwen25-7-full-lora-3k
Updated
Aug 8, 2025
zeliang0426/Qwen25-7-full-lora-3k
Updated
Aug 7, 2025
zeliang0426/1e-6-Qwen25-7-cache-lora-3k
Updated
Aug 7, 2025
zeliang0426/Qwen25-7-cache-lora-3k
Updated
Aug 7, 2025
zeliang0426/ddp-GSM8K-CacheTraining-LORA
Updated
Aug 3, 2025
zeliang0426/tofu_qwen-2.5-3b
Updated
Aug 3, 2025
zeliang0426/Fix-Strict_Darpo-cache-adapter-3k
Text Generation
•
3B
•
Updated
Jul 31, 2025
•
3
Previous
1
2
3
Next