·
AI & ML interests
None yet
Organizations
u-10bei/qwen3-32b-sft-merged
Text Generation
•
Updated
•
9
u-10bei/qwen3-14b-sft-merged
Text Generation
•
Updated
•
6
u-10bei/qwen3-8b-sft-merged
Text Generation
•
Updated
•
7
u-10bei/qwen3-4b-sft-merged
Text Generation
•
Updated
•
5
u-10bei/qwen3-1.7b-sft-merged
Text Generation
•
Updated
•
6
u-10bei/qwen3-0.6b-sft-merged
Text Generation
•
Updated
•
6
u-10bei/llm-jp-3-13b-instruct2-chat-sft2-grpo2_3-merged
Text Generation
•
Updated
•
8
u-10bei/llm-jp-3-13b-instruct2-chat-GSM8K-math2.0-cot2-grpo2-merged
Text Generation
•
Updated
•
12
u-10bei/llm-jp-3-13b-instruct2-chat-sft2-grpo3-merged
Text Generation
•
Updated
•
10
u-10bei/llm-jp-3-13b-instruct2-chat-sft2-grpo2_2-merged
Text Generation
•
Updated
•
9
u-10bei/llm-jp-3-13b-instruct2-chat-sft1_4-grpo2_2-merged
Text Generation
•
Updated
•
8
u-10bei/llm-jp-3-13b-instruct2-chat-sft1_2-grpo2_2-2000-merged
Text Generation
•
Updated
•
8
u-10bei/llm-jp-3-13b-instruct2-chat-sft1_2-grpo2_2-1500-merged
Text Generation
•
Updated
•
7
u-10bei/llm-jp-3-13b-instruct2-chat-sft1_2-grpo2_2-1000-merged
Text Generation
•
Updated
•
7
u-10bei/llm-jp-3-13b-instruct2-chat-sft1_2-grpo2_2-500-merged
Text Generation
•
Updated
•
6
u-10bei/llm-jp-3-13b-instruct2-chat-sft1_2-grpo2_1-merged
Text Generation
•
Updated
•
10
u-10bei/llm-jp-3-13b-instruct2-chat-sft1_1-grpo2_1-merged
Text Generation
•
Updated
•
6
u-10bei/llm-jp-3-13b-instruct2-chat-grpo1_2-merged
Text Generation
•
Updated
•
5
u-10bei/llm-jp-3-13b-instruct2-chat-grpo1_1-merged
Text Generation
•
Updated
•
8
u-10bei/llm-jp-3-13b-instruct2-chat-sft1_1-grpo2-merged
Text Generation
•
Updated
•
5
u-10bei/llm-jp-3-13b-instruct2-chat-sft1-grpo2-merged
Text Generation
•
Updated
•
8
u-10bei/llm-jp-3-13b-instruct2-chat-grpo500-merged
Text Generation
•
Updated
•
7
u-10bei/llm-jp-3-13b-instruct2-chat-sft2-grpo500-merged
Text Generation
•
Updated
•
8
u-10bei/llm-jp-3-13b-instruct2-chat-sft2-merged
Text Generation
•
Updated
•
7
u-10bei/llm-jp-3-13b-instruct2-gpro-0222_OpenMATHinstruct_1800_sft
Text Generation
•
Updated
•
8
u-10bei/llm-jp-3-13b-instruct2-grpo-0222_step1000_lora-sft
Text Generation
•
Updated
•
14
u-10bei/llm-jp-3-13b-instruct2-grpo-MATH-lighteval_step1000
Text Generation
•
Updated
•
8
u-10bei/llm-jp-3-13b-instruct2-grpo-0222_lora_step2000_ja5000
Text Generation
•
Updated
•
7
u-10bei/llm-jp-3-13b-instruct2-grpo-0222_lora_step2000_ja2000
Text Generation
•
Updated
•
10
u-10bei/llm-jp-3-13b-lora-orca-ichikara2_Tengentoppa
Updated