·
AI & ML interests
Machine learning, RLHF
Organizations
weqweasdas/preference_dataset_mixture2_and_safe_pku30k_and_argilla_math_and_ultra_code_for_preference_model
Viewer
•
Updated
•
606k
•
12
weqweasdas/preference_dataset_mixture2_and_safe_pku30k_for_preference_model
Viewer
•
Updated
•
554k
•
2
weqweasdas/ultra_feedback_binarized_for_preference_no_chat_all
Viewer
•
Updated
•
60.9k
weqweasdas/ultra_feedback_binarized_for_preference_no_chat_40k
Viewer
•
Updated
•
40k
•
2
weqweasdas/gemma_ultra_feedback_binarized_for_preference_15k
Viewer
•
Updated
•
15k
weqweasdas/zephyr_ultra_feedback_binarized_for_preference
Viewer
•
Updated
•
60.9k
•
2
weqweasdas/ultra_feedback_binarized_for_preference_no_chat
Viewer
•
Updated
•
60.9k
•
1
weqweasdas/ultra_feedback_binarized_for_preference
Viewer
•
Updated
•
60.9k
•
1
weqweasdas/zephyr_ultra_feedback_model1
Viewer
•
Updated
•
7.5k
weqweasdas/zephyr_ultra_feedback_n32
Viewer
•
Updated
•
15k
•
1
weqweasdas/open_chat_0106_ultra_feedback_n32
Viewer
•
Updated
•
60k
•
3
weqweasdas/openchat_model0_data_with_rewards
Viewer
•
Updated
•
1
•
3
weqweasdas/rsf_pi0_iter1_with_len
Viewer
•
Updated
•
1
weqweasdas/rsf_pi0_mistrav_02_prompt0
Viewer
•
Updated
•
1
•
1
weqweasdas/rsf_gemma_2b_iter1
Viewer
•
Updated
•
1
•
3
Viewer
•
Updated
•
1
•
1
weqweasdas/preference_dataset_mix2
Viewer
•
Updated
•
528k
•
45
•
3
Viewer
•
Updated
•
116k
•
1
weqweasdas/preference_dataset_mixture2_and_safe_pku150k
Viewer
•
Updated
•
678k
•
1
weqweasdas/ultra_prompt_split
Viewer
•
Updated
•
60k
•
3
•
2
weqweasdas/preference_dataset_mixture
Viewer
•
Updated
•
256k
•
1