RLAIF Experimentation
Collection
Research into RLAIF (Reinforcement Learning from AI feedback) with the goal of Constitutional AI and Sycophancy Resistance. • 4 items • Updated
This LoRA adapter was converted to GGUF format from TitleOS/RLAIF_Patriot_Experiment_LoRA via the TitleOS's Gguf-my-lora-cutting-edge space.
Refer to the original adapter repository for more details.
# with cli
llama-cli -m base_model.gguf --lora RLAIF_Patriot_Experiment_LoRA-q8_0.gguf (...other args)
# with server
llama-server -m base_model.gguf --lora RLAIF_Patriot_Experiment_LoRA-q8_0.gguf (...other args)
To know more about LoRA usage with llama.cpp server, refer to the llama.cpp server documentation.
8-bit
Base model
google/gemma-3n-E4B