TitleOS/RLAIF_Patriot_Experiment_LoRA-Q8_0-GGUF

This LoRA adapter was converted to GGUF format from TitleOS/RLAIF_Patriot_Experiment_LoRA via the TitleOS's Gguf-my-lora-cutting-edge space. Refer to the original adapter repository for more details.

Use with llama.cpp

# with cli
llama-cli -m base_model.gguf --lora RLAIF_Patriot_Experiment_LoRA-q8_0.gguf (...other args)

# with server
llama-server -m base_model.gguf --lora RLAIF_Patriot_Experiment_LoRA-q8_0.gguf (...other args)

To know more about LoRA usage with llama.cpp server, refer to the llama.cpp server documentation.

Downloads last month: 11

GGUF

Model size

38.4M params

Architecture

gemma3n

Hardware compatibility

8-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for TitleOS/RLAIF_Patriot_Experiment_Q8_0-GGUF

Base model

google/gemma-3n-E4B

Finetuned

google/gemma-3n-E4B-it

Adapter

TitleOS/RLAIF_Patriot_Experiment_LoRA

Adapter

(2)

this model

Dataset used to train TitleOS/RLAIF_Patriot_Experiment_Q8_0-GGUF

Collection including TitleOS/RLAIF_Patriot_Experiment_Q8_0-GGUF

RLAIF Experimentation

Collection

Research into RLAIF (Reinforcement Learning from AI feedback) with the goal of Constitutional AI and Sycophancy Resistance. • 4 items • Updated 25 days ago