LoRA Backward Model (1000 samples)

This model is a LoRA-finetuned version of NousResearch/Llama-2-7b-hf, trained to predict the instruction (x) given the assistant response (y). This implements the backward model training from the paper:

Self-Alignment with Instruction Backtranslation

Dataset

timdettmers/openassistant-guanaco and extract pairs of the form:

### Output (y)
<assistant's answer>

### Instruction (x)
<human's original question>

Downloads last month: 2

Model tree for sijiasijia/lora-backward-1000

Base model

NousResearch/Llama-2-7b-hf

Adapter

(137)

this model

Paper for sijiasijia/lora-backward-1000

Self-Alignment with Instruction Backtranslation

Paper • 2308.06259 • Published Aug 11, 2023 • 42