reciTAL/mlsum
Updated • 1.29k • 55
How to use kaixkhazaki/t5-small-turkish-summarisation-lora with PEFT:
from peft import PeftModel
from transformers import AutoModelForSeq2SeqLM
base_model = AutoModelForSeq2SeqLM.from_pretrained("google-t5/t5-small")
model = PeftModel.from_pretrained(base_model, "kaixkhazaki/t5-small-turkish-summarisation-lora")This model is a fine-tuned version of google-t5/t5-small on the mlsum dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|---|---|---|---|---|---|---|---|---|
| 2.557 | 1.0 | 624 | 2.1852 | 18.4642 | 11.9852 | 17.839 | 18.0458 | 20.0 |
| 2.364 | 2.0 | 1248 | 2.1269 | 19.6197 | 13.135 | 18.9274 | 19.0553 | 19.9913 |
| 2.3235 | 3.0 | 1872 | 2.0928 | 19.5088 | 13.2191 | 18.8665 | 18.9558 | 20.0 |
| 2.3002 | 4.0 | 2496 | 2.0669 | 19.2649 | 12.9427 | 18.6374 | 18.6937 | 20.0 |
| 2.2803 | 5.0 | 3120 | 2.0542 | 19.427 | 13.1022 | 18.7882 | 18.8376 | 20.0 |
| 2.2731 | 6.0 | 3744 | 2.0412 | 19.4496 | 13.1266 | 18.8102 | 18.8668 | 20.0 |
| 2.2655 | 7.0 | 4368 | 2.0338 | 19.4302 | 13.1054 | 18.7905 | 18.8397 | 20.0 |
| 2.2611 | 8.0 | 4992 | 2.0323 | 19.4302 | 13.1054 | 18.7905 | 18.8397 | 20.0 |
| 2.2584 | 9.0 | 5616 | 2.0321 | 19.4302 | 13.1054 | 18.7905 | 18.8397 | 20.0 |
| 2.2533 | 10.0 | 6240 | 2.0317 | 19.4302 | 13.1054 | 18.7905 | 18.8397 | 20.0 |
Base model
google-t5/t5-small