H/W resources

by r2d209 - opened Sep 20, 2023

Discussion

r2d209

Sep 20, 2023

I want to know H/W resources you used to train this model.
like GPU(a100 or... something else), GPU RAM size

byoussef

Owner Sep 20, 2023

Yes, it was trained on 7 A100 80GB GPUs. But it's a bit of an overkill. It was done mainly cuz I was working with a very large custom dataset.
I have been successful in training the same model also on a T4 GPU using DeepSpeed. And you could use an even smaller GPU if you utilize PEFT

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment