VRAM Requirements for Running the Model

by wilfoderek - opened Jan 10, 2025

How much VRAM is needed to run the model?
Is an H200 sufficient?
Thank you in advance!

Same question here, it seems the model params size is ~200GB, how much VRAM is needed? Thanks

MLX Community org Jan 16, 2025

@awni says it only has 37B parameters in memory. I'm not sure how that translates to GB. I'm gonna give it a try.

MLX Community org Jan 16, 2025

No that’s not quite right. It only needs to move 37B parameters from RAM to cache. To run this thing you need about 400GB of RAM

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment