I seem to be unable to get MTP working on VLLM.I get the error: 'GPUModelRunner' object has no attribute 'drafter'.
Anyone found a way around that problem?
no num_nextn_predict_layers in config either
· Sign up or log in to comment