How to use TheBloke/mpt-30B-instruct-GGML with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("TheBloke/mpt-30B-instruct-GGML", dtype="auto")