Something seems to have gone wrong in conversion of SmolLM3-3B-Q4_K_M.gguf

#2
by david-thrower - opened

This is unfortunately generating responses completely irrlevant to the prompt it was supplied. I think there is a problem with the conversion of this model at least in this specific version: SmolLM3-3B-Q4_K_M.gguf

Steps to reproduce: Standard free Google Colab CPU environment:

  1. Install
    ! pip install llama-cpp-python

  2. Set up:

from llama_cpp import Llama

llm = Llama.from_pretrained(
    repo_id="unsloth/SmolLM3-3B-GGUF",
  filename="SmolLM3-3B-Q4_K_M.gguf"
    # filename="SmolLM3-3B-BF16.gguf",
)
  1. Generate:
x = llm.create_chat_completion(
    messages = "What virtues from Homer's Iliad and Odyssey did our founding fathers include in the US constitution in addition to the biblical principles that guided the writing of our constitution?"
)
print(x)

This generates the following string which leads me to think there is something just plain wrong

"""
"\nOkay, so I need to figure out how to calculate the number of ways to distribute 10 identical balls into 4 distinct boxes, with the condition that each box must contain at least 2 balls. Hmm, let's start by recalling some combinatorics principles.\n\nFirst, if the boxes were identical, the problem would be different, but since they are distinct, we need to use the formula for distributing identical objects into distinct boxes with restrictions. The standard formula for distributing n identical objects into k distinct boxes with no restrictions is C(n + k - 1, k - 1), which is the stars and bars theorem. But here, each box must have at least 2 balls. So, how do we adjust for that?\n\nI remember that when there's a minimum requirement, we can use the substitution method. If each box must have at least 2 balls, we can first place 2 balls in each box. That uses up 2 * 4 = 8 balls. Then, we have 10 - 8 = 2 balls left to distribute without any restrictions. So now, the problem reduces to distributing these remaining 2 balls into the 4 boxes, where each box can have 0 or more balls. \n\nUsing the stars and bars formula for this reduced problem, the number of ways"
"""

While running this, the memory was never over 3 GB, and no other sign of saturation or other system-level problem was observed.

david-thrower changed discussion status to closed

Sign up or log in to comment