ImpossibleBench: Measuring LLMs' Propensity of Exploiting Test Cases Paper • 2510.20270 • Published Oct 23 • 6