Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation Paper • 2604.05083 • Published 15 days ago
What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time? Paper • 2603.19017 • Published Mar 19 • 3
What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time? Paper • 2603.19017 • Published Mar 19 • 3
TikZilla Collection Text-Guided TikZ Graphics Program Generation for Scientific Figures • 6 items • Updated Mar 17
TikZilla Collection Text-Guided TikZ Graphics Program Generation for Scientific Figures • 6 items • Updated Mar 17