Fine-tuning the Talkie 13B 1930 model on agentic trajectories
Ricardo
ricdomolm
AI & ML interests
LLMs
Recent Activity
updated a dataset 31 minutes ago
ricdomolm/SWE-bench_Verified-Cluster483 published a dataset 31 minutes ago
ricdomolm/SWE-bench_Verified-Cluster483 updated a dataset about 1 hour ago
ricdomolm/SWE-bench_Verified-Rescue6Organizations
1930 Coder
Fine-tuning the Talkie 13B 1930 model on agentic trajectories
Computational Arbitrage
Models and datasets for the paper "Computational Arbitrage in AI Model Markets"
mini-coder
Small models for agentic SWE research: https://ricardodominguez.github.io/blogs/minicoder.html
Training on the test task models
Models fine-tuned for multiple choice question answering (mc) and mathematical reasoning (gsm8k). https://arxiv.org/abs/2407.07890