Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
5
3
Dmitriev
Danil
Follow
0 followers
ยท
1 following
https_shot
DanilDmitriev1999
AI & ML interests
NLP, Multilingual, mLM, Dialog systems, Graph NN
Recent Activity
reacted
to
atasoglu
's
post
with ๐
about 2 months ago
Introducing ToolsGen ๐ ๏ธ I built a tool to solve a problem I kept running into: creating quality datasets for training LLMs to use tools. ToolsGen takes your JSON tool definitions and automatically generates realistic user requests, corresponding tool calls, and evaluates them using an LLM-as-a-judge pipeline. It outputs datasets ready to use with Hugging Face. What makes it useful: - Generates realistic user requests + tool calls from JSON definitions - LLM-as-a-judge quality scoring with multi-dimensional rubrics - Multiple sampling strategies (random, parameter-aware, semantic) - OpenAI-compatible API support - Outputs JSONL with train/val splits Still early days (API isn't stable yet), but it's already helping me generate tool-calling datasets much faster. Check it out: https://github.com/atasoglu/toolsgen Happy to hear feedback or ideas!
updated
a collection
about 1 year ago
reasoning
updated
a collection
about 1 year ago
follow instructions
View all activity
Organizations
None yet
Danil
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
2 datasets
almost 2 years ago
HuggingFaceTB/cosmopedia
Viewer
โข
Updated
Aug 12, 2024
โข
31.1M
โข
46.8k
โข
650
knkarthick/topicsum
Viewer
โข
Updated
Dec 7, 2022
โข
241k
โข
97
โข
8
liked
a Space
about 4 years ago
Runtime error
1
AnyNameHack
๐ฅ
1