Arena Leaderboard
View the latest LMArena model leaderboard
A collection of tools as various HF Spaces on LLMs.
View the latest LMArena model leaderboard
Track, rank and evaluate open LLMs and chatbots
Clone a voice and generate speech from your text
Launch a Streamlit web app interface
View and submit LLM evaluations
Pick a text splitter => visualize chunks. Great for RAG.
Create a model card for Hugging Face Hub
Explore LLM performance across hardware configurations
Explore and submit LLM benchmarks
Fine-tuning large language model with Gradio UI
Replace objects in images using prompts or reference images
High-fidelity Text-To-Speech
Identify named entities in text
Run a Phi3 model to answer your prompts
Chat with a visual AI assistant using text and images
Annotate and describe images with text prompts
Extract custom entities from text with zeroβshot NER
Perform multiple NLP tasks on your text
Convert PDFs to a Hugging Face dataset
Generate instruction-response pairs from text
Quantize a Hugging Face model to GGUF and create a repo
Generate artist-style 3D meshes from any input model
Display a web page
Run Gemini Nano locally in your browser with Transformers.js
Answer questions about images using text prompts
Experiment with and compare different tokenizers
Generate speakerβlabeled transcript from an audio file
Compare two faces and analyze facial attributes
Generate object masks on images with SAM2
All paper summaries read by Merve
Generate images from text prompts instantly
Generate images from your text prompt
Display a React app with TypeScript
Summarize text from a PDF URL
Generate chat responses using FalconMamba-7b model
LLM for long context
Convert text to audio and vice versa
Generate text based on prompts
Travel through the model latent space
Upload a paper to get reviews and vote on quality
Find datasets and models using semantic search
Convert models to Safetensors and open a PR
Convert text to natural-sounding speech audio
Generate spokenβstyle scripts from documents
Chat with a language model
Transcribe audio recordings into written text
Transcribe or translate audio and YouTube videos to text
RAG with source links inserted using LXT library.
Extract text from images using various OCR modes
Ask questions and get detailed answers
Deduplicate HuggingFace datasets in seconds
Generate text from audio recordings
Run code and get results with Qwenβ2.5 code interpreter
Refine your prompts
Interact with the Aya family of models.
Compare Open LLM Leaderboard results
Display a loading screen with a spinner
Convert and upload models to Hugging Face
Add vectors to Hub datasets and do in memory vector search.
Talk to Fixie.ai's Ultravox with WebRTC β‘οΈ
An analysis of LFS files on the Hub.
Demo for DocLayout-YOLO
Compress text prompts efficiently
diffusion-based Image Restoration model
Prompt with Images in flux[dev]
Generate structured GitHub issues
PaliGemma2 LoRA finetuned on VQAv2
Interact with multiple chatbots simultaneously
Fantasy story generator
Generate and run Jupyter notebooks from natural language prompts
QwQ-32B-Preview
Generate and preview code from your app description
Search, load and play with transformer pipelines
Generate 3D models from images
Aligns the tokens of two sentences
Upgraded to v1.0!
Small and powerful reasoning LLM that runs in your browser
Chat with an AI model using text and images
Quickest way to test naive RAG run with AutoRAG.
Next-generation reasoning model that runs locally in-browser
Generate descriptions from images and text prompts
In-browser unified multimodal understanding and generation.
Need to analyze data? Let a Llama-3.1 agent do it for you!
β¨[With v1.0.0] Accelerated TTS on Kokoro-82M
Chat with an AI that writes code and answers queries
Generate text and segment images using PaliGemma 2
The ultimate guide to training LLM on large GPU Clusters
Collection of marimo notebooks from a GitHub repository
Download and run a Hugging Face app
Magma-8B model for UI Agents
Answer questions using advanced AI
Blazingly Fast and Embarrassingly Simple Song Generation
Generate radial plots comparing language models
Contributing to OpenStreetMap with the help of AI
Convert images and sketches into graphics programs with TikZ
A bulk labelling interface for binary text classification
Generate any application by Vibe Coding it
Generate custom evaluations from your data easily!
Generate realistic dialogue from a script, using Dia!
Chat with AI and see its reasoning
Generate modified audio from text and voice
Expressive Zeroshot TTS
Create and enrich structured datasets with AI
Submit model evaluations and view leaderboard results
https://nanonets.com/research/nanonets-ocr-s/
Visual Audio Question Answering
Translate text instantly between many languages
nanonets ocr / smoldocling / monkey ocr / typhoon ocr
Generate detailed captions for any image
gpt-oss-120b on AMD MI300X GPUs
Edit and enhance images based on descriptive instructions
Duplicate this leaderboard to initialize your own!
Match images to find similar pictures instantly
Generate code snippets from plain text prompts
Real-time video captioning powered by FastVLM
Visualize embeddings in 3D space, powered by EmbeddingGemma
Interactive timeline to explore the π€Transformers models
Extract and convert document content from images
Run Granite-4.0-Micro 100% locally in your browser on WebGPU
Convert document images to HTML with Docling
Fast 4 step inference with Qwen Image Edit 2509
Fast 4 step inference with Qwen Image Edit 2509
Configurable Generalist Agent, leader in AppWorld Benchmark
Generate custom speech from text, voice descriptions, or samples
Transcribe audio to text with multi-language timestamps
Run GPT-OSS-20B locally in your browser on WebGPU
Build, train, and run LLMs in the browser
Run LFM2.5-1.2B-Thinking directly in your browser on WebGPU