8 11 7

Khalil Slimi

KhalilSlimi

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

upvoted a paper 29 days ago

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

upvoted a paper about 1 month ago

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

View all activity

Organizations

upvoted an article 3 days ago

Article

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

ServiceNow-AI

•

3 days ago

• 42

upvoted a paper 29 days ago

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published about 1 month ago • 75

upvoted a paper about 1 month ago

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

Paper • 2605.12178 • Published May 12 • 61

upvoted a paper 3 months ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published Mar 25 • 98

liked a dataset 3 months ago

ServiceNow-AI/eva

Viewer • Updated Mar 24 • 50 • 234 • 71

upvoted an article 3 months ago

Article

A New Framework for Evaluating Voice Agents (EVA)

ServiceNow-AI

•

Mar 24

• 95

liked a dataset 3 months ago

ServiceNow-AI/EnterpriseOps-Gym

Viewer • Updated Apr 30 • 2.56k • 7.35k • 89

upvoted a paper 3 months ago

EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

Paper • 2603.13594 • Published Mar 13 • 149

liked a model 6 months ago

ServiceNow-AI/Apriel-1.6-15b-Thinker

Image-Text-to-Text • 15B • Updated Dec 22, 2025 • 174 • 300

upvoted an article 6 months ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

ServiceNow-AI

•

Dec 9, 2025

• 84

liked a model 7 months ago

ServiceNow-AI/Apriel-H1-15b-Thinker-SFT

Text Generation • 16B • Updated Nov 3, 2025 • 26 • 29

upvoted a paper 7 months ago

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published Nov 10, 2025 • 107

upvoted a collection 8 months ago

Apriel-1.5-15B-Thinker

Collection

3 items • Updated Oct 2, 2025 • 76

upvoted a paper 8 months ago

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published Oct 1, 2025 • 125

liked a Space 8 months ago

Apriel Chat

💬

ServiceNow-AI model chat

liked a model 8 months ago

ServiceNow-AI/Apriel-1.5-15b-Thinker

Image-Text-to-Text • 15B • Updated Oct 6, 2025 • 188 • 469

upvoted a paper 9 months ago

AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs

Paper • 2509.08031 • Published Sep 9, 2025 • 21

Khalil Slimi

AI & ML interests

Recent Activity

Organizations

KhalilSlimi's activity

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

A New Framework for Evaluating Voice Agents (EVA)

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Apriel Chat