The

Insights from the front-lines of applied AI research for agentic CX​​​​‌‍​‍​‍‌‍‌​‍‌‍‍‌‌‍‌‌‍‍‌‌‍‍​‍​‍​‍‍​‍​‍‌​‌‍​‌‌‍‍‌‍‍‌‌‌​‌‍‌​‍‍‌‍‍‌‌‍​‍​‍​‍​​‍​‍‌‍‍​‌​‍‌‍‌‌‌‍‌‍​‍​‍​‍‍​‍​‍​‍‌‍​‌‌‍‌​‌‍‌‌‍‍‌‌‍‍​‍‌‍‍‌‌‍‍‌‌​‌‍‌‌‌‍‍‌‌​​‍‌‍‌‌‌‍‌​‌‍‍‌‌‌​​‍‌‍‌‌‍‌‍‌​‌‍‌‌​‌‌​​‌​‍‌‍‌‌‌​‌‍‌‌‌‍‍‌‌​‌‍​‌‌‌​‌‍‍‌‌‍‌‍‍​‍‌‍‍‌‌‍‌​​‌‌‍​‌‍​‌‌‍​‍‌​​‍‌‌‍​‍‌‍​‌‍‌‍‌​‍‌‌‍​‌‍‍‌‌​‌‌​‌‍‍‌‌‍‍‌‍‌​‍‌‌​​‌‍​‌‌‍‌‌‍‌‌​‍‌‌​‌‍‌‌​​‌‍‌‌​‌‌‍​‌‍​‌‌‍​‍‌​‌​​‍‌‍​‌‍‌‍‌‌​​‌‍‍‌‌​‌‌​‌‍‍‌‌‍‍‌‍‌‌‌​​‌‍​‌‌‍‌‌‍‌‌​‍‌​​‌‍​‌‌‌​‌‍‍​​‌‌‌​‌‍‍‌‌‌​‌‍​‌‍‌‌​‌‍​‍‌‍​‌‌​‌‍‌‌‌‌‌‌‌​‍‌‍​​‌​‍‌‌​​‍‌​‌‍‌‍​‌‌‍‌​‌‍‌‌‍‍‌‌‍‍​‍‌‍‌‍‍‌‌‍‌​​‌‌‍​‌‍​‌‌‍​‍‌​​‍‌‌‍​‍‌‍​‌‍‌‍‌​‍‌‌‍​‌‍‍‌‌​‌‌​‌‍‍‌‌‍‍‌‍‌​‍‌‌​​‌‍​‌‌‍‌‌‍‌‌​‍‌‍‌‌​‌‍‌‌​​‌‍‌‌​‌‌‍​‌‍​‌‌‍​‍‌​‌​​‍‌‍​‌‍‌‍‌‌​​‌‍‍‌‌​‌‌​‌‍‍‌‌‍‍‌‍‌‌‌​​‌‍​‌‌‍‌‌‍‌‌​‍‌‍‌​​‌‍​‌‌‌​‌‍‍​​‌‌‌​‌‍‍‌‌‌​‌‍​‌‍‌‌​‍​‍‌‌

Unified reasoner evaluation science
Unified reasoner evaluation science

Ada's Unified Reasoner replaced its Modular Reasoner with a re-baselined eval harness and Legitimacy Classifier. Adversarial pass rate: 88% to 97%.

Read

Solve hard problems. Ship real things.​​​​‌‍​‍​‍‌‍‌​‍‌‍‍‌‌‍‌‌‍‍‌‌‍‍​‍​‍​‍‍​‍​‍‌​‌‍​‌‌‍‍‌‍‍‌‌‌​‌‍‌​‍‍‌‍‍‌‌‍​‍​‍​‍​​‍​‍‌‍‍​‌​‍‌‍‌‌‌‍‌‍​‍​‍​‍‍​‍​‍​‍‌‍​‌‌‍‌​‌‍‌‌‍‍‌‌‍‍​‍‌‍‍‌‌‍‍‌‌​‌‍‌‌‌‍‍‌‌​​‍‌‍‌‌‌‍‌​‌‍‍‌‌‌​​‍‌‍‌‌‍‌‍‌​‌‍‌‌​‌‌​​‌​‍‌‍‌‌‌​‌‍‌‌‌‍‍‌‌​‌‍​‌‌‌​‌‍‍‌‌‍‌‍‍​‍‌‍‍‌‌‍‌​​‌‌‍​‌‍​‌‌‍​‍‌​​‍‌‌‍​‍‌‍​‌‍‌‍‌​‍‌‌‍​‌‍‍‌‌​‌‌​‌‍‍‌‌‍‍‌‍‌​‍‌‌​​‌‍​‌‌‍‌‌‍‌‌​‍‌‌​‌‍‌‌​​‌‍‌‌​‌‌‍​‌‍​‌‌‍​‍‌​‌​​‍‌‍​‌‍‌‍‌‌​​‌‍‍‌‌​‌‌​‌‍‍‌‌‍‍‌‍‌‌‌​​‌‍​‌‌‍‌‌‍‌‌​‍‌​​‌‍​‌‌‌​‌‍‍​​‌‌‍​‌‌​‌‍​‌​‍‍‌‌​‌‍‍‌‌‌​‌‍​‌‍‌‌​‌‍​‍‌‍​‌‌​‌‍‌‌‌‌‌‌‌​‍‌‍​​‌​‍‌‌​​‍‌​‌‍‌‍​‌‌‍‌​‌‍‌‌‍‍‌‌‍‍​‍‌‍‌‍‍‌‌‍‌​​‌‌‍​‌‍​‌‌‍​‍‌​​‍‌‌‍​‍‌‍​‌‍‌‍‌​‍‌‌‍​‌‍‍‌‌​‌‌​‌‍‍‌‌‍‍‌‍‌​‍‌‌​​‌‍​‌‌‍‌‌‍‌‌​‍‌‍‌‌​‌‍‌‌​​‌‍‌‌​‌‌‍​‌‍​‌‌‍​‍‌​‌​​‍‌‍​‌‍‌‍‌‌​​‌‍‍‌‌​‌‌​‌‍‍‌‌‍‍‌‍‌‌‌​​‌‍​‌‌‍‌‌‍‌‌​‍‌‍‌​​‌‍​‌‌‌​‌‍‍​​‌‌‍​‌‌​‌‍​‌​‍‍‌‌​‌‍‍‌‌‌​‌‍​‌‍‌‌​‍​‍‌‌

Join a team building at the intersection of cutting-edge AI and the industry's largest customer conversation dataset.

Open roles
Auto-reply detection
Auto-reply detection

Ada's AI agents detect and suppress automated emails, preventing infinite loops (~8,000 per week) and enabling earlier, configurable survey delivery.

Custom metrics for conversation analysis
Custom metrics for conversation analysis

How Ada built a per-conversation quality-scoring judge that scales to millions of conversations and stays calibrated to each customer's definition of "good."

Delta evaluation: Production replay pipeline
Delta evaluation: Production replay pipeline

DE replays production conversations through modified prompts & models, using an LLM-as-judge. Verdicts aggregate into win-rate metrics; traces into themes.

LLM-powered language detection
LLM-powered language detection

Replacing Ada's fastText language detector with an LLM tool call lifted recall from 78.6% to 97.9%: call the tool first and condition on script, not confidence.

Vector database migration
Vector database migration

Migrated knowledge and coaching retrieval to turbopuffer and Cohere Embed v4, improving latency, recall, and headroom to scale per-tenant namespaces.