Insights from the front-lines of applied AI research for agentic CX
Ada's Unified Reasoner replaced its Modular Reasoner with a re-baselined eval harness and Legitimacy Classifier. Adversarial pass rate: 88% to 97%.
ReadSolve hard problems. Ship real things.
Join a team building at the intersection of cutting-edge AI and the industry's largest customer conversation dataset.
Open rolesAda's AI agents detect and suppress automated emails, preventing infinite loops (~8,000 per week) and enabling earlier, configurable survey delivery.
How Ada built a per-conversation quality-scoring judge that scales to millions of conversations and stays calibrated to each customer's definition of "good."
DE replays production conversations through modified prompts & models, using an LLM-as-judge. Verdicts aggregate into win-rate metrics; traces into themes.
Replacing Ada's fastText language detector with an LLM tool call lifted recall from 78.6% to 97.9%: call the tool first and condition on script, not confidence.
Migrated knowledge and coaching retrieval to turbopuffer and Cohere Embed v4, improving latency, recall, and headroom to scale per-tenant namespaces.