Confident AI offers an open-source package called DeepEval that enables engineers to evaluate or “unit test” their LLM applications’ outputs. Confident AI is our commercial offering and it allows you to log and share evaluation results within your org, centralize your datasets used for evaluation, debug unsatisfactory evaluation results, and run evaluations in production throughout the lifetime of your LLM application. We offer 10+ default metrics for engineers to plug and use.

Additional Info

Founder(s)Jeffrey Ip
Bootstrapped or Raised?Other
Team Size5
Year Founded2023
Company TaglineOpen-source evaluation infrastructure for LLMs
HiringNo