Confident AI offers an open-source package called DeepEval that enables engineers to evaluate or “unit test” their LLM applications’ outputs. Confident AI is our commercial offering and it allows you to log and share evaluation results within your org, centralize your datasets used for evaluation, debug unsatisfactory evaluation results, and run evaluations in production throughout the lifetime of your LLM application. We offer 10+ default metrics for engineers to plug and use.
Additional Info
Founder(s) | Jeffrey Ip |
Bootstrapped or Raised? | Other |
Team Size | 5 |
Year Founded | 2023 |
Company Tagline | Open-source evaluation infrastructure for LLMs |
Hiring | No |