Evaluate your AI directly in CI!

Post eval results to the API, get back progress graphs for embedding where-ever.

$ curl -X POST \
    -H "Content-Type: application/json" \
    -H "Authorization: Bearer $PROJECT_SECRET_KEY" \
    -d '
{
  "Branch": "main",
  "Evals": [
    {
      "Name": "Ping Pong",
      "Group": "Ping",
      "Sample": { "Input": "Ping", "Output": "Plong", "Expected": "Pong" },
      "Results": [{ "Score": 0.40, "Type": "SemanticSimilarity" }],
      "Duration": 9212263
    },
    {
      "Name": "Ding Dong",
      "Group": "Ping",
      "Sample": { "Input": "Ding", "Output": "Drong", "Expected": "Dong" },
      "Results": [{ "Score": 0.92, "Type": "SemanticSimilarity" }],
      "Duration": 3839519
    },
    {
      "Name": "Hello there",
      "Group": "Hello",
      "Sample": { "Input": "Hello", "Output": "Hi", "Expected": "Hello there" },
      "Results": [
        { "Score": 0.69, "Type": "SemanticSimilarity" },
        { "Score": 0.50, "Type": "LexicalSimilarity" }
      ],
      "Duration": 1165943
    }
  ]
}
' https://api.evals.fun/evals

Made in 🇩🇰 by maragu