Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
Macy is a writer on the AI Team. She covers how AI is changing daily life and how to make the most of it. This includes writing about consumer AI products and their real-world impact, from ...