Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models ...
David Reid does not work for, consult, own shares in or receive funding from any company or organization that would benefit from this article, and has disclosed no relevant affiliations beyond their ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results