Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models ...
Using this widget, any data point with JSON data can be displayed as desired. The display is done using a template format, which can be thought of as a combined form of HTML code + JavaScript + CSS + ...
There was an error while loading. Please reload this page.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results