The Large Data Set Edexcel

The AI revolution is running out of data. What can researchers do?

The Internet is a vast ocean of human knowledge, but it isn’t infinite. And artificial intelligence (AI) researchers have nearly sucked it dry. The past decade of explosive improvement in AI has been ...

MIT Technology Review

This data set helps researchers spot harmful stereotypes in LLMs

A new multilingual tool aims to make it easier to evaluate AI models for bias in multiple languages. AI models are riddled with culturally specific biases. A new data set, called SHADES, is designed ...

Fierce Healthcare

OpenAI pushes further into healthcare with release of HealthBench to evaluate AI models

OpenAI, the maker of ChatGPT, released an open-source benchmark designed to measure the performance and safety of large language models in healthcare. The large data set, called HealthBench, goes ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

The AI revolution is running out of data. What can researchers do?

This data set helps researchers spot harmful stereotypes in LLMs

OpenAI pushes further into healthcare with release of HealthBench to evaluate AI models

Trending now