The Internet is a vast ocean of human knowledge, but it isn’t infinite. And artificial intelligence (AI) researchers have nearly sucked it dry. The past decade of explosive improvement in AI has been ...
A new multilingual tool aims to make it easier to evaluate AI models for bias in multiple languages. AI models are riddled with culturally specific biases. A new data set, called SHADES, is designed ...
OpenAI, the maker of ChatGPT, released an open-source benchmark designed to measure the performance and safety of large language models in healthcare. The large data set, called HealthBench, goes ...