AI Question Answering Models

There's a Benchmark Test That Measures AI 'Bullshit'—Most Models Fail

BullshitBench tests whether AI models can detect nonsensical questions—or if they'll confidently answer them anyway. The ...

AI agent could transform how scientists study weather and climate

Computer scientists and weather scientists have taken the first steps toward creating an AI agent capable of analyzing and ...

Which Is The Best AI For Medical Questions? Here’s The Winner

A new Stanford/Harvard study assessed 31 AI models. Here's the winner and the full list of AIs ranked by how well they answer complex clinical questions.

Search Engine Land

How to create answer-first content that AI models actually cite

AI has changed the way searching happens—and that, in turn, has changed how discovery works. With tools like Perplexity, ChatGPT, and Gemini, the way they crawl the web is completely different from ...

The Register on MSN

AI models still suck at math

Just less than before, according to the ORCA test exclusive Current-day LLMs are prediction engines and, as such, they can ...

How Businesses Can Prepare For AI Search

AI search is reshaping how people discover information and evaluate brands. The familiar path from query to website is ...

13d

How Chinese AI Chatbots Censor Themselves

Researchers from Stanford and Princeton found that Chinese AI models are more likely than their Western counterparts to dodge ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results