Researchers Have Ranked AI Models Based on Risk—and Found a Wild Range

15Aug/24Off

Researchers Have Ranked AI Models Based on Risk—and Found a Wild Range

Bo Li, an associate professor at the University of Chicago who specializes in stress testing and provoking AI models to uncover misbehavior, has become a go-to source for some consulting firms. These consultancies are often now less concerned with how smart AI models are than with how problematic—legally, ethically, and in terms of regulatory compliance—they can be.

Li and colleagues from several other universities, as well as Virtue AI, cofounded by Li, and Lapis Labs, recently developed a taxonomy of AI risks along with a benchmark that reveals how rule-breaking different large language models are. “We need some principles for AI safety, in terms of regulatory compliance and ordinary usage,” Li tells WIRED.

The researchers analyzed government AI regulations and guidelines, including those of the US, China, and the EU, and studied the usage policies of 16 major AI companies from around the world. ...

A company looking to use a LLM for customer service, for instance, might care more about a model’s propensity to produce offensive language when provoked than how capable it is of designing a nuclear device. ...

See the benchmarking site here: https://crfm.stanford.edu/helm/air-bench/latest/#/leaderboard

See the full story here: https://www.wired.com/story/ai-models-risk-rank-studies/

Filed under: Non-3D stories Comments Off

Comments (0) Trackbacks (0) ( subscribe to comments on this post )

Sorry, the comment form is closed at this time.

Trackbacks are disabled.

Can AI truly replicate the screams of a man on fire? Video game performers want their work protected » « MIT releases comprehensive database of AI risks

Pages

If your company is an ETC member, you can log in and see more news posts at www.etcentric.org

philip lelyveld The world of entertainment technology

Researchers Have Ranked AI Models Based on Risk—and Found a Wild Range

Pages

More posts