Categories: Wire Stories

MLCommons and AI Verify to collaborate on AI Safety Initiative

Agree to a memorandum of intent to collaborate on a set of AI safety benchmarks for LLMs


SAN FRANCISCO–(BUSINESS WIRE)–#ai–Today in Singapore, MLCommons® and AI Verify signed a memorandum of intent to collaborate on developing a set of common safety testing benchmarks for generative AI models for the betterment of AI safety globally.

A mature safety ecosystem includes collaboration across AI testing companies, national safety institutes, auditors, and researchers. The aim of the AI Safety benchmark effort that this agreement advances is to provide AI developers, integrators, purchasers, and policy makers with a globally accepted baseline approach to safety testing for generative AI.

“There is significant interest in the generative AI community globally to develop a common approach towards generative AI safety evaluations,” said Peter Mattson, MLCommons President and AI Safety working group co-chair. “The MLCommons AI Verify collaboration is a step-forward towards creating a global and inclusive standard for AI safety testing, with benchmarks designed to address safety risks across diverse contexts, languages, cultures, and value systems.”

The MLCommons AI Safety working group, a global group of academic researchers, industry technical experts, policy and standards representatives, and civil society advocates recently announced a v0.5 AI Safety benchmark proof of concept (POC). AI Verify will develop interoperable AI testing tools that will inform an inclusive v1.0 release which is expected to deliver this fall. In addition, they are building a toolkit for interactive testing to support benchmarking and red-teaming.

“Making first moves towards globally accepted AI safety benchmarks and testing standards, AI Verify Foundation is excited to partner with MLCommons to help our partners build trust in their models and applications across the diversity of cultural contexts and languages in which they were developed. We invite more partners to join this effort to promote responsible use of AI in Singapore and the world,” said Dr Ong Chen Hui, Chair of the Governing Committee at AI Verify Foundation.

The AI Safety working group encourages global participation to help shape the v1.0 AI Safety benchmark suite and beyond. To contribute, please join the MLCommons AI Safety working group.

About MLCommons

MLCommons is the world leader in building benchmarks for AI. It is an open engineering consortium with a mission to make AI better for everyone through benchmarks and data. The foundation for MLCommons began with the MLPerf® benchmarks in 2018, which rapidly scaled as a set of industry metrics to measure machine learning (ML) performance and promote transparency of ML and AI techniques. In collaboration with its 125+ members, global technology providers, academics, and researchers, MLCommons is focused on collaborative engineering work that builds tools for the entire AI industry through benchmarks and metrics, public datasets, and best practices.

About the AI Verify Foundation

The AI Verify Foundation aims to harness the collective power and contributions of the global open-source community to develop AI testing tools to enable responsible AI. The Foundation promotes best practices and standards for AI. The not-for-profit Foundation is a wholly owned subsidiary of the Infocommunications Media Development Authority of Singapore (IMDA).

Contacts

Kelly Berschauer

kelly@mlcommons.org

Alex

Recent Posts

HKPC New Industrialisation Unveils “Hong Kong Manufacturing Industries Development Study Report”

81% of Manufacturing Industries Yet to Embrace Smartification Solutions 7 Key Recommendations to Ignite New…

4 hours ago

NIA Unveils Path for Thai Innovation as It Enters 16th Year Aiming to Propel Thailand Towards Becoming an Innovative Nation

BANGKOK, THAILAND - Media Outreach Newswire - 16 September 2024 - The Ministry of Higher…

9 hours ago

Dusit Thani Bangkok partners with Porsche Thailand to offer ‘one-of-a-kind’ luxury limousine service for guests

Dusit’s reimagined flagship hotel sets a new standard in Thailand, becoming the first to offer…

10 hours ago

Every pip matters: Octa broker’s guide to market spreads

KUALA LUMPUR, MALAYSIA - Media OutReach Newswire - 16 September 2024 - No matter what…

10 hours ago

I-PRIMO Commemorates 25th Anniversary with the Grand Opening of Second Store at Suntec City

Explore I-PRIMO’s exquisite collection of over 200 beautifully crafted rings, each designed to capture the…

12 hours ago