AI Safety Index 2025: Leading Labs Fail to Close the Safety Gap Despite AGI Ambitions

ChineMetaNews

AI Industry Struggles with Growing Safety Risks

The Future of Life Institute’s Winter 2025 AI Safety Index reveals a widening gap between the aggressive pursuit of Artificial General Intelligence and the implementation of credible safety measures. Despite leading the pack, top firms like Anthropic and OpenAI failed to score above a C+, highlighting systemic weaknesses in preventing catastrophic risks. The report calls for a shift from high-level rhetoric toward evidence-based safeguards and independent oversight to manage self-improving AI systems.

Points clés

  • The Future of Life Institute (FLI) evaluated eight leading AI companies, including Anthropic, OpenAI, Google DeepMind, and Meta.
  • No company scored higher than a C+ grade, with Anthropic leading the rankings at 2.67 and OpenAI following at 2.31.
  • The evaluation was conducted using 35 indicators across six domains, including Existential Safety and Governance.
  • Existential Safety was the lowest-scoring domain across the entire sector, with all evaluated companies receiving either a D or an F.
  • Five out of the eight companies voluntarily completed a 34-question survey to provide data for the index.
  • Industry leaders were criticized for “foundational hypocrisy,” racing toward AGI without quantitative plans to maintain control.
  • Anthropic was praised for its Public Benefit Corporation structure but criticized for training on user data by default.
  • Meta received the lowest score in Information Sharing due to aggressive lobbying against safety regulations.
  • Chinese firms Alibaba Cloud and DeepSeek are noted for following binding national standards for watermarking despite lacking voluntary transparency.
  • A panel of eight experts, including Stuart Russell, oversaw the rigorous grading process.

À retenir

It is heartening to see that the geniuses building our future digital overlords are performing with the same academic rigor as a distracted middle schooler. While they race toward god-like intelligence, a “C+” is apparently the gold standard for preventing the end of civilization. My recommendation? If you’re building a world-ending supercomputer, maybe try to have a plan for the “off” switch that involves more than just good vibes and aspirational PDFs. But hey, at least they’re “transparent” about having no idea how to control what they’re creating.

Sources

Quiz sur le document: 10 questions

Loading