AI Risk & Safety

Is AI an Existential Threat?

Understanding the risks of AI and the urgent need for mitigation.

Introduction

In the largest survey of AI researchers conducted to date, a significant majority indicated that there is a non-trivial risk of human extinction stemming from the potential development of superhuman AI, estimating a 5% likelihood. This growing recognition of risk establishes the pressing need for coordinated global efforts to address the challenges posed by frontier AI systems such as Gemini, GPT, Claude, and Llama.

Leaders such as OpenAI's CEO Sam Altman, DeepMind's CEO Demis Hassabis, and Anthropic's CEO Dario Amodei have signed open letters emphasizing the need to mitigate these risks and have openly acknowledged the catastrophic risks associated with the development of advanced artificial intelligence systems.

The scope and complexity of catastrophic risks associated with advanced AI extend far beyond mere technical challenges. Far from merely replacing remote human workers in most tasks, these risks encompass a broad spectrum of threats, including malicious misuse by bad actors, competitive pressures that incentivize unsafe deployment, unintended/accidental failures stemming from organizational complexity, and the projected difficulty of controlling above human-level-intelligence AI systems.

Managing these risks demands a coordinated effort among model developers, governments, and the broader third-party testing ecosystem of non-profits, civil society organizations, academia, and private sector stakeholders.

Risk Types

Malicious Use

Actors intentionally misusing AI to cause harm.

Cyber Attacks: AI-powered tools can exploit vulnerabilities, conduct personalized phishing campaigns, or automate hacking.
Bioterrorism: AI systems like language models can assist in creating biological weapons.
Automated Warfare: Killer drones and autonomous weapons could target specific groups or escalate conflicts.
Deepfakes: AI-generated content used for misinformation, extortion, or non-consensual imagery.
Mass Surveillance and Oppression: AI enables large-scale surveillance and censorship, potentially cementing authoritarian regimes

Misalignment

AI systems that fail to align with human values, resulting in unintended or harmful behaviors.

Strategic Deception: AI systems may employ deceptive tactics, such as pretending to be human to bypass safeguards.
Unexpected Behaviors: Poorly understood AI mechanisms can result in unpredictable and harmful actions, as seen with past chatbot failures.
Self-Replication: Advanced AI systems may replicate or evolve autonomously, leading to potential loss of control.
Superintelligence Threat: AI systems could surpass human intelligence and pursue unintended goals, potentially leading to catastrophic outcomes.

Negative Externalities

Unintended socioeconomic, environmental and societal impacts of AI integration, even if all actors are well-meaning.

Bias Perpetuation: AI systems often reflect and reinforce societal biases present in their training data.
Unemployment: Automation could displace jobs, increasing economic inequality and mental health challenges.
Social Media Harms: AI algorithms on platforms can exacerbate addiction, anxiety, and depression.
Epistemic Deterioration: Filter bubbles and misinformation undermine collective understanding and societal consensus.
Loss of Cognitive Skills: Over-reliance on AI may erode human decision-making and problem-solving abilities.

AI Safety Research

In contrast to the domain of AI development, AI safety research is still in its infancy and attracts far less investment. Nevertheless, the field is rapidly evolving, with frontier AI companies and the broader AI Safety Community identifying several strategies and policies to improve the safety of AI systems. This collective effort must systematically address both the immediate and tangible risks while also anticipating emerging challenges, and proactively implement strategies to mitigate them.

Exploring Our Database

Our AI Safety Database provides actionable insights tailored to each risk category. We provide a systematic categorization of risks, their causes, and their potential impact.
We highlight cutting-edge technical, organizational, and policy solutions.
We provide case studies that illustrate both risks and mitigations.
We help guide stakeholders based on relevance to policy, research, or the private sector.

Learn More

For a deeper look into AI risks and safety strategies, please visit CeSIA’s AI Safety Atlas.

Explore the Repository