All Articles

430 articles

Basic sections

Intro to AI safety
I: AI progress is leading to superintelligence
II: AI may end up opposed to us
III: Consequences could be major, including human extinction
IV: We need to get our act together
Understanding AI systems
Fundamentals
Notable AI systems
Capabilities and limitations
Learning mechanisms
Prompting techniques
Alignment techniques
Future AI
General intelligence
Superintelligence
Time scales
Agency and autonomy
The alignment problem
Foundations of alignment
Core challenges
Common misconceptions
Difficulty of alignment
Inner and outer alignment
Deception
Implications of superintelligence
Is this serious?
Extreme outcomes
Misuse
AI takeover scenarios
Pathways to risk
Positive futures
Objections and responses
Is smarter-than-human AI unrealistic?
Is AI alignment easy?
Why not just set AI goals?
Why not just control AI?
Dealing with misaligned AGI after deployment
Other issues from AI
Morality
Objections to AI safety research
Miscellaneous arguments
Other resources
About Us
Resources elsewhere
Research resources

Advanced sections

Beyond the basics
Interpreting language models
Mesa-optimizers and subagents
Decision theory
Mathematics of agents
Strategy and outcomes
Brain emulation
Human intelligence enhancement
Computer science
Values
AI consciousness
Making LLMs useful
Predictions about future AI
Timelines
Compute and scaling
Nature of AI
Takeoff
Takeover
Relative capabilities
Good outcomes
Catastrophic outcomes
Alignment research
Current techniques
Benchmarks and evals
Prosaic alignment
Interpretability
Agent foundations
Other alignment approaches
Organizations and agendas
Researchers
AI governance
Governance research
Compute governance
Major labs
International politics
Policies
Policy resources
Governance research organizations

Other articles

Deliberately Unlisted

AISafety.info

AISafety.info is a project founded by Rob Miles. The website is maintained by a global team of specialists and volunteers from various backgrounds who want to ensure that the effects of future AI are beneficial rather than catastrophic.

© AISafety.info, 2022—1970

Aisafety.info is an Ashgro Inc Project. Ashgro Inc (EIN: 88-4232889) is a 501(c)(3) Public Charity incorporated in Delaware.