We Need Breadth-First AI Safety Plans
Depth-first plans lay out a path from here to aligned superintelligent AI. We need those kinds of plans. But depth-first plans depend on many assumptions: “We will make AI safe by doing step 1, then step 2, then step 3.” Step 1 only works under condition A, step 2 requires condition B, step 3 requires condition C. If A or B or C is false, the whole plan fails (and there’s a good chance we all die).
Consider Google’s safety plan from April 2025. To my knowledge, this is the best among the frontier AI companies’ plans.1
Google’s plan depends on a series of conditions:
Continue reading
