14 Frontier
15 Frontier
Recent work (roughly the last 12 months), newest first. Entries graduate into a topic chapter’s chronological timeline (Part II) once superseded or matured — keeping the book continuously evolving.
15.1 Signposts
Fast-moving paths shaping the next few years, each headed for a topic chapter.
- Recursive self-improvement / automated AI R&D — AI accelerating AI compresses oversight time and amplifies misalignment; the headline reason scalable oversight and capability thresholds exist. → Monitoring & Oversight
- AI control — assume a model may be scheming and design protocols safe regardless (Greenblatt et al., 2023). → Monitoring & Oversight
- Deceptive alignment / scheming — misaligned behavior that survives safety training (Hubinger et al., 2024). → Alignment
- Dangerous-capability evaluations — CBRN, cyber-offense, autonomy, self-proliferation; the trigger behind frontier safety frameworks. → Evaluation
- Safety cases — structured arguments that a system is safe enough to deploy (Hilton et al., 2025). → Systemic Safety & Governance
- Chain-of-thought monitorability — keeping reasoning legible for oversight, and the risk that optimization erodes it. → Interpretability
15.2 Intake queue
Dated entries below, newest first — one-line claim, why it matters, target topic. (Empty — populated during the continuous literature scan.)