AI Safety, Alignment, and Interpretability Breakthroughs 2025