Sitemap - 2023 - From AI to ZI

Rating my AI Predictions

The Future of From AI To ZI

Twelve Months in AI Safety

Unsafe AI as Dynamical Systems

AIs teams will probably be more superintelligent than individual AIs

[Research Update] Sparse Autoencoder features are bimodal

Explaining "Taking features out of superposition with sparse autoencoders"

Is behavioral safety "solved" in non-adversarial conditions?

Statistics for the Working Mathematician

Incorrectness Cascades - Three small follow-ups

Research Report: Incorrectness Cascades (Corrected)

I was Wrong, Simulator Theory is Real

Study 1b: This One Weird Trick does NOT cause incorrectness cascades

Study 1b Pre-registration

Research Report: Incorrectness Cascades

Pre-registering a study

Invocations: The Other Capabilities Overhang?

Early Results: Do LLMs complete false equations with false equations?

Corrigibility, Self-Deletion, and Identical Strawberries

Three of my beliefs about upcoming AGI

GPT-4: What we (I) know about it

Why do we assume there is a "real" shoggoth behind the LLM? Why not masks all the way down?

What's left for AGI besides scale?

Explaining SolidGoldMagikarp by looking at it from random directions

Addendum: More Efficient FFNs via Attention

No Really, Attention is ALL You Need

The Gallery for Painting Transformations

How does GPT-3 spend its 175B parameters?

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts