My Reservations about Discovering Latent Knowledge (Burns, Ye, et al)
aizi.substack.com
[AI Safety relevance rating: AI] This is my second post on Burns, Ye, et al’s recent preprint Discovering Latent Knowledge in Language Models Without Supervision. My first post summarizes what they did and what I liked about it.Thanks for reading From AI to ZI! Subscribe for free to receive new posts and support my work.
My Reservations about Discovering Latent Knowledge (Burns, Ye, et al)
My Reservations about Discovering Latent…
My Reservations about Discovering Latent Knowledge (Burns, Ye, et al)
[AI Safety relevance rating: AI] This is my second post on Burns, Ye, et al’s recent preprint Discovering Latent Knowledge in Language Models Without Supervision. My first post summarizes what they did and what I liked about it.Thanks for reading From AI to ZI! Subscribe for free to receive new posts and support my work.