Emmanuel Ameisen @mlpowered
Interpretability/Finetuning @AnthropicAI Previously: Staff ML Engineer @stripe, Wrote BMLPA by @OReillyMedia, Head of AI at @InsightFellows, ML @Zipcar mlpowered.com/book/ San Francisco, CA Joined June 2017-
Tweets2K
-
Followers11K
-
Following245
-
Likes7K
This is a super exciting release - Claude Fable 5 is the same underlying model as Mythos but with added safeguards. The benchmarks are great and it's SOTA on everything by a margin but I'll add that *qualitatively* also, this is a major-version-bump-deserving step change forward (imo of the same order as Claude 4.5 was in November), peaking especially for long problem-solving sessions on very difficult problems. You can give it a lot more ambitious tasks than what you're used to, the model "gets it" and it will just go, and it's never felt this tempting to stop looking at the code at all (but don't do this in prod!). The model still has quirks that people will run into and the safeguards are configured to be a little too trigger happy for launch, which can hopefully be tuned over time. I feel a lot of things changing as working software increasingly comes out on a tap. The Jevon's paradox kicks in and I feel my own demand for software growing substantially. You can ask for anything - explainers, visualizers, dashboards, bespoke single-use apps (e.g. a full wandb that is hyper-specific just for your project), you can 10X your test suite, auto-optimize code, run giant research projects with custom HTML for the results, anything! "Free your mind" (Matrix ref). Really looking forward to all the things people build!
Fable 5 is state-of-the-art on nearly all tested benchmarks, with exceptional performance in software engineering, knowledge work, scientific research, and vision. The longer and more complex the task, the larger Fable 5’s lead over our other models.
Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.
Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. anthropic.com/institute/recu…
We evaluated an early version of Claude Mythos Preview for risk assessment during a limited window in March 2026. We estimated a 50%-time-horizon of at least 16hrs (95% CI 8.5hrs to 55hrs) on our task suite, at the upper end of what we can measure without new tasks.
@__lightyear__ The reason the NLA showed these results on opus is that it was trained on transcripts where it ended up needing to infer the user's language. That's not true for the neuronpedia models (paper has more details)
Interpreting model activations is important to understand why a model is doing what its doing. Traditionally, we've done this with supervised methods (probing for a specific context), or unsupervised sparse decompositions (dictionary learning). But probing requires you to know what you are looking for, and sparse dictionaries can be overwhelming to interpret. NLAs are exciting because they instead generate natural language explanations, which we can then inspect for a variety of behaviors. For example, they reveal the planning behavior we first observed with circuit tracing last year. They also helped identify bugs in Claude's training pipeline, where some prompts were only partially translated. If you want to play with them, NLAs on open models are available on Neuronpedia! neuronpedia.org/llama3.3-70b-i…
New Anthropic research: Natural Language Autoencoders. Models like Claude talk in words but think in numbers. The numbers—called activations—encode Claude’s thoughts, but not in a language we can read. Here, we train Claude to translate its activations into human-readable text.
Interpreting language models can feel like stumbling through a dark forest - sometimes you just wish you had a flashlight! In our new post, we introduce HeadVis, our latest flashlight for studying attention heads.
How do LLMs store attributed of entities? And how do they compare different attributes in context? It turns out they mostly store information about a given entity over its own token, which allows for easy lookups. But in addition to the current entity's information, models also store information about the previous entity. That might seem redundant, but it actually enables a model to identify relationships between the current entity and the previous entity in one step!
Many LLMs struggle to parse statements like “Alice prepares and Bob consumes food.” Ask them “Who consumes food?” and they'll get it wrong What’s up with that? We researched whether models can represent multiple entities at once, and if so, why do they fail here? 🧵
Do LMs plan without verbalizing their plans? I'll be at ICLR presenting work with @mlpowered using circuit tracing to reveal latent planning—from choosing "a" vs "an" based on a planned-for word, to rhyming poetry—and how these abilities grow with scale: openreview.net/forum?id=H0B7p…
Made this 30 second video of Claude Design just by pasting in the Claude Design blog post and some tweets from @AnthropicAI employees Kinda speechless.
Anthropic’s Opus 4.7 just seized the #1 spot on the Vals Index with a score of 71.4%, a massive jump from the previous best (67.7%). It also ranks #1 on Vibe Code Bench, Vals Multimodal, Finance Agent, Mortgage Tax, SAGE, SWE-Bench, and Terminal Bench 2.
🧵New Anthropic Fellows research: We studied mechanisms of "introspective awareness" in LLMs. LLMs can sometimes detect steering vectors injected into their residual stream. But is this worthy of being called introspection, or attributable to some uninteresting confound?👇
Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing
We partnered with Mozilla to test Claude's ability to find security vulnerabilities in Firefox. Opus 4.6 found 22 vulnerabilities in just two weeks. Of these, 14 were high-severity, representing a fifth of all high-severity bugs Mozilla remediated in 2025.
Proud to work at a place that stands behind its values. 🇺🇸
A statement on the comments from Secretary of War Pete Hegseth. anthropic.com/news/statement…
I used to bite my tongue and hold my breath. Scared to rock the boat and make a mess. I stood for nothing, so I fell for everything. 🎶
AI is not a normal technology, and Anthropic’s mission is to make sure that it serves the long-term benefit of humanity. Doing so requires making tough decisions, and standing up for what we think is right. This is us doing that.
A statement from Anthropic CEO, Dario Amodei, on our discussions with the Department of War. anthropic.com/news/statement…
Chris Albon @chrisalbon
92K Followers 3K Following Field notes on generating knowledge with AI at https://t.co/4E9DwWIDG7 | Director, ML & Data @Wikimedia
vicki @vboykis
59K Followers 1K Following I move vectors to different machines sometimes. Founding ml engineer in recsys/search. building ✨I like Nutella.
Jeremy Howard @jeremyphoward
319K Followers 7K Following 🇦🇺 Co-founder: @AnswerDotAI/@FastDotAI ; Prev: Professor@UQ; @kaggle founding president; founder @fastmail/@enlitic/… https://t.co/16UBFTX7mo
Riley Goodside @goodside
215K Followers 3K Following Screenshots of chatbots since 2022. Formerly: Google DeepMind, Scale
Radek Osmulski @radekosmulski
30K Followers 615 Following LLMs and retrieval by day and other genres of AI when I get the chance 🧪 Senior AI Eng @NVIDIAAI 🏫 @fastdotai trained DL Eng 📝 https://t.co/By87iXx5Pu
Hamel Husain @HamelHusain
50K Followers 3K Following Evals Evals Evals - https://t.co/Zrmp6LRd9c About Me: https://t.co/P6WyeKkyTa
Delip Rao e/σ @deliprao
69K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈
Sanyam Bhutani @bhutanisanyam1
42K Followers 1K Following 👨💻 Working on llama models @AIatMeta | Previously: @h2oai, @weights_biases 🎙 Podcast @ctdsshow 👨🎓 Fellow @fastdotai 🎲 Grandmaster @Kaggle
Eugene Yan @eugeneyan
28K Followers 664 Following MTS @AnthropicAI. Prev: Principal Applied Scientist @Amazon, led ML @ Alibaba, Lazada, Healthtech startup.
merve @mervenoyann
88K Followers 5K Following (mer-veh) open-sourceress at @huggingface 🧙🏻♀️ DM me for any feedback about HF 🤗 https://t.co/MhrMkGTm7p
Eric Jang @ericjang11
135K Followers 4K Following
Sean J. Taylor @seanjtaylor
44K Followers 4K Following model measurement @OpenAI. Formerly @MotifAnalytics @Lyft and @Facebook. Keywords: Experiments, Causal Inference, Statistics, Machine Learning, Economics.
Alexandr Wang @alexandr_wang
515K Followers 858 Following chief ai officer @meta, founder @scale_ai. rational in the fullness of time
clem 🤗 @ClementDelangue
408K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI builders
👩💻 Paige Bai... @DynamicWebPaige
75K Followers 2K Following ✨ AI should be about empowering humans, building understanding, and making dreams realities. 👩💻 DevX Eng. Lead @GoogleDeepMind ex-@GitHub || views = my own!
Nathan Benaich @nathanbenaich
72K Followers 35K Following solo member of superinvestment staff @airstreet @airstreetpress @stateofai @raais
Andrew Carr 🤸 @andrew_n_carr
26K Followers 5K Following co-founder leading science @getcartwheel co-founder advisor @arcade_ai Past: Codex @OpenAI, Brain @GoogleAI, world ranked Tetris player
Tanishq Mathew Abraha... @iScienceLuvr
89K Followers 1K Following CEO @SophontAI | Founder @MedARC_AI | PhD at 19 (2023) | ex Research Director Stability AI | Biomed. engineer @ 14 | TEDx talk➡https://t.co/xPxwKTq6Qb
swyx 🔜 @aiDotEngin... @swyx
169K Followers 4K Following achieve ambition with intentionality, intensity, integrity & insanity. affiliations: - @dxtipshq - @cognition - @temporalio - @aidotengineer - @latentspacepod
Daniel Collins @novipyro
69 Followers 4K Following
Boyd Kane is in Londo... @beyarkay
477 Followers 967 Following human eyes should see the 22nd century. MATS9 w/ Alex (Turner|Cloud), writer of essays, spinner of satellites
Sam Brown @samizdis
42 Followers 410 Following
Koji @littleironwaltz
12 Followers 139 Following SWE in Tokyo 🇯🇵 obsessed with Claude Code, fueled by good music 🎧 previously in London & New Haven.
John @1obachevsky
1 Followers 502 Following
7384254c @7384254b
625 Followers 739 Following 🌅⏩⏩⏩h+ made of metal, wood, goopy stuff, and ceramic 𒀭w𒀭
Vǫktuneyra @voktuneyra
61 Followers 773 Following
Sam jackson @SmokeyMcNubbins
165 Followers 2K Following enterprise UX/digital fluency/solutions, AI, musician, producer/engineer, a7iii-er, helping
Moira @Vera28765582815
11 Followers 246 Following
dirceu @dirceu
983 Followers 542 Following Brazilian who became Canadian. Building AI agents at @Shopify. I like systems thinking, simplifying things, and JRPGs that don't require reflexes.
Morgane @sokosomi
0 Followers 2K Following
Sean @70ftoaks
12 Followers 112 Following
A* is in SF 7/13-7/20 @BeforeMorning__
346 Followers 1K Following ~~Mechanical eng + aerospace eng + philosophy student~~LLM, art, and interdisciplinary research enjoyer ***still learning***
henosis @hen0s1s
108 Followers 1K Following building to share intelligence & accelerate adoption model connoisseur prompting: @HenosisChat
Sichu Lu @lu_sichu
5K Followers 6K Following dms open nlab fan account/arxiv surveyor/pubmed enjoyer,two culture bridger, vacuous high gossiper,dearth of any domain expertise,reluctant g theorist,gpu poor
Nate Delaney-Busch @NateDBusch
44 Followers 318 Following Striving to build a more just, hopeful, and kind future. @MATSprogram Research Manager. Views are my own.
Deirdre💧🔥💨 @flowinguphill
5K Followers 5K Following Societal shifts, climate change, and AI from complex systems perspective. Practice satyagraha. Prev: Center for Nonlinear Studies / LANL, @SFIScience
Ian Jorre Beyst @ianbeyst
1K Followers 1K Following Mathematician. I design algorithms. Interested in energy markets, greentech, thermodynamics, history, etc
Tim @atomami0192
0 Followers 139 Following
Yusuf Ozuysal @yusufozuysal
425 Followers 2K Following Tinkering mode back on, previously @SnowflakeDB Cortex AI, @Neeva and @googlechrome
Mohamed @mekkcyber
842 Followers 842 Following ML Research Engineer @whitecircle | prev @huggingface 🤗 | Msc @ParisSaclay | Making LLMs Faster & Smaller & Safer
Alex Levin @AlexLev90212361
0 Followers 15 Following
Shankar V @shankysaurus
23 Followers 407 Following
Eve Rosa @EveRosa262836
4 Followers 208 Following
Risab Biswas @risab_biswas
85 Followers 209 Following ML Research @P360_Solutions | Prev: Master’s Thesis Research at @UofGlasgow | LLMs, Agents & Product Engineering
notasnowflakeschance @notasnowflaks
48 Followers 3K Following
Wei-Chuang Chan @JamwayChuang
67 Followers 290 Following
Lucas Bandarkar @LucasBandarkar
338 Followers 388 Following PhD student @uclaNLP — ML / #NLProc / multilingual @AIatMeta, @GoogleResearch
Ryan Peters @ryanpirl
177 Followers 94 Following Reverse engineering intelligent (learning) systems.
Daylilly @Daylilly06
0 Followers 35 Following
Xiaosha Evelyne Li @EvelyneXSLi
19 Followers 87 Following CSE (ECE) | Music Tech | GaTech - towards Machine Listening and Creating
Alberto Díez_ CEO of... @AdmAdg5
36 Followers 2K Following Founder & CEO, Global Authority UNITED NATIONS SDG Global Advisor FGO World Elevare Award Miami Diamond Excellence World Prize: Agency of the Year, Madrid
YuchenQuan @QuanYuchen27918
10 Followers 181 Following A third-year student in the College of Artificial Intelligence of China University of Petroleum, Peking. Major in Image Processing, Diffusion Models.
bebek34 @yudan_syah
8 Followers 53 Following
Pulseon @pulseon_dev
8 Followers 317 Following
François Chollet @fchollet
702K Followers 826 Following Co-founder @ndea. Co-founder @arcprize. Creator of Keras and ARC-AGI. Author of 'Deep Learning with Python'.
Sebastian Raschka @rasbt
468K Followers 1K Following ML/AI research engineer. Ex stats professor. Author of "Build a Large Language Model From Scratch" (https://t.co/O8LAAMRzzW) & reasoning (https://t.co/5TueQKx2Fk)
Yann LeCun @ylecun
1.2M Followers 787 Following Professor at NYU & Executive Chairman at AMI Labs. Ex-Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.
AK @_akhaliq
507K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo ,submit papers here: https://t.co/UzmYN5XOCi
vicki @vboykis
59K Followers 1K Following I move vectors to different machines sometimes. Founding ml engineer in recsys/search. building ✨I like Nutella.
Jeremy Howard @jeremyphoward
319K Followers 7K Following 🇦🇺 Co-founder: @AnswerDotAI/@FastDotAI ; Prev: Professor@UQ; @kaggle founding president; founder @fastmail/@enlitic/… https://t.co/16UBFTX7mo
Riley Goodside @goodside
215K Followers 3K Following Screenshots of chatbots since 2022. Formerly: Google DeepMind, Scale
Hamel Husain @HamelHusain
50K Followers 3K Following Evals Evals Evals - https://t.co/Zrmp6LRd9c About Me: https://t.co/P6WyeKkyTa
Soumith Chintala @soumithchintala
309K Followers 1K Following Building new things @thinkymachines. Also dabble in robotics at NYU. Cofounded @PyTorch. AI is delicious when it is accessible and open-source.
Google DeepMind @GoogleDeepMind
1.5M Followers 278 Following The engine room of @Google. Building AI safely and responsibly to solve the world’s most complex problems. Join us: https://t.co/jUHQA27iBL
Richard Socher @RichardSocher
120K Followers 1K Following Building self-improving superintelligence CEO @recursive_si and @youdotcom MP @aixventuresHQ Ex: Stanford Adj Prof, Chief Scientist at Salesforce, CEO MetaMind
Eric Jang @ericjang11
135K Followers 4K Following
Sean J. Taylor @seanjtaylor
44K Followers 4K Following model measurement @OpenAI. Formerly @MotifAnalytics @Lyft and @Facebook. Keywords: Experiments, Causal Inference, Statistics, Machine Learning, Economics.
AI Pub @ai__pub
71K Followers 339 Following AI papers and AI research explained, for technical people. Get hired by the best AI companies: https://t.co/MySVjUGOQ3
Michael Nielsen @michael_nielsen
119K Followers 5K Following Searching for the numinous 🇦🇺 🇨🇦, currently live in 🇺🇸 Research @AsteraInstitute https://t.co/maezekzRUb https://t.co/2dWwZKrvrn
Horace He @cHHillee
51K Followers 590 Following @thinkymachines Formerly @PyTorch "My learning style is Horace twitter threads" - @typedfemale
Ferenc Huszár @fhuszar
43K Followers 1K Following Founder & Supreme Leader of Technical Staff at Reasonable. Professor on leave from @Cambridge_CL. Alum of @Twitter, Magic Pony, @Balderton
rohan anil @_arohan_
43K Followers 2K Following member of technical staff & co-founder of @coreautoai - and continuing to aspire to understand deep learning.
Michael Andregg @michaelandregg
10K Followers 800 Following ceo of eon | human emulation pbc https://t.co/M7nhgJxMlO prev: optical supercomputers/networking/robotics, high-speed mass production electron microscopy
Viacheslav Sinii @ummagumm_a
109 Followers 300 Following
mark bissell @MarkMBissell
2K Followers 879 Following aspiring gentleman scientist, not necessarily in that order || prev: @GoodfireAI @PalantirTech @UniofOxford @WilliamsCollege
Ekdeep Singh Lubana @EkdeepL
3K Followers 1K Following Member of Technical Staff @GoodfireAI; Previously: Postdoc / PhD at Center for Brain Science, Harvard and University of Michigan
Goodfire @GoodfireAI
24K Followers 29 Following Using interpretability to understand, learn from, and design AI.
Jack Merullo @jack_merullo_
2K Followers 410 Following Interpretability @GoodfireAI was a Phd @BrownUniversity
Zheng Zhao @zhengzhao97
531 Followers 296 Following PhD Candidate @Edin_CDT_NLP @edinburghnlp | former intern @AIatMeta @amazon | working on LLMs
Ryota Kanai @kanair
10K Followers 3K Following CEO of Araya: https://t.co/os1GKxTrsd Consciousness Scientist: https://t.co/q6uxCveOHK, Funding Neurotech: https://t.co/0BLUmfFdhS
Nova DasSarma (p̄/de... @dropbella
685 Followers 231 Following Your Friendly Neighborhood Systems Architect · Rotate your passwords · Use two factor authentication · DM me about backups
Joshua Achiam @jachiam0
27K Followers 1K Following Freedom, flourishing, and abundance. Chief Futurist @openai. Main author of https://t.co/cKuSh21yaz
Eric J. Michaud @ericjmichaud_
4K Followers 1K Following Trying to make deep neural networks among the best understood objects in the universe. 💻🤖🧠👽🔭🚀
Kelsey Piper @KelseyTuoc
65K Followers 1K Following We're not doomed, we just have a big to-do list.
Arthur Conmy @ArthurConmy
8K Followers 2K Following soon @anthropicai prev: fixing things @googledeepmind
johnny @johnnylin
549 Followers 4 Following @neuronpedia someone told me you're supposed to reply to all the replies to yoru tweets to boost ur twitter rank or something. im not gonna do that
Michael Hanna @michaelwhanna
824 Followers 508 Following PhD student at the University of Amsterdam / ILLC, interested in computational linguistics and (mechanistic) interpretability.
Lee Sharkey @leedsharkey
3K Followers 2K Following Scruting matrices @ Goodfire | Previously: cofounded Apollo Research
Simon Boehm @Si_Boehm
3K Followers 117 Following
David Bau @davidbau
7K Followers 285 Following Computer Science Professor at Northeastern, Ex-Googler. Believes AI should be transparent. @[email protected] @davidbau.bsky.social https://t.co/wmP5LV0pJ4
Alex Tamkin @AlexTamkin
10K Followers 2K Following machine learning, science & society @AnthropicAI | recently: Clio, Anthropic Economic Index, Claude Artifacts | prev: phd @StanfordAILab, @stanfordnlp
Nicholas Turner @nicholasturner0
328 Followers 369 Following Research Scientist - ML, Mechanistic Interpretability, Neuroscience ||| Tweets do not represent the views of my employer ||| he/him
Adam Pearce @adamrpearce
6K Followers 382 Following @anthropicai, previously: google brain, @nytgraphics and @bbgvisualdata
Wes Gurnee @wesg52
4K Followers 234 Following Trying to read Claude’s mind. Interpretability at @AnthropicAI Prev: Optimizer @MIT, Byte-counter @Google
Jack Lindsey @Jack_W_Lindsey
18K Followers 253 Following Neuroscience of AI brains @AnthropicAI. Previously neuroscience of real brains @cu_neurotheory.
Dylan HadfieldMenell @dhadfieldmenell
5K Followers 3K Following Associate Prof @MITEECS working on value (mis)alignment in AI systems; Safety & Alignment Advisor at https://t.co/vt2gVrVr9f; @[email protected]; he/him
Liv @livgorton
6K Followers 420 Following ✨ asking sand to show its work // currently @AnthropicAI, prev @GoodfireAI // creating a more beautiful future
Samuel Marks @saprmarks
5K Followers 148 Following AI safety research @AnthropicAI, leading Cognitive Oversight team. Previously: postdoc with @davidbau, math PhD at @Harvard.
levent @__alpoge__
4K Followers 104 Following idiot. cuda og, harvard val, morgan prize, society of fellows, 1 hilbert problem so far, creating friendly, SAFE, delightful, supergenius ..things @anthropicai
METR @METR_Evals
26K Followers 32 Following We work to scientifically measure whether and when AI systems might threaten catastrophic harm to society. Nonprofit.
John Schulman @johnschulman2
76K Followers 2K Following Recently started @thinkymachines. Interested in reinforcement learning, alignment, birds, jazz music
Jan Leike @janleike
133K Followers 335 Following AI research @AnthropicAI. Previously OpenAI & DeepMind. Optimizing for a post-AGI future where humanity flourishes. Opinions aren't my employer's.
Keller Jordan @kellerjordan0
18K Followers 442 Following CIFAR-10 fanatic Pretraining @OpenAI OpCo LLC.
Kamal Ndousse @kandouss
4K Followers 543 Following AI @AnthropicAI Social learning enthusiast. Opinions and dumb jokes my own.
Leo Gao @nabla_theta
13K Followers 580 Following working on AGI alignment. prev: GPT-Neo, the Pile, LM evals, RL overoptimization, scaling SAEs to GPT-4, interp via circuit sparsity. EleutherAI cofounder.
Shannon Yang @shannonyangsky
2K Followers 5K Following Building talent & community in AI safety. Currently @AISecurityInst, prev. @AnthropicAI. Philosophy, Politics, and Economics alumna @UniofOxford.
James Bradbury @jekbradbury
17K Followers 9K Following Compute at @AnthropicAI! Previously JAX, TPUs, and LLMs at Google, MetaMind/@SFResearch, @Stanford Linguistics, @Caixin.
Tom Brown @NotTomBrown
27K Followers 432 Following Co-founder and Chief Compute Officer @AnthropicAI
Neel Nanda @NeelNanda5
41K Followers 122 Following Mechanistic Interpretability lead DeepMind. Formerly @AnthropicAI, independent. In this to reduce AI X-risk. Neural networks can be understood, let's go do it!
Joshua Batson @thebasepoint
6K Followers 679 Following trying to understand evolved systems (🖥 and 🧬) interpretability research @anthropicai formerly @czbiohub, @mit math
Jonathan Gray @jgrayatwork
2K Followers 376 Following Inference @AnthropicAI. Previously: @MetaAI (FAIR), @openai


































