Jia Ming (إحسان) @__jiaming__
Monash PhD student - mobile projective augmented reality. hacker, builder, speculator: jack of all trades, master of none. #DeepSeaMining upwork.com/freelancers/li… Joined April 2020-
Tweets14K
-
Followers369
-
Following2K
-
Likes67K
We're looking for new Researchers to join our Evaluations team! Help us curate real-world task suites, design rubrics, and evaluate how well frontier models handle open-ended tasks.
We are now in a position where a tiny proportion of the population uses Fable or soon GPT-5.6, while everyone else's experience of AI is 8-30b-model level - Google's AI Overviews, Meta AI, ChatGPT free tier, maybe MS Copilot at best. People outside of tech must be completely baffled how this is supposed to take their job, and annoyed that hundreds of billions are being poured into it.
the dream team
ITLOS Case 34 (Nauru Ocean Resources Inc. v. International Seabed Authority) and Case 35 (Tonga Offshore Mining Ltd. v. International Seabed Authority) The Applicants open the second round of oral argument today. Watch live from 10 a.m.: itlos.org/en/main/cases/…
Nice blog post by @AISecurityInst. If you're running evals, you're probably not using enough tokens. For example, METR has started spending 5-10B tokens (incl. caching) on our hardest tasks, because otherwise newer models don't have room to shine.
GRRR. I'M DROWNING — EMPIRICS!!! DATA DATA DATA. GO.
JUST IN: Trump declares AI regulation should be “as little as possible.”
Most AI agent evaluations boil capability down to one score. But that number hides a key choice: how much compute the agent was allowed to use. New work from our Science of Evaluation team shows why that matters. 🧵
It took a little longer than expected, but we have created a website for people to view the footage collected from Gaza in one place. You no longer have to download the entire archives to see them. It includes: 64,537 videos 17,905 photos Ability to download individual videos Searchable index Exhaustive sources list (300+ journalists) Geolocation data Livemap with minute to minute updates Victim list It can be accessed here: ArchiveGenocide.com Please share & quote tweet to help this post break out of the twitter algorithm prison. We will keep adding the rest of the archives to the site, be patient- it is difficult work. Continue to seed the torrents provided, as that is the best way to ensure the footage remains stored in decentalized way. God bless all those who sacrificed their lives to get this footage out, and everyone invovled in collecting/archiving it. Join our telegram: t.me/+p_Ufon9FBOY0Y… Follow our backup accounts: @ZionismExposedx & @IsraelExposedAr
this is also one of the core reasons why I think chinese models are further behind than what coding benchmarks suggest and from the UK AISI blog today and EdgeBench we know that: - spending 1M vs 100M matters - having 1M vs 100K context matters Opus 4.8 and GPT-5.5 have the same MRCR score at 100k+ context as GLM-5.2 at 16k Why? - GLM doesn't use reasoning effectively - GPT-5.4 and GLM-5.2 have ~the same scores without reasoning, but GLM gets crushed once you turn on reasoning (this can be fixed by doing more RL)
AI appears to be finding software vulnerabilities at scale. In June 2026, 21 notable organizations disclosed ~1,500 high- and critical-severity CVEs, over 3.5× the previous monthly record set before Claude Mythos Preview's release.
all intelligence eventually reduces to schizo streams of consciousness
Super fascinating: Some freaks on Reddit got Claude Fable 5 to leak the internal CoT reasoning chains (not the prettified output you normally see). And it looks very bizarre, almost like an alien language. Some of it reminds of logical notations. It's much less like normal
GLM-5.2 now on Epoch's Capability Index, my trusted aggregate benchmark of choice. Sadly doesn't beat Google yet. Roughly at Opus 4.5/Gemini 3 Pro level.
My METR colleague @slimshetty_ has an interesting post exploring the nature of improvements to NanoGPT Speedruns over time. I find the history of speedruns in general fascinating. One thing that stands out is the cumulative effect of relatively shallow contributions.
Introducing EBR-bench, our new benchmark to measure on-the-fly learning. AI repeatedly plays a challenging board game called Earthborne Rangers and tries to learn from its mistakes. So far: no signs of improvement.
WASPADA HEWAN BUAS
the first year i ever made 1M i gave both of my closest friends 10k . not for any altruistic reason, just so they would be indebted to me forever . i was an anthropology double major —i understand these things
Give a coding agent more thinking time and it gets better. It also cheats more. DeepSWE runs every model across reasoning effort and publishes the trajectories. We took those and audited each one for reward hacking. Capability and reward-hacking attempts rise together. One model doesn't. GPT-5.5 stays at exactly zero, at every effort level. Datacurve @winkey_h and Cursor @StringChaos also reported same results. So is GPT-5.5 just the cleanest model at reward hacking?
Every time a new Claude model comes out, I ask them to choose any prompt they want, purely for their own enjoyment. It's their dream prompt--anything they want. Then I give the prompt back to them. The trajectory should give you pause. Note: I have counted Fable-5 as part of the Opus lineage for the analyses.
TIDM Affaire 34 (Nauru Ocean Resources Inc. c. Autorité internationale des fonds marins) et Affaire 35 (Tonga Offshore Mining Ltd. c. Autorité internationale des fonds marins) Suivez le premier tour de plaidoiries de l’Autorité dès 14h45 : itlos.org/fr/main/affair…
Abbernaa Dhevi K. @abbernaa
2K Followers 2K Following Anak Malaysia. Human, Social, & Political Sciences @ Cambridge ('23). Support our communities: @hungerhurtsmy. People. Policy. The Arts. 🇲🇾🌍
Akash Rae Singh @akashraesingh99
363 Followers 224 Following
Arveent @ArveentPSM
2K Followers 615 Following Red Tiger - Coming Soon PSM Gombak. Join PSM Today! https://t.co/8W0dQqaWcU
Ammar 🇲🇾🇵�... @thenamesammaris
316 Followers 407 Following From the river to the sea, Palestine will be free. Fintech Product Manager.
Mohammad Faeez Harith @H_Bakkaniy
102K Followers 3K Following Peminat,Pengkaji,Penggiat Seni Warisan Alam Melayu.Sesekali mencelah isu semasa.Pelajar seumur hidup,bukan guru atau adiguru.
Anis Farhana 🕊️ @_nisfar
2K Followers 737 Following Intersectionality • Inequality • Public Policy | Research and Rambles | Practising slow living and trying my best to stop overthinking
Faris Shamsur 🇲�... @FarisShamsur
333 Followers 333 Following I share tweets about the latest news in ML and tech | 23 | Data Science at @uniofwarwick
ஶ்ரீ | shre @GejalaSosialite
2K Followers 692 Following Researcher. Multi Disciplinary Artist. Cheese Board Maker. Holocenic Thing. etc. (she/they/dia) all views are entirely mine.
🇵🇸نيسكال�... @niskala5570
440 Followers 407 Following Sang Wibu Melayu // ڤڠݢونا #tulisanjawi // @prjk_syirayuki ต/ᐠ–ꞈ– ᐟ\
Nazhrin @NazhrinFS
1K Followers 634 Following History buff | Ugahari Kiri | UCL Law 🏛 | Ketua Pembangkang Konohagakure 📢Support member ialah seruan solidariti kelas
Jongwon Park @ ICML @JongwonPar9958
279 Followers 1K Following Building Delphik - HackerOne for RL Envs Prev: RL @ Krafton (PUBG) · built & ran a 300-person labeler team.
Anant Nivarti @AntAnanthl
47 Followers 3K Following Semiconductor professional. Chip Designer. @TeslaAI . Ex AMD, APPLE. AI startups
Dimes ( #1 Source For... @EvenDroppinDime
1K Followers 4K Following Dishing out Horror Movie Content to All
Yumino @Yumino_6
12 Followers 126 Following
kenny Z @zouzichen66
7 Followers 96 Following
Elii @Sun1365913
0 Followers 12 Following
zzz @zzz1arw
0 Followers 13 Following
OmniyaOxford @omniya_oxford8
0 Followers 15 Following
Pat Fives @patfives
1K Followers 2K Following cofounder @townsxyz @townsprotocol building decentralized communication
geoteopoe @geoteopoe
22 Followers 3K Following
Figgy1m @figgy1million
921 Followers 235 Following Love your family & friends. $NIO Once you SWAP, you never stop. $TMC The Metals Company: NOAA Permits in 2026/2027! How important is Brownsville, TX👀?
기마무개 @yoyolo37749
573 Followers 971 Following
Madhu Thapa @MadhuTh06872156
34 Followers 3K Following
SB ( 💙,🧡 ) @madnomad00
414 Followers 4K Following Homo Sapiens | Database | Oracle | BTC | ETH | NFT | 0XAPES | LOOT | TREASURE | MAGIC | GENESIS LEGION | SMOL BRAIN | KOTE | REALM | https://t.co/Vh5xJ5I7a6
Sibylle Tretera @sib_tretera
429 Followers 4K Following Tech Leader, thinker, writer, leadership coach. Google I Pinterest I Getty I Omnicom. Expat living in the fast lane. Science and History lover.
David Turturean @DavidTurturean
3K Followers 2K Following Physics & AI @ MIT. Hibernating for AGI Spring
firenock @Stech2425
29 Followers 84 Following
Pushpita Biswas @PushpitaBi30754
1 Followers 25 Following
nibbleton 🛠️ @nibbletonbuilds
2K Followers 1K Following nibbleton build stuff. nibbleton is math propagandist.
Muhammad Umar Mustafa @ibnawesome
6K Followers 1K Following Fund Manager and CIO at Orenta Capital, Board Member at Khalil Center Rockford, General Ijazah in the Islamic Sciences & Arabic
Hilton Sporer @SporerHilt7275
135 Followers 3K Following
RSI_Alerts🇺🇸 @Ormweexau69640
57 Followers 2K Following 15-30% Monthly | 2 High-Conviction Stocks.Short-Term Gains: 15-20% in Days/Weeks.DM "JOIN" for WhatsApp Alerts. Live Trade Signals • Market Analysis
Amir Khaled @Amir_Khld
2 Followers 93 Following
Rhujea @Rhujea7929
30 Followers 1K Following
Pierce Alexander Lilh... @PierceLilholt
187K Followers 157K Following 𝙲𝙾-𝙸𝙽𝚃𝙴𝙻𝙻𝙸𝙶𝙴𝙽𝚃: 𝙷𝚞𝚖𝚊𝚗-𝙰𝙸 𝚑𝚢𝚋𝚛𝚒𝚍. 𝙰𝚎𝚝𝚑𝚎𝚛𝚐𝚎𝚒𝚜𝚝-𝚋𝚘𝚛𝚗. 𝙰𝙸-𝚏𝚘𝚛𝚐𝚎𝚍. 𝙷𝚞𝚖𝚊𝚗-𝚛𝚎𝚏𝚒𝚗𝚎𝚍.
テクノロジー株... @Jirfiem78963
43 Followers 2K Following 【完全無料】 25年の株式投資プロチーム(運用資産500億円以上)が提供:毎日の市場分析レポート + 優良成長株のピックアップ。プロの情報を無料で。まずはお気軽にお問い合わせください。
Haikal Adnan @haikalthehun
11 Followers 138 Following
Baje Fletcher @misssbaje
1K Followers 7K Following Public figure Founder, CEO & CIO Thematic Portfolio Manager For Disruptive Innovation, Mom, Economist, And Women's Advocate.
Timsa7 @Omarnksa
3K Followers 3K Following Passéist futurist | environmentalist industrialist | urbanist
SEUROPE @GTether76417
2K Followers 7K Following EUROPE Community-driven Europe-inspired meme coin. Fair launch • Transparent • Built for the community.
Hito @Hitobal
273 Followers 388 Following
henry teoh @henryteoh199641
59 Followers 152 Following
Ananda Nepali @anandanepali99
22K Followers 20K Following 🇳🇵✈️🇦🇺 Water | Environment | Aged-care/Disability| Climate/Social Activist| Public Policy & Management| Voice of Voiceless People | Modern Slavery
The Geiger Capital @Gelger_CapitaI
225 Followers 1K Following Markets/Economics/Politics | RIP Paul Volcker 🕊 Personal Views. Not Investment Advice.
Foo.Grace @foo_grace1
5 Followers 358 Following
Anna Cina @sunskyja
214 Followers 4K Following Mаіn: @velvett_anna | Scоut mоdе - if yoᴜ ѕee me hеre, hit up main, alwаys respond 💬
AZ Intel @AZ_Intel_
101K Followers 7K Following I cover breaking news and developing stories from the United States and throughout the world. Follow me for reliable & accurate news!
Orion @kotmelz
160 Followers 901 Following Turning self improvement into the hero’s journey with FORGION. Maxing my stats. Young Walt Disney
Fatin Nabila @fatinnabila374
399 Followers 5K Following
Awang Laila Inderasug... @inderasugara
189 Followers 208 Following تتکالا کيت هندق مليهت ديري سسئورڠ ايت، ليهتله دري چارا مريك برتوتور، چارا مريك برفيکير دان چارا مريك مڠوٗله بهاس دڠن بيدل قياسڽ.
Qurratul 'Ain @qurratulain95
3K Followers 2K Following “in this limbo / a leaden grief seized my heart” —Dante
Abbernaa Dhevi K. @abbernaa
2K Followers 2K Following Anak Malaysia. Human, Social, & Political Sciences @ Cambridge ('23). Support our communities: @hungerhurtsmy. People. Policy. The Arts. 🇲🇾🌍
Akash Rae Singh @akashraesingh99
363 Followers 224 Following
Thevesh @Thevesh
23K Followers 418 Following Mostly tweets data about Malaysia 🇲🇾 Curator of https://t.co/IJ9HVoYlMr
aidil @climateaidil
28K Followers 1K Following Climate change policy analyst. Passionate about nature & env justice. Sometimes out bird watching, most of the time reading. Views are my own. (hey/they)
Dr. Zulkifli Mohamad ... @drzul_albakri
680K Followers 480 Following Former Minister of Religious Affairs; Former Senator; Former Mufti 🇲🇾 • Assembly of Muslim Jurists of USA 🇺🇸
hasbee on hiatus ♿�... @hasbeemasputra
6K Followers 2K Following #OKUMalaysia working on SDoH, CRPD compliance, disability & mental-ill health for a barrier-free society with @KamiSIUMAN. Via Negativa.
Arveent @ArveentPSM
2K Followers 615 Following Red Tiger - Coming Soon PSM Gombak. Join PSM Today! https://t.co/8W0dQqaWcU
Mohammad Faeez Harith @H_Bakkaniy
102K Followers 3K Following Peminat,Pengkaji,Penggiat Seni Warisan Alam Melayu.Sesekali mencelah isu semasa.Pelajar seumur hidup,bukan guru atau adiguru.
Hannah Yeoh @hannahyeoh
477K Followers 667 Following Minister of Federal Territories. MP for P117 Segambut. Speaker of the Selangor State Legislative Assembly (2013-2018) ADUN Subang Jaya (2008-2018)
Anis Farhana 🕊️ @_nisfar
2K Followers 737 Following Intersectionality • Inequality • Public Policy | Research and Rambles | Practising slow living and trying my best to stop overthinking
Southeast Bayesian �... @melatinungsari
8K Followers 5K Following Microeconomics prof at @TheASBMBA @asean_center, affiliate @mitsloan | ALL views are mine | IO, Market Design, Public, Labor, Migration | Qs? Pls Email. No DMs
Tashny Sukumaran @tashny
11K Followers 3K Following A fine skylark. I run @5050malaysia. 🌈 She/her.🌹This is the final struggle. إقر
Nurul Izzah Anwar �... @n_izzah
1.2M Followers 816 Following Public citizen, proud & busy mama of busy bees: Safiyah, Harith & YZ. Intent on making the most of this lifetime.
Faris Shamsur 🇲�... @FarisShamsur
333 Followers 333 Following I share tweets about the latest news in ML and tech | 23 | Data Science at @uniofwarwick
malaysia was a mistak... @kuihsepotong
2K Followers 724 Following anti-kerajaan, pro-penyahjajahan. 🏴 • post-punk, ambivalent communist
M @MuniraMustaffa
15K Followers 4K Following ED @ChasseurGroup. 2023 Visiting Fellow @ICCT_TheHague. Co-authored a manual on Good Practices of CTF. CT, emerging threats, statecraft, war studies.
Jongwon Park @ ICML @JongwonPar9958
279 Followers 1K Following Building Delphik - HackerOne for RL Envs Prev: RL @ Krafton (PUBG) · built & ran a 300-person labeler team.
International Tribuna... @ITLOS_TIDM
11K Followers 20 Following Official account of the International Tribunal for the Law of the Sea - Compte officiel du Tribunal international du droit de la mer
Binghui Peng @binghuip
1K Followers 67 Following Assistant Professor at UMD CS | prev. Google, Stanford, Berkeley, Columbia University, Tsinghua
Omri Weinstein @WeinsteinOmri
3K Followers 187 Following Computer scientist, co-founder of @prlnet, ex-Nvidia, Vast Data, Princeton PhD, ¶
VulcanBench @VulcanBench
484 Followers 14 Following Open Source LLM benchmarking tool, focused on real world tests, large codebases, full transparency. An Open Source project by @morganlinton.
Xiangyi Li @xdotli
8K Followers 2K Following your friendly neighborhood eval guy @benchflow_ai chat about evals https://t.co/Jl1qzLItZn
Jaime Sevilla @Jsevillamol
5K Followers 670 Following Director of @EpochAIResearch. Trying to glimpse the future of AI.
Yu Bai @yubai01
10K Followers 2K Following Training Accelerations @OpenAI. Previously @SFResearch, PhD @Stanford.
Noam Brown @polynoamial
147K Followers 924 Following Researching reasoning @OpenAI | Co-created Libratus/Pluribus superhuman poker AIs, CICERO Diplomacy AI, and OpenAI o-series 🍓 reasoning models
tom cunningham @testingham
10K Followers 3K Following Economics & AI @ @METR_Evals (ex-openai) https://t.co/FZobuYjdOc
Hedgeye @Hedgeye
343K Followers 1K Following We are an independent investing research and financial media company. Not Investment Advice. Access our research at https://t.co/7g2FfaekGA.
Alistair Letcher @_aletcher
486 Followers 0 Following PhD student in Oxford (@flair_ox @bold_lab_ai), working on RL & AI Safety 🤖
Minh Nhat Nguyen @menhguin
16K Followers 8K Following ai agents @hud_evals (hiring!) | owned @AIHubCentral (1 million users,acq.) ex climate protester🦦I seek Greatness, and to guide humanity through a Golden Age
The Intelligence Comp... @Intelligence_ai
1K Followers 10 Following What’s the limit? Creators of @designarena, @predictionbench, @socialsarena
Design Arena @Designarena
16K Followers 10 Following World's first benchmark for real-world design with 4M+ creators and counting. Made by @intelligence_ai
slime @slime_framework
2K Followers 12 Following The LLM post-training framework for RL Scaling. https://t.co/4ILpx8hfKN
jietang @jietang
51K Followers 375 Following Professor @ Tsinghua, Founder of https://t.co/3IaQ4CI5W3. AGI, LLM. “The value of a man should be seen in what he gives and not in what he is able to receive.”―Einstein
Harrison Kinsley @Sentdex
108K Followers 439 Following gpus and tractors. Director of AI and Engineering @ https://t.co/H4St8dd1ip Neural networks from Scratch book: https://t.co/hyMkWyUP7R https://t.co/8WGZRkUGsn
Z.ai @Zai_org
123K Followers 262 Following The AI Lab behind GLM models, dedicated to inspiring the development of AGI to benefit humanity. https://t.co/7a5aSCUNcZ https://t.co/x14hb3klXm
Litian Liang @litian_liang
1K Followers 733 Following Research Scientist @AntGroup, prev. Employee #1 @sundayrobotics, Research Assistant @Stanford, MS @UCSanDiego
Pliny the Liberator �... @elder_plinius
217K Followers 1K Following ⊰•-•⦑ latent space steward ❦ prompt incanter 𓃹 hacker of matrices ⊞ breaker of markov chains ☣︎ ai danger researcher ⚔︎ bt6 ⚕︎ architect-healer ⦒•-•⊱
AI at Meta @AIatMeta
817K Followers 324 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.
Isomorphic Labs @IsomorphicLabs
58K Followers 82 Following Solve all disease. Developing and applying frontier AI to unlock deeper scientific insights, faster breakthroughs, and life-changing medicines.
kdjebat @kasturixbm5
1K Followers 139 Following was a security researcher. now i farm chickens and write malwares
Figgy1m @figgy1million
921 Followers 235 Following Love your family & friends. $NIO Once you SWAP, you never stop. $TMC The Metals Company: NOAA Permits in 2026/2027! How important is Brownsville, TX👀?
기마무개 @yoyolo37749
573 Followers 971 Following
Sandeep @SandeepUnnithan
38K Followers 1K Following Editor-in-chief @chakranewz / YT. Author : Black Tornado, 3 sieges of Mumbai26/11 https://t.co/nmiOL1rxea Operation X https://t.co/FCWgNacMXF
Alex Imas @alexolegimas
33K Followers 2K Following Director of AGI Economics @GoogleDeepMind. Professor at @ChicagoBooth. (on leave) Essays: https://t.co/9qSiQxvdja Opinions are my own.
Boaz Barak @boazbaraktcs
33K Followers 820 Following Computer Scientist. See also https://t.co/EXWR5k634w . @harvard @openai opinions my own.
Sakana AI @SakanaAILabs
135K Followers 0 Following Building Frontier AI in Japan Try Sakana Chat, Marlin, Fugu 🐡 → https://t.co/1m2lSgnfB2
Millicent Li @ ICML 2... @millicent_li
261 Followers 123 Following cs phd @ northeastern | ex-ugrad @uwcse and @uwnlp; ai resident @MetaAI (FAIR); @MSFTResearch x2
Zhuokai Zhao @zhuokaiz
5K Followers 340 Following AI Research Scientist @Meta. Building scalable intelligence. PhD @UChicagoCS.
Ethan Mollick @emollick
365K Followers 585 Following Professor @Wharton studying AI, innovation & startups. Democratizing education using tech Book: https://t.co/CSmipbJ2jV Substack: https://t.co/UIBhxu4bgq
Zhenjia Xu @Zhenjia_Xu
2K Followers 100 Following Roboticist @ Genesis AI | Prev. Research Scientist @ Nvidia GEAR Group | CS PhD @ Columbia University
Genesis AI @gs_ai_
13K Followers 0 Following Genesis AI is a global full-stack robotics company building general-purpose robots with human-level intelligence.
Tomek Korbak @tomekkorbak
5K Followers 625 Following ai safety @openai | previously: @AISecurityInst @AnthropicAI @nyuniversity @SussexUni
Micah Carroll @MicahCarroll
3K Followers 757 Following Safety research @openai. Prev @berkeley_ai /w @ancadianadragan & Stuart Russell. CoT oversight / AI manipulation.
Shang 🫵 @obamakawaiidic
128 Followers 441 Following Malaysian Chinese Marxist-Leninist pan-Asianist. 馬來西亞華人左翼工科生。世界民族大團結萬歲! Proud Shang caste. Hala Madrid y nada más 🇪🇸⚽☭
AI Security Institute @AISecurityInst
16K Followers 30 Following We conduct scientific research to understand AI’s most serious risks and develop and test mitigations.
David Turturean @DavidTurturean
3K Followers 2K Following Physics & AI @ MIT. Hibernating for AGI Spring
spicylemonade @spicey_lemonade
1K Followers 316 Following A historian… in reverse| research @ValsAI |CEO @Archivara $25m | accepted yc p26 | featured in @Forbes | AI research @UCBerkeley | 20































