Ziang Niu @MaxwellAng1
A crazy fan of statistics | Grad Student @Penn | studying Stat and Math Joined May 2019-
Tweets329
-
Followers123
-
Following511
-
Likes2K
edgePython 0.2.6 released. Thanks to everyone who is sharing issues via Github or just letting me know of desired features / improvements. github.com/pachterlab/edg…
Congratulations to the R Core Team on receiving the 2026 Rousseeuw Prize for Statistics. For over 30 years, R Core has maintained one of the most important pieces of infrastructure in statistics. Little recognition, thousands of volunteer hours, and enormous impact. Thank you!
As I have pointed out many times publicly, single cell foundation model performance will scale with the number of perturbations, not the number of cells. We barely have ~100k perturbations in the public domain and it is reasonable to expect we need millions to truly go OOD. Factor in also: modalities, time points, combinations etc. Short term, there is little to do using public data. Best medium term option, consider semi-mechanistic modelling: arxiv.org/abs/2501.19178 Long term, …, wait for the press release ;)
Do single-cell foundation models obey scaling laws? A somewhat thought-provoking new Nature Methods study by the Crawford lab suggests that, for current single-cell foundation models, the answer may be “not really.” Across a broad range of architectures and downstream tasks,
I see the idea of empirical Bayes is shining there!
Today we release Rhaister, an elegant statistical model that predicts drug phenotypes in new contexts w/ accuracies comparable to experimental assays. And dropping Emerald Bay, a 2M cell dataset measuring long time-course phenotypes across 1000s of drug-cell line interactions.
1/ We built the Genomics × AI blog so the genomics + ML community can share work fast and actually discuss it — incremental results, negative results, tutorials — without waiting on a publisher. Posts are live, more landing over the coming weeks: genomicsxai.github.io
A flaw in Person-Δ may be overstating progress in single-cell perturbation prediction models. Pearson warned about this in the 19th century: reusing the same controls induces spurious correlation. Split the controls, and much of the claimed prediction power fades. Link below 👇
In AI-guided discovery, models often turn huge candidate pools into shortlists for costly validation. We ask: can we put an error budget on AI-generated shortlists before running the experiment? For example: • Can we keep failed hits below 10%? • How many candidates should we test to get enough true positives? • How far down the list can we go before expecting too many false positives? • If we already have a fixed top-K list, how many are likely wrong? 📢 Excited to share TxConformal, a framework to turn AI scores into shortlists with controlled/estimated false positives, even in tasks where new candidates differ from past experimental data. This is joint work with amazing @KexinHuang5 @jure @EmmanuelCandes , in collaboration with Genentech @nate_diamant @gabo_scalia. We test it across proteins, genetic perturbations, regulatory DNA, clinical trials, ADMET, and antibacterial virtual screening. In a prospective A. baumannii screen at Genentech, TxConformal estimated 80.3 false positives before wet-lab validation; the experiment found 91, within the 90% CI. Preprint: biorxiv.org/content/10.648… Code: github.com/ying531/TxConf… 🧵[1/n] 👇
@YingJin531 Very nice work! I think the loop of AI assisted simulation-validation scientific process is forming! Statistics is still very useful at the end of simulation/generation to control the size of wrongly chosen scientific targets and at efficiently planning validation experiments.
Excited to share our ICML 2026 Hypothesis Testing Workshop in Seoul, this July! @icmlconf 🎉This workshop aims to bring together researchers developing modern hypothesis testing methodology and applying it to machine learning problems such as robustness, distribution shift, security, medicine, and LLM evaluation. In other words, if you care about how we make ML claims rigorous, this workshop is for you. We now have four confirmed speakers: Arthur Gretton @ArthurGretton, Yao Xie @yaoxie21851119, Bo Li @uiuc_aisecure, and Yisong Yue @yisongyue. The organizing team includes Xiuyuan Cheng (Duke), Feng Liu @AlexFengLiu1, Lester Mackey @LesterMackey, Shayak Sen @shayaksen, Danica J. Sutherland @d_j_sutherland, and Nathaniel Xu (UBC). 📌 Submission deadline: 10 May 2026 📌 Notification: 26 May 2026 📌 Camera-ready: 17 June 2026 📌 Workshop date: July 10 or 11, 2026 (TBA) 🚩Check more information below! 🔗Website: testing.ml 🔗Submission Portal: openreview.net/group?id=ICML.… We’re also recruiting PC members/reviewers. 🔗 Reviewer interest form: docs.google.com/forms/d/e/1FAI… 🏁Please feel free to share this with colleagues, collaborators, and students who may be interested. #ICML #ICML26
1/ Happy to release StatsClaw — an open-source multi-agent workflow for building statistical software with AI. w/ @Maple_Optboy Site: statsclaw.ai Paper: bit.ly/statsclaw
@linstonwin @captgouda24 @ben_golub So true.
Last month, a final-year CSE student DM’d me in panic. “Applied to 60+ summer internships. Not a single test link or interview call.” We spent 2 evenings rewriting his applications with Claude. Result: 6 interview invites in the next 10 days (5 startups and Microsoft). Here are the exact 7 prompts that turned it around👇
Yesterday I shared a Claude skill for academic slides. Now, the underlying guide — no AI needed, works for anyone. 📄 Best Practices for Academic & Analytical Presentations (Free PDF) bit.ly/4cp3QPe → Action titles, structured argument, exhibit discipline, citations
Roman Vershynin: A friendly proof of the Berry-Esseen theorem arxiv.org/abs/2602.06234 arxiv.org/pdf/2602.06234 arxiv.org/html/2602.06234
The UCL IMSS Annual Lecture will take place on the 27th April with a keynote from @LesterMackey. The theme is 'Computational Statistics and Machine Learning', and we will have talks from Alessandro Barp, Paula Cordero Encinar & Po-Ling Loh. imss2026.github.io @stats_UCL
Semiparametric KSD test: unifying score and distance-based approaches for goodness-of-fit testing ift.tt/RL4P8ID
🧠 Can Omni Large Language Models (OLLMs) truly reason the same across audio, vision, and text modality? 🚀 Introducing XModBench: a tri-modal multiple-choice benchmark (text ↔ vision ↔ audio) designed to test cross-modal consistency and capability in omni-modal LLMs. 🔍 Covering 60,828 QA pairs across five task families (perception, spatial reasoning, temporal reasoning, linguistic, external knowledge). 🤯Results show: even top models like Gemini 2.5 Pro struggle: (i) < 60% accuracy on spatial & temporal reasoning (ii) Sharp performance drop when the same content is given as audio rather than text (iii) Systematic imbalance — reasoning is far less consistent under vision/audio contexts than text 📘 Paper: arxiv.org/pdf/2510.15148 🌐 Project: xingruiwang.github.io/projects/XModB… 🤗 Huggingface: huggingface.co/datasets/RyanW…
Proud work with @RenZhimei
Ziang Niu, Zhimei Ren: Assumption-lean weak limits and tests for two-stage adaptive e... arxiv.org/abs/2505.10747 arxiv.org/pdf/2505.10747 arxiv.org/html/2505.10747
Zonghao (Hudson) Chen @Hudson19990518
179 Followers 319 Following PhD candidate at University College London, Center for Foundational Artificial Intelligence.
massimiliano concas @MConcas20642
2 Followers 37 Following
Jingchu Gai @Jingchug
300 Followers 771 Following PhD, CMU Machine Learning Department | Prev: Undergraduate, Peking University B.S Math.
Nancy E @zhongerlao32090
23 Followers 1K Following soft chaos & golden retriever energy 🐾 100% follow back
smile @Smilex_P
149 Followers 6K Following
LY @YantoLiem11
165 Followers 6K Following
Shimeng Huang @s24huang
35 Followers 328 Following Postdoctoral fellow at the Institute of Science and Technology Austria (ISTA)
Rosa Salla @rosetax99
16 Followers 236 Following
Molly S @NansakM97524
117 Followers 3K Following
George Brooks @CasperB86
12 Followers 471 Following
Bob Lily @BobLily07019142
8 Followers 541 Following
Rich Ma @RichMa1738895
26 Followers 383 Following
@unknown_economist @UN_Economist
9 Followers 247 Following
Karl @tomatoxuhs
20 Followers 935 Following
Chao @cq5211
47 Followers 3K Following
Weihan Zhang ㌠ @weihanzhang0427
188 Followers 2K Following Interested in semiparametric theory and causal inference. Biostatistics M.A. student @UCBerkeley. Alumnus @PKU_SG.
Yijie @Yjg_oo
6 Followers 280 Following
Leonardo Cooper @zlxie19
0 Followers 71 Following
Zhimei Ren @RenZhimei
1K Followers 321 Following Assistant professor @Wharton stats. Former postdoc @UChicago & PhD from @Stanford.
Ying Jin @YingJin531
1K Followers 336 Following Assistant professor @wharton stats. | Postdoc @harvard ds, PhD @stanford stats. | uncertainty quantification, causal inference, multiple testing, replicability.
Shuangning Li @ShuangningLi
964 Followers 253 Following Assistant Professor of Econometrics & Statistics @ChicagoBooth | Former Postdoc @HarvardStats | PhD in Statistics @Stanford
Sirio Legramanti @SirioLegramanti
836 Followers 550 Following Assistant Professor in #Statistics @UniBergamo Main research interests: #Bayesian inference, dimensionality reduction, #networks, ABC, #privacy
C @euniverrse
31 Followers 2K Following
Cong (Clarence) Jiang @statsCong
176 Followers 1K Following Postdoctoral researcher at HSPH 🎓 | PhD in Statistics & Actuarial Science from @UWaterloo | Connect with me on my website ⬇️ https://t.co/TWEi3p0e4e
Mei Wang @Changhang650915
305 Followers 816 Following
Xuebo Wang @XueboW19017
41 Followers 1K Following Biostat PhD Student @EmoryRollins. Stat/AI Genetics & Genomics.
Tayshus @Tayshus285469
133 Followers 4K Following
Talysleth @talysleth52028
9 Followers 1K Following
Bill Xiao @Stat_BillXiao
1 Followers 226 Following Interested in statistics, actuarial science, and network science.
Yousuf Harun @yousufovee007
73 Followers 685 Following PhD Student at RIT | Continual/ Lifelong Machine Learning
Wei Fan @wei_fan0618
27 Followers 66 Following PhD Student @wharton Statistics | Undergrad @ USTC-SGY
Yu Huang @yuhuang42
637 Followers 696 Following PhD student @Wharton Statistics | Prev @Tsinghua_Uni
Michele Guindani 🇺... @mguindani
3K Followers 1K Following Statistician. Bayesian. Professor. UCLA. Views are my own. Scientific communication is mostly through other media. Joined 2008 (verify below).
Shoaib Meraj Sami @shoaib_sami
67 Followers 4K Following PhD student @WVU. https://t.co/QdrtnGI97X and https://t.co/F02MGln8Xk @BUET . Former Engineer @Nuclear Power Plant Company Bangladesh Limited
blank @TonyZhu8Sydney
91 Followers 5K Following
Yu Gui @YuChicago1234
150 Followers 665 Following Postdoc @Wharton | PhD @UChicago Stats | uncertainty quantification, causal inference, transfer learning, multi-modal learning
Tyler Farghly @ ISBA @tylerfarghly
572 Followers 493 Following Postdoc @Sierra_ML_Lab (@inria_paris / @ENS_ULM) stochastics, diffusion models, mcmc, optimisation musician prev @oxcsml @UniofOxford @antikythera_xyz
Joe Blitzstein @stat110
17K Followers 5K Following Statistics professor at Harvard; statistician and data scientist; probability and paradoxes; Bayesian frequentist reconciliation; chess.
Triad sou. @triadsou
2K Followers 3K Following My interests: Biostatistics; Bioinformatics; Survival Analysis; Meta-Analysis; Diagnostic Statistics; Causal Inference. https://t.co/sdyqW4VlL0
SpaceArcher @ArcherYiYang
18 Followers 105 Following
Siu Lun Chau @Chau9991
581 Followers 566 Following Assistant Professor in Statistical Machine Learning @ CCDS, NTU Singapore. Previously @CISPA, @oxcsml, @AmazonUK and @MPI_IS.
Epicrispr Bio @EpicrisprBio
301 Followers 21 Following Epicrispr Bio is a biotechnology company developing ultra-compact therapies to modulate gene expression in vivo using the smallest known Cas protein
Feng Zhang @zhangf
56K Followers 72 Following Engineer and molecular biologist. Curious about the world and optimistic to make it better.
Arc Institute @arcinstitute
46K Followers 71 Following A full-stack institute for AI and biology research.
nature @Nature
2.7M Followers 3K Following Research, News, and Commentary from Nature, the international science journal For daily science news, get Nature Briefing: https://t.co/wGmQlQ8a4D
Halıcıoğlu Data Sc... @HDSIUCSD
2K Followers 171 Following HDSI at UC San Diego lays scientific foundations in data science to address ways to solve the world's most pressing problems through data science outcomes.
Schmidt Science Fello... @SchmidtFellows
6K Followers 615 Following We are developing the next generation of science leaders to transcend disciplines, advance discoveries, and solve the world's most pressing problems.
Matthew Stephens @mstephens999
5K Followers 149 Following Professor in Statistics and Human Genetics at University of Chicago
Niko McCarty. @NikoMcCarty
51K Followers 138 Following Biotechnologist & writer. Founding Editor @AsimovPress // Podcast: The New Biology // Microgrants at https://t.co/XriNjEMYEZ and https://t.co/iEv1FYzCBS
Nature Biotechnology @NatureBiotech
324K Followers 3K Following Publishing the best of biotech science and business. Find us on Bluesky, Facebook & Instagram. Part of @SpringerNature and @NaturePortfolio.
Jake P. Taylor-King @wildtypehuman
2K Followers 340 Following Co-founder at Relation (@relationrx) - applying #singlecell (check @scTrends_update !) #genetics #crispr #machinelearning to #drugdiscovery and #healthcare.
Nima Alidoust @nalidoust
4K Followers 642 Following CEO and Co-Founder, @tahoe_ai, Princeton PhD *15 زن، زندگی، آزادی
Yusuf Roohani @yusufroohani
2K Followers 478 Following Machine Learning & Biological Discovery. Previously Associate Director, ML @arcinstitute, PhD @StanfordAILab
Anshul Kundaje @anshulkundaje
31K Followers 3K Following Federally funded academic research is the innovation engine of the US economy. Reform is welcome. Destruction will have long term consequences.
Probability and Stati... @probnstat
81K Followers 701 Following Sharing insights on Probability, Statistics, ML, DL and AI research. Subscribe for recent research paper discussions at $2/month. DM to collaborate.
Stanford AI+Biomedici... @Stanford_AI_Bio
2K Followers 15 Following Weekly invited talks on AI X Biomedicine. Organized by @KKuanPang @YanayRosen @zoe_piran @mdbereket @arpi_ta_s @ElanaPearl @anshulkundaje @james_y_zou @jure
Alan Murphy @Al_Murphy_
572 Followers 525 Following Postdoctoral Research Scientist, Koo lab at Cold Spring Harbor Laboratory | Lead Editor Genomics x AI Blog | Deep Learning for genomics
Zonghao (Hudson) Chen @Hudson19990518
179 Followers 319 Following PhD candidate at University College London, Center for Foundational Artificial Intelligence.
Alex Chan @alex8chan
3K Followers 1K Following Econ prof @HarvardHBS; @nberpubs; PhD @Stanford; ex-Fortune 5 Healthcare SVP; ex-@McKinsey; 🐈⬛, 🍀 and 🚂 enthusiast
Rahul Satija @satijalab
28K Followers 353 Following Core Member, New York Genome Center; Professor, Biology, NYU
Rafael Irizarry @rafalab
29K Followers 490 Following Applied statistician. I tweet data-driven observations, data science educational materials, academic research updates, and the occasional joke.
Pierre Boyeau @pierreboyeau
235 Followers 262 Following EECS PhD student @Berkeley working on single cell genomics
Ming "Tommy" Tang @tangming2005
46K Followers 3K Following Director of bioinformatics at AstraZeneca. YouTube at chatomics. On my way to helping 1 million people learn bioinformatics. Also talks about leadership.
Anastasios Nikolas An... @ml_angelopoulos
8K Followers 2K Following Measuring intelligence @arena. Statistics, model evaluation. Formerly @Berkeley_EECS, @StanfordEng, student researcher @GoogleDeepMind.
Lior Pachter @lpachter
71K Followers 2K Following Bren Professor of Computational Biology @caltech. Blog at https://t.co/FFQzhEsmhi. Tweets represent my views, not my employer's. #methodsmatter
Jonathan Pritchard @jkpritch
15K Followers 343 Following My lab at Stanford studies human population genetics and complex traits.
Charles Fulco @FulcoCharles
297 Followers 150 Following Functional genomics for drug discovery @sanofi by way of @bmsnews, @HMS_SysBio, and @eric_lander lab at @broadinstitute. #sanofiEmployee
Vivian Wu @vivianwubeijing
54K Followers 3K Following Founder of @dashengmedia 大聲.Youtube: https://t.co/ROnjbnAZH9. Was BBC HK Bureau Head. China Editor @initiumnews. Politics Reporter @scmpnews
Ben Moll @ben_moll
27K Followers 1K Following Sir John Hicks Professor of Economics, @LSEecon. Macroeconomics with distribution(s). Coeditor of the American Economic Review.
Katalin Susztak @KSusztak
7K Followers 4K Following Physician-scientist at the University of Pennsylvania. Determined to understand chronic kidney disease development
Chenggang Xu 许成�... @ChenggangXu2024
44K Followers 109 Following political economist @ SCCEI, Stanford U, taught@ LSE, HKU, TsingHua, SNU, NTHU, Hebrew U
Zongming Ma @ZongmingMa
215 Followers 124 Following Occasionally a teacher, always a student. Data, model, method, theory. I enjoy all aspects of learning from data and experience.
Nicolò Cesa-Bianchi @NicoloCB
2K Followers 144 Following Professor at the University of Milan, Italy Machine learning algorithms
Adi Wyner @adiwyner
2K Followers 297 Following Professor of Statistics and Data Science. Co-Faculty director of Wharton Sports Analytics and Business Initiative.
Daron Acemoglu @DAcemogluMIT
364K Followers 330 Following Institute Professor @MIT, @MITEcon. Co-Director of @MITShapingWork. Author of Why Nations Fail, The Narrow Corridor, and Power & Progress.
Jesse Engreitz @jengreitz
4K Followers 396 Following Assistant Professor @stanford Genetics and BASE Initiative. Mapping the regulatory code of the human genome to understand heart development and disease.
罗玉凤 @Menghuanlangqi2
149K Followers 654 Following 典型低端人口,没什么可说的。 联系邮箱:[email protected] YouTube 频道 https://t.co/PtHCxLMxGy
INFORMS @INFORMS
13K Followers 2K Following INFORMS is the association for those who apply science, math, technology, and analytics to solve the world’s most critical challenges.
Ying Jin @YingJin531
1K Followers 336 Following Assistant professor @wharton stats. | Postdoc @harvard ds, PhD @stanford stats. | uncertainty quantification, causal inference, multiple testing, replicability.
Leena C Vankadara @leenaCvankadara
295 Followers 413 Following Lecturer (Assistant Professor) @GatsbyUCL; Applied Scientist II @Amazon Research; Previously @uni_tue and @MPI_IS; https://t.co/4kC5e6okE7
Nancy Zhang @NancyZh60672287
1K Followers 124 Following Genomics, Computational Biology, Professor of Statistics and Data Science at UPenn
Wilson Hernández B. @WilsonHernandeB
3K Followers 2K Following PhD student at @Penn (Criminology) | @GRADEPeru. Crimen, policía, violencia de género y justicia. Mis tweets, mi opinión. Di un TEDTalk: https://t.co/d0TKnji5kh
Molly Gasperini @MollyGasp
2K Followers 893 Following Assoc Dir @AllenInstitute, prev Sr Scientist @CajalNeuro, prev @OctantBio, prev PhD @jshendure. Genome engineering, fxnal genomics tech dev, synbio, neuro, PNW.























