Somesh Misra / ERP.ai @MathproBro
chief researcher at https://t.co/85QLNI0SE9 | working at the intersection of business processes, neural network topologies & machine learning erp.ai San Francisco, CA Joined February 2013-
Tweets2K
-
Followers830
-
Following270
-
Likes1K
Mathematics as a field is going to have to reorient itself in light of powerful AI. But a slight pushback to Gowers's comment: "If LLMs are at the point where they can solve 'gentle problems', ...the lower bound for contributing to mathematics will now be to prove something that LLMs can’t prove, rather than simply to prove something that nobody has proved up to now and that at least somebody finds interesting." Mathematics is infinite and thus inexhaustible. By having powerful AIs that can do heavy lifting, more of the burden is shifted towards taste and asking the right question. The possibility of discovering something by looking in the right place that everyone else missed becomes possible. In mathematical physics for instance, an Einstein with inspiration of the equivalence principle might not have to toil for a decade to invent general relativity, but could have equations proposed, their solutions found, and scenarios validated as limits of Newtonian physics. Contributing to mathematics, rather than having the bar raised for problem-solving, has opened up for ideation and generation.
But if AI mathematics continues to progress at anything like its current rate -- which is what I expect to happen -- then we will face a crisis very soon, and mathematics departments, who owe a duty of care to their students, should be urgently preparing for it.
@xuanalogue looked at your CLIPS paper, so yes, an AI that truly infers a student's hidden goals and epistemic state might enable persistence instead of enabling shortcuts. :)
sarvam is doing some phenomenal work. seeing positive commentary on r/locallama too
📢 Open-sourcing the Sarvam 30B and 105B models! Trained from scratch with all data, model research and inference optimisation done in-house, these models punch above their weight in most global benchmarks plus excel in Indian languages. Get the weights at Hugging Face and
Paper: “Demystifying Oversmoothing in Attention-Based Graph Neural Networks” (NeurIPS 2023, spotlight) By Xinyi Wu, Amir Ajorlou, Zihui Wu & @jababi at MIT/Caltech. Key move: they model attention-based GNNs as nonlinear time-varying dynamical systems and use joint spectral radius theory to prove oversmoothing is inevitable for GCNs, GATs, and graph transformers. Covers ReLU, LeakyReLU, GELU, SiLU. No architectural trick escapes it. The only way out is rethinking how depth is applied. 📄 arxiv.org/abs/2305.16102
Everyone thought attention would solve oversmoothing in GNNs. It doesn’t. It can’t. Rigorous proof: expressive power in attention-based GNNs collapses exponentially with depth. GATs, graph transformers - none are immune. The real insight? Depth shouldn’t be uniform. A boundary node sitting between two communities needs 2 layers. An interior node in a dense cluster might need 10. Treating them the same is the actual problem. Structure should dictate depth. Not the other way around.
This nomenclature always confused me! NP hard sounds like it's a subset of NP, but NP is verifiable, and NP hard is hard to solve. Knuth suggested three names "Herculean", "Formidable", and "Arduous", and sent out a poll to people in theory community. one write-in suggestion was "Hard-Ass Problems" (Hard As Satisfiability). Bell Labs won with "NP-hard" and they've been confusing people ever since. The real NP-hard problem was naming NP-hard.
Underlying reason: Continuity and symmetry induce equivalence classes over inputs. Transformers collapse nearby sequences into the same representation orbit. Perplexity is invariant on these orbits. Correctness is not. This was never about Perplexity the company. It is about algebra, group actions, and quotient spaces.
Paper link: arxiv.org/abs/2601.22950 cc @PetarV_93 Thank you for formalizing something many of us felt but could not prove.
Perplexity is not always right. It can appear confident and rigorous, and it can score extremely well by its own metric, while still producing an incorrect prediction. This is not a bug or a training artifact. The result comes from the paper “Perplexity Cannot Always Tell Right from Wrong”
This insight leads to a set of fundamental group theory based results. I have tried to characterize which forms of node-level memorization are inevitable in GNNs and which require symmetry breaking. Paper coming after review.
Hot take: a lot of GNN memorization isn’t learned at all. It’s forced. Graph symmetry + training dynamics decide what a GNN can and cannot memorize — before data even enters the picture.
Three claims/theorems about deep learning that seem difficult to disprove and even harder to prove: A) Gradient descent does more than minimize loss. It reshapes geometry by collapsing directions that are irrelevant to the task (gradient flow induces anisotropic contraction in the pullback metric, with decay along directions orthogonal to the loss gradient). B) Symmetry does not need to be imposed. When data and objectives are invariant, training dynamics tend to uncover quotient structure implicitly (optimization trajectories concentrate on equivalence classes induced by approximate group orbits, even without architectural equivariance). C) Memorization is not storage. It is the emergence of extremely sharp decision geometry confined to negligible-volume regions (interpolation is achieved via high-curvature decision boundaries localized to sets of vanishing measure in input space). These are not easy theorems. But they feel like the right ones to chase. Genuinely looking for advice, counterexamples, or references from people thinking deeply about this: @levie_ron @kamalikac @rsalakhu @ok1zjf @neelnanda5 @mmbronstein
A doubly stochastic matrix only redistributes values. It cannot amplify them or destroy them. Geometrically, it is a soft mixture of permutations. It shuffles and mixes, but conserves total signal. Identity is one extreme case of this. So mHC does not abandon the identity idea. It generalizes it. Identity becomes a stable geometric object instead of a single point. That is the breakthrough: deep learning stability enforced by geometry, not tricks.
That learned matrix gets applied again and again across layers. Now depth is no longer identity plus correction. It is repeated application of an unconstrained matrix. We are back to the original instability problem. mHC fixes this by using geometry. Instead of letting the identity be any learned matrix, it restricts it to a special space called doubly stochastic matrices. No math needed. Here is the intuition.
The DeepSeek mHC paper is a real breakthrough, and the reason is geometric, not architectural. Early neural networks were just repeated matrix multiplications: x <- W x. Depth was unstable. ResNets changed one line: x <- x + F(x) which linearizes to x <- (I + W)x. That single identity term is what made deep learning scale. Hyper-Connections broke this by replacing identity with a learned matrix, turning depth back into unconstrained matrix products. mHC fixes this in a principled way. Instead of identity or an arbitrary matrix, mHC uses a doubly stochastic one. Doubly stochastic matrices form the Birkhoff polytope. They are convex combinations of permutations. Geometrically, the residual stream undergoes conservative transport and mixing, not amplification or decay. Identity is just one extreme point of this space. Under composition, stability is preserved. mHC does not abandon identity. It generalizes it into a stable geometric object. This is not an engineering trick. It is linear algebra and geometry doing the real work
Anand Jhajharia @AnandJh09
0 Followers 4 Following
Kaveh Hassani @KavehHassani
268 Followers 495 Following Research Scientist @ Meta Superintelligence Labs
Lakshmi @Lakshmi_lik
347 Followers 266 Following The only limit to our realization of tomorrow is our doubts of today
sarthak 🌉 @_sarthak4
1K Followers 3K Following AI for Enterprise IT @decawork / prev: bits pilani, nvidia, ema
彭婷 @pengting614
1 Followers 19 Following
Mr_Black111 @aka_mr_black
46 Followers 330 Following Building Crosstats | Multi-channel YouTube analytics | Aspiring AI generalist | Vibe coder | Podcast junkie | Geopolitics
Achyut Tiwari @theachyuttiwari
328 Followers 1K Following Building AI infrastructure for Geohazard intelligence. Founder @Geoliquefy. Emergent Ventures Fellow @mercatus. Curating https://t.co/vMMiSIBWig
NNO @NavigatorAGI
278 Followers 3K Following
Sitesh Shrivastava @siteshps
1K Followers 691 Following
Jiri Fajtl @ok1zjf
127 Followers 883 Following From lens to inference | Fixing catastrophic interference in ANN | Research scientist @ Kingston uni London vi/vim/neovim
Rushikesh @rushikeshg10
1K Followers 673 Following swe • Backend, System Design and Low lvl stuff • ML • https://t.co/GDO1NaR7O0
Alejandro @Alexpi
110 Followers 209 Following
rahulr @rahul95ram
23 Followers 789 Following Engineering the Full stack of life # know a lil bit more about robotics, product, design and philosophy
Azim @MrSupaFast
32 Followers 545 Following FullStack Web Dev, Gen Ai(very little), AWS(very little)
Moonshine 🇮🇳�... @moonshines00
62 Followers 1K Following Bazball? In this economy? Student of Gautam Gambhir School of Leadership. Member of Rajasthan Royals Lobby.
Harikrishnan Ramadasa... @Haramdis
0 Followers 207 Following
Akshay Sahu @akshaysahu
16 Followers 866 Following
Lee Altenberg, Ph.D. ... @AltenbergLee
6K Followers 5K Following Theoretical biologist researching evolution. Information & Computer Sciences/Mathematics/Ecology Evolution & Conservation Biology, U Hawai`i@Mānoa Herr/Prof/Dr
Manish @SharmaManish___
17 Followers 38 Following SWE-2 @Google | 🚀Building the future with #Al & #Tech | Sharing tips on #Innovation & #CareerGrowth | DM for collabs! Let's connect & create #TechForGood
Bhavesh Shah 🇮🇳 @brdshah
230 Followers 835 Following Bharat first, NaMo next. Hindu first, tolerance next. RT ×= endorsement
Raghu Dhulipala @DhulipalaRaghu
0 Followers 52 Following
Kushal @Kushal_Chordiya
81 Followers 587 Following Software engineer, Meta. Lifts books, weights and occasionally spirits. LetterBoxd target audience.
phantom.observer 🌌... @VairagyaSadhana
2K Followers 959 Following Engineer exploring the realms of space, technology, & society Writing to simplify science, demystify tech, and spark meaningful conversations💡 #spaceshost
CULLINAN @garvitaryan
191 Followers 2K Following • Physics student • Dharma • Humanity first • highly ambitious • AWAKENED OPTIMIST •
Yashwanth Chennuru @yash1th_tweets
127 Followers 6K Following Web3 Yeah! | Electronics & Comp @mahindrauni
Anand Venkatraman @Anand_Venkatram
2K Followers 2K Following
Karan @KaranJanthe
333 Followers 951 Following figuring out my next adventure | Serial Key presser | DM's Open
Summer_boy @Summer_boy_play
2 Followers 87 Following
omkar shukla @shukla_omkar
157 Followers 205 Following Product Management, Mathematics, Sports and Reading!
Naveen @_naveendn
20 Followers 502 Following
Paras Bansal @parasbansal33
191 Followers 3K Following
Mohit Shah @mohit2501
67 Followers 475 Following
Sai Krishna @SaiKris01015746
21 Followers 471 Following
Maithra Raghu @maithra_raghu
21K Followers 532 Following Cofounder and CEO @Samaya_AI. Formerly Research Scientist Google Brain (@GoogleAI), PhD in ML @Cornell.
Tiberiu Mușat @Tiberiu_Musat_
768 Followers 953 Following Trying to figure out how AI works 🔍🧠 Currently at @ETH Zurich, previously @EPFL 🇨🇭 LLMs, interpretability, emergence, grokking 🤖
Gauri Tripathi @Gauri_the_great
3K Followers 498 Following AI Researcher | Working on speech models | Creating for the love of it
Paria Rashidinejad @paria_rd
1K Followers 545 Following Assistant Professor @USC; Research Scientist @AIatMeta FAIR; PhD @berkeley_ai @CHAI_Berkeley
Vipul Vaibhaw @vaibhaw_vipul
14K Followers 2K Following Founding Engineer @pre6ai Open source ❤️. Math and Systems. Most posts are notes to myself.
Ezgi Korkmaz @EzgiKorkmazAI
3K Followers 0 Following Machine Learning Researcher, PhD in Machine Learning. Reinforcement Learning. Been at @UCL | @GoogleDeepmind | @UCBerkeley
Astera Institute @AsteraInstitute
10K Followers 29 Following We empower visionary, high-leverage science and technology projects with the capacity to create transformative progress for human civilization.
Dileep George @dileeplearning
16K Followers 1K Following Head of AI @AsteraInstitute Prev: AGI @DeepMind, cofounder @vicariousai (acqd by Alphabet), cofounder @Numenta. IIT-Bombay, MS&PhD Stanford. https://t.co/IlsczdBtZo
Nicole Levin @nicilevv
269 Followers 226 Following Platform @pebble_bed VC. I notice things that are about to matter. Then I create experiences around them.
Chirag Arora @iChiragArora
379 Followers 1K Following 23, open to work. I love coffee, math, python and her ofc. contributing @aoagents. try out https://t.co/asEb6Qk1S2
smitha milli @SmithaMilli
3K Followers 448 Following research scientist, meta (fair) opinions are my own 🥺 👉👈
SAIR @SAIRfoundation
4K Followers 32 Following Terence Tao & Nobel, Turing, Fields laureates advancing scientific discovery & guiding AI with scientific principles. Grounding intelligence. Scaling discovery.
Gergely Orosz @GergelyOrosz
337K Followers 3K Following Writing @Pragmatic_Eng, the #1 software engineering newsletter on Substack. Author of @EngGuidebook. Formerly Uber & Skype.
Centre for Credible A... @ccaiwut
562 Followers 31 Following
Subbarao Kambhampati ... @rao2z
28K Followers 72 Following AI researcher & teacher @SCAI_ASU. Former President of @RealAAAI; Chair of @AAAS Sec T. Here to tweach #AI. YouTube Ch: https://t.co/4beUPOmf6y Bsky: rao2z
Manasi Sharma @ ICLR ... @ManasiSharma_
534 Followers 336 Following research engineer @scale_AI, working on reasoning for frontier models, agents, rl | prev @stanford, @StanfordAILab, @mitll, @Columbia
California Institute ... @CIMCAI
3K Followers 16 Following
Bartosz Naskręcki @nasqret
11K Followers 422 Following Mathematician | Vice-Dean @UAM_Poznan | Researcher @ccaiwut | Owner of https://t.co/lEspgf36Pg | Mathematics, AI and programming
Yaashaa Golovanov @Golovanov_ammoc
13K Followers 801 Following Director, AMMOC - An International Math Circle in honor of Vladimir Igorevich Arnold and Jerrold Eldon Marsden.
Zechen Zhang @ZechenZhang5
2K Followers 1K Following Building the future for human and AI collaboration @orch_research @Harvard
Adam Dziedzic @adam_dziedzic
285 Followers 100 Following I'm a researcher, software developer, systems designer & engineer. I have a passion for machine/deep learning, databases, technology, traveling, sport & music.
Adarsh Jamadandi @adarshjamadandi
175 Followers 287 Following PhD Student @irisa_lab. Graph Representation Learning and Geometric Deep Learning. You can find me tuning my model’s weights or weights at the gym.
Neel Somani @neelsomani
20K Followers 253 Following Formal methods & ML research. Prev: Founder of Eclipse, QR at Citadel. Proud Cal Bear.
Christian Szegedy @ChrSzegedy
44K Followers 3K Following #deeplearning, #ai research scientist. Opinions are mine.
Math, Inc. @mathematics_inc
13K Followers 0 Following Solve math, solve everything. Dedicated to superintelligence via autoformalization
Jiri Fajtl @ok1zjf
127 Followers 883 Following From lens to inference | Fixing catastrophic interference in ANN | Research scientist @ Kingston uni London vi/vim/neovim
Sri M @SriMspeaks
13K Followers 3 Following Spiritual guide, Social reformer, Educationist, Author and Speaker. Recipient of Padma Bhushan 2020.
Chen Sun 🤖 @ChenSun92
3K Followers 426 Following Research Scientist @GoogleDeepMind Discussing RL, memory, openendedness, continual learning,automated science ex-IMO(Canada) ex-neuroscientist Views are my own
xuan (ɕɥɛn / sh-ye... @xuanalogue
10K Followers 1K Following Assistant Professor at NUS. Scaling cooperation for an increasingly automated future. PhD @ MIT ProbComp / CoCoSci. Pronouns: 祂/伊
Nima Dehmamy @nimatabari
170 Followers 197 Following physicist working on ML for science at IBM Research.
Robin Walters @RobinSFWalters
495 Followers 199 Following Asst. Prof. at Khoury College of CS at Northeastern
Artificial Intelligen... @SciFi
21K Followers 22 Following New Artificial Intelligence papers from https://t.co/gDZs9w7xd4: expert systems, theorem proving. Thank you to arXiv for use of its open access interoperability.
Rose Yu @yuqirose
10K Followers 609 Following Machine Learning Prof @UCSanDiego, Scholar @amazon, Previously @google, @Northeastern, @Caltech, @USC, #Physics-Guided #AI, MIT TR-35 Innovator.
Jianke Yang @jiankeyang
15 Followers 12 Following
Aria @ariahalwong
611 Followers 887 Following Anthropic Fellow | prev. MATS 9.0 Scholar, princeton math, engineering/quant
Ali Behrouz @behrouz_ali
8K Followers 1K Following Research Intern @Google, Ph.D. Student @Cornell_CS, interested in machine (continual) learning and understanding what is called intelligence.
Russ Salakhutdinov @rsalakhu
112K Followers 181 Following CSO @ Sooth Labs, Professor @ CMU, President Elect ICML Board, Ex-VP of Research @ Meta (Multimodal LLMs, AI Agents), ex-Director of AI at @Apple.
Ruihan Wu @ruihan_w
55 Followers 36 Following I am a postdoctoral researcher at the University of California, San Diego
return of the researc... @byebyescaling
3K Followers 2K Following ilya is right. bye bye scaling, back to empirical deep learning monkë | prev upenn, quant
Machine Learning Stre... @MLStreetTalk
39K Followers 741 Following MLST is by Dr. Tim Scarfe @ecsquendor w/ cameos from @DoctorDuggar https://t.co/5YCv2SdFwN (early access/priv.discord) - Sponsor us!
Yi Tay @YiTayML
56K Followers 87 Following research scientist @googledeepmind ✨♊, model co-lead/captain of gemini deepthink imo gold medal 🥇, opinions are my own.
Shubhendu Trivedi @_onionesque
10K Followers 897 Following Cultivated Abandon. Twitter interests: Machine learning research, applied mathematics, mathematical miscellany, ML for physics/chemistry, books.
Kamalika Chaudhuri @kamalikac
6K Followers 3K Following Researcher, Google Deepmind. Formerly, Director FAIR @ Meta. Former Professor at UCSD. Researcher in AI privacy, security, and generalization.


























