Jamie Simon @learning_mech

doing fundamental science of deep learning | PhD from Berkeley | can catch a whole egg in my mouth jamiesimon.io Berkeley, CA Joined December 2023

Tweets

140
Followers

1K
Following

75
Likes

195

Jamie Simon @learning_mech

a hour ago

I quite like this line of work! blends "mechinterp" and "learning mechanics" approaches to fundamental science of deep learning.

Scaling laws describe how loss changes with scale. Do neurons inside models change predictably too? We study vision and language models up to 30B params and find systematic scaling in neuron universality, specialization, and selectivity. Paper+code: avdravid.github.io/rosetta-neuron… 1/n

2 15 55 24K 25

0 0 6 164 1

View Details

David Klindt @klindt_david

2 weeks ago

I thoroughly enjoyed reading this recent paper by @yasamanbb et al (arxiv.org/pdf/2602.15029) that derives analytically why certain latent variables must lead to geometry in word embeddings. (getting Fourier modes even with open boundary but exponential kernel is neat!) I think it would be great to compare this to some of @prfsanjeevarora et al's work on this (eg tinyurl.com/4az4325n) More broadly, I have been thinking about the right data generating process for language. For vision, we have latent spaces with great manifold structure (eg the SO3 pose of an object) and nonlinear mixing functions. But for language? Are there really any continuous latent variables? What is the "DSprites" of language? Is it all just co-occurrence stats or is there something more in LLM word embeddings?

5 58 500 28K 443

View Details

Jamie Simon @learning_mech

3 weeks ago

@KrzakalaF ah, lovely! I quite like this sort of "posit a simpler dynamics and study it" sort of approach. very physicsy! I like Misha Belkin's NFA/RFM stuff (which this reminds me of) for the same reason. possible path towards an answer to our Open Dir 1?? learningmechanics.pub/openquestions/…

0 1 5 537 4

View Details

Jamie Simon @learning_mech

3 weeks ago

@electro_vansh ah, thx! unf don't think I'll have the time, but honored :)

0 0 0 11 0

View Details

Jamie Simon @learning_mech

a month ago

did you know that with a few modifications, you can get the Ising model to simulate cells fighting to the death? one of my favorite side projects of all time: jamiesimon.io/blog/cell-figh…

11 80 606 63K 328

View Details

Jamie Simon @learning_mech

3 weeks ago

@alexdong @imbue_ai @KuninDaniel arxiv.org/abs/2604.21691 :)

0 0 1 18 0

View Details

Good Work @goodworkmb

4 weeks ago

Leaked Sam Altman messages (2023)

10 19 454 70K 34

View Details

Imbue @imbue_ai

a month ago

Mechanistic interpretability aspires to be the biology of deep learning. @KuninDaniel and @learning_mech say that an emerging theory of deep learning they and their team call 🛠️ learning mechanics 🛠️ will be the physics.

2 3 22 2K 10

View Details

varun @varunneal

a month ago

crazy how you can pinpoint the exact curvature of a trillion dimensional model based on how wiggly the loss curve is

2 10 157 14K 91

View Details

Jamie Simon @learning_mech

a month ago

@MattHausmannAtx (yes, look up the Cellular Potts model)

0 0 0 42 0

View Details

Jamie Simon @learning_mech

a month ago

@MattHausmannAtx psh who would want to

1 0 1 241 0

View Details

Jamie Simon @learning_mech

a month ago

@reson8Labs oh they can hug

1 0 0 351 0

View Details

Jamie Simon @learning_mech

a month ago

@guzmansalv me too. my favorite method is having an immune system!

0 0 2 479 0

View Details

Jamie Simon @learning_mech

a month ago

for the bright-eyed and bushy-tailed: there's a Learning Mechanics discord! young academics who want to do research in this area should especially consider joining + starting convos. discord.gg/GTHfUnf7hz

0 2 31 2K 29

View Details

Daniel Kunin @KuninDaniel

a month ago

Excited to share that our paper “Sequential Group Composition: A Window into the Mechanics of Deep Learning” was accepted to ICML 2026 in Seoul! Co-led with @giovannimarchet and @AdeleMyersPhD @hopfbifurcator @ninamiolane Paper: arxiv.org/abs/2602.03655

8 47 236 72K 182

View Details

Good Work @goodworkmb

a month ago

Palantir office speedrun

203 2K 35K 2.5M 4K

View Details

Jamie Simon @learning_mech

a month ago

ditto. props to @justanotherlaw for taking a second look :) (though I also found merit in the criticisms in the first version.) hopeful we can eventually (hopefully soon enough...) make contact w/ AI alignment + governance, whose noble causes we would v much like to aid.

Alex Atanasov @ABAtanasov

a month ago

This is a great post and I especially respect the author for updating his view when presented with new information. I strongly encourage young researchers interested in interpretability, science of DL, and safety to look at it. lesswrong.com/posts/6SRq7mZ9…

3 10 160 12K 172

0 0 10 1K 1

View Details

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

a month ago

It seems we're at a stage where deep learning is evolving from alchemy into an engineering discipline; this is an exciting paper which lays out that a scientific theory is emerging for Deep Learning. Paper: arxiv.org/abs/2604.21691 Tweet: x.com/learning_mech/…

Jamie Simon @learning_mech

a month ago

1/ Deep learning is going to have a scientific theory. We can see the pieces starting to come together, and it's looking a lot like physics! We're releasing a paper pulling together these emerging threads and giving them a name: learning mechanics. 🔨 arxiv.org/pdf/2604.21691 🔧

53 292 2K 303K 2K

1 3 25 2K 15

View Details

Jamie Simon @learning_mech

a month ago

yeah, totally! I once messaged everyone on facebook with my first and last name. I eventually made a big group chat! v ethnically + geographically diverse. probs the closest I've gotten as an adult to meeting a truly random slice of the US.

Devon ☀️ @devonzuegel

a month ago

Jury selection is cool. It's probably the closest you ever get to seeing a true random sample of the population

2 1 47 5K 2

0 0 7 817 0

View Details

Tim Duignan @TimothyDuignan

a month ago

Aren’t diffusion models explicitly derived from a correspondence with physics and entirely consistent with how physics says you should model systems over a range of scales ( ie mori zwanzig theory: langevin dynamics with a fitted vector field ? ) what more do you want?

Pedro Domingos @pmddomingos

a month ago

Sorry, but these correspondences between AI and physics are vacuous. People have been making them since (at least) the 80s, and they always come to nothing.