hoagy @HoagyCunningham

alignment attempter London Joined April 2022

Tweets

103
Followers

152
Following

161
Likes

845

hoagy @HoagyCunningham

4 months ago

Makes a lot of sense but still interesting to see. Wonder if you could remove this part of representation, training for non-toxicity even when residual stream is in that region, measuring success by difficulty of training away the change - has anyone tried to do similar?

Andrew Lee @a_jy_l

4 months ago

1 3 13 1K 3

Download Image

1 0 0 301 0

hoagy @HoagyCunningham

6 months ago

Niels Bohr, on the surprising complexity of autoregressive text generation.

0 0 7 184 1

Download Image

hoagy @HoagyCunningham

7 months ago

First paper, finally released🥳🥰

AK @_akhaliq

7 months ago

First paper, finally released🥳🥰

2 41 171 38K 87

Download Image

2 2 14 558 1

hoagy @HoagyCunningham

a year ago

Alignment strategy generator! "New top secret MIRI agenda hinted at. All we know is it involves {NN arch}, {math area} & {phil topic}". Fill at random & distribute..

1 0 3 171 0

Harshal Nandigramwar @hnanacc

343 Followers 247 Following ai @intel labs, prev: ai @cariad_tech, masters @Uni_Stuttgart, building @todackcom, @themelioai

Matthew Clarke @Matthew05049818

0 Followers 2K Following

Younesse Kaddar @you_kad

58 Followers 422 Following Theoretical Computer Science Student

StellSoul_3 @stellsoul92816

19 Followers 484 Following

nick @birdbearfish

70 Followers 917 Following thinking about thinking things

PhD student at @OxfordTVG, @UniofOxford.
I am working on Trustworthy Machine Learning, championing friendly AI and polite robots

Aleks Petrov @AleksPPetrov

166 Followers 446 Following PhD student at @OxfordTVG, @UniofOxford. I am working on Trustworthy Machine Learning, championing friendly AI and polite robots

Researcher, programmer, DJ, transhumanist. @ArayaGlobal; formerly @imperialcollege/@MSFTResearch/@TwitterResearch/@facebookai/@DeepMind/@nnaisense.

Kai Arulkumaran @kaixhin

7K Followers 5K Following Researcher, programmer, DJ, transhumanist. @ArayaGlobal; formerly @imperialcollege/@MSFTResearch/@TwitterResearch/@facebookai/@DeepMind/@nnaisense.

Krystof Mitka @krystof_mitka

120 Followers 509 Following Currently completing undergraduate double degree in Applied Mathematics and Computer Science in 🇳🇱

krishna soham @iamkrishnasoham

107 Followers 928 Following i compute therefore i am

Jonathan Downing @downing_jonny

143 Followers 830 Following ML PhD @UniofOxford

PhD Student @GroNLP 🐮, core dev of @InseqLib (https://t.co/tTjrg26ygQ). Interpretability ∩ HCI ∩ #NLProc. Prev: @AmazonScience, @Aindo_AI, @ItaliaNLP_Lab.

Gabriele Sarti @gsarti_

2K Followers 2K Following PhD Student @GroNLP 🐮, core dev of @InseqLib (https://t.co/tTjrg26ygQ). Interpretability ∩ HCI ∩ #NLProc. Prev: @AmazonScience, @Aindo_AI, @ItaliaNLP_Lab.

Guillaume Corlouer @tkrdan

209 Followers 283 Following Understanding neural networks

Bradley Brown @brad19brown

82 Followers 280 Following Professional bit rearranger 👨‍💻 | CS PhD Student at the University of Oxford

tm @morfhow

4 Followers 93 Following 天

PhD candidate in neuroAI at @DondersInst. Working on comp neuro and DL theory. Also into drone photogrammetry, filmmaking, philosophy, and space exploration.

Adrián F. Amil @adriamilcar

435 Followers 2K Following PhD candidate in neuroAI at @DondersInst. Working on comp neuro and DL theory. Also into drone photogrammetry, filmmaking, philosophy, and space exploration.

Alex Loftus @AlexLoftus19

60 Followers 271 Following Data Scientist / Machine Learning Engineer

Rishi @rishih2o

55 Followers 1K Following

Alpaca 🦙 (e/delve) @LeetAlpaca3

211 Followers 3K Following Servant of the future. If I blocked you, I probably didn’t mean to. (Big block list from long ago) Tweets are my own.

On 5th-6th April 2024, TAIS will bring together leading AI safety experts in Tokyo to discuss how to make AI safe, beneficial, and aligned with human values.

Technical AI Safety C.. @tais_2024

128 Followers 28 Following On 5th-6th April 2024, TAIS will bring together leading AI safety experts in Tokyo to discuss how to make AI safe, beneficial, and aligned with human values.

Harry Mayne @HarryMayne5

119 Followers 425 Following Interpretability @oiioxford @uniofoxford. PhD student. Previously @Cambridge_Uni

_original @original1126948

9 Followers 1K Following

Mikołaj Piórczyńsk.. @AjPiorczynski

4 Followers 297 Following

Vedang Lad @vedanglad

218 Followers 357 Following MIT, computer science, physics, mathematics, art, photography, cross country, track and field

Sweet Kristy @SweetKrist23248

575 Followers 5K Following Am a happy girl 👧 for life 🥹❤️

Eric J. Michaud @ericjmichaud_

1K Followers 771 Following PhD student at MIT. Trying to make deep neural networks among the best understood objects in the universe. 💻🤖🧠👽🔭🚀

James Fox @James_D_Fox

128 Followers 820 Following CS PhD student at Oxford University

Shepherd the finite through the local minima of imperfect information/ Aligning all beings w/ the lovepill I'm developing/ Putting cult back into culture

Sun 乌龟 💖 @suntzoogway

2K Followers 1K Following Shepherd the finite through the local minima of imperfect information/ Aligning all beings w/ the lovepill I'm developing/ Putting cult back into culture

Wes Gurnee @wesg52

3K Followers 198 Following Optimizer @MIT @ORCenter PhD student thinking about Mechanistic Interpretability, Optimization, and Governance.

Grog @cosell2013

83 Followers 920 Following a

Wendy Sun @wendy_sunq

49 Followers 289 Following AI & Math at MIT | @MIT_CSAIL | AI safety

Ole Jorgensen @ojorgy

197 Followers 748 Following AI Safety Researcher

I'm researching how to understand and control large-scale AI systems with physics and cognitive science. PhD student @Stanford

Jakub Smékal @jakub_smekal

500 Followers 1K Following I'm researching how to understand and control large-scale AI systems with physics and cognitive science. PhD student @Stanford

Stuart Ritchie 🇺�.. @StuartJRitchie

36K Followers 1K Following Research Comms @AnthropicAI

Jammehpapa @jammehpapa154

428 Followers 4K Following Believe in your lord Jesus Christ And A Lover Of Jesus Christ ✝️🙏

Ebrimajatt831 @ebrimajatt45935

621 Followers 5K Following Jesus Christ is the king of all kings and he shall come back soon ✝️✝️🙏

Andrew Lee @a_jy_l

262 Followers 435 Following CS PhD student @UMich advised by @radamihalcea. Prev Intern: @MetaAI x2, @MSFTResearch

Wentao Wang @wentaow10

80 Followers 922 Following PhD student @NYUDataScience

Michaël Trazzi @MichaelTrazzi

12K Followers 24 Following AI Alignment https://t.co/cAS4FnR5yf

Alex Makelov @AMakelov

122 Followers 147 Following SERI MATS, prev @MIT @Harvard Creator of https://t.co/yKCQG7hqPm Machine learning, theoretical computer science, math.

Dave @Dave73387426037

678 Followers 5K Following

But the sea came up as usual and disrespectfully drenched the king's feet and shins.
I want the good ending pls, not the bad one.
transhumanist, ML, RL, lmao

ɢʀɛǟȶK̶i̶n̶g�.. @GreatKingCnut

472 Followers 2K Following But the sea came up as usual and disrespectfully drenched the king's feet and shins. I want the good ending pls, not the bad one. transhumanist, ML, RL, lmao

Charlie O'Neill @charles0neill

344 Followers 1K Following Maths + Comp Sci + Economics @ ANU. Using mech interp to build hierarchical planning modules into transformers

Josh @JoshPurtell

727 Followers 2K Following ML Researcher. Ex: Cyber microexit, Yale Math. Hiring in ML Ars longa

Researching scalable oversight @MATSprogram | prev @METR_Evals @ai_risks | spaced repetition | AI safety | https://t.co/p887k6EsFs

Arjun Panickssery is .. @panickssery

1K Followers 2K Following Researching scalable oversight @MATSprogram | prev @METR_Evals @ai_risks | spaced repetition | AI safety | https://t.co/p887k6EsFs

Lucre Snooker @LucreSnooker

3K Followers 1K Following

Dalton brown @Daltonbrown944

185 Followers 2K Following AI Alignment, AI ethics

cured count keyserling's insomnia, woke kant from his dogmatic slumber, helped ur mum cum;

my views represent the coherent extrapolated volition of humanity

wombo combo @marquisedgelord

88 Followers 799 Following cured count keyserling's insomnia, woke kant from his dogmatic slumber, helped ur mum cum; my views represent the coherent extrapolated volition of humanity

wireheading @wireheading

67 Followers 267 Following engineer on a trading sidequest

mebubo @mebubo

10 Followers 465 Following

hoagy @hoagyd

314 Followers 772 Following I'm trying to find my keys

AI Safety Institute @AISafetyInst

529 Followers 29 Following We’re building a team of world leading talent to tackle some of the biggest challenges in AI safety - come and join us.

typedfemale @typedfemale

23K Followers 479 Following a really exciting new account "have you ever though you might be like scott alexander? very smart, but can't do math" - anon

Eric J. Michaud @ericjmichaud_

1K Followers 771 Following PhD student at MIT. Trying to make deep neural networks among the best understood objects in the universe. 💻🤖🧠👽🔭🚀

Wes Gurnee @wesg52

3K Followers 198 Following Optimizer @MIT @ORCenter PhD student thinking about Mechanistic Interpretability, Optimization, and Governance.

main @main_horse

8K Followers 474 Following AGI Believer. Haven't applied @OpenAI. Likes are not always endorsement.

Andrew Lee @a_jy_l

262 Followers 435 Following CS PhD student @UMich advised by @radamihalcea. Prev Intern: @MetaAI x2, @MSFTResearch

Postdoc studying interpretability for AI safety under @davidbau. PhD in math from @harvard. Previously director of technical programs at https://t.co/FxRv4QgERO.

Samuel Marks @saprmarks

696 Followers 79 Following Postdoc studying interpretability for AI safety under @davidbau. PhD in math from @harvard. Previously director of technical programs at https://t.co/FxRv4QgERO.

Andy Lapsa @AndyLapsa

6K Followers 51 Following CEO/Co-Founder @stoke_space - Husband/dad/engineer building 100% reusable rockets

Research Scientist at Samsung AI Center
Outstanding Paper Award @icmlconf 2023
Action Editor @TmlrOrg
I tweet about ML papers and math

Konstantin Mishchenko @konstmish

4K Followers 564 Following Research Scientist at Samsung AI Center Outstanding Paper Award @icmlconf 2023 Action Editor @TmlrOrg I tweet about ML papers and math

Alex Makelov @AMakelov

122 Followers 147 Following SERI MATS, prev @MIT @Harvard Creator of https://t.co/yKCQG7hqPm Machine learning, theoretical computer science, math.

Emmett Shear @eshear

103K Followers 708 Following ∀∃,∃∀

Lucre Snooker @LucreSnooker

3K Followers 1K Following

SpaceX @SpaceX

34.5M Followers 113 Following SpaceX designs, manufactures and launches the world’s most advanced rockets and spacecraft

Wei Dai @weidai11

7K Followers 82 Following wrote Crypto++, b-money, UDT. thinking about existential safety and metaphilosophy. blogging at https://t.co/mBVFhriJVf

Mira Murati @miramurati

274K Followers 526 Following CTO @OpenAI

Luke Muehlhauser @lukeprog

8K Followers 293 Following Open Philanthropy Senior Program Officer, AI Governance and Policy

Joe Barnard 🚀 @joebarnard

87K Followers 481 Following Intern @ https://t.co/MZrwGWouCr

Lucia Quirke @lucia_quirke

212 Followers 81 Following Neural network interpretability researcher at EleutherAI

hoagy @hoagyd

314 Followers 772 Following I'm trying to find my keys

Artemis Cultist. Loosh Harvester. Peattard. At eternal war with Rome. Infinite love is the only truth everything else is illusion!!!!!🇨🇳🏴󠁧󠁢󠁥󠁮󠁧󠁿🇮🇱🇦🇷

RFH🦎👁‍🗨�.. @hollowearthterf

38K Followers 985 Following Artemis Cultist. Loosh Harvester. Peattard. At eternal war with Rome. Infinite love is the only truth everything else is illusion!!!!!🇨🇳🏴󠁧󠁢󠁥󠁮󠁧󠁿🇮🇱🇦🇷

Buck Shlegeris @bshlgrs

1K Followers 199 Following CEO at Redwood Research, working on technical research for AI safety.

Joe Benton @JoeJBenton

248 Followers 63 Following Alignment Science at Anthropic | Previously PhD at University of Oxford

Lauro @laurolangosco

883 Followers 677 Following Working on AI safety and science of deep learning @CambridgeMLG. Here to discuss ideas and have fun.

akbir. @akbirkhan

883 Followers 999 Following dumbest overseer in the loop @UCL_DARK (he/him)

Herbie Bradley @herbiebradley

690 Followers 602 Following a generalist agent | AI governance & safety @AISafetyInst | PhD student @Cambridge_Uni @AI4ER_CDT | formerly @AiEleuther

Chris Meserole @chrismeserole

3K Followers 775 Following Executive Director, Frontier Model Forum | Former Director, Brookings A.I. & Emerging Tech Initiative

Arthur Conmy @ArthurConmy

1K Followers 652 Following @ Google DeepMind

David Krueger @DavidSKrueger

13K Followers 4K Following Cambridge faculty - AI alignment, deep learning, and existential safety. Formerly Mila, FHI, DeepMind, ElementAI, AISI.

Internet Rocket Scientist, Gamer, Astronomer, Dad, Scotsman. Makes videos about science and video games.... at the same time! https://t.co/5p7T8YmtuC

Scott Manley @DJSnM

513K Followers 538 Following Internet Rocket Scientist, Gamer, Astronomer, Dad, Scotsman. Makes videos about science and video games.... at the same time! https://t.co/5p7T8YmtuC

Meg Tong @megtong_

158 Followers 189 Following Working at @Anthropic!

Daniel Murfet @danielmurfet

699 Followers 508 Following Mathematician at the University of Melbourne. Working on Singular Learning Theory and AI alignment.

Alex Turner @Turn_Trout

997 Followers 39 Following Research scientist on the scalable alignment team at Google DeepMind. All views are my own.

Co-Director at @MATSprogram | Board Member at https://t.co/26oYPZwxVx | PhD in physics | Accelerate AI alignment + build a better future for all

Ryan Kidd @ryan_kidd44

945 Followers 822 Following Co-Director at @MATSprogram | Board Member at https://t.co/26oYPZwxVx | PhD in physics | Accelerate AI alignment + build a better future for all

Quintin Pope @QuintinPope5

3K Followers 186 Following ML researcher focusing on natural language modeling and alignment.

Jesse Hoogland @jesse_hoogland

857 Followers 1K Following Researcher and decel working on developmental interpretability. Executive Director @ Timaeus

Lewis Hammond @lrhammond

753 Followers 1K Following Research Director @coop_ai / DPhil Candidate @CompSciOxford and @HertfordCollege / Affiliate @FHIOxford and @GovAI_

shako @shakoistsLog

6K Followers 2K Following real truth about it is, no one gets it right https://t.co/1Qoefmwmw2

Walter Goodwin @goodwin_ml

101 Followers 138 Following AI & teaching robots, Uni of Oxford.

PhD at 19 |
Founder and CEO at @MedARC_AI |
Research Director at @StabilityAI |
@kaggle Notebooks GM |
Biomed. engineer @ 14 |
TEDx talk➡https://t.co/xPxwKTq6Qb

Tanishq Mathew Abraha.. @iScienceLuvr

Mikhail Parakhin @MParakhin

17K Followers 21 Following

Jamie Bernardi @The_JBernardi

915 Followers 655 Following Doing AI Governance research, ex-Co-Founder, Bluedot Impact. Climber, guitarist and sporadic musician. he/him.

Chair Professor in AI, Director of IDS, Head of CS, HKU;
Professor of EECS, Berkeley;
Author of Book: High-Dim Data Analysis, https://t.co/gwaqMJp8av.

Yi Ma @YiMaTweets

71K Followers 123 Following Chair Professor in AI, Director of IDS, Head of CS, HKU; Professor of EECS, Berkeley; Author of Book: High-Dim Data Analysis, https://t.co/gwaqMJp8av.

Tesla AI @Tesla_AI

148K Followers 14 Following Developing & deploying autonomy at scale in vehicles, robots & more

Nina Rimsky @NinaRimsky

439 Followers 317 Following Interested in AI safety + interpretability

The Suburban Pirate @suburbanpirate

941 Followers 674 Following East Dulwich Retrofit Coordinator. #1stMillionHP

aidan ewart @aidanprattewart

96 Followers 455 Following ai safety lukewarm takes

Logan Smith @loganriggssmith

29 Followers 12 Following Aligning AIs before they align us. 🤖🎯 #SafeSkynet

#1 Rudin Prologue Sta.. @Robert_AIZI

67 Followers 115 Following Learning AI Safety, blogging at https://t.co/VQL0UnGSFi, making bad jokes here. Math PhD. Any pronouns.

Director/CEO at Apollo Research @apolloaisafety
Ph.D. student of Machine Learning @PhilippHennig5; AI safety/alignment

Marius Hobbhahn @MariusHobbhahn

2K Followers 994 Following Director/CEO at Apollo Research @apolloaisafety Ph.D. student of Machine Learning @PhilippHennig5; AI safety/alignment

Stefan Schubert @StefanFSchubert

28K Followers 2K Following Philosophy, psychology, and effective altruism.

🌈ze.zima @ze_zima

a day ago

Emotional to be sharing this finally. It's so incredibly special to me. It's a piece that muses on reincarnation, extra-dimensional perspectives, and the interconnectedness of lives. The music is 'Back to body' by Bethany Ley, also released today.💕(1/?)

74 461 3K 170K 870

Download Video

River Kenna @the_wilderless

2 years ago

@icantsay Yeah my suspicion is that emotions are fat soluble, and if you eat to deal with them, they come back in force when the fat burns off

4 14 163 0 89

Nathan 🔍 @NathanpmYoung

5 days ago

Future House now has a sign so we can know when the next train is. It’s cool when people just build stuff like this.

5 1 78 4K 9

Download Image

alth0u🤸 @alth0u

6 days ago

i like poets who weren't tortured

2 5 61 3K 0

Alex Christian @alexschristian

4 years ago

For some reason I had to turn Andrew Cuomo’s mad coronavirus poster into a Jenny Holzer installation:

1 2 5 0 0

Download Video

Ed Bithell @Ed_Bithell

4 days ago

@DalrympleWill Don't miss the perfect opportunity to tweet Sala Francés' painting the Expulsion of the Jews from Spain! museodelprado.es/en/the-collect…

1 0 1 58 0

Download Image

Daniel @growing_daniel

a week ago

Data is way more like oil now than when that was originally said

34 43 818 124K 47

mostly quiet @babarganesh

a week ago

why do they call them down comforters and not *ducks for cover*

0 2 16 469 0

Dwarkesh Patel @dwarkesh_sp

2 years ago

@tsungxu @tferriss @lexfridman @joerogan lol, so you're saying I should stay in SF?

2 0 4 0 0

mostly quiet @babarganesh

2 weeks ago

"i got covid right off the bat"

0 2 10 540 0

sympathetic opposition @sympatheticopp

2 weeks ago

a witch has put me under a curse of silence about anything that is remotely unpleasant & i dont feel like talking about

5 2 66 2K 1

aidan ewart @aidanprattewart

2 weeks ago

my (real) (actual) government put this out today what a timeline

1 0 8 549 1

Download Image

Glenn @GlennLuk

3 weeks ago

That so many people buy into this reflects: (i) how we as a country are now so far removed from manufacturing that we understand so little of it (as demonstrated here and many similar replies), and (ii) the difference between ideas and execution.