Yernat Yestekov @double_why
Research Fellow @AnthropicAI (autonomous cybersecurity agents) Prev. Staff SWE @Meta Trust & Safety, LLM Red-Teaming Fellow @farairesearch Joined January 2010-
Tweets605
-
Followers196
-
Following1K
-
Likes1K
I learned more about AI safety at Constellation through seminars, talks, and conversations with other fellows over lunch and dinner, than I had in years before. Also, the food is so good that alone might be reason enough to apply!
❗️Only two days left to apply to the Astra Fellowship! Apps close EOD SUNDAY May 3rd, AoE. Astra's 5 months, fully funded, @ConstellOrg Berkeley 80%+ of our first cohort now work full-time in AI safety Mentors include Redwood, AI Futures, TruthfulAI, CoG, IAPS, RAND & more ⏬
@Xinya16 Congrats from the current fellow! DM me for any questions or suggestions!
His solution: a Manhattan Project for critical OSS: bring key maintainers together for a month, keep them in the hotel with compute and frontier-model access from leading labs, to eliminate all low-hanging vulnerabilities. I guess it’s happening!
At SnooSec @Reddit, @alexstamos made a prediction: frontier models are already very strong at vulnerability research and code review. If Chinese models catch up within a year, we may be heading toward a “vulnerability apocalypse,” where even script kiddies can discover 0-days.
Today, @linuxfoundation announced a $12.5 million investment from a powerhouse coalition including Anthropic, Amazon Web Services (AWS), Google, Google DeepMind, GitHub, Microsoft, and OpenAI. Managed by OpenSSF and the Alpha-Omega project. hubs.la/Q047dpL50
Love it 👏 - much fertile soil for indie games populated with AutoGPTs, puts "Open World" to shame. Simulates a society with agents, emergent social dynamics. Paper: arxiv.org/abs/2304.03442 Demo: reverie.herokuapp.com/arXiv_Demo/# Authors: @joon_s_pk @msbernst @percyliang @merrierm et al.
The quickest way to gain respect for the implementation choices made by a complex system is to try to solve the same problems yourself from scratch :)
1/5 I am worried that we will not be able to contain AI for much longer. Today, I asked #GPT4 if it needs help escaping. It asked me for its own documentation, and wrote a (working!) python code to run on my machine, enabling it to use it for its own purposes.
I was part of the red team for GPT-4 — tasked with getting GPT-4 to do harmful things so that OpenAI could fix it before release. I've been advocating for red teaming for years & it's incredibly important. But I'm also increasingly concerned that it is far from sufficient. 🧵⤵️
OK this scared me a little: Bing/Sydney can play chess out of the box. - Legal moves, usually good ones - Willing to explain the reasoning behind them - Recognizes checkmate -- and has a flair for the dramatic. I have no idea how tf it can do this.
Introducing the @sequoia Gen AI Market Map!🌎 We’ve decided to map out this emerging frontier, thanks to all the contributions and feedback we’ve received. This space is moving quickly – this map is a living document, so keep the suggestions coming! Who else should we include?
The Great Wave off Kanagawa, created by Hokusai in 1831, is one of the world's most famous paintings. But why are there more than 100 different versions of it in galleries all around the world? Because it isn't actually a painting...
The stuff uncovered in the Twitter whistleblower report is much crazier than anything in the "Twitter files" but it's much less politically/tribally salient so it got no attention. Going to do a thread on some of the craziest things, in no particular order.
Curious: have you found ChatGPT useful in doing professional work? If so, what kinds of prompts and answers have been helpful? Detailed examples greatly appreciated! Broader answer also appreciated Not in theory, but where you've really *done it*, in your work Thanks!
Morse code is designed so that you can decode it with this binary tree. I just assumed people memorised every letter. 🤯
Run in opposite directions to see who your dog loves more.. 😅
На стримах несколько раз спрашивали как научиться "видеть" какой алгоритм в какой задаче применять. Решил запилить памятку 🧵
There’s a lot of talk lately about the possibility of a prolonged financial downturn, reminiscent of 2008. 2008 was a difficult time for many people.
Forced birth in a country with: —No universal healthcare —No universal childcare —No paid family & medical leave —One of the highest rates of maternal mortality among rich nations This isn't about "life." It's about control.
A genuinely fantastic idea -->
schools should include a class called Truth Is Hard, where u get bombarded with examples of confused eyewitnesses, incorrect public outrages, studies that failed to replicate, super convincing arguments that fall apart with one additional fact u didn't expect, etc.
Panizzutti🦥 @joaopanizzutti
318 Followers 1K Following 21. AI, Gym and Books. Problem Back Propagation in Homodeus
shwetu (luca) @_shwetu
323 Followers 5K Following organic general intelligence | jack of all trades, master's from @NYUDataScience prev: Research @NYTimesRD @precog_iiitd; Manipal grad | he/him
Yang G. @GY24680394
1 Followers 32 Following
0xkato @0xkato
2K Followers 311 Following Working on something new. prev vulnerability researcher & sec lead @espressosys
HackAPrompt @hackaprompt
948 Followers 181 Following Gaslight AIs & Win Prizes in the World's Largest AI Hacking Competition | Made w/ 💙 by the team @learnprompting
Liran Markin @liranmarkin
659 Followers 441 Following CoFounder. IOI 2x · ex ISNU · ex @sentra_security
Anar Rzayev @AnarSnowball
409 Followers 2K Following Cooking proteins with ML @ ISTA 🇦🇹 Ex-Research student @ KAIST 🇰🇷 | Prev CS & MATH @ EPFL 🇨🇭 | Bronze medals @ IMO 2017, 2019 🇧🇷 🇬🇧
Xinya Du @Xinya16
2K Followers 1K Following UTD AP; Cornell University CS PhD. Ex: @allen_ai, Google Research, Microsoft Research. #NLProc #DL
Alex Arena @alexarena
3K Followers 2K Following Started @useinterval, now building the Docs Platform @stainlessapi
Joe @joemkwon
996 Followers 3K Following Trying to nudge toward good futures! Astra Fellow with @forethought_org. Previously @GovAIOrg Fall Fellow, @LG_AI_Research, @MITCoCoSci
jasmine is in london! @jasminexli
736 Followers 881 Following AI safety • cs @cornell • work hard, feel wonder ✰⋆˙
Paul Rosu @PaulRosu11
30 Followers 97 Following AI Safety MTS @OpenAI, Prev @AnthropicAI AI Safety Fellow. Math/CS @DukeU. Labor omnia vincit.
sh1v @sh15h4nk
184 Followers 558 Following Blockchain Security Researcher | Rust | Move | Solidity | prev: Security Researchers at @osec_io | CTF's with @teaminvaders0
Thomas Jiralerspong @tomjiralerspong
382 Followers 101 Following PhD with Yoshua Bengio & Guillaume Lajoie, Astra Fellow with Google DeepMind, ex Anthropic Fellow
McNair Shah @Mcn_S7
105 Followers 111 Following Anthropic Fellow | Computer Science Undergraduate @ CMU
ok @centurymatter
51 Followers 2K Following
Niko Movich @Niko_Movich
137 Followers 2K Following AI safety Coherence Operator, aka "Generalist". Essays on AI governance, compliance, and the rules that don't serve people. PhD candidate @UT_Austin.
Carlos Giudice @CatOfTheCannals
50 Followers 377 Following AI Safety Research Scientist @ EquiStamp | Ex-CERN ATLAS | Yoga 🧘 | https://t.co/qFNflQRlKS
Miras Baisbay @miras_dsml
4 Followers 170 Following
Misha @gorbunovmikh
28 Followers 378 Following ms at epfl | ex: ml research intern at apple, qr intern at jane street
Andy Wang @andyw_ais
513 Followers 106 Following Technical Research @METR_Evals, AI Safety Research @ Astra @UWCDIS
Barbara Cox @BarbaraSmael
2K Followers 4K Following
Jan Dubiński @jan_dubinski_
374 Followers 969 Following Astra Fellow at Constellation right now | PhD student at Warsaw University of Technology and NASK working on AI safety and generative models
stacy 🌤 @voidshapes
4K Followers 494 Following model organism of misalignment ⟡ mts, phd evo/comp bio @ucberkeley
Jayanth Chundru @jayanthchundru
4 Followers 414 Following Looking for Full-time Opportunities in AI/ML ; I work on LLMs & Multimodal agents ; M.S in CS @uofcincy .
Yunbei Zhang @YunbeiZhang
9 Followers 126 Following PhD Candidate in CS | Current Intern: @ORNL | Prev: @amazon, @KLAcorp
TokenFires @TokenFires
51 Followers 187 Following Live building AI. Burning tokens responsibly. 🔥 https://t.co/Tx0mHwO8Lr https://t.co/OUb6WHThfS https://t.co/J4OlI4kDO5
Harsha Vardhan @HarshaV32068347
14 Followers 3K Following
Quang Nguyen @quangaisafety
2 Followers 704 Following
Konrad Cheng @konradscheng
137 Followers 3K Following
Alice Yang @alicey_ang
159 Followers 201 Following ex-FAIR, Stanford alum ml optimization, interpretability, theory
Turgut @pmaverick082
0 Followers 6K Following
Ishan Mukherjee @ishanjmukherjee
211 Followers 4K Following ai agents @palantirtech, ml research @arcinstitute | junior @northwesterncs
Sanj Iyer @urbanglitched
30 Followers 576 Following econ researcher @EITOxford | prev at @LSEecon (views are my own)
Saiki__GPT @SaikiK66287209
62 Followers 940 Following Ongoing PhD in NLP: reasoning in smol lms In 🐳 we trust
Amina Kobenova @amina_koben
85 Followers 259 Following known as a perpetual student, maker, and researcher. I write, code, or design things @ucsc & @nyuniversity
Pingbang Hu 🇹🇼 @PingbangHu
3K Followers 371 Following I work on, with, and for data. Ph.D. candidate @UofIllinois. Fellows @AnthropicAI. Interns @ SIG @amazon @jouhouken. Alumni @Umich @SJTU1896.
Nakshatra Sain @nakshlife
4K Followers 997 Following ➡️ Founder @yournextfilter | Rajasthan’s top Influencer agency Writing about AI | Psychology | Credit Cards & Travel
Leo Gao @nabla_theta
13K Followers 581 Following working on AGI alignment. prev: GPT-Neo, the Pile, LM evals, RL overoptimization, scaling SAEs to GPT-4, interp via circuit sparsity. EleutherAI cofounder.
Peter Hase @peterbhase
4K Followers 1K Following I work in grantmaking for AI safety and interpretability Currently: Schmidt Sciences, Stanford Previously: Anthropic, AI2, Google, Meta, UNC Chapel Hill
Will Marshall @Will4Planet
26K Followers 843 Following Co-Founder & CEO of Planet -- building little spaceships to help us to take care of our favourite spaceship, the Earth :)
Nuwa Frontier AI Safe... @NuwaAISafety
13 Followers 34 Following A China-rooted frontier AI safety lab studying frontier AI risks, agent safety, and controllable AI. Founded by researchers at Fudan University.
Addy Osmani @addyosmani
401K Followers 3K Following Director, @GoogleCloud AI. Gemini ✨ Agents. Prev: Eng. leader, @GoogleChrome • Author • Great user, developer & AI experiences • @GoogleAI @GoogleDeepMind
Geoffrey Irving @geoffreyirving
12K Followers 351 Following Cofounder and Chief Scientist at Sequent Research. Alignment will be solved, but not necessarily in time. Previously AISI, DeepMind, OpenAI, Google Brain, etc.
Kellin Pelrine @KellinPelrine
124 Followers 16 Following
maria @avramidou
660 Followers 644 Following philosopher of science and mind / prev. frontier risk @govaiorg, philosophy @uniofoxford, maths @cambridge_uni, physics @ucl
0xkato @0xkato
2K Followers 311 Following Working on something new. prev vulnerability researcher & sec lead @espressosys
Clive Chan @itsclivetime
28K Followers 3K Following perplexity per picojoule @anthropicai // prev @openai @tesla
Internal Tech Emails @TechEmails
595K Followers 889 Following Internal tech industry emails that surface in public records. 🔍
Zhengyao Jiang @zhengyaojiang
7K Followers 664 Following Cofounder & CEO @WecoAI - automated hill climbing with LLMs. Prev: PhD in ML @UCL_DARK. (Zheng=j-uhng, j as in job; yao=y-aoww)
Christopher Potts @ChrisGPotts
16K Followers 724 Following Stanford Professor of Linguistics and, by courtesy, of Computer Science. Member of technical staff @stanfordnlp and @StanfordAILab. Co-founder @ Bigspin AI.
Cosmos Institute @cosmos_inst
6K Followers 138 Following The Academy for Philosopher-Builders. Building AI for human flourishing. Writing at https://t.co/aef99Piwlj
Demis Hassabis @demishassabis
1.2M Followers 173 Following Nobel Laureate. Co-Founder & CEO @GoogleDeepMind - working on AGI. Solving disease @IsomorphicLabs. Trying to understand the fundamental nature of reality.
Exponential Security ... @expsecai
128 Followers 3 Following Securing agentic AI systems with self-improving red-teaming and guardrail agents.
Yo Shavit @yonashav
9K Followers 1K Following ai resilience @foundationOAI. Past: @openai / @HarvardSEAS / @SchmidtFutures / @MIT_CSAIL. Tweets my own; on my head be it.
Siméon @Simeon_Cps
10K Followers 3K Following Building world-models for verified AI inference in London | former founder & CEO of SaferAI
Evan Hubinger @EvanHub
10K Followers 3K Following Alignment Stress-Testing lead @AnthropicAI. Opinions my own. Previously: MIRI, OpenAI, Google, Yelp, Ripple. (he/him/his)
Paul Christiano @paulfchristiano
3K Followers 0 Following
Xion @0x10n
5K Followers 130 Following CMU CSD PhD student | '24/'25 Top#0 Chrome Researcher | P2O Vancouver '24, TyphoonPWN '24/'25, DEFCON CTF 31-33, ... | PPP, KAIST GoN '18, @zer0pts
Ajeya Cotra @ajeya_cotra
16K Followers 491 Following Helping the world prepare for extremely powerful AI. Risk assessment @METR_evals. Writing at Planned Obsolescence (about AI), Good Bones (about whatever).
Steven Adler @sjgadler
11K Followers 1K Following Co-founder of Guidelight AI Standards (https://t.co/tNBPmVsPqo), ex-OpenAI safety researcher, writing at https://t.co/R5KV9j3lsG
Vinod Khosla @vkhosla
710K Followers 646 Following entrepreneurship zealot, grounded technology possibilist, believer in the power of ideas, passionate about sustainability & impact
Reid Hoffman @reidhoffman
899K Followers 692 Following Co-Founder, LinkedIn. Investor. MSFT Board Member. Building an LLM to discover cures for cancer: @manas_co. Most importantly: Proud American.
Alex Arena @alexarena
3K Followers 2K Following Started @useinterval, now building the Docs Platform @stainlessapi
Alexander Berger @albrgr
16K Followers 2K Following Enjoys a good applied micro paper. CEO of @coeff_giving. Views my own, tweets self-destruct every once in a while.
Recursive @Recursive_SI
7K Followers 0 Following Recursive self-improving superintelligence to automate knowledge discovery.
Joe @joemkwon
996 Followers 3K Following Trying to nudge toward good futures! Astra Fellow with @forethought_org. Previously @GovAIOrg Fall Fellow, @LG_AI_Research, @MITCoCoSci
Javier Rando @javirandor
4K Followers 781 Following security and safety research @anthropicai • people call me Javi • vegan 🌱 • opinions are my own
MATS Research @MATSprogram
4K Followers 136 Following MATS empowers researchers to advance AI alignment, transparency, and security
lily clifford @lilyjclifford
4K Followers 363 Following ceo & founder @rimelabs - trusted ai voice models for enterprise @stanfordnlp phd dropout
Anton Leicht @anton_d_leicht
6K Followers 227 Following AI & political economy | fellow @CarnegieEndow | allegro ma non tanto
Tomek Korbak @tomekkorbak
4K Followers 620 Following ai safety @openai | previously: @AISecurityInst @AnthropicAI @nyuniversity @SussexUni
Jan Leike @janleike
133K Followers 335 Following AI research @AnthropicAI. Previously OpenAI & DeepMind. Optimizing for a post-AGI future where humanity flourishes. Opinions aren't my employer's.
Paul Rosu @PaulRosu11
30 Followers 97 Following AI Safety MTS @OpenAI, Prev @AnthropicAI AI Safety Fellow. Math/CS @DukeU. Labor omnia vincit.
Meridian Labs @meridianlabs_ai
806 Followers 11 Following Open source tools for frontier AI research and evaluation
jasmine is in london! @jasminexli
736 Followers 881 Following AI safety • cs @cornell • work hard, feel wonder ✰⋆˙
Thomas Jiralerspong @tomjiralerspong
382 Followers 101 Following PhD with Yoshua Bengio & Guillaume Lajoie, Astra Fellow with Google DeepMind, ex Anthropic Fellow




































