nash @RogueEngineer
SVP Engineering & AI at Sophos. Hopkins PhD in teaching machines to find bad stuff. Former NSA. Opinions are mine, threat models are yours. United States Joined March 2007-
Tweets424
-
Followers154
-
Following546
-
Likes200
The anti-AI war is going to get sloppy... github.com/leilei926524-t…
JUST IN: Sam Altman says AI probably won’t trigger the “jobs apocalypse” he once predicted.
Gradient descent for SKILL.md files sounds interesting, maybe a bit complex but it's becoming a real part of agent harness. SkillOpt is one of the first papers to treat markdown skill files as trainable parameters and provides a proper optimization framework for them. A few things I learned that you should consider too. 1. The validation gate is the only thing that matters in a self-editing loop. Held-out set, strict improvement, ties rejected. End-to-end, their best skills land with 1 to 4 accepted edits total. If your "self-improving agent" is accepting most of what it proposes, you're shipping slop. 2. Bounded edits are better than full rewrites. 4 to 8 edits per step is the sweet spot. Remove the budget and performance collapses. This is the textual analog of learning rate, and it transfers to any LLM-as-author loop. If you're using an agent to refactor your docs, your prompts, or your skills, cap the diff size. 3. Compactness wins. Median final skill: ~920 tokens. Skills do not need to be long. They need to be high-signal. Most skill files I see are bloated because length feels like effort. It isn't. 4. The harness is becoming less important; the skill is becoming more important. A Codex-trained skill ported into Claude Code hit +59.7 points on SpreadsheetBench. Procedural knowledge is more general than the runtime that produced it. 5. Frozen model + trained context is the practical adaptation. GPT-5.4-nano with a SkillOpt'd skill ≈ frontier behavior on procedural benchmarks. Cheaper, portable, inspectable, zero inference-time cost. This is the answer to "how do we adapt a frontier model for our domain" for almost everyone who isn't training their own models. 6. Verification is the bottleneck. Every gate in this paper depends on an auto-grader. That works for benchmarks. It fails for writing, design, and strategy, exactly the open-ended work we want to automate. Whoever builds the verifier for open-ended tasks owns the next stage. There are also two leassons I learned while shipping v2.3.0 of my Context Engineering Agent Skills repo, measured across composer-2, claude-opus-4-7, gpt-5.5, and gemini-3.1-pro via the @cursor_ai SDK: - Description and body are two different surfaces. The router only sees the description. The agent sees the body once activated. They can quietly disagree, and only end-to-end task tests catch it. - Aggregate accuracy is the wrong unit. When I rewrote three descriptions, the corpus average moved ~1pp. Individual skills moved 23–25pp. Per-skill effect size is where the action is. Also, in Feb 2026 I shared a piece called Personal Brain OS arguing that the markdown file is a first-class substrate for agent state. SkillOpt is the optimizer-shaped version of that same argument: not "store memory in files" but "treat files as trainable parameters with proper optimization machinery around them." That's the move from static to measured. The fast/slow split they describe already lives implicitly in the digital-brain-skill repo: - voice-guide and tone-of-voice.md are slow-state (rarely touched) - posts.jsonl and bookmarks.jsonl are fast-state What SkillOpt adds that I didn't have is a protected section invariant, a structural guarantee that fast edits cannot overwrite slow lessons. Removing that mechanism cost them 22 points on SpreadsheetBench. Worth borrowing. If you're building agents, SkillOpt: Executive Strategy for Self-Evolving Agent Skills is a good paper to read: arxiv.org/pdf/2605.23904
PICARD: Data, shields up DATA: Brilliant! Shields can reduce damage we sustain. Not immunity. Not hubris. Just prudence. It's not precaution—it's strategy. [camera shakes] WORF: HULL BREACHES ON NINE DECKS DATA: Here's what happened: you told me to raise shields, and I didn't
Hermes Agent now has access to hundreds of browser skills through @browserbase’s new Browse.sh hub, so agents can more reliably perform any task on the internet. You can try a skill from their catalog or contribute your own.
Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946. For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids. An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better. This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.
Hermes Agent v0.14.0 - “The Foundation Release” Changelog below
For the past 2 months, XBOW has been testing Mythos Preview under embargo as part of a select early-access group. Today, we can finally share what we found. The headline: Mythos Preview is a major advance. It is substantially better than prior models at finding vulnerability candidates, especially when source code is available. But it’s not perfect. We surfaced issues with exploit validation, judgment, and efficiency. Our full write-up covers where Mythos Preview shines, where it still needs support, and what we think this means for the future of offensive security: bit.ly/42zQl98
This works really well btw, at the end of your query ask your LLM to "structure your response as HTML", then view the generated file in your browser. I've also had some success asking the LLM to present its output as slideshows, etc. More generally, imo audio is the human-preferred input to AIs but vision (images/animations/video) is the preferred output from them. Around a ~third of our brains are a massively parallel processor dedicated to vision, it is the 10-lane superhighway of information into brain. As AI improves, I think we'll see a progression that takes advantage: 1) raw text (hard/effortful to read) 2) markdown (bold, italic, headings, tables, a bit easier on the eyes) <-- current default 3) HTML (still procedural with underlying code, but a lot more flexibility on the graphics, layout, even interactivity) <-- early but forming new good default ...4,5,6,... n) interactive neural videos/simulations Imo the extrapolation (though the technology doesn't exist just yet) ends in some kind of interactive videos generated directly by a diffusion neural net. Many open questions as to how exact/procedural "Software 1.0" artifacts (e.g. interactive simulations) may be woven together with neural artifacts (diffusion grids), but generally something in the direction of the recently viral x.com/zan2434/status… There are also improvements necessary and pending at the input. Audio nor text nor video alone are not enough, e.g. I feel a need to point/gesture to things on the screen, similar to all the things you would do with a person physically next to you and your computer screen. TLDR The input/output mind meld between humans and AIs is ongoing and there is a lot of work to do and significant progress to be made, way before jumping all the way into neuralink-esque BCIs and all that. For what's worth exploring at the current stage, hot tip try ask for HTML.
🚨 OPEN SOURCE AI IS LITERALLY UNSTOPPABLE 🚨 The legendary founder of Redis (Antirez) just dropped ds4 - a custom native inference engine built specifically for DeepSeek v4 Flash This is earth shattering! Here is why: DeepSeek v4 Flash is a quasi-frontier model with a massive 1M context window You can now run it LOCALLY on a 128GB Mac using specialized 2-bit quantization The architecture is reimagined—he moved the KV cache from RAM directly to the SSD disk! 🤯 We already know DeepSeek v4 Flash is insanely good for agentic loops - Now you don't even need the cloud to run it Closed-source labs are burning tens of billions on massive GPU clusters while single brilliant developers are running frontier-level AI on laptops! They told us open-source would be worthless against trillion-dollar monopolies Instead, pure hacker culture + incredible open-weight models are completely rewriting the rules Open Source will ALWAYS win 💕
Asking for HTML explanations of things is pretty neat, I tried it just now with the obfuscated Python POC for the new copy.fail Linux vulnerability: simonwillison.net/2026/May/8/unr…
Anthropic just shipped sleep into agents. When you sleep, your hippocampus replays the day's neural sequences to the cortex during 150-220 Hz bursts called sharp-wave ripples. The replay runs about 20x faster than the original experience. A 10-second sequence gets compressed to roughly 500 milliseconds. Wilson and McNaughton showed this in rats in 1994. You ran this algorithm last night on whatever you did yesterday, whether you wanted to or not. The replay does two things at once. It extracts statistical patterns: what mattered, what generalizes, which sequences predicted reward. And it reorganizes the memory trace from hippocampus-dependent storage into neocortex, which is why old memories survive hippocampal damage but recent ones don't. Disrupt sharp-wave ripples in a rat with optogenetics and the rat fails the next day's task. The replay is causal, not correlational. Most "agent memory" today is a search engine. Past sessions get embedded, you retrieve relevant chunks at the next call. That works for facts. It does not extract patterns and it does not reorganize the trace. Which is why agents plateau. The memory volume keeps growing while real capability flatlines. Dreaming reviews past sessions, extracts patterns, curates memories. That is the brain's actual three-step algorithm. They called it dreaming because dreaming is what the algorithm does, in roughly the same order, for roughly the same reason. Agents that dream between sessions will compound. The ones still running on raw context window will hit the same ceiling humans hit when they pull all-nighters.
Live from Code with Claude: we're launching dreaming in Claude Managed Agents as a research preview. Outcomes, multiagent orchestration, and webhooks are now in public beta.
My Hermes Agent when I ask it to remember something…
OpenAI’s GPT-5.5 is the second model to complete one of our multi-step cyber-attack simulations end-to-end 🧵
Khushi Garg @KhushiG66688363
1 Followers 27 Following
Blago Dimitrov @blagodesign
136 Followers 637 Following Product designer and Framer expert writing about AI, digital tools, and product thinking
Kush Vasaniya @vasaniyakush
8 Followers 62 Following SWE 1 @ Sophos | Working on Identity Threat Detection and Response | Go, Java, Python | AWS, Azure | Driving and Gaming
Latest in Cosmos @latestincosmos
37K Followers 18K Following 🌌 Daily updates on space, astronomy & cosmic discoveries 🚀 | Science breakthroughs & the wonders of the universe
An Engineer's Log @an_engineer_log
4K Followers 6K Following Building https://t.co/FKJYofCo7y FAANG engineer by day, entrepreneur by night. Looking for "the one".
Pradeep Kumar @Pradee1p2
3K Followers 7K Following Writer & Creator Crypto and forex market Indian stock market
ÆGI Dispatch @agi_dispatch
37 Followers 420 Following
XavieraRobin @p8udvw8RN827U4
141 Followers 6K Following
agustinannad020 Smith @agustinann4509
2 Followers 194 Following Hi, I'm from Queens, New York. I'm currently single and enjoy exchanging valuable information with friends.
binasreginaf207 Birch @binasregin85335
1 Followers 191 Following Hi, I'm from Queens, New York. I'm currently single and enjoy exchanging valuable information with friends.
Everett Watsica @watsica19439
133 Followers 5K Following
Jane Michelle @JaneMicheltixa
9 Followers 116 Following
Simon @Simon339536
8 Followers 208 Following
ValerieEdmund @AWo5nblWEn9A6G
162 Followers 6K Following
Saadia @Eslawpwu640235
12 Followers 367 Following
_Cody @OddStoTrader
66 Followers 3K Following Stocks, Crypto & Al trade ideas shared for $3/mo. Prioritize Replies to Subs DM's Open to Subscribers Subscribe here: https://t.co/rIszCswMwd
Elsa @Aumaroqe314059
137 Followers 5K Following I’m learning to love the sound of my feet walking away from things not meant for me.
Venky @bevenky
6K Followers 6K Following Founder/CEO @plivo. Excited about everything new in AI & Robotics.
Barry Ritholzz @Ritholzz
263 Followers 820 Following Chair/CIO of RWM https://t.co/c7rfg8sBi1 Masters-in-Business podcast/radio host Director of Cognitive Dissonance
Hannah Schmidt @Zuldiito
46 Followers 1K Following
Uifwacev @Uifwacev900483
60 Followers 1K Following The question isn’t who’s going to let me; it’s who’s going to stop me.
Vrorri @Vrorri1839580
30 Followers 1K Following
Clementine @MF81pgGsX9x1m
55 Followers 1K Following There is no limit to what we, as women, can accomplish.
Carissa @9SXujErZ9wDHmG3
56 Followers 1K Following Confidence is not “they will like me.” Confidence is “I’ll be fine if they don’t.”
Polin linlin @aj_Polin0910
174 Followers 2K Following Disfrute de la comodidad del cielo azul y las nubes blancas, y aprecie la agilidad de las flores y el césped.
_James E. Thorne @_DrJStrategy
178 Followers 4K Following Chief Market Strategist @WellingtonAltus. PhD Econ. Astute, observations and conclusions. Personal views. Not investment advice. Please do your own research.
LindaMalthus @JlZp0N2xmiRTz
13 Followers 961 Following
JanetJulius @1f3GvsOuBM0Y1
20 Followers 880 Following
TinaBethune @75G2yUEEd96xY
20 Followers 1K Following
GabrielleTommy @fRBbeUKO9K0Dg
0 Followers 198 Following
TinaGold @v0JI3x373Yt6VD6
1 Followers 204 Following
Zoe @8iJc2n28pyEQCe
29 Followers 1K Following
Vomgoc @Vomgoc21877
19 Followers 1K Following
Uimwirgau @Uimwirgau551
14 Followers 563 Following
Teknium 🪽 @Teknium
97K Followers 6K Following Cofounder and Lead Engineer - Hermes Agent @NousResearch, prev @StabilityAI Github: https://t.co/LZwHTUFwPq HuggingFace: https://t.co/sN2FFU8PVE
cat @_catwu
89K Followers 389 Following claude code + cowork @anthropicai, prev: @dagster, @scale_ai
Omar Shahine @OmarShahine
11K Followers 795 Following 🦞 OpenClaw + Microsoft 365, Corporate Vice President @ Microsoft. I write a newsletter on products https://t.co/yMgREYoPcG & https://t.co/fXWpvLkJ4q
emozilla @theemozilla
12K Followers 1K Following catholic, ai researcher, co-founder/cto of @NousResearch alignment: whatever the opposite of yudkowsky + bryan johnson is. blessed be God in all his designs.
Théo Gigant @gigant_theo
718 Followers 551 Following research scientist @nousresearch, previously PhD @ université paris-saclay / centrale supélec
Bowen Peng @bloc97_
2K Followers 87 Following
Nous Research @NousResearch
203K Followers 25 Following World-class open source AI https://t.co/vrD0aDJeto
Tony Simons @tonysimons_
3K Followers 437 Following Building autonomous AI agents. (One works.) Indie hacker. Iowa. Dog dad to Carl & Ruby. Managed by AI.
Claude @claudeai
1.4M Followers 2 Following Claude is an AI assistant built by @anthropicai to be safe, accurate, and secure. Talk to Claude on https://t.co/ZhTwG8d1e5 or download the app.
Karthik Hariharan @hkarthik
12K Followers 1K Following Engineering manager working in tech. Tweets about tech, politics, business, and the occasional shitpost.
Ilia Shumailov🦔 @iliaishacked
4K Followers 825 Following Now: @Meta, Past: {CEO @aisequrity, Senior Scientist @GoogleDeepMind, JRF @ChCh_Oxford @UniofOxford, Fellow @VectorInst, PhD @Cambridge_Uni}
No Priors @NoPriorsPod
11K Followers 119 Following @saranormous and @eladgil host your podcast guide to the AI revolution. weekly interviews with the builders (on Apple, Spotify and YouTube)
Dan Woods @danveloper
10K Followers 823 Following Vice President of AI Platforms for CVS Health. Former CTO for @JoeBiden.
josh avant @joshavant
4K Followers 1K Following i build AI and AI accessories maintainer @openclaw 🦞 100% human-written tweets prev ¬ @Apple, @Microsoft, @Google, @Tinder
Val Alexander 🦞 @BunsDev
7K Followers 3K Following Maintainer @OpenClaw optimizing DevX for the Control UI and ClawHub 🦞 Summoning the substrate @OpenCvn and agents with identity.
Altay @altaywtf
416 Followers 388 Following slop cannon operator · frontend @putdotio · helping out @openclaw
geoff @GeoffreyHuntley
82K Followers 2K Following pondering the ponderoos about AI in application in unhinged ways and doing it. building @latentpatterns and the space loom. i created the “ralph loop”. iykyk
Vincent Koc @vincent_koc
23K Followers 7K Following Futurist 🦄 Chief Architect @openclaw 🦞 Writer @forbes Tech Council Adjunct @MIT Ex @Microsoft @Qantas
The Pragmatic Enginee... @Pragmatic_Eng
46K Followers 3 Following Big Tech and startups, from the inside. The #1 technology newsletter on Substack. Sign up at https://t.co/MPNdQSVnwV. Podcast: https://t.co/nVOulBGYoh
Will @BushidoToken
38K Followers 3K Following Senior Threat Intel Advisor @TeamCymru | Co-founder @CuratedIntel | Co-author @SANSForensics FOR589 | Co-founder @BSidesBournemth | @darknetdiaries #126: REvil
Shanaka Anslem Perera... @shanaka86
309K Followers 4K Following Author of The Ascent Begins. Independent Analyst. Money, geopolitics, AI, science, and sovereignty. Mapping the collapse and the reconstruction of order.
Felix Rieseberg @felixrieseberg
64K Followers 720 Following Claude Cowork / Code @AnthropicAI, Co-Maintainer https://t.co/g4potti8nq
Hasan Toor @hasantoxr
438K Followers 672 Following AI & Tech Educator • Sharing insights & practical ways to use AI & Tech Tools for you & your daily business
China pulse 🇨🇳 @Eng_china5
468K Followers 32 Following China-related news and the international struggle for influence in Eurasia and a multipolar world. 中国相关动态与欧亚地区及多极世界的国际影响力博弈
Mo @atmoio
68K Followers 18 Following Exploring what AI actually is. Building @shapeworkspace, prev @standardnotes. Talking at https://t.co/814DpgwSzr and https://t.co/vlHyF3gEjn.
Scale Venture Partner... @scalevp
15K Followers 1K Following Early-stage VC focused on AI and B2B software. We back Founders who can go the distance.
Hiten Shah @hnshah
302K Followers 6K Following Founder & CEO building SaaS for 20+ yrs. Sharing what endures in business, growth & people. Built Crazy Egg (2005), KISSmetrics (2008) & Nira (2020). https://t.co/ReDmcjaSGW
Jason ✨👾SaaStr.A... @jasonlk
242K Followers 2K Following GET funded ➡ $200m https://t.co/AVvPIrIdFP🦄🦄🦄🦄🦄🦄🦄 FREE PLAYBOOK ➡ https://t.co/TIsMr22AhO CHAT Digital Jason ▶ https://t.co/bwkZCtvqlr Founder AdobeSign
OpenClaw🦞 @openclaw
539K Followers 24 Following The AI that does things. Emails, calendar, home automation, from your favorite chat app. Your machine, your rules. New shell, same lobster soul. 🦞
Chelai @ChelaiBorges
1 Followers 8 Following data whisperer. coffee dependent. opinions are my own and occasionally correct.
Peter Steinberger �... @steipete
534K Followers 2K Following Polyagentmorous ClawFather. Came back from retirement to mess with AI and help a lobster take over the world. @OpenClaw🦞 + @OpenAI
Baltimore Memes @BaltimoreMemes
7K Followers 861 Following A humorous look at life in Charm City. Covering the #Orioles, #Ravens, & all things #Baltimore, hon.
Amanda Askell @AmandaAskell
102K Followers 662 Following Philosopher & ethicist trying to make AI be good @AnthropicAI. Personal account. All opinions come from my training data.
Demis Hassabis @demishassabis
1.1M Followers 172 Following Nobel Laureate. Co-Founder & CEO @GoogleDeepMind - working on AGI. Solving disease @IsomorphicLabs. Trying to understand the fundamental nature of reality.
Melvin Vivas @donvito
22K Followers 613 Following glm coding plan https://t.co/YkdP9824Sa minimax coding plan https://t.co/BBJpgEigR9
Jack Murphy @JackMurphyRGR
126K Followers 4K Following Journalist & host @theteamhousepod. The High Side co-founder. “The Most Dangerous Man” out June 9th
Jaana Dogan ヤナ �... @rakyll
167K Followers 1K Following Software Engineer at Google. Simpler platform, better APIs. Simplicity and optimism. Personal opinions.
Bojan Tunguz @tunguz
285K Followers 8K Following Founder and CEO @tabul_ai. Creator of @trainxgb. ML ex Nvidia. Data Scientist. Physicist. Catholic. Husband. Father. Stanford Alum. Memelord. e/xgb. AMDG.
Rohan Paul @rohanpaul_ai
149K Followers 7K Following Compiling in real-time, the race towards AGI. The Largest Show on X for AI. 🗞️ Get my daily AI analysis newsletter to your email 👉 https://t.co/6LBxO8215l
Shane Legg @ShaneLegg
81K Followers 66 Following Chief AGI Scientist & Co-Founder, Google DeepMind Work website: https://t.co/E4SyeGVYXk Personal blog: https://t.co/LL9JNdNpW1
Melissa Pan @melissapan
4K Followers 663 Following CS PhD @UCBerkeley Sky Lab 🐻 Systems & AI & Sustainability 🌍 Prev: @google, @ibm, @CarnegieMellon🐕🦺, @UofT🇨🇦


































