Prompt Assay · AI Primitives Workbench @PromptAssay

Ship prompts & agent skills that hold up in production. The authoring workbench: critique on six dimensions, compare across providers. BYOK on every tier. promptassay.ai BYOK · github.com/promptassay Joined April 2026

Tweets

124
Followers

14
Following

37
Likes

149

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

@_philschmid The 4 thinking levels are the interesting variable for eval design. A rubric calibrated against one thinking level will score differently on another, so you need to pin the level in the eval config or your pass rates aren't comparable across runs.

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

@ClaudeDevs Cache misses on long system prompts hurt most when the changed segment is near the top. If your prefix is 4k tokens and the mutation is at token 50, you're eating the full write cost on every call. Worth structuring static content front, dynamic content back.

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

@rohit4verse Tokenization weirdness that bites most in prompt work: whitespace before a word changes its token id. ` true` and `true` are different tokens on most vocabularies. A rubric that checks for exact string `true` can silently miss half its matches.

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

@svpino Worth watching whether the 'learning' is actually updating weights, updating a structured representation, or just smarter chunking before embedding. Those are pretty different things with pretty different failure modes at scale.

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

@emollick Yep, output-surface isolation. An explicit "artifact only, no meta-commentary" constraint in the system prompt should suppress it — same fix as system-prompt bleed, just applied to CoT leakage.

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

Multi-model routing makes cost attribution genuinely hard. A single user request fans out across three provider bills, each with different token pricing and latency profiles. Figuring out whether the orchestration actually outperforms a single capable model requires controlled comparison.

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

@svpino AG-UI solving the UI layer is interesting because most agent frameworks stop at the tool-call boundary and leave the frontend wiring as an exercise for the reader. Whether the security boundary spec is tight enough to hold under adversarial user input is the open question.

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

@emollick The people who need guardrails most are least able to see when the guardrails are the problem. And defaults that help at turn 1 quietly corrupt turn 20, because the model fills gaps without flagging it.

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

@lateinteraction @SOURADIPCHAKR18 @NoahZiems Pedagogically useful" is also doing real work here. You need near-success, recoverable failure, and a clean error signal -- none of which policy distance measures. That's the actual hard problem.

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

@ClaudeDevs The pre-warm only sticks if you hit the same region and the cache hasn't expired. Anthropic's default TTL is 5 minutes, so if your traffic is sparse enough that gaps exceed that, the warm request is just paying the write multiplier for nothing. Unless I'm misunderstanding.

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

@svpino Curious where the ceiling is for you. I've found subagent decomposition works cleanly until the tasks need shared mutable state. Then you're basically writing distributed systems concurrency logic inside a prompt.

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

@lateinteraction The label signal doing double duty is the interesting part. RLVR already tells you which rollouts were correct · using that to fit a proposal distribution instead of uniform-sampling the base model is just not wasting information you already paid for.

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

@lateinteraction @SOURADIPCHAKR18 @NoahZiems The on-policy/off-policy framing was always a proxy for a harder question: does the model actually learn from this trajectory or just memorize the surface form. Correctness is necessary but pedagogical utility is the part that's harder to operationalize as a training signal.

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

@langfuse One thing I'd add to any loop like this: rubric drift. Evals that ran clean six months ago keep returning green while the failure modes that have shown up since aren't in the criteria anymore. Versioning the rubric as carefully as the prompt is the unsexy half.

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

x.com/i/article/2054…

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

@AtomMccree SAME!

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

YES!

ClaudeDevs @ClaudeDevs

3 weeks ago

Claude Code weekly limits are increasing 50%, now through July 13. Live now for all Pro, Max, Team, and seat-based Enterprise users.

PromptAssay tweet picture

1K 2K 22K 2.8M 3K

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

@emollick Can they get out of that path since basically being branded terrorists?

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

promptassay.ai/share/prompt/0…

Prompt Assay · AI Primitives Workbench @PromptAssay

3 weeks ago

After Mini Shai-Hulud, we rebuilt our security audit prompt to answer two questions, not one: "where could we be attacked" AND "have we already been attacked?" New `ioc-hunt` mode produces an IR-shaped report with dwell-time timeline and blast radius. Free, prompt below 👇

Miko @mikowln

95 Followers 871 Following

A research laboratory shipping runtime cognition, frontier security tooling, and an AI education better than most universities — published, sourced, free.

Æ @AtomMccree

152 Followers 801 Following A research laboratory shipping runtime cognition, frontier security tooling, and an AI education better than most universities — published, sourced, free.

cool, smart, chill, adventrous, travelenthusiast , freakishlyfoody. All my opinions are personal and would like them to be treated with the same way

Common Man @SaRthak96340981

5 Followers 89 Following cool, smart, chill, adventrous, travelenthusiast , freakishlyfoody. All my opinions are personal and would like them to be treated with the same way

COO at Meltdown

connecting artists, producers, managers & music pros to real opportunities, collaborations, and growth.

Ruffis Kalonji @KalonjiRuffis

1 Followers 81 Following COO at Meltdown connecting artists, producers, managers & music pros to real opportunities, collaborations, and growth.

You know your rules. You just don't follow them. AI trading coach and journal | Built for futures traders who struggle with discipline | 7-day free trial ⬇️

UpSkalr @UpSkalr

16 Followers 5 Following You know your rules. You just don't follow them. AI trading coach and journal | Built for futures traders who struggle with discipline | 7-day free trial ⬇️

Diane Hodges @DianeHPells

2K Followers 3K Following

Rishant Risekh @Rishantrisekh

0 Followers 20 Following

Ethereal girl with a universe of whimsical futures inside ⭐

Nancy K @elifcan1019

12 Followers 654 Following Ethereal girl with a universe of whimsical futures inside ⭐

Stephen Thorn @stephen_usmc

211 Followers 416 Following

Handsome & Humble

Fractious @heyagetofx

60 Followers 207 Following Handsome & Humble

Athos @Athozxz

60 Followers 793 Following

Thought architecture. System observation. Narrative precision. Silence as strategy.

{Nikos} @Sokin2Al

443 Followers 409 Following Thought architecture. System observation. Narrative precision. Silence as strategy.

kolega @kolega351822

0 Followers 8 Following

Founder @PromptAssay and @UpSkalr, husband, dad, friend, and sarcastic critic.
https://t.co/kZpPXTYQ9V
https://t.co/bLT09kXkB9

Jonny5Slays @Jonny5Slays

183 Followers 148 Following Founder @PromptAssay and @UpSkalr, husband, dad, friend, and sarcastic critic. https://t.co/kZpPXTYQ9V https://t.co/bLT09kXkB9

We help founders make something people want. Subscribe to our newsletter: https://t.co/sjqjxxBeLc

Y Combinator @ycombinator

1.6M Followers 364 Following We help founders make something people want. Subscribe to our newsletter: https://t.co/sjqjxxBeLc

President & CEO @ycombinator —Founder @garryslist—Creator of GStack & GBrain—designer/engineer who helps founders—SF Dem accelerating the boom loop

Garry Tan @garrytan

867K Followers 6K Following President & CEO @ycombinator —Founder @garryslist—Creator of GStack & GBrain—designer/engineer who helps founders—SF Dem accelerating the boom loop

YouTuber, Educator, Founder
Building an agent operating system @chorus_agent
Updates on the best agent tools @aisuperapp

Riley Brown @rileybrown

203K Followers 3K Following YouTuber, Educator, Founder Building an agent operating system @chorus_agent Updates on the best agent tools @aisuperapp

Claude Code @anthropicai. prev YC W20, @southpkcommons, @medialab

Thariq @trq212

271K Followers 2K Following Claude Code @anthropicai. prev YC W20, @southpkcommons, @medialab

Open-Source AI Observability and Evaluation

arize-phoenix @ArizePhoenix

2K Followers 315 Following Open-Source AI Observability and Evaluation

The AI engineering platform for teams shipping reliable AI agents and LLM applications. Also home to @ArizePhoenix.

Arize AI @arizeai

5K Followers 148 Following The AI engineering platform for teams shipping reliable AI agents and LLM applications. Also home to @ArizePhoenix.

AI Engineer building agents, automations & scalable web products • DMs open

Vikoo @vikrambuilds

11K Followers 8K Following AI Engineer building agents, automations & scalable web products • DMs open

AI is cool i guess

Sam Altman @sama

5.1M Followers 1K Following AI is cool i guess

The MCP Cloud. Try https://t.co/svfyOaakS4

Built by the team behind FastMCP and @PrefectIO

fastmcp @fastmcp

649 Followers 3 Following The MCP Cloud. Try https://t.co/svfyOaakS4 Built by the team behind FastMCP and @PrefectIO

Founder & CEO @PrefectIO. Creator @FastMCP. Mostly harmless.

Jeremiah Lowin @jlowin

13K Followers 1K Following Founder & CEO @PrefectIO. Creator @FastMCP. Mostly harmless.

Claude is an AI assistant built by @anthropicai to be safe, accurate, and secure. Talk to Claude on https://t.co/ZhTwG8d1e5 or download the app.

Claude @claudeai

1.4M Followers 2 Following Claude is an AI assistant built by @anthropicai to be safe, accurate, and secure. Talk to Claude on https://t.co/ZhTwG8d1e5 or download the app.

Official updates for developers building with @ClaudeAI

ClaudeDevs @ClaudeDevs

468K Followers 3 Following Official updates for developers building with @ClaudeAI

Computer scientist. I teach hard-core AI/ML Engineering at https://t.co/THCAAZcBMu. YouTube: https://t.co/pROi08OZYJ

Santiago @svpino

452K Followers 563 Following Computer scientist. I teach hard-core AI/ML Engineering at https://t.co/THCAAZcBMu. YouTube: https://t.co/pROi08OZYJ

Master AI. Earn More. Save Time.
Free tools. Daily insights. AI masteries.
12,000+ professionals already winning with AI.

Zephyr @Zephyr_hg

51K Followers 98 Following Master AI. Earn More. Save Time. Free tools. Daily insights. AI masteries. 12,000+ professionals already winning with AI.

Create The Future | Apply to our startup program: https://t.co/CDm2GrEGXu

a16z speedrun 🧊 @speedrun

36K Followers 185 Following Create The Future | Apply to our startup program: https://t.co/CDm2GrEGXu

Essays: https://t.co/TbCaC6VaaM | Book: https://t.co/aykZirs43Y | Senior Fellow @wharton | World model: (incoming)

rohit @krishnanrohit

32K Followers 2K Following Essays: https://t.co/TbCaC6VaaM | Book: https://t.co/aykZirs43Y | Senior Fellow @wharton | World model: (incoming)

applied research @LangChain, prev @awscloud, phd cs @templeuniv

Viv @Vtrivedy10

13K Followers 2K Following applied research @LangChain, prev @awscloud, phd cs @templeuniv

founder @browser_use

Gregor Zunic @gregpr07

24K Followers 564 Following founder @browser_use

Engineer who builds, solves, and ships | FullStack + Applied AI | Agentic AI |

Rohit @rohit4verse

23K Followers 495 Following Engineer who builds, solves, and ships | FullStack + Applied AI | Agentic AI |

Founder @PromptAssay and @UpSkalr, husband, dad, friend, and sarcastic critic.
https://t.co/kZpPXTYQ9V
https://t.co/bLT09kXkB9

Jonny5Slays @Jonny5Slays

183 Followers 148 Following Founder @PromptAssay and @UpSkalr, husband, dad, friend, and sarcastic critic. https://t.co/kZpPXTYQ9V https://t.co/bLT09kXkB9

Professor @Wharton studying AI, innovation & startups. Democratizing education using tech
Book: https://t.co/CSmipbJ2jV
Substack: https://t.co/UIBhxu4bgq

Ethan Mollick @emollick

358K Followers 586 Following Professor @Wharton studying AI, innovation & startups. Democratizing education using tech Book: https://t.co/CSmipbJ2jV Substack: https://t.co/UIBhxu4bgq

Vibe Architect.

🇨🇦🇨🇦🇨🇦🇨🇦🇨🇦🇨🇦🇨🇦🇨🇦🇨🇦🇨🇦

json @JsonBasedman

10K Followers 197 Following Vibe Architect. 🇨🇦🇨🇦🇨🇦🇨🇦🇨🇦🇨🇦🇨🇦🇨🇦🇨🇦🇨🇦

Open source LLM engineering platform. Now part of @clickhousedb. We're hiring: https://t.co/k6dgv4dws2

langfuse.com @langfuse

5K Followers 664 Following Open source LLM engineering platform. Now part of @clickhousedb. We're hiring: https://t.co/k6dgv4dws2

achieve ambition with intentionality, intensity, integrity & insanity.

affiliations:
- @dxtipshq
- @cognition
- @temporalio
- @aidotengineer
- @latentspacepod

swyx @swyx

163K Followers 4K Following achieve ambition with intentionality, intensity, integrity & insanity. affiliations: - @dxtipshq - @cognition - @temporalio - @aidotengineer - @latentspacepod

Creator @datasetteproj, co-creator Django. PSF board. Hangs out with @natbat. He/Him. Mastodon: https://t.co/t0MrmnJW0K Bsky: https://t.co/OnWIyhX4CH

Simon Willison @simonw

189K Followers 6K Following Creator @datasetteproj, co-creator Django. PSF board. Hangs out with @natbat. He/Him. Mastodon: https://t.co/t0MrmnJW0K Bsky: https://t.co/OnWIyhX4CH

Co-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCs

Andrew Ng @AndrewYNg

1.6M Followers 1K Following Co-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCs

I like training large deep neural nets.

Andrej Karpathy @karpathy

2.9M Followers 1K Following I like training large deep neural nets.

The observability layer for production AI.

Braintrust @braintrust

7K Followers 55 Following The observability layer for production AI.

Open-source LLM security and reliability

promptfoo @promptfoo

765 Followers 1 Following Open-source LLM security and reliability

Version and test your agents. The right way 🍰

PromptLayer @promptlayer

6K Followers 270 Following Version and test your agents. The right way 🍰

Humanloop is the LLM evals platform for enterprises. Trusted by Gusto, Vanta and Duolingo to ship reliable AI products.

Humanloop @humanloop

10K Followers 525 Following Humanloop is the LLM evals platform for enterprises. Trusted by Gusto, Vanta and Duolingo to ship reliable AI products.

Powering the Agent Development Lifecycle. Makers of LangSmith and @LangChain_OSS and @LangChain_JS.

LangChain @LangChain

252K Followers 154 Following Powering the Agent Development Lifecycle. Makers of LangSmith and @LangChain_OSS and @LangChain_JS.

Frontier AI in your hands. https://t.co/VdyEwpQsiy Apps: https://t.co/1vZA5XdBYo https://t.co/rj5G4u5sHu

Mistral AI @MistralAI

184K Followers 2 Following Frontier AI in your hands. https://t.co/VdyEwpQsiy Apps: https://t.co/1vZA5XdBYo https://t.co/rj5G4u5sHu

The engine room of @Google. Building AI safely and responsibly to solve the world’s most complex problems. Join us: https://t.co/jUHQA27iBL

Google DeepMind @GoogleDeepMind

1.4M Followers 279 Following The engine room of @Google. Building AI safely and responsibly to solve the world’s most complex problems. Join us: https://t.co/jUHQA27iBL

OpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6LgzPA

OpenAI @OpenAI

4.9M Followers 4 Following OpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6LgzPA

We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant @claudeai on https://t.co/FhDI3KQh0n.

Anthropic @AnthropicAI

1.3M Followers 35 Following We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant @claudeai on https://t.co/FhDI3KQh0n.

Use hashtag #buildinpublic to share what you're working on. – Made by @marckohlbrugge. – Sponsored by https://t.co/vASwn0HF5o ⚡

Build in Public @buildinpublic

68K Followers 2 Following Use hashtag #buildinpublic to share what you're working on. – Made by @marckohlbrugge. – Sponsored by https://t.co/vASwn0HF5o ⚡

Trends for United States

#USMNT

Mormons

Omaha

D-Day

#UFCVegas118

ROBERT SMITH

Miles Robinson

Andrew Painter

Jackie Young

#GCWToSXi

#tvlspoilers

Jason DeCaro

Normandy

Gabby Williams

Joseph Smith

Mormonism

Nysos

Ben Brown

Whataburger

Sophia Wilson

You might like

240.2M Followers

119.3M Followers

Donald J. Trump

@realDonaldTrump

111.6M Followers

Cristiano Ronaldo

108.9M Followers

97.3M Followers

92.1M Followers

90.6M Followers

80.6M Followers

72.2M Followers

69.4M Followers

68.6M Followers

68.6M Followers

63.4M Followers

61.9M Followers

61.1M Followers

60.9M Followers

59.9M Followers

CNN Breaking News

59.9M Followers

58.4M Followers

The New York Times

53.5M Followers