Ben Edelman @EdelmanBen

Agent Security lead @ U.S. Center for AI Standards and Innovation. Prev: science of deep learning PhD @ Harvard benjaminedelman.com Joined March 2014

Tweets

112
Followers

638
Following

53
Likes

146

Ben Edelman @EdelmanBen

a month ago

This paper does a fantastic job conveying that: (1) Deep learning abounds in miraculous empirical regularities (2) A beautiful scientific theory has emerged over the past decade to explain the miracles (3) Yet most fundamental questions remain mysteries. The best is yet to come.

Jamie Simon @learning_mech

a month ago

1/ Deep learning is going to have a scientific theory. We can see the pieces starting to come together, and it's looking a lot like physics! We're releasing a paper pulling together these emerging threads and giving them a name: learning mechanics. 🔨 arxiv.org/pdf/2604.21691 🔧

53 292 2K 303K 2K

2 6 42 6K 16

View Details

Ben Edelman @EdelmanBen

2 months ago

@zicokolter At CAISI we started using the phrase "agent hijacking" for prompt injections of agents because it avoids the inevitable confusion about the prompt injection vs jailbreak distinction (not to even mention direct vs indirect), and conveys impact more directly for a lay audience.

0 0 1 63 1

View Details

Ben Edelman @EdelmanBen

2 months ago

@zicokolter Yep agreed it's all the same underlying vulnerability; instruction hierarchy-style distinctions (app developer / user / external content) are "just" an abstraction. (I was also involved with the new paper, btw)

1 0 1 50 0

View Details

Ben Edelman @EdelmanBen

2 months ago

@zicokolter Post where Simon Willison coined prompt injection: simonwillison.net/2022/Sep/12/pr…. Paper where Greshake et al. coined indirect prompt injection: arxiv.org/abs/2302.12173

1 0 0 72 1

View Details

Ben Edelman @EdelmanBen

2 months ago

@zicokolter Fwiw, my understanding is that the original coinage of prompt injection was focused on contexts where the untrusted data comes from an untrusted user. Then Greshake et al. coined IPI to highlight the case where the attacker leverages data likely to be retrieved at inference time.

1 0 0 55 0

View Details

Ben Edelman @EdelmanBen

4 months ago

Excited to be part of this initiative. Join our team to advance the frontier of agent security research and standards! usajobs.gov/job/856267900

Director Michael Kratsios @mkratsios47

4 months ago

The future of AI is agentic, and America is leading the way to make it secure and interoperable. A new AI Agent Standards Initiative is launching this week @NIST to drive industry-led standards and open protocols that build trust and advance innovation. nist.gov/news-events/ne…

140 330 2K 153K 884

0 0 9 673 3

View Details

Tony Wang @TonyWangIV

4 months ago

Excited to share @NIST+CAISI’s initial public draft on how to run and report results of automated evals. If you have opinions on evals, we’d love your feedback — help us improve the AI evals ecosystem! Public comments accepted through March 31st via ai800-2@NIST.gov. more in🧵

2 6 28 4K 17

View Details

Boaz Barak @boazbaraktcs

4 months ago

One of the best places if you have technical background and care about AI going well!

Ben Edelman @EdelmanBen

4 months ago

People sometimes ask me how to leverage a technical background to jump into U.S. AI policy. As of this week my answer is straightforward: apply to join us at CAISI! We're a startup within government, and we're doing a hiring surge.

4 23 88 24K 36

0 3 23 5K 11

View Details

Dwarkesh Patel @dwarkesh_sp

4 months ago

Seems like a great opportunity for technical talent to come into government and help the USG make sound, technically informed decisions on AI

Samuel Hammond 🦉 @hamandcheese

4 months ago

CAISI is hiring for a bunch of exciting new roles, from partnerships to technical experts in AI x bio / chem and more. They're serious about bringing in strong researchers & engineers and letting them do good work. Based in DC or SF: nist.gov/caisi/careers-…

4 40 170 85K 68

9 15 144 50K 40

View Details

Samuel Hammond 🦉 @hamandcheese

4 months ago

4 40 170 85K 68

View Details

Ben Edelman @EdelmanBen

4 months ago

My Agent Security team is hiring Research Engineers & Scientists. Other teams are hiring people with strong technical backgrounds too: Frontier Assessment, Cyber, Chem/Bio, Applied Systems, and Partnerships. Job postings are listed here: nist.gov/caisi/careers-…

0 1 10 657 5

View Details

Ben Edelman @EdelmanBen

4 months ago

The United States is the center of the AI revolution. We need dedicated public servants to ensure our government is smart on AI issues.

1 2 9 6K 1

View Details

Ben Edelman @EdelmanBen

4 months ago

4 23 88 24K 36

View Details

Ben Edelman @EdelmanBen

4 months ago

At CAISI, we're the U.S. government's leading experts on agent security. We published this RFI so deployers, developers, and experts can provide insights that inform our research and NIST guidelines development. Responses due March 9th!

Peter Cihon @pcihon

4 months ago

CAISI has published an RFI about securing AI agents. It seeks insights from AI agent deployers, developers, and computer security researchers. Questions address the current threat landscape, mitigations, measurements, and other security considerations unique to AI agents.

1 6 13 3K 2

1 1 8 684 2

View Details

Peter Cihon @pcihon

5 months ago

CAISI is recruiting an intern to support an agent security standards project. Position closes Jan. 15 for a February start. Please help spread the word. Details in thread:

2 18 52 14K 12

View Details

Ben Edelman @EdelmanBen

5 months ago

@boazbaraktcs Since I organized this by model family branding (GPT) rather than developer (OpenAI), I think the move would be to add a separate o-series line. And don't get me started about Sonnet vs Opus