Scale AI @scale_AI, Twitter Profile

Scale AI @scale_AI

2 years ago

Today at TransformX, we announced a huge step forward for the open source ML community: we are partnering with @StabilityAI to release the first large language model trained with human feedback. carper.ai/instruct-gpt-a… 1/4

5 79 492 0 115

Scale AI @scale_AI

2 years ago

Reinforcement learning with human feedback (RLHF) is what powers the highest performing language models. arxiv.org/abs/2009.01325

2 2 33 0 4

Scale AI @scale_AI

2 years ago

Scale partnered with OpenAI on InstructGPT (openai.com/blog/instructi…), and we’re excited to make these techniques available to everyone.

1 4 29 0 1

Scale AI @scale_AI

2 years ago

We’ll release our first trained model with Stability AI soon. If you want to start tinkering with RLHF now, we’re also helping develop TRLX: github.com/CarperAI/trlx — the open source library for reinforcement learning with transformers.

1 8 36 0 4

Frank Taylor @frank_sf

a year ago

@scale_AI has this been released?

0 0 0 7 0