Anthropic’s Claude adds a prompt playground to quickly improve your AI apps

5:11 PM PDT • July 9, 2024

Prompt engineering became a hot job last year in the AI industry, but it seems Anthropic is now developing tools to at least partially automate it.

Anthropic released several new features on Tuesday to help developers create more useful applications with the startup’s language model, Claude, according to a company blog post. Developers can now use Claude 3.5 Sonnet to generate, test and evaluate prompts, using prompt engineering techniques to create better inputs and improve Claude’s answers for specialized tasks.

Language models are pretty forgiving when you ask them to perform some tasks, but sometimes small changes to the wording of a prompt can lead to big improvements in the results. Normally you’d have to figure out that wording yourself, or hire a prompt engineer to do it, but this new feature offers quick feedback that could make finding improvements easier.

The features are housed within Anthropic Console under a new Evaluate tab. Console is the startup’s test kitchen for developers, created to attract businesses looking to build products with Claude. One of the features, unveiled in May, is Anthropic’s built-in prompt generator; this takes a short description of a task and constructs a much longer, fleshed out prompt, utilizing Anthropic’s own prompt engineering techniques. While Anthropic’s tools may not replace prompt engineers altogether, the company said it would help new users, and save time for experienced prompt engineers.

Within Evaluate, developers can test how effective their AI application’s prompts are in a range of scenarios. Developers can upload real-world examples to a test suite or ask Claude to generate an array of AI-generated test cases. Developers can then compare how effective various prompts are side-by-side, and rate sample answers on a five-point scale.

A prompt being fed generated data to find good and bad responses.

In an example from Anthropic’s blog post, a developer identified that their application was giving answers that were too short across several test cases. The developer was able to tweak a line in their prompt to make the answers longer, and apply it simultaneously to all their test cases. That could save developers lots of time and effort, especially ones with little or no prompt engineering experience.

Anthropic CEO and co-founder Dario Amodei said prompt engineering was one of the most important things for widespread enterprise adoption of generative AI in an interview from Google Cloud Next earlier this year. “It sounds simple, but 30 minutes with a prompt engineer can often make an application work when it wasn’t before,” said Amodei.

More TechCrunch

Rediff, once a pioneer of internet services in India, sells majority stake for $3M

Manish Singh

3 hours ago

Payments infrastructure firm Infibeam Avenues has acquired a majority 54% stake in Rediff.com for up to $3 million, a dramatic twist of fate for the 28-year-old business that was the…

Rediff, once a pioneer of internet services in India, sells majority stake for $3M

Crypto

Terraform Labs co-founder and crypto fugitive Do Kwon set for extradition to South Korea

Kate Park

5 hours ago

The ruling confirmed an earlier decision in April from the High Court of Podgorica which rejected a request to extradite the crypto fugitive to the United States.

Terraform Labs co-founder and crypto fugitive Do Kwon set for extradition to South Korea

Apps

Meta’s Threads crosses 200 million active users

Ivan Mehta

5 hours ago

A day after Meta CEO Mark Zuckerberg talked about his newest social media experiment Threads reaching “almost” 200 million users on the company’s Q2 2024 earnings call, the platform has…

Meta’s Threads crosses 200 million active users

TechCrunch Disrupt 2024

Connect with Google Cloud, Aerospace, Qualcomm and more at Disrupt 2024

Cindy Zackney

11 hours ago

TechCrunch Disrupt 2024 will be in San Francisco on October 28–30, and we’re already excited! Disrupt brings innovation for every stage of your startup journey, and we could not bring you this…

Connect with Google Cloud, Aerospace, Qualcomm and more at Disrupt 2024

Featured Article

A comprehensive list of 2024 tech layoffs

The tech layoff wave is still going strong in 2024. Following significant workforce reductions in 2022 and 2023, this year has already seen 60,000 job cuts across 254 companies, according to independent layoffs tracker Layoffs.fyi. Companies like Tesla, Amazon, Google, TikTok, Snap and Microsoft have conducted sizable layoffs in the…

Cody Corrall

Alyssa Stringer

15 hours ago

A comprehensive list of 2024 tech layoffs

Enterprise

Intel to lay off 15,000 employees

Maxwell Zeff

15 hours ago

Intel announced it would layoff more than 15% of its staff, or 15,000 employees, in a memo to employees on Thursday. The massive headcount is part of a large plan…

AI music startup Suno claims training model on copyrighted music is ‘fair use’

Lauren Forristal

16 hours ago

Following the recent lawsuit filed by the Recording Industry Association of America (RIAA) against music generation startups Udio and Suno, Suno admitted in a court filing on Thursday that it did, in…

AI music startup Suno claims training model on copyrighted music is ‘fair use’

Hardware

iPad sales help bail out Apple amid a continued iPhone slide

Brian Heater

17 hours ago

In spite of a drop for the quarter, iPhone remained Apple’s most important category by a wide margin.

iPad sales help bail out Apple amid a continued iPhone slide

Venture

How filming a cappella concerts and dance recitals led Northzone’s newest partner Molly Alter to a career in VC

Rebecca Szkutak

19 hours ago

Molly Alter wears a lot of hats. She’s a mocumentary filmmaker working on a project about an alternate reality where charades is big business. She’s a caesar salad connoisseur and…

How filming a cappella concerts and dance recitals led Northzone’s newest partner Molly Alter to a career in VC

Microsoft now lists OpenAI as a competitor in AI and search

Maxwell Zeff

20 hours ago

Microsoft has a long and tangled history with OpenAI, having invested a reported $13 billion in the ChatGPT maker as part of a long-term partnership. As part of the deal,…

Microsoft now lists OpenAI as a competitor in AI and search

Startups

Sequoia-backed Knowde raises Series C at a valuation cut

Rebecca Szkutak

21 hours ago

The San Jose-based startup raised $60 million in a round that values it lower than the $500 million valuation it garnered in its most recent round, according to multiple sources.

Sequoia-backed Knowde raises Series C at a valuation cut

Transportation

Self-driving truck startup Aurora Innovation to sell up to $420M in shares ahead of commercial launch

Rebecca Bellan

21 hours ago

Self-driving technology company Aurora Innovation is looking to raise hundreds of millions in additional capital as it races toward a driverless commercial launch by the end of 2024. Aurora is…

Self-driving truck startup Aurora Innovation to sell up to $420M in shares ahead of commercial launch

Apps

Twitter disappears from Mac App Store

Lauren Forristal

22 hours ago

X (formerly Twitter) can no longer be accessed in the Mac App Store, suggesting that it has been officially delisted. Searches for both “Twitter” and “X” on Apple’s platform no…

Google brings Gemini-powered search history and Lens to Chrome desktop

Ivan Mehta

22 hours ago

Google Thursday said that it is introducing new Gemini-powered features for Chrome’s desktop version, including Lens for desktop, tab compare for shopping assistance, and natural language integration for search history.…

The EU’s AI Act is now in force

Natasha Lomas

1 day ago

The European Union’s risk-based regulation for applications of artificial intelligence has come into force starting from today.

Biotech & Health

Healx, an AI-enabled drug discovery platform for rare diseases, raises $47M

Paul Sawers

1 day ago

The company also said it has received regulatory clearance to start Phase 2 clinical trials for a new drug in the U.S. later this year.

Healx, an AI-enabled drug discovery platform for rare diseases, raises $47M

Enterprise

EU greenlights HPE’s $14B Juniper Networks acquisition

Paul Sawers

1 day ago

The European Commission (EC) has given the go-ahead to HPE’s planned megabucks acquisition of Juniper Networks.

EU greenlights HPE’s $14B Juniper Networks acquisition

Zuckerberg says Meta will need 10x more computing power to train Llama 4 than Llama 3

Ivan Mehta

1 day ago

Meta, which develops one of the biggest foundational open source large language models, Llama, believes it will need significantly more computing power to train models in the future. Mark Zuckerberg…

Zuckerberg says Meta will need 10x more computing power to train Llama 4 than Llama 3

Climate

Axle Energy’s sprint to decarbonize the grid lights up with $9M seed led by Accel

Natasha Lomas

1 day ago

Axle Energy is a B2B, back-end infrastructure business focused on connecting flexible assets, such as electric vehicles and home batteries, to energy markets that aren’t otherwise available for consumers to…

Axle Energy’s sprint to decarbonize the grid lights up with $9M seed led by Accel

OpenAI pledges to give U.S. AI Safety Institute early access to its next model

Kyle Wiggers

1 day ago

OpenAI CEO Sam Altman says that OpenAI is working with the U.S. AI Safety Institute, a federal government body that aims to assess and address risks in AI platforms, on…

OpenAI pledges to give U.S. AI Safety Institute early access to its next model

Anthropic’s Claude adds a prompt playground to quickly improve your AI apps

More TechCrunch

Get the industry’s biggest tech news

TechCrunch Daily News

Startups Weekly

TechCrunch Fintech

TechCrunch Mobility

Tags