AI gains “values” with Anthropic’s new Constitutional AI chatbot approach

May 10, 2023

Enlarge / Anthropic’s Constitutional AI logo on a glowing orange background. (credit: Anthropic / Benj Edwards)

On Tuesday, AI startup Anthropic detailed the specific principles of its “Constitutional AI” training approach that provides its Claude chatbot with explicit “values.” It aims to address concerns about transparency, safety, and decision-making in AI systems without relying on human feedback to rate responses.

Claude is an AI chatbot similar to OpenAI’s ChatGPT that Anthropic released in March.

“We’ve trained language models to be better at responding to adversarial questions, without becoming obtuse and saying very little,” Anthropic wrote in a tweet announcing the paper. “We do this by conditioning them with a simple set of behavioral principles via a technique called Constitutional AI.”

Read 18 remaining paragraphs | Comments

Source

You may Missed

AI gains “values” with Anthropic’s new Constitutional AI chatbot approach

Leave a Reply Cancel reply

Cities Skylines II DLC now wrecks the main game, fix will take weeks

Windows vulnerability reported by the NSA exploited to install Russian malware

This solar giant is moving manufacturing back to the US

Next ID@Xbox Showcase will air on April 29, showing new indie games

Kremlin-backed actors spread disinformation ahead of US elections

Cities Skylines II DLC now wrecks the main game, fix will take weeks

Windows vulnerability reported by the NSA exploited to install Russian malware

Next ID@Xbox Showcase will air on April 29, showing new indie games

Kremlin-backed actors spread disinformation ahead of US elections

May 2023
M	T	W	T	F	S	S
« Apr				Jun »
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30	31