OpenAI checked to see whether GPT-4 could take over the world

An AI-generated image of the earth enveloped in an explosion.

As part of pre-release safety testing for its new GPT-4 AI model, launched Tuesday, OpenAI allowed an AI testing group to assess the potential risks of the model’s emergent capabilities—including “power-seeking behavior,” self-replication, and self-improvement.

While the testing group found that GPT-4 was “ineffective at the autonomous replication task,” the nature of the experiments raises eye-opening questions about the safety of future AI systems.

Raising alarms

“Novel capabilities often emerge in more powerful models,” writes OpenAI in a GPT-4 safety document published yesterday. “Some that are particularly concerning are the ability to create and act on long-term plans, to accrue power and resources (“power-seeking”), and to exhibit behavior that is increasingly ‘agentic.'” In this case, OpenAI clarifies that “agentic” isn’t necessarily meant to humanize the models or declare sentience but simply to denote the ability to accomplish independent goals.

Read 21 remaining paragraphs | Comments

Source

OpenAI checked to see whether GPT-4 could take over the world

Raising alarms

Leave a Reply Cancel reply

Baldur’s Gate 3’s latest patch brings more mod support and tools

LastPass users targeted in phishing attacks good enough to trick even the savvy

The Download: American’s hydrogen train experiment, and why we need boring robots

Disney Speedstorm’s Golden Pass controversy moves Gameloft to consider changes

Broadcom says “many” VMware perpetual licenses got support extensions

Baldur’s Gate 3’s latest patch brings more mod support and tools

LastPass users targeted in phishing attacks good enough to trick even the savvy

Disney Speedstorm’s Golden Pass controversy moves Gameloft to consider changes

Broadcom says “many” VMware perpetual licenses got support extensions

March 2023
M	T	W	T	F	S	S
« Feb				Apr »
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31