Google’s PaLM-E is a generalist robot brain that takes commands

A robotic arm controlled by PaLM-E reaches for a bag of chips in a demonstration video.

On Monday, a group of AI researchers from Google and the Technical University of Berlin unveiled PaLM-E, a multimodal embodied visual-language model (VLM) with 562 billion parameters that integrates vision and language for robotic control. They claim it is the largest VLM ever developed and that it can perform a variety of tasks without the need for retraining.

According to Google, when given a high-level command, such as “bring me the rice chips from the drawer,” PaLM-E can generate a plan of action for a mobile robot platform with an arm (developed by Google Robotics) and execute the actions by itself.

PaLM-E does this by analyzing data from the robot’s camera without needing a pre-processed scene representation. This eliminates the need for a human to pre-process or annotate the data and allows for more autonomous robotic control.

Read 11 remaining paragraphs | Comments

Source

Google’s PaLM-E is a generalist robot brain that takes commands

Leave a Reply Cancel reply

Baldur’s Gate 3’s latest patch brings more mod support and tools

LastPass users targeted in phishing attacks good enough to trick even the savvy

The Download: American’s hydrogen train experiment, and why we need boring robots

Disney Speedstorm’s Golden Pass controversy moves Gameloft to consider changes

Broadcom says “many” VMware perpetual licenses got support extensions

Baldur’s Gate 3’s latest patch brings more mod support and tools

LastPass users targeted in phishing attacks good enough to trick even the savvy

Disney Speedstorm’s Golden Pass controversy moves Gameloft to consider changes

Broadcom says “many” VMware perpetual licenses got support extensions

March 2023
M	T	W	T	F	S	S
« Feb				Apr »
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31