Strict Standards: Only variables should be passed by reference in /home/blog/web/masterinvest.info/public_html/core/modules/show.full.php on line 364 Nvidia показала систему для обучения роботов
0.02%
0.38%
0.31%
BTC
$64,109.97
0.04%
0.15%
3.84%
ETH
$1,727.01
0.05%
0.62%
2.85%
BNB
$589.11
0.12%
0.10%
0.80%
XRP
$1.15
0.07%
3.49%
9.36%
SOL
$73.90
0.03%
0.55%
2.58%
TRX
$0.32655994
0.11%
0.01%
3.69%
DOGE
$0.08327801
0.18%
0.14%
3.03%
ADA
$0.16177164
0.04%
0.41%
1.55%
LINK
$7.96
0.47%
3.28%
3.57%
LTC
$45.67
0.02%
0.38%
0.31%
BTC
$64,109.97
0.04%
0.15%
3.84%
ETH
$1,727.01
0.05%
0.62%
2.85%
BNB
$589.11
0.12%
0.10%
0.80%
XRP
$1.15
0.07%
3.49%
9.36%
SOL
$73.90
0.03%
0.55%
2.58%
TRX
$0.32655994
0.11%
0.01%
3.69%
DOGE
$0.08327801
0.18%
0.14%
3.03%
ADA
$0.16177164
0.04%
0.41%
1.55%
LINK
$7.96
0.47%
3.28%
3.57%
LTC
$45.67
   /       /   

Nvidia показала систему для обучения роботов

Researchers from Nvidia, Carnegie Mellon University, and the University of California, Berkeley introduced ENPIRE — a framework that allows AI coding agents to improve robot control policies on real hardware.

The system runs a closed loop: the robot performs a task, the environment automatically evaluates the result and returns to its initial state, and the AI agent analyzes errors, rewrites the code, and launches the next series of trials.

How ENPIRE works

In robotics, training on real hardware remains an expensive and slow process. After a failed attempt, the scene must be returned to its initial state, the result checked, the algorithm changed, and the trial conducted again. Usually part of this work requires the involvement of engineers.

ENPIRE transfers to the physical world an approach that Nvidia calls AutoResearch: AI agents write code, test it, and improve it in subsequent iterations. However, unlike in a digital environment, here each experiment involves real robots, cameras, objects, grasping errors, friction, and other physical constraints.

The framework consists of four modules:

  • Environment is responsible for automatic scene reset, result verification, logging, and safety interfaces;
  • Policy Improvement launches the improvement of the control policy;
  • Rollout evaluates the policy on one or several physical robots;
  • Evolution allows agents to analyze logs, look for ideas in the literature, change the training infrastructure, and fix code.

After the initial setup of the environment, the loop can run without constant human supervision. The agent receives data from video, trajectories, and the reward function, proposes a new hypothesis, changes the code, tests the result on the robot, and saves the changes if they improve the metric.

Why automatic verification and reset are needed

A key element of ENPIRE is the automation of two operations: verifying the result and returning the scene to its initial state. The first is needed so that the system can determine on its own whether the task has been completed. For example, in the cable tie scenario, the evaluation function combined a detector, a segmentation model, and verification by two cameras. This way the agent received a success or error signal without manual labeling of each run.

Automatic reset allows running many attempts in a row. After a failed action, the robot must return the object or scene to a state suitable for the next experiment. Without this, training on real hardware quickly runs into the need for constant human involvement.

As Decrypt noted, at the first stage a human helps the agent create permanent tools — a reset procedure and a reward function. After that they are reused, and the agent takes over the further improvement of the policy.

What was shown on the robots

In real experiments the team tested ENPIRE on several manipulation tasks. Push-T checks whether the robot can push a T-shaped object into a given zone. Pin Insertion requires inserting pins into holes 4 mm in diameter. GPU installation and operations with a cable tie are also shown.

On the Nvidia project page it is stated that in real manipulation tasks the system successfully completed the task in 99% of cases if the agent was given up to eight attempts taking into account previous errors. The metric reflects the system's ability to recover after failures and repeat actions taking context into account, rather than the accuracy of a single isolated attempt.

As coding agents, the team compared Codex on GPT-5.5, Claude Code on Opus 4.7, and Kimi Code on Kimi K2.6. The evaluation took place in the AutoEnvBench benchmark on the Push-T and Pin Insertion tasks.

The researchers also tested ENPIRE in RoboCasa — a simulator of household tasks such as opening cabinets and drawers and turning objects on or off in the kitchen. In these scenarios ENPIRE outperformed Nvidia's GR00T and CaP-X — an agent system that uses tools but does not run a full cycle of automatic research.

Eight robots accelerated training

A separate block of the work is devoted to scaling to a fleet of robots. Nvidia conducted an experiment on eight robotic stations with two manipulators. Each had its own hardware, computer, and AI coding agent.

The stations exchanged results via Git: a successful idea or code change could quickly spread between agents. This approach made it possible to reduce training time. According to Decrypt, the transition from one robot to eight reduced the time to master Push-T from about five to two hours. For Pin Insertion the time dropped from more than 90 minutes to about 40 minutes.

Limitations

The authors emphasized that scaling does not solve all problems. When agents read logs, write code, debug it, or wait for a response from the base language model, the robots and computing resources are not fully utilized. As the number of robots grows, GPU activity increases, but the average utilization of the robots themselves decreases. Teams of agents spend more time summarizing the results of other branches and coordinating, rather than only on physical runs.

Another limitation is the growth in token consumption. A larger fleet of robots brings the policy to a working state faster but requires more tokens because of reading logs, sharing ideas, and coordination between agents.

In addition, ENPIRE has so far been shown on a limited set of manipulation tasks. Its results do not mean that robots can already independently master arbitrary physical skills in an open environment without engineering preparation.

Recall that in June Nvidia introduced the Isaac GR00T Reference Humanoid Robot — a research reference design for developing and testing the skills of humanoid robots. The configuration included a Unitree H2 Plus body and tactile five-fingered hands by Sharpa Wave.

Earlier Unitree introduced "the world's first ready-for-mass-production" piloted robot. The android is able to move on two and four limbs.

Source: ForkLog

18-06-2026
Криптовалюты / Новости в мире криптовалют

Новости в мире криптовалют

Alibaba unveils AI models for controlling robotsAlibaba unveils AI models for controlling robotsGPU miners for AMD and Nvidia with ProgPoW supportGPU miners for AMD and Nvidia with ProgPoW support

Random quote about money

"Философия богатого отличается от философии бедного следующим: богатый инвестирует свои деньги и расходует то, что осталось; бедный же расходует свои деньги и инвестирует то, что осталось."

Джим Рон

Interesting posts in other sections of the blog

Information

Users of Guests are not allowed to comment this publication.

Latest articles

all articles →
OpenAI усилила команду соавтором Gemini и бывшим советником Белого дома по вопросам ИИНовости в мире криптовалютOpenAI усилила команду соавтором Gemini и бывшим советником Белого дома по вопросам ИИК компании OpenAI присоединятся две влиятельные фигуры из мира искусственного интеллекта и государственной политики США — соавтор архитектуры Transformer Ноам21-06-2026Макглоун и Далио: рынки США перегреты — что ждет биткоинНовости в мире криптовалютМакглоун и Далио: рынки США перегреты — что ждет биткоинСразу два известных аналитика предупреждают о рисках для переоцененных американских рынков. Стратег Bloomberg Intelligence Майк Макглоун говорит о возможном21-06-2026Weekly: биткоин ищет дно, ФРС «без сюрпризов», кризис и другой курс майнинга и проблемы Binance в ЕСНовости в мире криптовалютWeekly: биткоин ищет дно, ФРС «без сюрпризов», кризис и другой курс майнинга и проблемы Binance в ЕСРедакция Incrypted подготовила для вас очередной дайджест о главных событиях в сфере Web3 за неделю. В нем мы расскажем о сигналах возможного дна биткоина и21-06-2026Axelar сообщил о взломе моста с Secret Network на $4,67 млнНовости в мире криптовалютAxelar сообщил о взломе моста с Secret Network на $4,67 млн19 июня блокчейн-проект Axelar раскрыл взлом моста с протоколом Secret Network. Злоумышленник вывел около $4,67 млн, использовав уязвимость «бесконечного21-06-2026Путать евро-стейблкоины и цифровой евро — дорогая ошибкаНовости в мире криптовалютПутать евро-стейблкоины и цифровой евро — дорогая ошибкаСтарший директор по стратегии и политике ЕС в компании Circle Патрик Хансен подчеркнул, что евро-стейблкоины и будущий цифровой евро от Европейского21-06-2026«Хищник стал добычей»: известный MEV-бот Ethereum потерял до $15 млн после ловушки с фальшивыми токенамиНовости в мире криптовалют«Хищник стал добычей»: известный MEV-бот Ethereum потерял до $15 млн после ловушки с фальшивыми токенамиАналитическая компания Blockaid сообщила об успешной атаке на одного из самых известных MEV-ботов в экосистеме Ethereum — jaredfromsubway.eth. Инцидент21-06-2026Bitdeer за год нарастила добычу биткоина на 370%Новости в мире криптовалютBitdeer за год нарастила добычу биткоина на 370%В мае компания Bitdeer добыла 921 BTC против 196 BTC годом ранее — рост составил 370%. Собственный хешрейт за этот период увеличился почти на 420%, с 13,6 EH/s21-06-2026Мошенник выдал сам себя, пожаловавшись ZachXBT на заморозку биткоиновНовости в мире криптовалютМошенник выдал сам себя, пожаловавшись ZachXBT на заморозку биткоиновОнчейн-детектив ZachXBT рассказал историю о мошеннике, который невольно выдал сам себя. Подписчик под ником AmanKesar11 написал ему с жалобой на21-06-2026Incrypted Conference 2026 — Ukraine's largest crypto conference — was held in KyivНовости в мире криптовалютIncrypted Conference 2026 — Ukraine's largest crypto conference — was held in KyivOn June 13, 2026 Kyiv hosted Incrypted Conference 2026 — the annual crypto conference organized by the team of the leading Ukrainian crypto media Incrypted.21-06-2026
Sign inMasterInvest