Physical AI Powering the New Age of Industrial Operations 2025
Page 10 of 26 · WEF_Physical_AI_Powering_the_New_Age_of_Industrial_Operations_2025.pdf
While technological advances are unlocking
previously unachievable applications, the true shift
is not merely in what is now technically feasible,
but in what is economically viable. As highlighted
in Figure 2, the future of intelligent robotics is
defined by simplified deployment and more intuitive
human–machine interaction, enabling reductions in
implementation time and greater scalability. As physical AI supports a wider variety of
operations and becomes easier to deploy –
requiring fewer specialized skills and less task-
specific customization – automation becomes viable
across a much broader range of operations. This
evolution does not just enable new use cases, it
redefines the overall economics of automation.
1.3 Limitations yet to be resolved
Even as rapid advances continue, data scarcity, 3D
spatial intelligence and dexterity remain challenges
to be solved in the coming years:
–Data scarcity: Large language models (LLMs)
thrived because of the ability to scrape large
amounts of data from the internet. These LLMs
have now read many websites and ingested
many books. Physical AI also thrives on high-
quality data, yet curated robotics datasets
remain limited and costly, because they have
to be collected in the real world. This challenge
is rapidly being overcome through advances
in synthetic data generation. Photorealistic
rendering and domain randomization help to
simulate varying lighting, textures and object
shapes in virtual environments to teach robots
how to grasp items in diverse real-world
conditions. Combined with open-source efforts
and the accelerated deployment of real-world
robotic fleets, these developments are set to
close the data gap and dramatically enhance
learning efficiency. For example, as a first step,
robotic companies such as Sanctuary AI10
started with teleoperation – having an operator
control one or more robots – while collecting
data. The goal is to use this data to train the
robots to operate autonomously at a later point.
Developers can use technologies provided
by Nvidia and other companies to produce
numerous plausible futures or variations
based on real or synthetic data, serving as
an automatic data multiplier grounded in
real physics.
–3D spatial intelligence: Data scarcity is
chief among the reasons that perceiving,
reasoning within and interacting with complex
3D environments remains a demanding task. However, progress is accelerating. Start-ups
such as World Labs and Covariant, as well as
academic leaders, are using simulation, real-
world data and multimodal AI architectures to
enable robust spatial understanding. Vision–
language–action (VLA) models are rapidly
evolving as a promising path, with foundation
models poised to unlock generalizable spatial
reasoning capabilities.
–Generalizable dexterity with high degrees
of freedom: Achieving human-level dexterity
remains a frontier challenge due to the
mechanical, sensory and computational
constraints. For example, robotic hands must
operate with high degrees of freedom (DoF) –
often more than 20 joints – which makes real-
time motion planning, force control and collision
avoidance very complex. Crucially, progress
in 3D spatial intelligence is an enabler of
dexterity: fine manipulation depends on precise
perception of object geometry, pose and
occlusions. Foundation models that integrate 3D
scene understanding with manipulation planning
will enable robots to better select stable grasps,
adapt to object variability and execute corrective
strategies when conditions change.
While advances in robotics continue and
technological breakthroughs keep pushing
boundaries, addressing emerging challenges is
crucial for sustainable adoption. Key areas requiring
attention include cybersecurity. Vulnerability to
cyberthreats increases as factories and warehouses
become increasingly software-defined and robotic
systems become more interconnected. Ensuring
robust cybersecurity measures is essential
to protect against potential disruptions and
data breaches. Vision–language–
action (VLA) models
are rapidly evolving
as a promising
path, with
foundation models
poised to unlock
generalizable
spatial reasoning
capabilities.
Physical AI: Powering the New Age of Industrial Operations
10
Ask AI what this page says about a topic: