Spatial computing
Comment
Stakeholder Type

Spatial computing

1.5.1

Sub-Field

Spatial computing

To convincingly merge complex virtual elements with real-world environments, XR devices will need to understand and interact with their surroundings. These capabilities will be powered by AI that can use various sensor technologies to create and use spatial content while perceiving and reasoning spatially.

Future Horizons:

×××

5-yearhorizon

3D reconstruction and geometric reasoning progresses rapidly

In the near term, low-cost headsets become commercially viable at higher scale. Then the demand for better spatial-computing capabilities from a range of AI-application areas drives rapid progress in 3D reconstruction and geometric reasoning. This brings some high-value XR use cases, such as in manufacturing and production, into practical reach, making the field increasingly commercially attractive and spurring major new investments.

10-yearhorizon

XR goes mainstream

XR achieves mainstream adoption and becomes widely used in retail, education and healthcare. Highly realistic and physically accurate “digital twins” of real-world objects and locations become widely used by urban planners, engineers and manufacturers for real-time monitoring, design and process optimisation.

25-yearhorizon

Virtual and physical reality are intermeshed

The boundaries between virtual and physical reality dissolve. Everyday objects and environments have persistent digital layers that are accessible and modifiable through XR devices, and virtual elements are permanently and seamlessly interwoven into physical reality. XR evolves from immersive simulations to a foundational platform for human experience, transforming learning, creativity, work and entertainment.

Algorithmic advances now make it possible to use inexpensive cameras and motion sensors to simultaneously track a device’s location and map its surroundings.18 Advanced computer vision can also convert 2D images captured from moving cameras into detailed 3D models of spaces and objects19,20 and use them to reconstruct photorealistic 3D scenes from any point of view,21,22 enabling the virtualisation of entire environments. AI can also layer these 3D models with semantic labels that categorise objects and features within the environment,23 and seamlessly integrate virtual elements into scenes by allowing them to collide with or be occluded by real-world objects.24 Many of these capabilities are now integrated into freely available software development kits from large technology and gaming companies.25,26

However, developing a truly immersive XR experience will require AI models that can go beyond simply mapping and reconstructing 3D environments to understanding and reasoning about them. Fortunately, solving these problems will be crucial for a wide range of AI applications, including self-driving cars, robots and drones, so considerable resources are already going into this problem. Vision-language models trained on large amounts of image and text data appear capable of some degree of spatial reasoning.27,28 But “world models”, which learn rich representations of an environment’s spatial and physical properties to make predictions of how it will evolve over time, could be even more promising.29,30

Spatial computing - Anticipation Scores

The Anticipation Potential of a research field is determined by the capacity for impactful action in the present, considering possible future transformative breakthroughs in a field over a 25-year outlook. A field with a high Anticipation Potential, therefore, combines the potential range of future transformative possibilities engendered by a research area with a wide field of opportunities for action in the present. We asked researchers in the field to anticipate:

  1. The uncertainty related to future science breakthroughs in the field
  2. The transformative effect anticipated breakthroughs may have on research and society
  3. The scope for action in the present in relation to anticipated breakthroughs.

This chart represents a summary of their responses to each of these elements, which when combined, provide the Anticipation Potential for the topic. See methodology for more information.