Structure from Motion and SLAM

How does a robot know where it is? How can we build a 3D model from photos? This chapter tackles the fundamental problem of recovering 3D structure and camera motion from 2D images.

What is this chapter about? We learn to calibrate cameras, estimate camera poses, optimize 3D reconstructions, and build maps in real-time for autonomous navigation.

Why does this matter? These techniques enable:

Augmented reality: Placing virtual objects in real scenes requires knowing the camera pose
Autonomous vehicles: Self-driving cars must understand 3D geometry
Photogrammetry: Creating 3D models from drone or phone photos
Robot navigation: SLAM lets robots explore unknown environments

How the topics connect: We start with camera calibration—measuring the intrinsic parameters of cameras. Pose estimation recovers where the camera is relative to known 3D points. Bundle adjustment jointly optimizes everything. SLAM does this in real-time as the robot explores.

Chapter 11: Structure from Motion and SLAM

Chapter Overview

Chapter Roadmap

Camera Calibration

Pose Estimation

Bundle Adjustment

SLAM

Sign up to unlock this chapter