Real-Time Radiance Field Rendering
Bernhard Kerbl, Georgios Kopanas, Thomas Leimkühler, George Drettakis
Read the Paper on arXiv3D Gaussian Splatting (Kerbl et al., SIGGRAPH 2023) represents scenes as collections of anisotropic 3D Gaussians, enabling real-time novel view synthesis at quality competitive with the best Neural Radiance Field (NeRF) methods while rendering at >100 FPS at 1080p resolution on a single RTX 3090 GPU.
The fundamental insight is replacing neural networks with an explicit, point-based scene representation. Each scene is modeled by 1 to 5 million 3D Gaussians, where every Gaussian carries learnable attributes: a position (mean) , a full 3D covariance matrix (parameterized via a rotation quaternion and a scale vector ), an opacity , and 48 spherical harmonic coefficients encoding view-dependent color (16 SH basis functions per RGB channel at degree ). Each Gaussian stores approximately 59 scalar parameters, and the total scene representation occupies 50-200 MB depending on complexity. A custom tile-based CUDA rasterizer projects and composites these Gaussians in real time without any neural network evaluation at render time.
Key advances:
3D Gaussian Splatting achieves state-of-the-art visual quality on standard benchmarks — 33.32 dB PSNR on Mip-NeRF 360 (comparable to Mip-NeRF 360's 33.09 dB), 27.41 dB on Tanks & Temples, and 29.41 dB on Deep Blending — while being ~100-1000x faster to render than NeRF methods. It bridges the gap between quality and interactivity for the first time, enabling applications in VR, gaming, digital twins, and real-time 3D content creation.
Click any topic to jump in
Anisotropic Gaussians parameterized by mean, rotation, scale, opacity, and SH — a differentiable scene representation.
Per-Gaussian SH coefficients (degree 3, 48 scalars) encode smooth view-dependent color without MLPs.
3D→2D projection with Jacobian covariance + alpha compositing gives closed-form volume rendering.
16×16 tile sort and early termination turn O(N·P) splatting into a linear GPU workload.
Gradient-triggered clone/split and opacity pruning let the primitive count self-tune to scene complexity.
L1 + D-SSIM loss with Adam and density control converges 100× faster than NeRF at matching quality.
Upgrade to PixelBank Premium to unlock this content.