BlockGaussian: Scalable Novel View Synthesis for Large Scale Scenes Based on Gaussian Splatting

Teaser — BlockGaussian reconstructs city-scale scenes from massive multi-view images and enables high-quality novel view synthesis from arbitrary viewpoints, as illustrated in the surrounding images. Compared to existing methods, our approach reduces reconstruction time from hours to minutes while achieving superior rendering quality in most scenes.

Abstract

The recent advancements in 3D Gaussian Splatting (3DGS) have demonstrated remarkable potential in novel view synthesis tasks. The divide-and-conquer paradigm has enabled large-scale scene reconstruction, but significant challenges remain in scene partitioning, optimization, and merging processes. This paper introduces BlockGaussian, a novel framework incorporating a content-aware scene partition strategy and visibility-aware block optimization to achieve efficient and high-quality large-scale scene reconstruction. Specifically, our approach considers the content-complexity variation across different regions and balances computational load during scene partitioning, enabling efficient scene reconstruction. To tackle the supervision mismatch issue during independent block optimization, we introduce auxiliary points during individual block optimization to align the ground-truth supervision, which enhances the reconstruction quality. Furthermore, we propose a pseudo-view geometry constraint that effectively mitigates rendering degradation caused by airspace floaters during block merging. Extensive experiments on large-scale scenes demonstrate that our approach achieves state-of-the-art results in both reconstruction efficiency and rendering quality.

Challenges

Overview

Comparison With SOTA

Quality comparison with other methods on U3D and Mill19 datasets — Quantitative comparison of novel view synthesis results on Mill19 and UrbanScene3D datasets. The best, the second best, and the third best results are highlighted in red, orange and yellow.

Description of the second image — Quantitative comparison of novel view synthesis results on Mill19 and UrbanScene3D datasets. We present the optimization time OptTime (hh:mm), the number of final points ($10^6$) and the allocated VRAM (GB) during evaluation.

Rendering Comparison

Scene Building: left is 3DGS, right is BlockGaussian(ours)

Scene Residence: left is DOGS, right is BlockGaussian(ours)

Scene MC-Aerial: left is CityGaussian, right is BlockGaussian(ours)

Scene Rubble: left is VastGaussian, right is BlockGaussian(ours)

BibTeX

@misc{wu2025blockgaussianefficientlargescalescene,
      title={BlockGaussian: Efficient Large-Scale Scene Novel View Synthesis via Adaptive Block-Based Gaussian Splatting}, 
      author={Yongchang Wu and Zipeng Qi and Zhenwei Shi and Zhengxia Zou},
      year={2025},
      eprint={2504.09048},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2504.09048}, 
}

BlockGaussian: Efficient Large-Scale Scene Novel View Synthesis via Adaptive Block-Based Gaussian Splatting