Neural 3D Reconstruction and Rendering

Rui Li

Last updated on Nov 23, 2021

Novel View Rendering with high resolutio of 4032$\times$3024. Training time 8 hours with 4 GPUs, Nvisia V100 in a 80 images dataset with full resolution.

Neural rendering and differentiable rendering have become a popular and powerful toolkit for solving high-quality photorealistic 3D reconstruction and novel viewpoint synthesis. The pain points of neural rendering projects are computational efficiency and the scalability problem of solving large-scale and high-resolution cases, our project tries to improve those, to enable efficient, large-scale photorealistic 3D reconstruction, urban reconstruction, 3D intrinsic decomposition and editing.

Efficient High-resolution Depth Rendering

Full resolution and efficient rendering of scene depth: 4032x3024, with less than 1 mins and 1 GPUs.

System Design Key Points

Sampling Strategies: uniform sampling, importance sampling, random sampling, selective sampling
1. selective sampling: based on previous loss and density gradient
Coordinate System: unique coordinate [X, Y, Z] is [Right, Down, Forward], camera origin is [0, 0, 0] in the camera coordinate, thus, ray direction in each pixel is [u-uc/ focal, v-vc/focal, 1].
Data Streaming: load everything in CPU RAM and samples, iteratively load batch to CUDA (VRAM), so that can solve any scale of images dataset without occupying VRAM too much. Support different dataset io, colmap, 7-scenes, ScanNet, FalconViz’s UAV dataset.
Ray Shuffle: images order and pixel order will shuffle randomly, and thus can improve 3D reconstruction performance.
Representation Methods [Fully Connected MLP, ACORN, NSVF, etc]: support MLP, ACORN, now. (idea: Octree-> arbitrary tree: xgboost boosting tree with objective)
Learning Scheduler: support multiple learning rate scheduler, use ReduceLROnPlateau learning scheduler right now.
Multi-GPUs training and rendering
Activation Function (relu, rrelu, leaky_relu): there are several activation functions are using in the system, e.g., RGB, density, network output, network layers. Replace ReLu to RReLU for randomized negative density outputs.
Helper functions: save results for each epoch(done), save args.

More results to show:

neural rendering

Neural 3D Reconstruction and Rendering

Efficient High-resolution Depth Rendering

System Design Key Points

More results to show:

Rui Li

Research Scientist, Peng Cheng Laboratory, Shenzhen, China