🌀 VGGT-Ω

🐙 GitHub Repository | Project Page

Upload a video or a set of images to create a 3D reconstruction of a scene or object. VGGT-Ω takes these images and generates a 3D point cloud, along with estimated camera poses.

Getting Started:

  1. Upload Your Data: Use the "Upload Video" or "Upload Images" buttons on the left to provide your input. Videos will be automatically split into individual frames using the selected sampling rate.
  2. Preview: Your uploaded images will appear in the gallery on the left.
  3. Reconstruct: Click the "Reconstruct" button to run camera and depth inference and build the first GLB scene.
  4. Visualize: The point cloud and camera poses will appear in the viewer on the right. You can rotate, pan, zoom, and download the GLB file.
  5. Adjust Visualization (Optional): After reconstruction, adjust the visualization options and click "Update Visual" to refresh the GLB without rerunning inference.

Please note: The demo limits Max Points by default to keep the UI responsive; increase Max Points if you need a denser point cloud. Visualizing very dense point clouds may take longer due to third-party rendering, which is independent of VGGT-Ω's processing time.

0.5 2

Reconstruction (Point Cloud and Camera Poses)

Please upload a video or images, then click Reconstruct.

2 100
500 10000