🌀 VGGT-Ω

🐙 GitHub Repository | Project Page

Upload a video or a set of images to create a 3D reconstruction of a scene or object. VGGT-Ω takes these images and generates a 3D point cloud, along with estimated camera poses.

Getting Started:

Upload Your Data: Use the "Upload Video" or "Upload Images" buttons on the left to provide your input. Videos will be automatically split into individual frames using the selected sampling rate.
Preview: Your uploaded images will appear in the gallery on the left.
Reconstruct: Click the "Reconstruct" button to run camera and depth inference and build the first GLB scene.
Visualize: The point cloud and camera poses will appear in the viewer on the right. You can rotate, pan, zoom, and download the GLB file.
Adjust Visualization (Optional): After reconstruction, adjust the visualization options and click "Update Visual" to refresh the GLB without rerunning inference.

Please note: The demo limits Max Points by default to keep the UI responsive; increase Max Points if you need a denser point cloud. Visualizing very dense point clouds may take longer due to third-party rendering, which is independent of VGGT-Ω's processing time.

Click any row to load an example.

Examples

Upload Video	Video Sampling FPS	Upload Images	Confidence Threshold (%)	Filter Black Background	Filter White Background	Show Camera	Filter Sky	Max Points (K points)