Video To SimReady

Overview

The Video to SimReady workflow creates a simulation-ready 3D asset from a short video clip. By analyzing frames across the video, the system extracts multiple viewpoints of an object and uses multi-view 3D reconstruction techniques to build a complete mesh.

<aside> 🔬 ⚠️ This feature is still in development and may have limitations. Results may vary depending on video quality, lighting consistency, and object motion. For production use, Image to SimReady is recommended.

</aside>

How It Works

Frame Extraction — Key frames are automatically extracted from the video at regular intervals or at points of maximum viewpoint diversity.
Multi-View Reconstruction — The extracted frames are processed as a multi-view input set, using structure-from-motion and neural reconstruction techniques to infer 3D geometry.
Mesh and Texture Generation — The reconstructed 3D geometry is meshed, cleaned, and textured using the same pipeline as Image to SimReady.
SimReady Pipeline — The asset is then processed through the physics and collision generation stage.

Best Practices for Input Video

Keep the object centered in the frame throughout the video.
Move slowly around the object to capture all angles — avoid fast motion or abrupt cuts.
Use consistent, even lighting — avoid shadows or reflections that obscure the object surface.
Use a plain, contrasting background.
Keep the object stationary — do not rotate or move the object itself during recording.
Recommended duration: 10–30 seconds for complete 360° coverage.

Learn more in the Physics and Collision Generation documentation.