How to generate a 3D scene with MIDI-3D from the command line? What are the detailed steps?

2025-08-28

1.5 K

Using MIDI-3D from the command line involves two key stages:

Execute the following command (as an example of a cartoon style image):

python -m scripts.grounding_sam --image assets/example_data/Cartoon-Style/04_rgb.png --labels "lamp sofa table dog" --output ./segmentation.png
Parameter Description:
– -image: Input image path
– -Labels.: Space-separated list of object names
– -output: Where the generated segmentation map is saved

Use the core reasoning script:

python -m scripts.inference_midi --rgb 00_rgb.png --seg 00_seg.png --output-dir "./output" --do-image-padding
Advanced Tips:
- add-do-image-paddingParameters optimize the quality of edge object generation
- The output directory automatically generates subfolders with timestamps to avoid file overwriting

Typical generation takes about 40-60 seconds, and it is recommended to prioritize the use of officially provided example data for process testing.

Quick query station AI tool