Short Explanation
Use SMS when the robot must plan through latent physical properties such as shape, contact, or material instead of relying on purely geometric state.
Scan, Materialize, Simulate turns RGB-D observations into a reconstructed scene, infers geometry and material properties, and runs Genesis-based simulation to optimize physically grounded robot actions.
Core parameters, trigger timing, and visual before/after demo references.
Use SMS when the robot must plan through latent physical properties such as shape, contact, or material instead of relying on purely geometric state.
Prepare the scene, image, video, sensor stream, prompt, or configuration expected by the original project.
Read the produced visualization, prediction, map, trajectory, mask, grasp pose, or other documented artifact.
A quick-run style example for the documentation page.
Readable controls and the meaning of each returned artifact.
experimentselectbilliardsSelects the experiment pipeline, such as billiards or quadrotor.
presetselectliteControls simulation scale, optimization loops, and rendering fidelity.
outputpathDestination JSON path for experiment artifacts and metrics.
rgb_d_framespathOptional RGB-D observations used by the scan and reconstruction stage.
scene_representationReconstructed geometry or scene handle used for downstream simulation.
material_estimatesInferred material attributes used in the physics model.
action_parametersOptimized control or interaction parameters produced by the solver.
objective_valueSimulation-derived score used to select or compare candidate actions.
Official resources, deployment steps, academic context, citation, and source-reported benchmark numbers.
# Relative-path local entry for the SMS framework deployment cd tools/scan_materialize_simulate ./setup.sh sms_env python -m sms_framework.main --experiment billiards --preset lite --output tools/scan_materialize_simulate/outputs/billiards_result.json python -m sms_framework.main --experiment quadrotor --preset lite --scene_id 0 --start_positions 4 --output tools/scan_materialize_simulate/outputs/quadrotor_result.json
{
"tool": "scan_materialize_simulate",
"status": "ok",
"results": [
{
"label": "Physically grounded scene materialization",
"score": 0.87,
"output": "Reconstructed scene state, inferred materials, optimized action parameters, experiment JSON"
}
],
"timing": {
"runtime": "Official runtime notes include 1.1 s/iteration for billiards Nelder-Mead optimization, 8.2 s/iteration for quadrotor optimization on an RTX 4090, and about 3 minutes for 60-frame mapping.",
"device": "documented in source benchmark when available"
},
"artifacts": {
"visualization": "tools/scan_materialize_simulate/runs/visualization.png",
"raw_predictions": "tools/scan_materialize_simulate/runs/predictions.json"
}
}Paper identity and contribution summary.
@misc{scan_materialize_simulateYEAR,
title={Scan, Materialize, Simulate: A Generalizable Framework for Physically Grounded Robot Planning},
author={Author},
year={YEAR},
note={Add venue or arXiv identifier},
url={https://arxiv.org/abs/2505.14938}
}Only compact, source-reported numbers are shown here.
| Dataset | Metric | Value | Runtime | Source |
|---|---|---|---|---|
| Billiards, 18 scenes x 30 seeds | Task objective, SMS predicted / realized | 5.5 (12.7) cm / 20.5 (17.9) cm | Nelder-Mead averages 1.1 s/iteration, converges in 15-20 of 30 iterations | Official SMS paper, Table 1 |
| Billiards, per-scene best | SMS realized vs baseline realized | 5.5 (8.3) cm vs 28.2 (21.7) cm | PyBullet simulation timestep 0.0025 s | Official SMS paper, Table 1 |
| Quadrotor landing, 4 scenes x 10 starts | Landing success rate, SMS vs visual prompting | 100% / 80% / 90% / 90% vs 50% / 50% / 60% / 50% | Genesis optimization averages 8.2 s/iteration on RTX 4090 | Official SMS paper |
| Scene reconstruction | Scan/mapping setup | 60 RGB-D images; about 3 s/frame optimization; about 3 min total mapping | FR3 controller 400 Hz for billiards setup | Official SMS paper |
Official SMS paper, Table 1, billiards/quadrotor experiment details, wrapper source, SMS framework README, bundled Genesis assets, tests, and experiment entrypoints.
Visual references from the original tool. Click any image to inspect the original size.