Short Explanation
Use ActPerMoMa when a robot should actively move to reduce uncertainty before grasping or manipulating an object.
ActPerMoMa chooses informative base and camera motions so a mobile manipulator can improve object perception before acting.
Core parameters, trigger timing, and visual before/after demo references.
Use ActPerMoMa when a robot should actively move to reduce uncertainty before grasping or manipulating an object.
Prepare the scene, image, video, sensor stream, prompt, or configuration expected by the original project.
Read the produced visualization, prediction, map, trajectory, mask, grasp pose, or other documented artifact.
Readable controls and the meaning of each returned artifact.
scene_statepathCurrent perception state or simulator observation.
target_objecttextObject or goal that drives the active perception objective.
viewpoint_actionMotion selected to improve perception of the target.
manipulation_actionAction proposed after the perception state is sufficiently informative.
Official resources, deployment steps, academic context, citation, and source-reported benchmark numbers.
# Follow the official ActPerMoMa environment setup, then run the provided policy/evaluation scripts in Isaac Sim.
{
"tool": "actpermoma",
"status": "ok",
"results": [
{
"label": "Active perception for mobile manipulation",
"score": 0.87,
"output": "Active perception or manipulation action"
}
],
"timing": {
"runtime": "The paper reports path/view statistics such as dtotal and vtotal rather than a single wall-clock latency.",
"device": "documented in source benchmark when available"
},
"artifacts": {
"visualization": "tools/actpermoma/runs/visualization.png",
"raw_predictions": "tools/actpermoma/runs/predictions.json"
}
}Paper identity and contribution summary.
@misc{actpermoma2023,
title={ActPerMoMa: Active Perception for Mobile Manipulation},
author={Author},
year={2023},
note={CoRL 2023},
url={https://arxiv.org/abs/2310.00433}
}Only compact, source-reported numbers are shown here.
| Dataset | Metric | Value | Runtime | Source |
|---|---|---|---|---|
| Simple scenes, 500 episodes | Success / abort / grasp-failure rate | 95.4% / 1.4% / 3.2% | dtotal 3.59+/-1.69 m; vtotal 12.67+/-5.39 | Official CoRL 2023 paper, Table III |
| Complex scenes, 500 episodes | Success / abort / grasp-failure rate | 92.6% / 2.2% / 5.2% | dtotal 4.57+/-2.51 m; vtotal 16.20+/-8.65 | Official CoRL 2023 paper, Table III |
| Complex scenes, hard grasps, 500 episodes | Success / abort / grasp-failure rate | 61.8% / 29.6% / 8.6% | dtotal 7.19+/-4.17 m; vtotal 24.81+/-13.24 | Official CoRL 2023 paper, Table III |
Official CoRL 2023 paper, Table III, Isaac Sim setup, and active-perception policy evaluation scripts.
Visual references from the original tool. Click any image to inspect the original size.