Short Explanation
Use TAPIR-style tracking during the robot approach phase to keep a target pixel locked despite motion or camera shake.
Tracks arbitrary target points through a live video stream so closed-loop control can correct motion under disturbance.
Core parameters, trigger timing, and visual before/after demo references.
Use TAPIR-style tracking during the robot approach phase to keep a target pixel locked despite motion or camera shake.
Prepare the scene, image, video, sensor stream, prompt, or configuration expected by the original project.
Read the produced visualization, prediction, map, trajectory, mask, grasp pose, or other documented artifact.
A quick-run style example for the documentation page.
Readable controls and the meaning of each returned artifact.
initial_target_pixeltext[120, 150]Initial 2D point selected on the target object.
camera_stream_framefileLive frame or buffered video sequence to track through.
inference_modeselectmock_no_jaxSelects the installed JAX/TAPIR backend or a mock tracking fallback.
current_tracked_pixelCurrent 2D target coordinate for closed-loop correction.
is_occludedWhether the point is estimated to be hidden.
confidenceTracking confidence for the current frame.
total_displacementMeasured shift from the initial point over the tracked clip.
Official resources, deployment steps, academic context, citation, and source-reported benchmark numbers.
python tools/tapir/run.py --video tools/tapir/examples/stream.mp4 --initial-point 120,150 --output tools/tapir/runs/tracks.json
{
"tool": "tapir",
"status": "ok",
"results": [
{
"label": "Point tracking for visual servoing",
"score": 0.87,
"output": "Tracked point, occlusion flag, confidence"
}
],
"timing": {
"runtime": "The official project page reports about 40 FPS when tracking 256 points on a 256x256 video in online mode; the submitted spreadsheet also describes about 20 ms and 30-60 Hz output frequency.",
"device": "documented in source benchmark when available"
},
"artifacts": {
"visualization": "tools/tapir/runs/visualization.png",
"raw_predictions": "tools/tapir/runs/predictions.json"
}
}Paper identity and contribution summary.
@misc{tapir2023,
title={TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement},
author={Author},
year={2023},
note={DeepMind 2023},
url={https://arxiv.org/abs/2306.08637}
}Only compact, source-reported numbers are shown here.
| Dataset | Metric | Value | Runtime | Source |
|---|---|---|---|---|
| TAP-Vid benchmark | Average Jaccard (AJ) | 60.2 / 62.9 / 88.3 / 73.3 on Kinetics / DAVIS / Kubric / RGB-Stacking | TAPIR | Official TAPIR project page |
| TAP-Vid baseline comparison | Average Jaccard (AJ) | 46.6 / 38.4 / 65.4 / 59.9 for TAP-Net; 35.3 / 42.0 / 59.1 / 37.3 for PIPs | Official comparison table | Official TAPIR project page |
Official TAPIR project benchmark table, mock point-tracking input, trajectory output, occlusion flag, and confidence output.
Visual references from the original tool. Click any image to inspect the original size.