Short Explanation
Upload one RGB image and ZoeDepth predicts a metric depth map that can be used for 3D perception, obstacle reasoning, or scene geometry estimation.
ZoeDepth estimates metric depth from a single RGB image by combining relative depth priors with metric depth prediction, enabling zero-shot transfer across indoor and outdoor scenes.
Core parameters, trigger timing, and visual before/after demo references.
Upload one RGB image and ZoeDepth predicts a metric depth map that can be used for 3D perception, obstacle reasoning, or scene geometry estimation.
Prepare the scene, image, video, sensor stream, prompt, or configuration expected by the original project.
Read the produced visualization, prediction, map, trajectory, mask, grasp pose, or other documented artifact.
A quick-run style example for the documentation page.
Readable controls and the meaning of each returned artifact.
imagefileSingle RGB image to convert into a metric depth prediction.
modelselectZoeD_NCheckpoint variant. ZoeD_N is commonly used for NYU-style indoor depth, while ZoeD_NK targets mixed indoor/outdoor transfer.
pretrained_resourceselectTorch HubChooses whether the model is loaded from Torch Hub, local checkpoint, or the repository config.
outputpathWhere to save the raw depth and rendered visualization.
depth_mapPer-pixel metric depth values, typically in meters after model-specific scaling.
visualizationA colored image for inspection; color is for readability, not the raw numeric result.
model_variantThe checkpoint used, which affects indoor/outdoor generalization.
Official resources, deployment steps, academic context, citation, and source-reported benchmark numbers.
# Relative-path local entry for the ZoeDepth tool folder python tools/zoedepth/sanity.py # Torch Hub inference path: python tools/zoedepth/examples/infer_depth.py --image tools/zoedepth/examples/input.jpg --model ZoeD_N --output tools/zoedepth/runs/depth_output.png # Gradio UI path: python -m tools/zoedepth.ui.app # Evaluation examples: python tools/zoedepth/evaluate.py -m zoedepth -d nyu python tools/zoedepth/evaluate.py -m zoedepth_nk -d nyu # Suggested repository layout when adding local files: # tools/zoedepth/README.md # tools/zoedepth/sanity.py # tools/zoedepth/evaluate.py # tools/zoedepth/ui/app.py # tools/zoedepth/examples/input.jpg # tools/zoedepth/runs/depth_output.png # This page documents the path. The static page does not execute ZoeDepth.
{
"tool": "zoedepth",
"status": "ok",
"depth_map": [
{
"label": "Metric depth estimation",
"score": 0.87,
"output": "Metric depth map"
}
],
"timing": {
"runtime": "PyTorch inference depends on backbone and resolution; the paper reports ZoeDepth variants from 42M parameters (Swin2-T) to 345M parameters (BEiT-L).",
"device": "documented in source benchmark when available"
},
"artifacts": {
"visualization": "tools/zoedepth/runs/visualization.png",
"raw_predictions": "tools/zoedepth/runs/predictions.json"
}
}Paper identity and contribution summary.
@misc{zoedepth2023,
title={ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth},
author={Shariq Farooq Bhat and Reiner Birkl and Diana Wofk and Peter Wonka and Matthias Muller},
year={2023},
note={arXiv:2302.12288, 2023},
url={https://arxiv.org/abs/2302.12288}
}Only compact, source-reported numbers are shown here.
| Dataset | Metric | Value | Runtime | Source |
|---|---|---|---|---|
| NYU Depth V2 | delta1 / REL / RMSE / log10 | 0.955 / 0.075 / 0.270 / 0.032 for ZoeD-M12-N | 42M-345M parameters depending on backbone | ZoeDepth paper |
| KITTI | REL | 0.057 for universal ZoeD-M12-NK | Single-image PyTorch inference | ZoeDepth paper |
ZoeDepth paper, NYU/KITTI quantitative tables, Torch Hub entries, sanity scripts, evaluation scripts, Gradio UI, and model configs.
Visual references from the original tool. Click any image to inspect the original size.