Short Explanation
Given one RGB image, Depth Anything predicts a dense relative depth map.
Foundation depth model for robust relative depth prediction from a single RGB image.
Core parameters, trigger timing, and visual before/after demo references.
Given one RGB image, Depth Anything predicts a dense relative depth map.
Prepare the scene, image, video, sensor stream, prompt, or configuration expected by the original project.
Read the produced visualization, prediction, map, trajectory, mask, grasp pose, or other documented artifact.
A quick-run style example for the documentation page.
Readable controls and the meaning of each returned artifact.
img_pathfileInput RGB image file.
encoderselectvitlBackbone variant (vits, vitb, vitl).
outdirpathDirectory for exported depth maps.
depth_mapPredicted per-pixel relative depth.
vis_depthColorized depth map for visualization.
Official resources, deployment steps, academic context, citation, and source-reported benchmark numbers.
python run.py --img-path tools/depth-anything/examples/input.jpg --encoder vitl --outdir tools/depth-anything/runs
{
"tool": "depth-anything",
"status": "ok",
"depth_map": [
{
"label": "Monocular depth estimation",
"score": 0.87,
"output": "Depth map"
}
],
"timing": {
"runtime": "The official README reports inference time on V100 / A100 / RTX4090 TensorRT as 12 / 8 / 3 ms for Small, 13 / 9 / 6 ms for Base, and 20 / 13 / 12 ms for Large.",
"device": "documented in source benchmark when available"
},
"artifacts": {
"visualization": "tools/depth-anything/runs/visualization.png",
"raw_predictions": "tools/depth-anything/runs/predictions.json"
}
}Paper identity and contribution summary.
@misc{depthanything2024,
title={Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data},
author={Lihe Yang and Bingyi Kang and Zilong Huang and et al.},
year={2024},
note={CVPR 2024 / arXiv:2401.10891},
url={https://arxiv.org/abs/2401.10891}
}Only compact, source-reported numbers are shown here.
| Dataset | Metric | Value | Runtime | Source |
|---|---|---|---|---|
| KITTI zero-shot benchmark | AbsRel / delta1 | 0.076 / 0.947 for Depth Anything-L; 0.080 / 0.939 for Depth Anything-B | Large and Base encoders | Official README |
| NYUv2 zero-shot benchmark | AbsRel / delta1 | 0.043 / 0.981 for Depth Anything-L; 0.046 / 0.979 for Depth Anything-B | Large and Base encoders | Official README |
Official checkpoints, run script, and zero-shot benchmark tables from the README.
Visual references from the original tool. Click any image to inspect the original size.