깊이 추정

유형	출력	활용
상대 깊이(Relative)	순서 관계만 (가까움/멀음)	시각 효과, 배경 분리
절대 깊이(Metric)	실제 거리(미터)	로봇, 자율주행

유형

출력

활용

상대 깊이(Relative)

순서 관계만 (가까움/멀음)

시각 효과, 배경 분리

절대 깊이(Metric)

실제 거리(미터)

로봇, 자율주행

구현

Depth Anything V2 (Transformers)

from transformers import pipeline
from PIL import Image

# 깊이 추정 파이프라인
pipe = pipeline("depth-estimation", model="depth-anything/Depth-Anything-V2-Small-hf")

image = Image.open("image.jpg")
result = pipe(image)

depth_map = result["depth"]  # PIL 이미지
depth_map.save("depth_output.png")

MiDaS (PyTorch Hub)

import torch
import cv2

# MiDaS 모델 로드
model = torch.hub.load("intel-isl/MiDaS", "DPT_Large")
midas_transforms = torch.hub.load("intel-isl/MiDaS", "transforms")
transform = midas_transforms.dpt_transform

model.eval()
device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
model = model.to(device)

# 추론
image = cv2.imread("image.jpg")
image_rgb = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)
input_tensor = transform(image_rgb).to(device)

with torch.no_grad():
    prediction = model(input_tensor)
    prediction = torch.nn.functional.interpolate(
        prediction.unsqueeze(1),
        size=image.shape[:2],
        mode="bicubic",
    ).squeeze()

depth_map = prediction.cpu().numpy()

관련 기술 비교

모델	방식	속도	정확도	특징
MiDaS	DPT (ViT + Dense Prediction)	보통	높음	범용 상대 깊이
Depth Anything V2	DINOv2 기반	빠름	매우 높음	최신, 범용 추천
ZoeDepth	MiDaS + Metric Head	보통	높음	절대 깊이 가능
UniDepth	범용 Metric Depth	보통	높음	카메라 내재 파라미터 불필요

상대 깊이와 절대 깊이의 차이는?

상대 깊이는 “A가 B보다 가깝다”는 순서만 알려줍니다. 절대 깊이는 “A까지 3.2m, B까지 5.7m”처럼 실제 거리를 제공합니다. 대부분의 범용 모델은 상대 깊이를 출력하며, 절대 깊이가 필요하면 Metric Depth 모델을 사용해야 합니다.

논문	학회/연도	링크
MiDaS: Towards Robust Monocular Depth	IEEE T-PAMI 2022	arXiv:1907.01341
Depth Anything	CVPR 2024	arXiv:2401.10891
Depth Anything V2	NeurIPS 2024	arXiv:2406.09414

논문

학회/연도

링크

MiDaS: Towards Robust Monocular Depth

IEEE T-PAMI 2022

arXiv:1907.01341

Depth Anything

CVPR 2024

arXiv:2401.10891

Depth Anything V2

NeurIPS 2024

arXiv:2406.09414

핵심 아이디어

동작 방식

구현

Depth Anything V2 (Transformers)

MiDaS (PyTorch Hub)

관련 기술 비교

참고 논문

​핵심 아이디어

​동작 방식

​구현

​Depth Anything V2 (Transformers)

​MiDaS (PyTorch Hub)

​관련 기술 비교

​참고 논문

핵심 아이디어

동작 방식

구현

Depth Anything V2 (Transformers)

MiDaS (PyTorch Hub)

관련 기술 비교

참고 논문