DEPTH ANYTHING 3... IS COMING... #508
Replies: 3 comments 3 replies
-
|
... finally! Can't wait until its implemented within IW3. Let's wait and see... . Thanks for sharing @diverswan9! Cheers... TeeJay-NLD |
Beta Was this translation helpful? Give feedback.
-
|
Amazing results! And the best part is that with Depth Anything 3 we can have an even better Video Depth Anything model! Let's remember that Depth Anything 3 had been in development for a very long time and it was too late to benefit from some of the things that have come out recently. The first thing to highlight is that Depth Anything 3 was trained on low resolution:
The second thing is that it uses an old version of DINO:
The newest DINOv3 has 3 things that I really missed in DINOv2: 1. Patch Size 16, instead of 14, see Table 2 We won't have to use resolutions divisible by 14 as with DA2 (training 518 × 518) or DA3 (training 504 × 504). 2. Stability of results even at very high resolutions. DINOv3 (ViT-H+) remains stable even at a resolution of 7168 × 4096, see Figure 17. This opens the way for training and inference at, for example, 1280×720 or 3840 × 2160 resolution (divisible by 16 but not by 14). 3. Even better depth estimation results, confirmed on the most commonly used test data, see Table 12 Now try to imagine these results from the table above combined with the DA3 results from Table 3 And that's not all... Two people have already submitted requests for Video Depth Anything based on DINOv3: I am also preparing to submit a request to the Video Depth Anything researchers in the coming days. My request will include many more solutions than DINOv3. One of the many things I want to propose are the latest training datasets I have collected in my repository: Video Depth Estimation Rankings and 2D to 3D Video Conversion Rankings. And that's just the beginning of what I'll be offering Video Depth Anything researchers. I hope that someone will support my request, which I will make in a few days, and add something more from themselves. |
Beta Was this translation helpful? Give feedback.
-
|
hope Depth Anything 3 will address the common issue of missing vertical sharp edges in depth maps. All current models often fail to capture small, thin contours, which leads to distorted curved lines when the image is converted into 3D side-by-side format. |
Beta Was this translation helpful? Give feedback.

Uh oh!
There was an error while loading. Please reload this page.
-
https://openreview.net/forum?id=yirunib8l8
pdf - https://openreview.net/pdf?id=yirunib8l8
Beta Was this translation helpful? Give feedback.
All reactions