Literature DB >> 19299858

Make3D: learning 3D scene structure from a single still image.

Ashutosh Saxena1, Min Sun, Andrew Y Ng.   

Abstract

We consider the problem of estimating detailed 3D structure from a single still image of an unstructured environment. Our goal is to create 3D models that are both quantitatively accurate as well as visually pleasing. For each small homogeneous patch in the image, we use a Markov Random Field (MRF) to infer a set of "plane parameters" that capture both the 3D location and 3D orientation of the patch. The MRF, trained via supervised learning, models both image depth cues as well as the relationships between different parts of the image. Other than assuming that the environment is made up of a number of small planes, our model makes no explicit assumptions about the structure of the scene; this enables the algorithm to capture much more detailed 3D structure than does prior art and also give a much richer experience in the 3D flythroughs created using image-based rendering, even for scenes with significant nonvertical structure. Using this approach, we have created qualitatively correct 3D models for 64.9 percent of 588 images downloaded from the Internet. We have also extended our model to produce large-scale 3D models from a few images.

Mesh:

Year:  2009        PMID: 19299858     DOI: 10.1109/TPAMI.2008.132

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  17 in total

1.  An Intelligent Body Posture Analysis Model Using Multi-Sensors for Long-Term Physical Rehabilitation.

Authors:  Chin-Feng Lai; Ren-Hung Hwang; Ying-Hsun Lai
Journal:  J Med Syst       Date:  2017-03-14       Impact factor: 4.460

2.  Differential 3D Facial Recognition: Adding 3D to Your State-of-the-Art 2D Method.

Authors:  J Matias Di Martino; Fernando Suzacq; Mauricio Delbracio; Qiang Qiu; Guillermo Sapiro
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2020-04-13       Impact factor: 6.226

3.  A proto-object based saliency model in three-dimensional space.

Authors:  Brian Hu; Ralinkae Kane-Jackson; Ernst Niebur
Journal:  Vision Res       Date:  2016-01-19       Impact factor: 1.886

4.  Local spectral anisotropy is a valid cue for figure-ground organization in natural scenes.

Authors:  Sudarshan Ramenahalli; Stefan Mihalas; Ernst Niebur
Journal:  Vision Res       Date:  2014-08-29       Impact factor: 1.886

5.  Rethinking Shape From Shading for Spoofing Detection.

Authors:  J Matias Di Martino; Qiang Qiu; Guillermo Sapiro
Journal:  IEEE Trans Image Process       Date:  2020-12-11       Impact factor: 10.856

6.  Basic level scene understanding: categories, attributes and structures.

Authors:  Jianxiong Xiao; James Hays; Bryan C Russell; Genevieve Patterson; Krista A Ehinger; Antonio Torralba; Aude Oliva
Journal:  Front Psychol       Date:  2013-08-29

Review 7.  Deep Learning-Based Monocular Depth Estimation Methods-A State-of-the-Art Review.

Authors:  Faisal Khan; Saqib Salahuddin; Hossein Javidnia
Journal:  Sensors (Basel)       Date:  2020-04-16       Impact factor: 3.576

8.  Road Scene Simulation Based on Vehicle Sensors: An Intelligent Framework Using Random Walk Detection and Scene Stage Reconstruction.

Authors:  Yaochen Li; Zhichao Cui; Yuehu Liu; Jihua Zhu; Danchen Zhao; Jian Yuan
Journal:  Sensors (Basel)       Date:  2018-11-05       Impact factor: 3.576

9.  High Level 3D Structure Extraction from a Single Image Using a CNN-Based Approach.

Authors:  J A de Jesús Osuna-Coutiño; Jose Martinez-Carranza
Journal:  Sensors (Basel)       Date:  2019-01-29       Impact factor: 3.576

10.  UASOL, a large-scale high-resolution outdoor stereo dataset.

Authors:  Zuria Bauer; Francisco Gomez-Donoso; Edmanuel Cruz; Sergio Orts-Escolano; Miguel Cazorla
Journal:  Sci Data       Date:  2019-08-29       Impact factor: 6.444

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.