Representations and Techniques for 3D Object Recognition and Scene Interpretation

Representations and Techniques for 3D Object Recognition and Scene Interpretation

Author: Derek Hoiem

Publisher: Morgan & Claypool Publishers

Published: 2011

Total Pages: 172

ISBN-13: 1608457281

DOWNLOAD EBOOK

One of the grand challenges of artificial intelligence is to enable computers to interpret 3D scenes and objects from imagery. This book organizes and introduces major concepts in 3D scene and object representation and inference from still images, with a focus on recent efforts to fuse models of geometry and perspective with statistical machine learning. The book is organized into three sections: (1) Interpretation of Physical Space; (2) Recognition of 3D Objects; and (3) Integrated 3D Scene Interpretation. The first discusses representations of spatial layout and techniques to interpret physical scenes from images. The second section introduces representations for 3D object categories that account for the intrinsically 3D nature of objects and provide robustness to change in viewpoints. The third section discusses strategies to unite inference of scene geometry and object pose and identity into a coherent scene interpretation. Each section broadly surveys important ideas from cognitive science and artificial intelligence research, organizes and discusses key concepts and techniques from recent work in computer vision, and describes a few sample approaches in detail. Newcomers to computer vision will benefit from introductions to basic concepts, such as single-view geometry and image classification, while experts and novices alike may find inspiration from the book's organization and discussion of the most recent ideas in 3D scene understanding and 3D object recognition. Specific topics include: mathematics of perspective geometry; visual elements of the physical scene, structural 3D scene representations; techniques and features for image and region categorization; historical perspective, computational models, and datasets and machine learning techniques for 3D object recognition; inferences of geometrical attributes of objects, such as size and pose; and probabilistic and feature-passing approaches for contextual reasoning about 3D objects and scenes. Table of Contents: Background on 3D Scene Models / Single-view Geometry / Modeling the Physical Scene / Categorizing Images and Regions / Examples of 3D Scene Interpretation / Background on 3D Recognition / Modeling 3D Objects / Recognizing and Understanding 3D Objects / Examples of 2D 1/2 Layout Models / Reasoning about Objects and Scenes / Cascades of Classifiers / Conclusion and Future Directions


Multimodal Scene Understanding

Multimodal Scene Understanding

Author: Michael Yang

Publisher: Academic Press

Published: 2019-07-17

Total Pages: 0

ISBN-13: 9780128173589

DOWNLOAD EBOOK

Multimodal Scene Understanding: Algorithms, Applications and Deep Learning presents recent advances in multi-modal computing, with a focus on computer vision and photogrammetry. It provides the latest algorithms and applications that involve combining multiple sources of information and describes the role and approaches of multi-sensory data and multi-modal deep learning. The book is ideal for researchers from the fields of computer vision, remote sensing, robotics, and photogrammetry, thus helping foster interdisciplinary interaction and collaboration between these realms. Researchers collecting and analyzing multi-sensory data collections - for example, KITTI benchmark (stereo+laser) - from different platforms, such as autonomous vehicles, surveillance cameras, UAVs, planes and satellites will find this book to be very useful.


Reconstruction and Analysis of 3D Scenes

Reconstruction and Analysis of 3D Scenes

Author: Martin Weinmann

Publisher: Springer

Published: 2016-03-17

Total Pages: 250

ISBN-13: 3319292463

DOWNLOAD EBOOK

This unique work presents a detailed review of the processing and analysis of 3D point clouds. A fully automated framework is introduced, incorporating each aspect of a typical end-to-end processing workflow, from raw 3D point cloud data to semantic objects in the scene. For each of these components, the book describes the theoretical background, and compares the performance of the proposed approaches to that of current state-of-the-art techniques. Topics and features: reviews techniques for the acquisition of 3D point cloud data and for point quality assessment; explains the fundamental concepts for extracting features from 2D imagery and 3D point cloud data; proposes an original approach to keypoint-based point cloud registration; discusses the enrichment of 3D point clouds by additional information acquired with a thermal camera, and describes a new method for thermal 3D mapping; presents a novel framework for 3D scene analysis.


Stereo Scene Flow for 3D Motion Analysis

Stereo Scene Flow for 3D Motion Analysis

Author: Andreas Wedel

Publisher: Springer Science & Business Media

Published: 2011-08-17

Total Pages: 133

ISBN-13: 0857299654

DOWNLOAD EBOOK

This book presents methods for estimating optical flow and scene flow motion with high accuracy, focusing on the practical application of these methods in camera-based driver assistance systems. Clearly and logically structured, the book builds from basic themes to more advanced concepts, culminating in the development of a novel, accurate and robust optic flow method. Features: reviews the major advances in motion estimation and motion analysis, and the latest progress of dense optical flow algorithms; investigates the use of residual images for optical flow; examines methods for deriving motion from stereo image sequences; analyses the error characteristics for motion variables, and derives scene flow metrics for movement likelihood and velocity; introduces a framework for scene flow-based moving object detection and segmentation; includes Appendices on data terms and quadratic optimization, and scene flow implementation using Euler-Lagrange equations, in addition to a helpful Glossary.