Representations and Techniques for 3D Object Recognition and Scene Interpretation

Representations and Techniques for 3D Object Recognition and Scene Interpretation

Author: Derek Hoiem

Publisher: Morgan & Claypool Publishers

Published: 2011-09-09

Total Pages: 171

ISBN-13: 160845729X

DOWNLOAD EBOOK

One of the grand challenges of artificial intelligence is to enable computers to interpret 3D scenes and objects from imagery. This book organizes and introduces major concepts in 3D scene and object representation and inference from still images, with a focus on recent efforts to fuse models of geometry and perspective with statistical machine learning. The book is organized into three sections: (1) Interpretation of Physical Space; (2) Recognition of 3D Objects; and (3) Integrated 3D Scene Interpretation. The first discusses representations of spatial layout and techniques to interpret physical scenes from images. The second section introduces representations for 3D object categories that account for the intrinsically 3D nature of objects and provide robustness to change in viewpoints. The third section discusses strategies to unite inference of scene geometry and object pose and identity into a coherent scene interpretation. Each section broadly surveys important ideas from cognitive science and artificial intelligence research, organizes and discusses key concepts and techniques from recent work in computer vision, and describes a few sample approaches in detail. Newcomers to computer vision will benefit from introductions to basic concepts, such as single-view geometry and image classification, while experts and novices alike may find inspiration from the book's organization and discussion of the most recent ideas in 3D scene understanding and 3D object recognition. Specific topics include: mathematics of perspective geometry; visual elements of the physical scene, structural 3D scene representations; techniques and features for image and region categorization; historical perspective, computational models, and datasets and machine learning techniques for 3D object recognition; inferences of geometrical attributes of objects, such as size and pose; and probabilistic and feature-passing approaches for contextual reasoning about 3D objects and scenes. Table of Contents: Background on 3D Scene Models / Single-view Geometry / Modeling the Physical Scene / Categorizing Images and Regions / Examples of 3D Scene Interpretation / Background on 3D Recognition / Modeling 3D Objects / Recognizing and Understanding 3D Objects / Examples of 2D 1/2 Layout Models / Reasoning about Objects and Scenes / Cascades of Classifiers / Conclusion and Future Directions


Appearance-based 3-D Object Recognition and Pose Estimation

Appearance-based 3-D Object Recognition and Pose Estimation

Author: Chiranji Lal Chowdhary

Publisher: LAP Lambert Academic Publishing

Published: 2011-01

Total Pages: 76

ISBN-13: 9783843391153

DOWNLOAD EBOOK

In Computer Vision, Image Processing, Artificial Intelligence and Neural Networks object recognition is one of the most successful applications of image or object analysis and understanding.The recognition system typically involves some sort of sensor, the use of a model database in which all the objects "models" representations are saved, and a decision-making ability.When a sensor views an object the digitized image is processed so as to represent it in the same way as the models are represented in the databases.Then a recognition algorithm tries to find the model to which the object best matches.For the view-based recognition, the representations take into account the appearance of the object. To achieve 3D Object recognition(3DOR) the pose of objects are also saved in the database.In general two 3DOR techniques. They are Geometric feature-based approach and Appearance- based approach.The geometric feature-based approach uses properties of shape of object i.e. lines, curves, and vertices for object recognition descriptions.But appearance-based 3DOR is the combined effects of objects shape, reflectance properties, pose and the illumination.


Computer Vision -- ECCV 2014

Computer Vision -- ECCV 2014

Author: David Fleet

Publisher: Springer

Published: 2014-08-14

Total Pages: 855

ISBN-13: 331910599X

DOWNLOAD EBOOK

The seven-volume set comprising LNCS volumes 8689-8695 constitutes the refereed proceedings of the 13th European Conference on Computer Vision, ECCV 2014, held in Zurich, Switzerland, in September 2014. The 363 revised papers presented were carefully reviewed and selected from 1444 submissions. The papers are organized in topical sections on tracking and activity recognition; recognition; learning and inference; structure from motion and feature matching; computational photography and low-level vision; vision; segmentation and saliency; context and 3D scenes; motion and 3D scene analysis; and poster sessions.


Toward Category-Level Object Recognition

Toward Category-Level Object Recognition

Author: Jean Ponce

Publisher: Springer

Published: 2007-01-25

Total Pages: 622

ISBN-13: 3540687955

DOWNLOAD EBOOK

This volume is a post-event proceedings volume and contains selected papers based on presentations given, and vivid discussions held, during two workshops held in Taormina in 2003 and 2004. The 30 thoroughly revised papers presented are organized in the following topical sections: recognition of specific objects, recognition of object categories, recognition of object categories with geometric relations, and joint recognition and segmentation.


3D Object Recognition and Pose Estimation Using Feature Descriptor Regression in a Bayes' Framework

3D Object Recognition and Pose Estimation Using Feature Descriptor Regression in a Bayes' Framework

Author: Sergi Segura Morros

Publisher:

Published: 2012

Total Pages:

ISBN-13:

DOWNLOAD EBOOK

[ANGLÈS] In this thesis, we have tried to find a suitable method to solve typical applications of pose recognition for cars, face, or facades. We have looked for efficient algorithms that allow us to solve the problem without the need of reconstructing a 3D model of the object and therefore, with a much lower computational load. The method mimics the first steps of a 3D reconstruction, where we need to take pictures of the object at different orientations, but, instead of building a computationally complex 3D model of the object, we use the information extracted in the feature descriptors of each image to estimate the feature appearance at unknown poses. We can take advantage of the fact that descriptors change their values when a change in the orientation of the object occurs, and predict the values at orientations for which the ground truth information is not available. The method is separated in two parts, the Off-line and the On-line Stage. In the Off-line Stage, we take pictures in a few known poses of the object to recognize, and we establish a track for each feature along the available images. For each feature track, we build a regression function that will estimate the value of the feature at unavailable poses. In the On-line Stage, a test image is input to the system. We extract its features and compare them with the features of the available training poses to establish correspondences. Once this matching is done, and following the principles on which SIFT features are matched, we compute the Euclidean distance between each feature in the track and the test image to find the most similar one. In order to achieve a more accurate result, we estimate the value of the feature at the poses that are not available by applying the regression function at those orientations. The pose estimation is conceived as an optimization problem as we have to minimize the error function given by the distance between the estimated descriptor and the current one. As the error function presents various local minima (the error function is not perfectly concave), we divide it into windows and then choose the global minimum among them, retrieving in this way the correct pose of the test image. The other main reason to divide the domain in sub-intervals is to maximize the number of tracks used. By embedding the minimization inside a Bayesian framework, we can estimate the probability of the actual pose given the feature descriptors of the test image.


3-D Object Recognition: Representation and Matching

3-D Object Recognition: Representation and Matching

Author: International Business Machines Corporation. Research Division

Publisher:

Published: 1998

Total Pages: 35

ISBN-13:

DOWNLOAD EBOOK

Abstract: "Three-dimensional object recognition problem addresses a number of significant research issues in computer vision: representation of a 3D object, identification of the object, robust estimation of its pose, and registration of multiple views of the object for automatic model construction. This paper surveys the previous work in three important topics of computer vision: representation, matching, and pose estimation of a 3D object. It also presents an overview of the free-form surface matching problem, and describes Cosmos, our framework for representing and recognizing free-form objects. This vision system recognizes arbitrarily curved 3D rigid objects from a single view using dense surface data. We present both the theoretical aspects of this work and the experimental results of a prototype recognition system based on Cosmos."