Face Detection and Adaptation

Face Detection and Adaptation

Author: Cha Zhang

Publisher: Morgan & Claypool Publishers

Published: 2010-10-10

Total Pages: 140

ISBN-13: 1608451348

DOWNLOAD EBOOK

Face detection, because of its vast array of applications, is one of the most active research areas in computer vision. In this book, we review various approaches to face detection developed in the past decade, with more emphasis on boosting-based learning algorithms. We then present a series of algorithms that are empowered by the statistical view of boosting and the concept of multiple instance learning. We start by describing a boosting learning framework that is capable to handle billions of training examples. It differs from traditional bootstrapping schemes in that no intermediate thresholds need to be set during training, yet the total number of negative examples used for feature selection remains constant and focused (on the poor performing ones). A multiple instance pruning scheme is then adopted to set the intermediate thresholds after boosting learning. This algorithm generates detectors that are both fast and accurate. We then present two multiple instance learning schemes for face detection, multiple instance learning boosting (MILBoost) and winner-take-all multiple category boosting (WTA-McBoost). MILBoost addresses the uncertainty in accurately pinpointing the location of the object being detected, while WTA-McBoost addresses the uncertainty in determining the most appropriate subcategory label for multiview object detection. Both schemes can resolve the ambiguity of the labeling process and reduce outliers during training, which leads to improved detector performances. In many applications, a detector trained with generic data sets may not perform optimally in a new environment. We propose detection adaption, which is a promising solution for this problem. We present an adaptation scheme based on the Taylor expansion of the boosting learning objective function, and we propose to store the second order statistics of the generic training data for future adaptation. We show that with a small amount of labeled data in the new environment, the detector's performance can be greatly improved. We also present two interesting applications where boosting learning was applied successfully. The first application is face verification for filtering and ranking image/video search results on celebrities. We present boosted multi-task learning (MTL), yet another boosting learning algorithm that extends MILBoost with a graphical model. Since the available number of training images for each celebrity may be limited, learning individual classifiers for each person may cause overfitting. MTL jointly learns classifiers for multiple people by sharing a few boosting classifiers in order to avoid overfitting. The second application addresses the need of speaker detection in conference rooms. The goal is to find who is speaking, given a microphone array and a panoramic video of the room. We show that by combining audio and visual features in a boosting framework, we can determine the speaker's position very accurately. Finally, we offer our thoughts on future directions for face detection. Table of Contents: A Brief Survey of the Face Detection Literature / Cascade-based Real-Time Face Detection / Multiple Instance Learning for Face Detection / Detector Adaptation / Other Applications / Conclusions and Future Work


Boosting-Based Face Detection and Adaptation

Boosting-Based Face Detection and Adaptation

Author: Matthieu Salzmann

Publisher: Springer Nature

Published: 2022-06-01

Total Pages: 132

ISBN-13: 3031018095

DOWNLOAD EBOOK

Face detection, because of its vast array of applications, is one of the most active research areas in computer vision. In this book, we review various approaches to face detection developed in the past decade, with more emphasis on boosting-based learning algorithms. We then present a series of algorithms that are empowered by the statistical view of boosting and the concept of multiple instance learning. We start by describing a boosting learning framework that is capable to handle billions of training examples. It differs from traditional bootstrapping schemes in that no intermediate thresholds need to be set during training, yet the total number of negative examples used for feature selection remains constant and focused (on the poor performing ones). A multiple instance pruning scheme is then adopted to set the intermediate thresholds after boosting learning. This algorithm generates detectors that are both fast and accurate. We then present two multiple instance learning schemes for face detection, multiple instance learning boosting (MILBoost) and winner-take-all multiple category boosting (WTA-McBoost). MILBoost addresses the uncertainty in accurately pinpointing the location of the object being detected, while WTA-McBoost addresses the uncertainty in determining the most appropriate subcategory label for multiview object detection. Both schemes can resolve the ambiguity of the labeling process and reduce outliers during training, which leads to improved detector performances. In many applications, a detector trained with generic data sets may not perform optimally in a new environment. We propose detection adaption, which is a promising solution for this problem. We present an adaptation scheme based on the Taylor expansion of the boosting learning objective function, and we propose to store the second order statistics of the generic training data for future adaptation. We show that with a small amount of labeled data in the new environment, the detector's performance can be greatly improved. We also present two interesting applications where boosting learning was applied successfully. The first application is face verification for filtering and ranking image/video search results on celebrities. We present boosted multi-task learning (MTL), yet another boosting learning algorithm that extends MILBoost with a graphical model. Since the available number of training images for each celebrity may be limited, learning individual classifiers for each person may cause overfitting. MTL jointly learns classifiers for multiple people by sharing a few boosting classifiers in order to avoid overfitting. The second application addresses the need of speaker detection in conference rooms. The goal is to find who is speaking, given a microphone array and a panoramic video of the room. We show that by combining audio and visual features in a boosting framework, we can determine the speaker's position very accurately. Finally, we offer our thoughts on future directions for face detection. Table of Contents: A Brief Survey of the Face Detection Literature / Cascade-based Real-Time Face Detection / Multiple Instance Learning for Face Detection / Detector Adaptation / Other Applications / Conclusions and Future Work


Computer Vision -- ECCV 2014

Computer Vision -- ECCV 2014

Author: David Fleet

Publisher: Springer

Published: 2014-08-14

Total Pages: 855

ISBN-13: 331910599X

DOWNLOAD EBOOK

The seven-volume set comprising LNCS volumes 8689-8695 constitutes the refereed proceedings of the 13th European Conference on Computer Vision, ECCV 2014, held in Zurich, Switzerland, in September 2014. The 363 revised papers presented were carefully reviewed and selected from 1444 submissions. The papers are organized in topical sections on tracking and activity recognition; recognition; learning and inference; structure from motion and feature matching; computational photography and low-level vision; vision; segmentation and saliency; context and 3D scenes; motion and 3D scene analysis; and poster sessions.


Handbook of Digital Face Manipulation and Detection

Handbook of Digital Face Manipulation and Detection

Author: Christian Rathgeb

Publisher: Springer Nature

Published: 2022-01-31

Total Pages: 487

ISBN-13: 3030876640

DOWNLOAD EBOOK

This open access book provides the first comprehensive collection of studies dealing with the hot topic of digital face manipulation such as DeepFakes, Face Morphing, or Reenactment. It combines the research fields of biometrics and media forensics including contributions from academia and industry. Appealing to a broad readership, introductory chapters provide a comprehensive overview of the topic, which address readers wishing to gain a brief overview of the state-of-the-art. Subsequent chapters, which delve deeper into various research challenges, are oriented towards advanced readers. Moreover, the book provides a good starting point for young researchers as well as a reference guide pointing at further literature. Hence, the primary readership is academic institutions and industry currently involved in digital face manipulation and detection. The book could easily be used as a recommended text for courses in image processing, machine learning, media forensics, biometrics, and the general security area.


The Development of Face Processing

The Development of Face Processing

Author: Gudrun Schwarzer

Publisher: Hogrefe & Huber Publishing

Published: 2003

Total Pages: 168

ISBN-13:

DOWNLOAD EBOOK

This book draws together, for the first time, the latest scientific findings from leading international researchers on how face recognition develops. It is only in recent years that methods acceptable in experimental psychology have been developed for studying this vital and unique process. While other publications have concentrated on computer modeling and of face processing and the like, this one is unique in that it looks at fundamental (and so far unanswered) questions such as: What are the roots of and reasons for our ability to recognize faces? How much of this ability is learned and how much innate? By connecting studies on face processing in infancy with those on the development of face processing, it thus bridges the gap between face processing research and visual perceptual development. Leading researchers from USA and Europe who have conducted pioneering work in these domains describe results and anticipate future inquiry, covering topics such as fundamental cognitive abilities in infancy, development of face processing from infancy to adulthood, and the effects of expertise on face recognition.


Computer Vision – ECCV 2020

Computer Vision – ECCV 2020

Author: Andrea Vedaldi

Publisher: Springer Nature

Published: 2020-11-15

Total Pages: 836

ISBN-13: 3030585557

DOWNLOAD EBOOK

The 30-volume set, comprising the LNCS books 12346 until 12375, constitutes the refereed proceedings of the 16th European Conference on Computer Vision, ECCV 2020, which was planned to be held in Glasgow, UK, during August 23-28, 2020. The conference was held virtually due to the COVID-19 pandemic. The 1360 revised papers presented in these proceedings were carefully reviewed and selected from a total of 5025 submissions. The papers deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; object recognition; motion estimation.


Handbook of Research on Applied Intelligence for Health and Clinical Informatics

Handbook of Research on Applied Intelligence for Health and Clinical Informatics

Author: Thakare, Anuradha Dheeraj

Publisher: IGI Global

Published: 2021-10-22

Total Pages: 470

ISBN-13: 1799877108

DOWNLOAD EBOOK

Currently, informatics within the field of public health is a developing and growing industry. Clinical informatics are used in direct patient care by supplying medical practitioners with information that can be used to develop a care plan. Intelligent applications in clinical informatics facilitates with the technology-based solutions to analyze data or medical images and help clinicians to retrieve that information. Decision models aid with making complex decisions especially in uncertain situations. The Handbook of Research on Applied Intelligence for Health and Clinical Informatics is a comprehensive reference book that focuses on the study of resources and methods for the management of healthcare infrastructure and information. This book provides insights on how applied intelligence with deep learning, experiential learning, and more will impact healthcare and clinical information processing. The content explores the representation, processing, and communication of clinical information in natural and engineered systems. This book covers a range of topics including applied intelligence, medical imaging, telehealth, and decision support systems, and also looks at technologies and tools used in the detection and diagnosis of medical conditions such as cancers, diabetes, heart disease, lung disease, and prenatal syndromes. It is an essential reference source for diagnosticians, medical professionals, imaging specialists, data specialists, IT consultants, medical technologists, academicians, researchers, industrial experts, scientists, and students.


Fitting the Mind to the World

Fitting the Mind to the World

Author: Colin W. G. Clifford

Publisher: Oxford University Press

Published: 2005-05-05

Total Pages: 388

ISBN-13: 9780198529699

DOWNLOAD EBOOK

"This book brings together a collection of studies from international researchers who demonstrate the brain's remarkable capacity to adapt its representation of the visual world in response to changes in its environment."--BOOK JACKET.


The Sixth International Symposium on Neural Networks (ISNN 2009)

The Sixth International Symposium on Neural Networks (ISNN 2009)

Author: Hongwei Wang

Publisher: Springer Science & Business Media

Published: 2009-05-03

Total Pages: 904

ISBN-13: 3642012167

DOWNLOAD EBOOK

This volume of Advances in Soft Computing and Lecture Notes in Computer th Science vols. 5551, 5552 and 5553, constitute the Proceedings of the 6 Inter- tional Symposium of Neural Networks (ISNN 2009) held in Wuhan, China during May 26–29, 2009. ISNN is a prestigious annual symposium on neural networks with past events held in Dalian (2004), Chongqing (2005), Chengdu (2006), N- jing (2007) and Beijing (2008). Over the past few years, ISNN has matured into a well-established series of international conference on neural networks and their applications to other fields. Following this tradition, ISNN 2009 provided an a- demic forum for the participants to disseminate their new research findings and discuss emerging areas of research. Also, it created a stimulating environment for the participants to interact and exchange information on future research challenges and opportunities of neural networks and their applications. ISNN 2009 received 1,235 submissions from about 2,459 authors in 29 co- tries and regions (Australia, Brazil, Canada, China, Democratic People's Republic of Korea, Finland, Germany, Hong Kong, Hungary, India, Islamic Republic of Iran, Japan, Jordan, Macao, Malaysia, Mexico, Norway, Qatar, Republic of Korea, Singapore, Spain, Taiwan, Thailand, Tunisia, United Kingdom, United States, Venezuela, Vietnam, and Yemen) across six continents (Asia, Europe, North America, South America, Africa, and Oceania). Based on rigorous reviews by the Program Committee members and reviewers, 95 high-quality papers were selected to be published in this volume.