Learning Convolution Operators for Visual Tracking

Learning Convolution Operators for Visual Tracking

Author: Martin Danelljan

Publisher: Linköping University Electronic Press

Published: 2018-05-03

Total Pages: 81

ISBN-13: 9176853322

DOWNLOAD EBOOK

Visual tracking is one of the fundamental problems in computer vision. Its numerous applications include robotics, autonomous driving, augmented reality and 3D reconstruction. In essence, visual tracking can be described as the problem of estimating the trajectory of a target in a sequence of images. The target can be any image region or object of interest. While humans excel at this task, requiring little effort to perform accurate and robust visual tracking, it has proven difficult to automate. It has therefore remained one of the most active research topics in computer vision. In its most general form, no prior knowledge about the object of interest or environment is given, except for the initial target location. This general form of tracking is known as generic visual tracking. The unconstrained nature of this problem makes it particularly difficult, yet applicable to a wider range of scenarios. As no prior knowledge is given, the tracker must learn an appearance model of the target on-the-fly. Cast as a machine learning problem, it imposes several major challenges which are addressed in this thesis. The main purpose of this thesis is the study and advancement of the, so called, Discriminative Correlation Filter (DCF) framework, as it has shown to be particularly suitable for the tracking application. By utilizing properties of the Fourier transform, a correlation filter is discriminatively learned by efficiently minimizing a least-squares objective. The resulting filter is then applied to a new image in order to estimate the target location. This thesis contributes to the advancement of the DCF methodology in several aspects. The main contribution regards the learning of the appearance model: First, the problem of updating the appearance model with new training samples is covered. Efficient update rules and numerical solvers are investigated for this task. Second, the periodic assumption induced by the circular convolution in DCF is countered by proposing a spatial regularization component. Third, an adaptive model of the training set is proposed to alleviate the impact of corrupted or mislabeled training samples. Fourth, a continuous-space formulation of the DCF is introduced, enabling the fusion of multiresolution features and sub-pixel accurate predictions. Finally, the problems of computational complexity and overfitting are addressed by investigating dimensionality reduction techniques. As a second contribution, different feature representations for tracking are investigated. A particular focus is put on the analysis of color features, which had been largely overlooked in prior tracking research. This thesis also studies the use of deep features in DCF-based tracking. While many vision problems have greatly benefited from the advent of deep learning, it has proven difficult to harvest the power of such representations for tracking. In this thesis it is shown that both shallow and deep layers contribute positively. Furthermore, the problem of fusing their complementary properties is investigated. The final major contribution of this thesis regards the prediction of the target scale. In many applications, it is essential to track the scale, or size, of the target since it is strongly related to the relative distance. A thorough analysis of how to integrate scale estimation into the DCF framework is performed. A one-dimensional scale filter is proposed, enabling efficient and accurate scale estimation.


Pattern Recognition and Computer Vision

Pattern Recognition and Computer Vision

Author: Zhouchen Lin

Publisher: Springer Nature

Published: 2019-10-31

Total Pages: 655

ISBN-13: 3030316548

DOWNLOAD EBOOK

The three-volume set LNCS 11857, 11858, and 11859 constitutes the refereed proceedings of the Second Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2019, held in Xi’an, China, in November 2019. The 165 revised full papers presented were carefully reviewed and selected from 412 submissions. The papers have been organized in the following topical sections: Part I: Object Detection, Tracking and Recognition, Part II: Image/Video Processing and Analysis, Part III: Data Analysis and Optimization.


Computer Vision – ECCV 2016

Computer Vision – ECCV 2016

Author: Bastian Leibe

Publisher: Springer

Published: 2016-09-15

Total Pages: 915

ISBN-13: 331946454X

DOWNLOAD EBOOK

The eight-volume set comprising LNCS volumes 9905-9912 constitutes the refereed proceedings of the 14th European Conference on Computer Vision, ECCV 2016, held in Amsterdam, The Netherlands, in October 2016. The 415 revised papers presented were carefully reviewed and selected from 1480 submissions. The papers cover all aspects of computer vision and pattern recognition such as 3D computer vision; computational photography, sensing and display; face and gesture; low-level vision and image processing; motion and tracking; optimization methods; physics-based vision, photometry and shape-from-X; recognition: detection, categorization, indexing, matching; segmentation, grouping and shape representation; statistical methods and learning; video: events, activities and surveillance; applications. They are organized in topical sections on detection, recognition and retrieval; scene understanding; optimization; image and video processing; learning; action, activity and tracking; 3D; and 9 poster sessions.


Computer Vision – ECCV 2020 Workshops

Computer Vision – ECCV 2020 Workshops

Author: Adrien Bartoli

Publisher: Springer Nature

Published: 2021-01-02

Total Pages: 777

ISBN-13: 3030668231

DOWNLOAD EBOOK

The 6-volume set, comprising the LNCS books 12535 until 12540, constitutes the refereed proceedings of 28 out of the 45 workshops held at the 16th European Conference on Computer Vision, ECCV 2020. The conference was planned to take place in Glasgow, UK, during August 23-28, 2020, but changed to a virtual format due to the COVID-19 pandemic. The 249 full papers, 18 short papers, and 21 further contributions included in the workshop proceedings were carefully reviewed and selected from a total of 467 submissions. The papers deal with diverse computer vision topics. Part IV focusses on advances in image manipulation; assistive computer vision and robotics; and computer vision for UAVs.


Human-Robot Interaction

Human-Robot Interaction

Author: Gholamreza Anbarjafari

Publisher: BoD – Books on Demand

Published: 2018-07-04

Total Pages: 186

ISBN-13: 178923316X

DOWNLOAD EBOOK

This book takes the vocal and visual modalities and human-robot interaction applications into account by considering three main aspects, namely, social and affective robotics, robot navigation, and risk event recognition. This book can be a very good starting point for the scientists who are about to start their research work in the field of human-robot interaction.


Transactions on Intelligent Welding Manufacturing

Transactions on Intelligent Welding Manufacturing

Author: Shanben Chen

Publisher: Springer Nature

Published: 2024-01-26

Total Pages: 129

ISBN-13: 981996136X

DOWNLOAD EBOOK

The primary aim of this volume is to provide researchers and engineers from both academic and industry with up-to-date coverage of new results in the field of robotic welding, intelligent systems and automation. The book is mainly based on papers selected from the 2022 International Conference on Robotic Welding, Intelligence and Automation (RWIA’2022) in Shanghai and Lanzhou, China. The articles show that the intelligentized welding manufacturing (IWM) is becoming an inevitable trend with the intelligentized robotic welding as the key technology. The volume is divided into four logical parts: Intelligent Techniques for Robotic Welding, Sensing of Arc Welding Processing, Modeling and Intelligent Control of Welding Processing, as well as Intelligent Control and its Applications in Engineering.


Visual Object Tracking from Correlation Filter to Deep Learning

Visual Object Tracking from Correlation Filter to Deep Learning

Author: Weiwei Xing

Publisher: Springer Nature

Published: 2021-11-18

Total Pages: 202

ISBN-13: 9811662428

DOWNLOAD EBOOK

The book focuses on visual object tracking systems and approaches based on correlation filter and deep learning. Both foundations and implementations have been addressed. The algorithm, system design and performance evaluation have been explored for three kinds of tracking methods including correlation filter based methods, correlation filter with deep feature based methods, and deep learning based methods. Firstly, context aware and multi-scale strategy are presented in correlation filter based trackers; then, long-short term correlation filter, context aware correlation filter and auxiliary relocation in SiamFC framework are proposed for combining correlation filter and deep learning in visual object tracking; finally, improvements in deep learning based trackers including Siamese network, GAN and reinforcement learning are designed. The goal of this book is to bring, in a timely fashion, the latest advances and developments in visual object tracking, especially correlation filter and deep learning based methods, which is particularly suited for readers who are interested in the research and technology innovation in visual object tracking and related fields.


Image and Graphics Technologies and Applications

Image and Graphics Technologies and Applications

Author: Yongtian Wang

Publisher: Springer

Published: 2018-08-11

Total Pages: 674

ISBN-13: 981131702X

DOWNLOAD EBOOK

This book constitutes the refereed proceedings of the 13th Chinese Conference on Image and Graphics Technologies and Applications, IGTA 2018, held in Beijing, China in April, 2018. The 64 papers presented were carefully reviewed and selected from 138 submissions. They provide a forum for sharing progresses in the areas of image processing technology; image analysis and understanding; computer vision and pattern recognition; big data mining, computer graphics and VR; as well as image technology applications.


Computer Vision – ECCV 2018

Computer Vision – ECCV 2018

Author: Vittorio Ferrari

Publisher: Springer

Published: 2018-10-06

Total Pages: 860

ISBN-13: 3030012409

DOWNLOAD EBOOK

The sixteen-volume set comprising the LNCS volumes 11205-11220 constitutes the refereed proceedings of the 15th European Conference on Computer Vision, ECCV 2018, held in Munich, Germany, in September 2018.The 776 revised papers presented were carefully reviewed and selected from 2439 submissions. The papers are organized in topical sections on learning for vision; computational photography; human analysis; human sensing; stereo and reconstruction; optimization; matching and recognition; video attention; and poster sessions.


Intelligent Computing Methodologies

Intelligent Computing Methodologies

Author: De-Shuang Huang

Publisher: Springer

Published: 2017-07-20

Total Pages: 781

ISBN-13: 3319633155

DOWNLOAD EBOOK

This three-volume set LNCS 10361, LNCS 10362, and LNAI 10363 constitutes the refereed proceedings of the 13th International Conference on Intelligent Computing, ICIC 2017, held in Liverpool, UK, in August 2017. The 212 full papers and 20 short papers of the three proceedings volumes were carefully reviewed and selected from 612 submissions. This third volume of the set comprises 67 papers. The papers are organized in topical sections such as Intelligent Computing in Robotics; Intelligent Computing in Computer Vision; Intelligent Control and Automation; Intelligent Agent and Web Applications; Fuzzy Theory and Algorithms; Supervised Learning; Unsupervised Learning; Kernel Methods and Supporting Vector Machines; Knowledge Discovery and Data Mining; Natural Language Processing and Computational Linguistics; Advances of Soft Computing: Algorithms and Its Applications - Rozaida Ghazali; Advances in Swarm Intelligence Algorithm; Computational Intelligence and Security for Image Applications in SocialNetwork; Biomedical Image Analysis; Information Security; Machine Learning; Intelligent Data Analysis and Prediction.