Dictionary Learning for Scalable Sparse Image Representation

Dictionary Learning for Scalable Sparse Image Representation

Author: Bojana Begovic

Publisher:

Published: 2016

Total Pages: 276

ISBN-13:

DOWNLOAD EBOOK

Modern era of signal processing has developed many technical tools for recording and processing large and growing amount of data together with algorithms specialised for data analysis. This gives rise to new challenges in terms of data processing and modelling data representation. Fields ranging from experimental sciences, astronomy, computer vision,neuroscience mobile networks etc., are all in constant search for scalable and efficient data processing tools which would enable more effective analysis of continuous video streams containing millions of pixels. Therefore, the question of digital signal representation is still of high importance, despite the fact that it has been the topic of a significant amount of work in the past. Moreover, developing new data processing methods also affects the quality of everyday life, where devices such as CCD sensors from digital cameras or cell phones are intensively used for entertainment purposes. Specifically, one of the novel processing tools is signal sparse coding which represents signals as linear combinations of a few representational basis vectors i.e., atoms given an overcomplete dictionary. Applications that employ sparse representation are many such as denoising, compression, and regularisation in inverse problems, feature extraction, and more. In this thesis we introduce and study a particular signal representation denoted as the scalable sparse coding. It is based on a novel design for the dictionary learning algorithm, which has proven to be effective for scalable sparse representation of many modalities such as high motion video sequences, natural and solar images. The proposed algorithm is built upon the foundation of the K-SVD framework originally designed to learn non-scalable dictionaries for natural images. The scalable dictionary learning design is mainly motivated by the main perception characteristics of the Human Visual System (HVS) mechanism. Specifically, its core structure relies on the exploitation of the spatial high-frequency image components and contrast variations in order to achieve visual scene objects identification at all scalable levels. The implementation of HVS properties is carried out by introducing a semi-random Morphological Component Analysis (MCA) based initialisation of the scalable dictionary and the regularisation of its atom’s update mechanism. Subsequently, this enables scalable sparse image reconstruction. In general, dictionary learning for sparse representations leads to state-of-the-art image restoration results for several different problems in the field of image processing. Experiments in this thesis show that these are equally achievable by accommodating all dictionary elements to tailor the scalable data representation and reconstruction, hence modelling data that admit sparse representation in a novel manner. Furthermore, achieved results demonstrateand validate the practicality of the proposed scheme making it a promising candidate for many practical applications involving both time scalable display, denoising and scalable compressive sensing (CS). Performed simulations include scalable sparse recovery for representation of static and dynamic data changing over time such as video sequences and natural images. Lastly, we contribute novel approaches for scalable denoising and contrast enhancement (CE), applied on solar images corrupted with pixel-dependent Poisson and zero-mean additive white Gaussian noise. Given that solar data contain noise introduced by charge-coupled devices within the on-board acquisition system these artefacts, prior to image analysis, have to be removed. Thus, novel image denoising and contrast enhancement methods are necessary for solar preprocessing.


Dictionary Learning Algorithms and Applications

Dictionary Learning Algorithms and Applications

Author: Bogdan Dumitrescu

Publisher: Springer

Published: 2018-04-16

Total Pages: 289

ISBN-13: 3319786741

DOWNLOAD EBOOK

This book covers all the relevant dictionary learning algorithms, presenting them in full detail and showing their distinct characteristics while also revealing the similarities. It gives implementation tricks that are often ignored but that are crucial for a successful program. Besides MOD, K-SVD, and other standard algorithms, it provides the significant dictionary learning problem variations, such as regularization, incoherence enforcing, finding an economical size, or learning adapted to specific problems like classification. Several types of dictionary structures are treated, including shift invariant; orthogonal blocks or factored dictionaries; and separable dictionaries for multidimensional signals. Nonlinear extensions such as kernel dictionary learning can also be found in the book. The discussion of all these dictionary types and algorithms is enriched with a thorough numerical comparison on several classic problems, thus showing the strengths and weaknesses of each algorithm. A few selected applications, related to classification, denoising and compression, complete the view on the capabilities of the presented dictionary learning algorithms. The book is accompanied by code for all algorithms and for reproducing most tables and figures. Presents all relevant dictionary learning algorithms - for the standard problem and its main variations - in detail and ready for implementation; Covers all dictionary structures that are meaningful in applications; Examines the numerical properties of the algorithms and shows how to choose the appropriate dictionary learning algorithm.


Dictionary Learning in Visual Computing

Dictionary Learning in Visual Computing

Author: Qiang Zhang

Publisher: Springer Nature

Published: 2022-05-31

Total Pages: 133

ISBN-13: 303102253X

DOWNLOAD EBOOK

The last few years have witnessed fast development on dictionary learning approaches for a set of visual computing tasks, largely due to their utilization in developing new techniques based on sparse representation. Compared with conventional techniques employing manually defined dictionaries, such as Fourier Transform and Wavelet Transform, dictionary learning aims at obtaining a dictionary adaptively from the data so as to support optimal sparse representation of the data. In contrast to conventional clustering algorithms like K-means, where a data point is associated with only one cluster center, in a dictionary-based representation, a data point can be associated with a small set of dictionary atoms. Thus, dictionary learning provides a more flexible representation of data and may have the potential to capture more relevant features from the original feature space of the data. One of the early algorithms for dictionary learning is K-SVD. In recent years, many variations/extensions of K-SVD and other new algorithms have been proposed, with some aiming at adding discriminative capability to the dictionary, and some attempting to model the relationship of multiple dictionaries. One prominent application of dictionary learning is in the general field of visual computing, where long-standing challenges have seen promising new solutions based on sparse representation with learned dictionaries. With a timely review of recent advances of dictionary learning in visual computing, covering the most recent literature with an emphasis on papers after 2008, this book provides a systematic presentation of the general methodologies, specific algorithms, and examples of applications for those who wish to have a quick start on this subject.


Selection-based Dictionary Learning for Sparse Representation in Visual Tracking

Selection-based Dictionary Learning for Sparse Representation in Visual Tracking

Author: Baiyang Liu

Publisher:

Published: 2012

Total Pages: 79

ISBN-13:

DOWNLOAD EBOOK

This dissertation describes a novel selection-based dictionary learning method with a sparse representation to tackle the object tracking problem in computer vision. The sparse representa- tion has been widely used in many applications including visual tracking, compressive sensing, image de-noising and image classification, and learning a good dictionary for the sparse rep- resentation is critical for obtaining high performance. The most popular existing dictionary learning algorithms are generalized from K-means, which compute the dictionary columns to minimize the overall target reconstruction error iteratively. For better discriminative capability to differentiate target-object (positive) from background (negative) data, a class of dictionary algorithms has been developed to learn the dictionary from both the positive and the negative data. However, these methods do not work well for visual tracking in a dynamic environment in which the background can change considerably between frames in a non-linear way. The background cannot be modeled statically with the usual linear models. In this tdissertation, I report on the development of a selection-based dictionary learning algorithm (K-Selection) that constructs the dictionary by choosing its columns from the training data. Each column is the most representative basis for the whole dataset, which also has a clear physical meaning. With locality-constraints, the subspace represented by the learned dictionary is not restricted to the training data alone, and is also less sensitive to outliers. The sparse representation based on this dictionary learning method supports a more robust tracker trained on the target-object data alone. This is because the learned dictionary has more discriminative power and can better distinguish the object from the background clutter. By extending the dictionary with encoded spatial information, I present a new tracking algorithm which is robust to dynamic appearance changes and occlusions. The performance of the proposed algorithms have been validated for several challenging visual tracking applications through a series of comparative experiments.


Sparse Modeling for Image and Vision Processing

Sparse Modeling for Image and Vision Processing

Author: Julien Mairal

Publisher: Now Publishers

Published: 2014-12-19

Total Pages: 216

ISBN-13: 9781680830088

DOWNLOAD EBOOK

Sparse Modeling for Image and Vision Processing offers a self-contained view of sparse modeling for visual recognition and image processing. More specifically, it focuses on applications where the dictionary is learned and adapted to data, yielding a compact representation that has been successful in various contexts.


Sparse Dictionary Learning and the Compact Support Neural Network

Sparse Dictionary Learning and the Compact Support Neural Network

Author: Hongyu Mou

Publisher:

Published: 2022

Total Pages: 0

ISBN-13:

DOWNLOAD EBOOK

Computer vision is an interdisciplinary scientific field that deals with how computers can gain high-levelunderstanding from digital images or videos. The tasks of computer vision include methods for acquiring, processing analyzing and understanding digital images. In this dissertation, we present approaches for obtaining meaningful representations in computer vision. The second chapter proposes a new method for dictionary learning that uses Feature Selection with Annealing (FSA) to control the representation sparsity directly. This method obtains a smaller reconstruction error than LARS or IRLS and the sparse representation can be used for image classification, where it obtains a smaller misclassification error than LARS and IRLS. In the third chapter, we introduce a novel neuron formulation that can be used for obtaining a tight representation near the training examples and thus preventing high confidence predictions on examples that are far away from the training data. The proposed a neuron uses ReLU as the activation function and has compact support, which means that its output is zero outside a bounded domain. We also show how to avoid difficulties in training a neural network with such neurons. Furthermore, the experimental findings on standard benchmark datasets show that the proposed approach has smaller test errors than the state-of-the-art competing methods and outperforms the competing methods in detecting out-of-distribution samples on two out of three datasets. In the fourth chapter, we prove that a neural network with such compact support neurons has the universal approximation property. This means that the network can approximate any continuous and integrable function with an arbitrary degree of accuracy. In the last chapter, we use the feature of CSNN for few shot learning by recognizing data from a new class as out-of-distribution data and it shows good performance.


On-Line Learning in Neural Networks

On-Line Learning in Neural Networks

Author: David Saad

Publisher: Cambridge University Press

Published: 2009-07-30

Total Pages: 412

ISBN-13: 9780521117913

DOWNLOAD EBOOK

On-line learning is one of the most commonly used techniques for training neural networks. Though it has been used successfully in many real-world applications, most training methods are based on heuristic observations. The lack of theoretical support damages the credibility as well as the efficiency of neural networks training, making it hard to choose reliable or optimal methods. This book presents a coherent picture of the state of the art in the theoretical analysis of on-line learning. An introduction relates the subject to other developments in neural networks and explains the overall picture. Surveys by leading experts in the field combine new and established material and enable nonexperts to learn more about the techniques and methods used. This book, the first in the area, provides a comprehensive view of the subject and will be welcomed by mathematicians, scientists and engineers, both in industry and academia.


Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016

Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016

Author: Sebastien Ourselin

Publisher: Springer

Published: 2016-10-17

Total Pages: 666

ISBN-13: 3319467263

DOWNLOAD EBOOK

The three-volume set LNCS 9900, 9901, and 9902 constitutes the refereed proceedings of the 19th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2016, held in Athens, Greece, in October 2016. Based on rigorous peer reviews, the program committee carefully selected 228 revised regular papers from 756 submissions for presentation in three volumes. The papers have been organized in the following topical sections: Part I: brain analysis, brain analysis - connectivity; brain analysis - cortical morphology; Alzheimer disease; surgical guidance and tracking; computer aided interventions; ultrasound image analysis; cancer image analysis; Part II: machine learning and feature selection; deep learning in medical imaging; applications of machine learning; segmentation; cell image analysis; Part III: registration and deformation estimation; shape modeling; cardiac and vascular image analysis; image reconstruction; and MR image analysis.


Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015

Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015

Author: Nassir Navab

Publisher: Springer

Published: 2015-09-28

Total Pages: 801

ISBN-13: 3319245740

DOWNLOAD EBOOK

The three-volume set LNCS 9349, 9350, and 9351 constitutes the refereed proceedings of the 18th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2015, held in Munich, Germany, in October 2015. Based on rigorous peer reviews, the program committee carefully selected 263 revised papers from 810 submissions for presentation in three volumes. The papers have been organized in the following topical sections: quantitative image analysis I: segmentation and measurement; computer-aided diagnosis: machine learning; computer-aided diagnosis: automation; quantitative image analysis II: classification, detection, features, and morphology; advanced MRI: diffusion, fMRI, DCE; quantitative image analysis III: motion, deformation, development and degeneration; quantitative image analysis IV: microscopy, fluorescence and histological imagery; registration: method and advanced applications; reconstruction, image formation, advanced acquisition - computational imaging; modelling and simulation for diagnosis and interventional planning; computer-assisted and image-guided interventions.