Semi-supervised Learning for Training CNNs with Few Data

Semi-supervised Learning for Training CNNs with Few Data

Author: Víctor García Satorras

Publisher:

Published: 2017

Total Pages:

ISBN-13:

DOWNLOAD EBOOK

Although Deep Learning has successfully been applied to many fields, it relies on large amounts of data. In this work we focus on two different research lines within the context of image classification that try to deal with this problem. a) The first part of the project is focused on Active Learning (AL), which is an extensive field within Machine Learning that tries to reduce the amount of labeling work by inter- actively querying the most informative samples from a large dataset. Most of the AL literature is based on uncertainty sampling methods which do not perform so well when applied to neural networks. In this project we present a density estimation approach for Active Learning that overcomes some of the sampling limitations re- lated to the uncertainty-based methods. b) The second part of the project is focused on a very recent field within deep learning called one-shot learning, which aims to correctly classify samples by just seeing one or few training samples from each class. In this work we present a simple non-linear learnable metric for one-shot learning that overcomes most of the state of the art results obtained with simple methods and is competitive in terms of accuracy to more complex ones. We also present a meta-learner architecture based on Graph Neural Networks for one-shot learning.


Introduction to Semi-Supervised Learning

Introduction to Semi-Supervised Learning

Author: Xiaojin Geffner

Publisher: Springer Nature

Published: 2022-05-31

Total Pages: 116

ISBN-13: 3031015487

DOWNLOAD EBOOK

Semi-supervised learning is a learning paradigm concerned with the study of how computers and natural systems such as humans learn in the presence of both labeled and unlabeled data. Traditionally, learning has been studied either in the unsupervised paradigm (e.g., clustering, outlier detection) where all the data are unlabeled, or in the supervised paradigm (e.g., classification, regression) where all the data are labeled. The goal of semi-supervised learning is to understand how combining labeled and unlabeled data may change the learning behavior, and design algorithms that take advantage of such a combination. Semi-supervised learning is of great interest in machine learning and data mining because it can use readily available unlabeled data to improve supervised learning tasks when the labeled data are scarce or expensive. Semi-supervised learning also shows potential as a quantitative tool to understand human category learning, where most of the input is self-evidently unlabeled. In this introductory book, we present some popular semi-supervised learning models, including self-training, mixture models, co-training and multiview learning, graph-based methods, and semi-supervised support vector machines. For each model, we discuss its basic mathematical formulation. The success of semi-supervised learning depends critically on some underlying assumptions. We emphasize the assumptions made by each model and give counterexamples when appropriate to demonstrate the limitations of the different models. In addition, we discuss semi-supervised learning for cognitive psychology. Finally, we give a computational learning theoretic perspective on semi-supervised learning, and we conclude the book with a brief discussion of open questions in the field. Table of Contents: Introduction to Statistical Machine Learning / Overview of Semi-Supervised Learning / Mixture Models and EM / Co-Training / Graph-Based Semi-Supervised Learning / Semi-Supervised Support Vector Machines / Human Semi-Supervised Learning / Theory and Outlook


Semi-Supervised Learning and Domain Adaptation in Natural Language Processing

Semi-Supervised Learning and Domain Adaptation in Natural Language Processing

Author: Anders Søgaard

Publisher: Morgan & Claypool Publishers

Published: 2013-05-01

Total Pages: 105

ISBN-13: 1608459861

DOWNLOAD EBOOK

This book introduces basic supervised learning algorithms applicable to natural language processing (NLP) and shows how the performance of these algorithms can often be improved by exploiting the marginal distribution of large amounts of unlabeled data. One reason for that is data sparsity, i.e., the limited amounts of data we have available in NLP. However, in most real-world NLP applications our labeled data is also heavily biased. This book introduces extensions of supervised learning algorithms to cope with data sparsity and different kinds of sampling bias. This book is intended to be both readable by first-year students and interesting to the expert audience. My intention was to introduce what is necessary to appreciate the major challenges we face in contemporary NLP related to data sparsity and sampling bias, without wasting too much time on details about supervised learning algorithms or particular NLP applications. I use text classification, part-of-speech tagging, and dependency parsing as running examples, and limit myself to a small set of cardinal learning algorithms. I have worried less about theoretical guarantees ("this algorithm never does too badly") than about useful rules of thumb ("in this case this algorithm may perform really well"). In NLP, data is so noisy, biased, and non-stationary that few theoretical guarantees can be established and we are typically left with our gut feelings and a catalogue of crazy ideas. I hope this book will provide its readers with both. Throughout the book we include snippets of Python code and empirical evaluations, when relevant.


Semi-Supervised Learning

Semi-Supervised Learning

Author: Olivier Chapelle

Publisher: MIT Press

Published: 2010-01-22

Total Pages: 525

ISBN-13: 0262514125

DOWNLOAD EBOOK

A comprehensive review of an area of machine learning that deals with the use of unlabeled data in classification problems: state-of-the-art algorithms, a taxonomy of the field, applications, benchmark experiments, and directions for future research. In the field of machine learning, semi-supervised learning (SSL) occupies the middle ground, between supervised learning (in which all training examples are labeled) and unsupervised learning (in which no label data are given). Interest in SSL has increased in recent years, particularly because of application domains in which unlabeled data are plentiful, such as images, text, and bioinformatics. This first comprehensive overview of SSL presents state-of-the-art algorithms, a taxonomy of the field, selected applications, benchmark experiments, and perspectives on ongoing and future research.Semi-Supervised Learning first presents the key assumptions and ideas underlying the field: smoothness, cluster or low-density separation, manifold structure, and transduction. The core of the book is the presentation of SSL methods, organized according to algorithmic strategies. After an examination of generative models, the book describes algorithms that implement the low-density separation assumption, graph-based methods, and algorithms that perform two-step learning. The book then discusses SSL applications and offers guidelines for SSL practitioners by analyzing the results of extensive benchmark experiments. Finally, the book looks at interesting directions for SSL research. The book closes with a discussion of the relationship between semi-supervised learning and transduction.


Continual Semi-Supervised Learning

Continual Semi-Supervised Learning

Author: Fabio Cuzzolin

Publisher: Springer Nature

Published: 2022-09-27

Total Pages: 148

ISBN-13: 3031175875

DOWNLOAD EBOOK

This book constitutes the proceedings of the First International Workshop on Continual Semi-Supervised Learning, CSSL 2021, which took place as a virtual event during August 2021.The 9 full papers and 0 short papers included in this book were carefully reviewed and selected from 14 submissions.


Learning with Partially Labeled and Interdependent Data

Learning with Partially Labeled and Interdependent Data

Author: Massih-Reza Amini

Publisher: Springer

Published: 2015-05-07

Total Pages: 113

ISBN-13: 3319157264

DOWNLOAD EBOOK

This book develops two key machine learning principles: the semi-supervised paradigm and learning with interdependent data. It reveals new applications, primarily web related, that transgress the classical machine learning framework through learning with interdependent data. The book traces how the semi-supervised paradigm and the learning to rank paradigm emerged from new web applications, leading to a massive production of heterogeneous textual data. It explains how semi-supervised learning techniques are widely used, but only allow a limited analysis of the information content and thus do not meet the demands of many web-related tasks. Later chapters deal with the development of learning methods for ranking entities in a large collection with respect to precise information needed. In some cases, learning a ranking function can be reduced to learning a classification function over the pairs of examples. The book proves that this task can be efficiently tackled in a new framework: learning with interdependent data. Researchers and professionals in machine learning will find these new perspectives and solutions valuable. Learning with Partially Labeled and Interdependent Data is also useful for advanced-level students of computer science, particularly those focused on statistics and learning.


Medical Image Learning with Limited and Noisy Data

Medical Image Learning with Limited and Noisy Data

Author: Ghada Zamzmi

Publisher: Springer Nature

Published: 2022-09-21

Total Pages: 243

ISBN-13: 3031167600

DOWNLOAD EBOOK

This book constitutes the proceedings of the First Workshop on Medical Image Learning with Limited and Noisy Data, MILLanD 2022, held in conjunction with MICCAI 2022. The conference was held in Singapore. For this workshop, 22 papers from 54 submissions were accepted for publication. They selected papers focus on the challenges and limitations of current deep learning methods applied to limited and noisy medical data and present new methods for training models using such imperfect data.


Medical Image Computing and Computer Assisted Intervention – MICCAI 2022

Medical Image Computing and Computer Assisted Intervention – MICCAI 2022

Author: Linwei Wang

Publisher: Springer Nature

Published: 2022-09-15

Total Pages: 782

ISBN-13: 3031164407

DOWNLOAD EBOOK

The eight-volume set LNCS 13431, 13432, 13433, 13434, 13435, 13436, 13437, and 13438 constitutes the refereed proceedings of the 25th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2022, which was held in Singapore in September 2022. The 574 revised full papers presented were carefully reviewed and selected from 1831 submissions in a double-blind review process. The papers are organized in the following topical sections: Part I: Brain development and atlases; DWI and tractography; functional brain networks; neuroimaging; heart and lung imaging; dermatology; Part II: Computational (integrative) pathology; computational anatomy and physiology; ophthalmology; fetal imaging; Part III: Breast imaging; colonoscopy; computer aided diagnosis; Part IV: Microscopic image analysis; positron emission tomography; ultrasound imaging; video data analysis; image segmentation I; Part V: Image segmentation II; integration of imaging with non-imaging biomarkers; Part VI: Image registration; image reconstruction; Part VII: Image-Guided interventions and surgery; outcome and disease prediction; surgical data science; surgical planning and simulation; machine learning – domain adaptation and generalization; Part VIII: Machine learning – weakly-supervised learning; machine learning – model interpretation; machine learning – uncertainty; machine learning theory and methodologies.


Semi-supervised Learning Using Triple-Siamese Network

Semi-supervised Learning Using Triple-Siamese Network

Author: Debapriya Banerjee

Publisher:

Published: 2020

Total Pages: 27

ISBN-13:

DOWNLOAD EBOOK

Missing data problem is inevitable in mostly all research areas including Arti-cial Intelligence, Machine Learning and Computer Vision where we have modicum knowledge about the complete dataset. One of the key reasons of missing data in AI is insufficiency of accurately labeled data. To solve a classification problem using ML or training a Deep Neural Network model, we need a huge amount of labeled data. It is difficult to get labeled data but unlabeled data is inexpensive and available easily.It is usual that we get no more than a single element per class to train our models due to unavailability of enough labeled training data. Strict privacy control or accidental loss may also cause missing data problem. One of the ways of getting training data labeled is using human-in-the-loop, but budget constraints can prevent that option.The objective of this research is to recover the complete signal or missing labels of the dataset using state-of-the-art Machine Learning and Computer Vision techniques.We propose a novel network trained with a few instances of a class to perform Metric Learning. We then convert our dataset to a graph signal and recover the graph completely using Recovery algorithm in Graph Fourier Transform. Our approach performs significantly better than Graph Neural Network and other state-of-the-art techniques.


Supervised and Unsupervised Learning for Data Science

Supervised and Unsupervised Learning for Data Science

Author: Michael W. Berry

Publisher: Springer Nature

Published: 2019-09-04

Total Pages: 191

ISBN-13: 3030224759

DOWNLOAD EBOOK

This book covers the state of the art in learning algorithms with an inclusion of semi-supervised methods to provide a broad scope of clustering and classification solutions for big data applications. Case studies and best practices are included along with theoretical models of learning for a comprehensive reference to the field. The book is organized into eight chapters that cover the following topics: discretization, feature extraction and selection, classification, clustering, topic modeling, graph analysis and applications. Practitioners and graduate students can use the volume as an important reference for their current and future research and faculty will find the volume useful for assignments in presenting current approaches to unsupervised and semi-supervised learning in graduate-level seminar courses. The book is based on selected, expanded papers from the Fourth International Conference on Soft Computing in Data Science (2018). Includes new advances in clustering and classification using semi-supervised and unsupervised learning; Address new challenges arising in feature extraction and selection using semi-supervised and unsupervised learning; Features applications from healthcare, engineering, and text/social media mining that exploit techniques from semi-supervised and unsupervised learning.