 Title Pages
 Series Foreword
 Preface

1 Introduction to SemiSupervised Learning 
1 A Taxonomy for SemiSupervised Learning Methods 
3 SemiSupervised Text Classification Using EM 
4 Risks of SemiSupervised Learning: How Unlabeled Data Can Degrade Performance of Generative Classifiers 
5 Probabilistic SemiSupervised Clustering with Constraints 
6 Transductive Support Vector Machines 
7 SemiSupervised Learning Using SemiDefinite Programming 
8 Gaussian Processes and the NullCategory Noise Model 
9 Entropy Regularization 
10 DataDependent Regularization 
11 Label Propagation and Quadratic Criterion 
12 The Geometric Basis of SemiSupervised Learning 
13 Discrete Regularization 
14 SemiSupervised Learning with Conditional Harmonic Mixing 
15 Graph Kernels by Spectral Transforms 
16 Spectral Methods for Dimensionality Reduction 
17 Modifying Distances 
18 LargeScale Algorithms 
19 SemiSupervised Protein Classification Using Cluster Kernels 
20 Prediction of Protein Function from Networks 
25 Analysis of Benchmarks 
22 An Augmented PAC Model for SemiSupervised Learning 
23 MetricBased Approaches for SemiSupervised Regression and Classification 
24 Transductive Inference and SemiSupervised Learning 
25 A Discussion of SemiSupervised Learning and Transduction  References
 Notation and Symbols
 Contributors
 Index
Introduction to SemiSupervised Learning
Introduction to SemiSupervised Learning
 Chapter:
 (p.1) 1 Introduction to SemiSupervised Learning
 Source:
 SemiSupervised Learning
 Author(s):
Chapelle Olivier
Schölkopf Bernhard
Zien Alexander
 Publisher:
 The MIT Press
This chapter first presents definitions of supervised and unsupervised learning in order to understand the nature of semisupervised learning (SSL). SSL is halfway between supervised and unsupervised learning. In addition to unlabeled data, the algorithm is provided with some supervision information—but not necessarily for all examples. Often, this information will be the targets associated with some of the examples. Other forms of partial supervision are possible. For example, there may be constraints such as “these points have (or do not have) the same target.” The different setting corresponds to a different view of semisupervised learning: In succeeding chapters, SSL is seen as unsupervised learning guided by constraints. A problem related to SSL was introduced by Vapnik several decades ago—transductive learning. In this setting, a labeled training set and an unlabeled test set are provided. The idea of transduction is to perform predictions only for the test points.
Keywords: semisupervised learning, SSL, unlabeled data, algorithm, supervision information, partial supervision, constraints, Vapnik, transductive learning
MIT Press Scholarship Online requires a subscription or purchase to access the full text of books within the service. Public users can however freely search the site and view the abstracts and keywords for each book and chapter.
Please, subscribe or login to access full text content.
If you think you should have access to this title, please contact your librarian.
To troubleshoot, please check our FAQs, and if you can't find the answer there, please contact us.
 Title Pages
 Series Foreword
 Preface

1 Introduction to SemiSupervised Learning 
1 A Taxonomy for SemiSupervised Learning Methods 
3 SemiSupervised Text Classification Using EM 
4 Risks of SemiSupervised Learning: How Unlabeled Data Can Degrade Performance of Generative Classifiers 
5 Probabilistic SemiSupervised Clustering with Constraints 
6 Transductive Support Vector Machines 
7 SemiSupervised Learning Using SemiDefinite Programming 
8 Gaussian Processes and the NullCategory Noise Model 
9 Entropy Regularization 
10 DataDependent Regularization 
11 Label Propagation and Quadratic Criterion 
12 The Geometric Basis of SemiSupervised Learning 
13 Discrete Regularization 
14 SemiSupervised Learning with Conditional Harmonic Mixing 
15 Graph Kernels by Spectral Transforms 
16 Spectral Methods for Dimensionality Reduction 
17 Modifying Distances 
18 LargeScale Algorithms 
19 SemiSupervised Protein Classification Using Cluster Kernels 
20 Prediction of Protein Function from Networks 
25 Analysis of Benchmarks 
22 An Augmented PAC Model for SemiSupervised Learning 
23 MetricBased Approaches for SemiSupervised Regression and Classification 
24 Transductive Inference and SemiSupervised Learning 
25 A Discussion of SemiSupervised Learning and Transduction  References
 Notation and Symbols
 Contributors
 Index