Feedback Coding for Active Learning (bibtex)

by G. Canal, M. Bloch and C. Rozell

Abstract:

The iterative selection of examples for labeling in active machine learning is conceptually similar to feedback channel coding in information theory: in both tasks, the objective is to seek a minimal sequence of actions to encode information in the presence of noise. While this high-level overlap has been previously noted, there remain open questions on how to best formulate active learning as a communications system to leverage existing analysis and algorithms in feedback coding. In this work, we formally identify and leverage the structural commonalities between the two problems, including the characterization of encoder and noisy channel components, to design a new algorithm. Specifically, we develop an optimal transport-based feedback coding scheme called Approximate Posterior Matching (APM) for the task of active example selection and explore its application to Bayesian logistic regression, a popular model in active learning. We evaluate APM on a variety of datasets and demonstrate learning performance comparable to existing active learning methods, at a reduced computational cost. These results demonstrate the potential of directly deploying concepts from feedback channel coding to design efficient active learning strategies.

View PDF

Reference:

Feedback Coding for Active LearningG. Canal, M. Bloch and C. Rozell. In International Conference on Artificial Intelligence and Statistics (AISTATS), April 2021. (Acceptance rate 30%)

Bibtex Entry:

@InProceedings{canal.21,
     author = 	 {Canal, G. and Bloch, M. and Rozell, C.},
     title = 	 {Feedback Coding for Active Learning},
     year =	 2021,
  	 month = apr,
  abstract = {The iterative selection of examples for labeling in active machine learning is conceptually similar to feedback channel coding in information theory: in both tasks, the objective is to seek a minimal sequence of actions to encode information in the presence of noise. While this high-level overlap has been previously noted, there remain open questions on how to best formulate active learning as a communications system to leverage existing analysis and algorithms in feedback coding. In this work, we formally identify and leverage the structural commonalities between the two problems, including the characterization of encoder and noisy channel components, to design a new algorithm. Specifically, we develop an optimal transport-based feedback coding scheme called Approximate Posterior Matching (APM) for the task of active example selection and explore its application to Bayesian logistic regression, a popular model in active learning. We evaluate APM on a variety of datasets and demonstrate learning performance comparable to existing active learning methods, at a reduced computational cost. These results demonstrate the potential of directly deploying concepts from feedback channel coding to design efficient active learning strategies.},
  booktitle = {International Conference on Artificial Intelligence and Statistics (AISTATS)},
  note = {(Acceptance rate 30\%)},
 address = {Virtual meeting},
 url = {https://arxiv.org/abs/2103.00654}
  }