Online Machine Teaching under Learner Uncertainty: Gradient Descent Learners of a Quadratic Loss (bibtex)

by B. Martin Urcelay, C.J. Rozell and M. Bloch

Abstract:

We revisit the framework of online machine teaching, a special case of active learning in which a teacher with full knowledge of a model attempts to train a learner by adaptively presenting examples. While online machine teaching example selection strategies are typically designed assuming omniscience, i.e., the teacher has absolute knowledge of the learner state, we show that efficient machine teaching is possible even when the teacher is uncertain about the learner initialization. Specifically, we consider the case of learners that perform gradient descent of a quadratic loss to learn a linear classifier, and propose an online machine teaching algorithm in which the teacher simultaneously learns the learner state while teaching the learner. We theoretically show that the learner's mean square error decreases exponentially with the number of examples, thus achieving a performance similar to the omniscient case and outperforming two stage strategies that first attempt to make the teacher omniscient before teaching. We empirically illustrate our approach in the context of a cross-lingual sentiment analysis problem.

View PDF

Reference:

Online Machine Teaching under Learner Uncertainty: Gradient Descent Learners of a Quadratic LossB. Martin Urcelay, C.J. Rozell and M. Bloch. SIAM Journal on Mathematics of Data Science (SIMODS), 7(3), pp. 884-905, July 2025.

Bibtex Entry:

@article{urcelay.23b,
	title={Online Machine Teaching under Learner Uncertainty: Gradient Descent Learners of a Quadratic Loss},
	author={Martin Urcelay, B. and Rozell, C.J. and Bloch, M.},
	year= 2025,
	volume = {7},
	number = {3},
	pages = {884-905},
	journal = {SIAM Journal on Mathematics of Data Science (SIMODS)},
	month = jul,
	abstract = {We revisit the framework of online machine teaching, a special case of active learning in which a teacher with full knowledge of a model attempts to train a learner by adaptively presenting examples. While online machine teaching example selection strategies are typically designed assuming omniscience, i.e., the teacher has absolute knowledge of the learner state, we show that efficient machine teaching is possible even when the teacher is uncertain about the learner initialization. Specifically, we consider the case of learners that perform gradient descent of a quadratic loss to learn a linear classifier, and propose an online machine teaching algorithm in which the teacher simultaneously learns the learner state while teaching the learner. We theoretically show that the learner's mean square error decreases exponentially with the number of examples, thus achieving a performance similar to the omniscient case and outperforming two stage strategies that first attempt to make the teacher omniscient before teaching. We empirically illustrate our approach in the context of a cross-lingual sentiment analysis problem.},
	doi = {https://doi.org/10.1137/24M1657997},
	url = {https://epubs.siam.org/eprint/MT9BW9S4MWHWVVBQS6MD/full}
}