Teaching and Compressing for Low VC-Dimension

DOI10.1007/978-3-319-44479-6_26MaRDI QIDQ4604393zbMATH OpenOpenAlexFDO

Authors Shay Moran, Amir Shpilka, A. Wigderson, Amir Yehudayoff

Publication date 26 February 2018

Published in A Journey Through Discrete Mathematics (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/1502.06187

Learning and adaptive systems in artificial intelligence (68T05) Computational learning theory (68Q32) Coding and information theory (compaction, compression, models of communication, encoding schemes, etc.) (aspects in computer science) (68P30)

Abstract: In this work we study the quantitative relation between VC-dimension and two other basic parameters related to learning and teaching. Namely, the quality of sample compression schemes and of teaching sets for classes of low VC-dimension. Let

C

be a binary concept class of size

m

and VC-dimension

d

. Prior to this work, the best known upper bounds for both parameters were

l o g (m)

, while the best lower bounds are linear in

d

. We present significantly better upper bounds on both as follows. Set

k = O (d 2^{d} l o g l o g | C |)

. We show that there always exists a concept

c

in

C

with a teaching set (i.e. a list of

c

-labeled examples uniquely identifying

c

in

C

) of size

k

. This problem was studied by Kuhlmann (1999). Our construction implies that the recursive teaching (RT) dimension of

C

is at most

k

as well. The RT-dimension was suggested by Zilles et al. and Doliwa et al. (2010). The same notion (under the name partial-ID width) was independently studied by Wigderson and Yehudayoff (2013). An upper bound on this parameter that depends only on

d

is known just for the very simple case

d = 1

, and is open even for

d = 2

. We also make small progress towards this seemingly modest goal. We further construct sample compression schemes of size

k

for

C

, with additional information of

k l o g (k)

bits. Roughly speaking, given any list of

C

-labelled examples of arbitrary length, we can retain only

k

labeled examples in a way that allows to recover the labels of all others examples in the list, using additional

k l o g (k)

information bits. This problem was first suggested by Littlestone and Warmuth (1986).

Recommendations

Cites work

Cited in

(10)

This page was built for publication: Teaching and Compressing for Low VC-Dimension

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4604393)