Data Dependent Priors in PAC-Bayes Bounds

John Shawe-Taylor,Emilio Parrado-Hernández,Amiran Ambroladze

Data Dependent Priors in PAC-Bayes Bounds

2010

One of the central aims of Statistical Learning Theory is the bounding of the test set performance of classifiers trained with i.i.d. data. For Support Vector Machines the tightest technique for assessing this so-called generalisation error is known as the PAC-Bayes theorem. The bound holds independently of the choice of prior, but better priors lead to sharper bounds. The priors leading to the tightest bounds to date are spherical Gaussian distributions whose means are determined from a separate subset of data. This paper gives another turn of the screw by introducing a further data dependence on the shape of the prior: the separate data set determines a direction along which the covariance matrix of the prior is stretched in order to sharpen the bound. In addition, we present a classification algorithm that aims at minimizing the bound as a design criterion and whose generalisation can be easily analysed in terms of the new bound.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations