Abstract
Learning from label proportions is the term used for the learning paradigm where the training data is provided in groups (or “bags”), and only the label proportion for each bag is known. The objective is to learn a model to predict the class labels of individual instances. This paradigm presents very different applications, specially concerning anonymous data. Two different iterative strategies are proposed to deal with this type of problems, both based on optimising the class membership of the instances using the estimated pattern distribution per bag and the label proportions. Discriminant analysis is reformulated to deal with non-crisp class memberships. A thorough set of experiments is conducted to test: (1) the performance gap between these approaches and the fully supervised setting, (2) the potential advantages of optimising class memberships by our proposals, and (3) the influence of factors such as the bag size and the number of classes of the problem in the performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Chen, S., Liu 0015, B., Qian, M., Zhang, C.: Kernel K-means based framework for aggregate outputs classification. In: ICDM Workshops, pp. 356–361. IEEE Computer Society (2009)
Fan, K., Zhang, H., Yan, S., Wang, L., Zhang, W., Feng, J.: Learning a generative classifier from label proportions. Neurocomputing 139, 47–55 (2014)
Hernández-González, J., Inza, I., Lozano, J.A.: Weak supervision and other non-standard classification problems: a taxonomy. Pattern Recognit. Lett. 69, 49–55 (2016)
Hernández-González, J., Iñza, I., Lozano, J.A.: Learning Bayesian network classifiers from label proportions. Pattern Recognit. 46(12), 3425–3440 (2013)
Musicant, D.R., Christensen, J.M., Olson, J.F.: Supervised learning by training on aggregate outputs. In: Proceedings of the 7th IEEE International Conference on Data Mining (ICDM), pp. 252–261
Quadrianto, N., Smola, A.J., Caetano, T.S., Le, Q.V.: Estimating labels from label proportions. J. Mach. Learn. Res. 10, 2349–2374 (2009)
Stolpe, M., Morik, K.: Learning from label proportions by optimizing cluster model selection. In: Gunopulos, D., Hofmann, T., Malerba, D., Vazirgiannis, M. (eds.) ECML PKDD 2011, Part III. LNCS, vol. 6913, pp. 349–364. Springer, Heidelberg (2011)
Yu, F.X., Liu, D., Kumar, S., Jebara, T., Chang, S.F.: \(\propto \)SVM for learning with label proportions. In: Proceedings of the 30th International Conference on Machine Learning, ICML, pp. 504–512 (2013)
Acknowledgment
This work was financed by the TIN2014-54583-C2-1-R project of the Spanish MINECO, by FEDER Funds and by the P11-TIC-7508 project of the Junta de Andalucía, Spain.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Pérez-Ortiz, M., Gutiérrez, P.A., Carbonero-Ruz, M., Hervás-Martínez, C. (2016). Learning from Label Proportions via an Iterative Weighting Scheme and Discriminant Analysis. In: Luaces , O., et al. Advances in Artificial Intelligence. CAEPIA 2016. Lecture Notes in Computer Science(), vol 9868. Springer, Cham. https://doi.org/10.1007/978-3-319-44636-3_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-44636-3_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-44635-6
Online ISBN: 978-3-319-44636-3
eBook Packages: Computer ScienceComputer Science (R0)