Principal Components Identify MLP Hidden Layer Size for Optimal Generalisation Performance

Girolami, M.

doi:10.1007/978-3-7091-6492-1_9

M. Girolami⁴

404 Accesses

Abstract

One of the major concerns when implementing a supervised artificial neural network solution to a classification or prediction problem, is the network’s performance on unseen data. The phenomenon of the network overfitting the training data, is understood and reported in the literature. Most researchers recommend a ‘trial and error’ approach to selecting the optimal number of weights for the network, which is time consuming, or start with a large network and prune to an optimal size. Current pruning techniques based on approximations of the Hessian matrix of the error surface are computationally intensive and prone to severe approximation errors if a suitable minimal training error has not been achieved. We propose a novel and simple design heuristic for a three layer multi-layer perceptron (MLP) based on an eigenvalue decomposition of the covariance matrix of the middle layer output. This technique identifies the neurons which are contributing to the redundancy of data through the network and as such are additional effective network parameters which have a deleterious effect on the classifier surface smoothness. This technique identifies redundancy in the network data and so is not dependant on the network training having reached a minimal error value making the Levenberg-Marquardt approximation valid. We report on simulations using the double-convex benchmark which show the utility of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Second Order Training and Sizing for the Multilayer Perceptron

Article 08 October 2019

Closed determination of the number of neurons in the hidden layer of a multi-layered perceptron network

Article 02 November 2016

An Empirically-Sourced Heuristic for Predetermining the Size of the Hidden Layer of a Multi-layer Perceptron for Large Datasets

References

C. Bishop. Neural networks for pattern recognition. Oxford University Press, 1995.
Google Scholar
C. Bishop. Regularization and complexity control in feedforward networks. In International Conference on Artificial Neural Networks, volume 1, pages 141–148, 1995.
MathSciNet Google Scholar
B. Hassibi, D.G. Stork, and G. Wolff. Optimal brain surgeon and general network pruning. In IEEE International Conference on Neural Networks, volume 1, pages 293–299, 1992.
Google Scholar
S. Haykin. Neural Networks: A Comprehensive Foundation. MacMillan Publishing, 1995.
Google Scholar
J. Karhunen and J. Joutsensalo. Generalisations of principal component analysis, optimisation problems and neural networks. Neural Networks, 8(4):549–562, 1995.
Article Google Scholar
Y. Le Cun, J.S. Denker, and S.A. Solla. Optimal brain damage. Advances in Neural Information Processing Systems, 2:598–605, 1990.
Google Scholar
J.E. Moody. The effective number of parameters: An analysis of generalisation and regularisation in nonlinear learning systems. In Advances in Neural Informations Processing Systems, pages 847–854. Morgan Kauffmann, 1992.
Google Scholar
J.E. Moody, A.U. Leen, and T.K. Leen. Fast pruning using principal components. In Advances In Neural Information Processing, volume 6. Morgan Kauffmann, 1994.
Google Scholar
R. Shiavi. Introduction to Applied Statistical Signal Analysis. Aksen Associates Incorporated Publishers, Irwin, 1991.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computing and Information Systems, University of Paisley, High Street, Paisley, Scotland, PA1 2BE
M. Girolami

Authors

M. Girolami
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Girolami, M. (1998). Principal Components Identify MLP Hidden Layer Size for Optimal Generalisation Performance. In: Artificial Neural Nets and Genetic Algorithms. Springer, Vienna. https://doi.org/10.1007/978-3-7091-6492-1_9

Download citation

DOI: https://doi.org/10.1007/978-3-7091-6492-1_9
Publisher Name: Springer, Vienna
Print ISBN: 978-3-211-83087-1
Online ISBN: 978-3-7091-6492-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics