Abstract
This paper aims at developing a data mining approach for classification rule representation and automated acquisition from numerical data with continuous attributes. The classification rules are crisp and described by ellipsoidal regions with different attributes for each individual rule. A regularization model trading off misclassification rate, recognition rate and generalization ability is presented and applied to rule refinement. A regularizing data mining algorithm is given, which includes self-organizing map network based clustering techniques, feature selection using breakpoint technique, rule initialization and optimization, classifier structure and usage. An Illustrative example demonstrates the applicability and potential of the proposed techniques for domains with continuous attributes.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
R. L. Kennedy, Y. Lee, B. V. Roy, C. D. Reed and R. P. Lippmann, Solvong Data Mining Problems Through Pattern Recognition. Prentice Hall, PTR, Unica Technologies, Inc., (1998)
S. Theodoridis and K. Koutroumbas: Pattern Recognition, Academic Press, (1999)
A.K. Jain, P. W. Duin, and J. Mao, Statistical pattern recognition: a review, IEEE Trans.On Pattern Analysis and Machine Intelligence, 5 (2000) 4–37
S. Sestito and T. S. Dillon, Automated Knowledge Acquisition. Australia: Prentice Hall, (1994)
T. Kohonen, Self-Organization and Associative Memory. Berlin: Springer-Verlag (1989)
R. P. Lippmann, Pattern classification using neural networks, IEEE Communications Magazine, (1989) 47–64
J. Y. Ching, A. K. C. Wong and K. C. C. Chan, Class-dependent discretization for inductive learning from continuous and mixed-mode data, IEEE Trans.On Pattern Analysis and Machine Intelligence, 7(1995) 641–651
P. K. Simpson, Fuzzy min-max neural networks-Part I: Classification, IEEE Trans. On Neural Networks, 5(1992) 776–786
X. Wu, Fuzzy interpretation of discretized intervals, IEEE Trans. On Fuzzy Systems, 6(1999)753–759
S. Mitra, R. K. De and S. K. Pal, Knowledge-based fuzzy MLP for classification and rule generation, IEEE Trans. On Neural Networks, 6(1997) 1338–1350
L. M. Fu, A neural-network model for learning domain rules based on its activation function characteristics, IEEE Trans. On Neural Networks, 5(1998) 787–795
J. Vesanto, and E. Alhoniemi, Clustering of the self-organizing map, IEEE Trans. On Neural Networks, Special Issue On Neural Networks for Data Mining and Knowledge Discovery, 3(2000) 586–600
H. Lu, R. Setion and H. Liu, Effective data mining using neural networks, IEEE Trans. On Knowledge and Data Engineering, 6(1996) 957–961
S. Abe, R. Thawonmas and Y. Kobayashi, Feature selection by analyzing class regions approximated by ellipsoids, IEEE Trans. On SMC-Part C: Applications and Reviews, 2(1998) 282–287
W. F. Bloomer, T. S. Dillon and M. Witten, Hybrid BRAINNE: Further developments in extracting symbolic disjunctive rules, Expert Systems with Applications, 3(1997) 163–168
D. H. Wang and T. S. Dillon, Extraction and optimization of classification rules for continuous or mixed mode data using neural nets, Proceedings of SPIE International Conference on Data Mining and Knowledge Discovery: Theory, Tools and Technology III, pp. 38–45, April 16-20, 2001, Orlando, Florida, USA.
H. Yang and T. S. Dillon, Convergence of self-organizing neural algorithm, Neural Networks, 5(1992) 485–493
G. G. Sutton and J. A. Reggia, Effects of normalization constraints on competitive learning, IEEE Trans. On Neural Networks, 3(1994) 502–504
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, D., Dillon, T., Chang, E. (2002). Trading off between Misclassification, Recognition and Generalization in Data Mining with Continuous Features. In: Hendtlass, T., Ali, M. (eds) Developments in Applied Artificial Intelligence. IEA/AIE 2002. Lecture Notes in Computer Science(), vol 2358. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48035-8_30
Download citation
DOI: https://doi.org/10.1007/3-540-48035-8_30
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43781-9
Online ISBN: 978-3-540-48035-8
eBook Packages: Springer Book Archive