Skip to main content

Locality Preserving Based K-Means Clustering

  • Conference paper
  • First Online:
Intelligence Science and Big Data Engineering. Big Data and Machine Learning Techniques (IScIDE 2015)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9243))

  • 2821 Accesses

Abstract

K-Means is a powerful clustering method, in which the Euclidean distance is usually employed. In the paper, by following the basic idea of locality preserving projection (LPP), we first define locality preserving scatter matrix, then introduce a new Mahalanobis distance by using the defined matrix, finally propose a novel K-Means clustering algorithm based on the given Mahalanobis distance. Different from the traditional K-Means algorithm, the proposed method considers fully the intrinsic manifold structure of data. Experimental results show that the proposed method can achieve better clustering accuracy in contrast with the traditional K-Means algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Rokach, L.: A survey of clustering algorithms. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, pp. 269–298. Springer, Heidelberg (2010)

    Google Scholar 

  2. Xu, R., Wunsch II, D.: Survey of clustering algorithms. IEEE Trans. Neural Netw. 16(3), 645–678 (2005)

    Article  Google Scholar 

  3. Yu-chen, S., Yu-ying, Z., Hai-dong, M.: Research based on Euclid distance with weights of clustering method. Comput. Eng. Appl. 43(4), 179–180 (2007)

    Google Scholar 

  4. Alin, F., Shuhua, R.: K-Means clustering algorithm based on coefficient of variation. Comput. Eng. Appl. 48(35), 114–117 (2012)

    Google Scholar 

  5. Xianghus, W., Niu Shengjie, W., Chengou, W.: An improvement on estimating covariance matrix during cluster analysis using Mahalanobis distance. Appl. Stat. Manage. 30(2), 240–245 (2011)

    Google Scholar 

  6. Xiang, Z., Shitong, W.: Mahalanbbis distance based possibilistic clustering algorithm and its analysis. Data Acquisit. Process. 23(8), 86–88 (2011)

    Google Scholar 

  7. He, X.F., Niyogi, P.: Locality preserving projections. In: Proceedings of the Conference on Advances in Neural Information Processing Systems, pp. 585–591 (2003)

    Google Scholar 

  8. He, X.F., Yan, S.C., Hu, Y.X., Niyogi, P., Zhang, H.J.: Face recognition using Laplacian faces. IEEE Trans. Pattern Anal. Mach. Intell. 27(3), 328–340 (2005)

    Article  Google Scholar 

  9. Zhonghua, S., Yonghui, P., Shitong, W.: A supervised locality preserving projection algorithm for dimensionality reduction. Pattern Recog. Artif. Intell. 21(2), 232–239 (2008)

    Google Scholar 

  10. Chuanliang, C., Rongfang, B., Ping, G.: Combining LPP with PCA for Microarray Data Clustering. IEEE Congress on Evolutionary Computation, pp. 2081–2086 (2008)

    Google Scholar 

  11. Sun, X., Zhang, Q., Wang, Z.: Using LPP and LS-SVM for spam filtering. In: 2009 ISECS International Colloquium on Computing, Communication, Control, and Management, vol. 2, pp. 451–454, 8–9 August 2009

    Google Scholar 

  12. Melnykov, I., Melnykov, V.: On K-means algorithm with the use of Mahalanobis distances. Stat. Probab. Lett. 84(2014), 88–95 (2014)

    Article  MathSciNet  MATH  Google Scholar 

  13. Kokiopoulou, E., Saad, Y.: Orthogonal neighborhood preserving projections a projectionbased dimensionality reduction technique. IEEE Trans. Pattern Anal. Mach. Intell. 29(12), 2143–2156 (2007)

    Article  Google Scholar 

  14. Cai, D., He, X., Han, J., Zhang, H.J.: Orthogonal laplacianfaces for face recognition. IEEE Trans. Image Process. 15(11), 3608–3614 (2006)

    Article  Google Scholar 

  15. Kakade, S.M., Shalev-Shwartz, S., Tewari, A.: Regularization Techniques for Learning with Matrices. Mach. Learn. Res. 13(1), 1865–1890 (2012)

    MathSciNet  MATH  Google Scholar 

  16. Blake, C., Merz, C.: UCI Repository of machine learning databases. http://www.ics.uci.edu/~mlearn/MLRepository.html

  17. von Luxburg, U.: A tutorial on spectral clustering. Stat. Comput. 17(4), 395–416 (2007)

    Article  MathSciNet  Google Scholar 

  18. Keysers, D.: Usps dataset (1994). http://wwwi6.informatik.rwth-aachen.de/keysers/usps.html

Download references

Acknowledgments

This work is supported by the Graduate Innovation Fund of Xihua University (Grant No.ycjj2014032), the Key Scientific Research Foundation of Sichuan Provincial Department of Education (Grant No.11ZA004) and the National Science Foundation of China (Grant No. 61103168, 61271413, 61472329).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiaohuan Yang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Yang, X., Wang, X., Tian, Y., Du, Y. (2015). Locality Preserving Based K-Means Clustering. In: He, X., et al. Intelligence Science and Big Data Engineering. Big Data and Machine Learning Techniques. IScIDE 2015. Lecture Notes in Computer Science(), vol 9243. Springer, Cham. https://doi.org/10.1007/978-3-319-23862-3_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-23862-3_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-23861-6

  • Online ISBN: 978-3-319-23862-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics