Skip to main content

Research on Micro-Blog Information Perception and Mining Platform

  • Conference paper
  • First Online:
Advanced Technologies, Embedded and Multimedia for Human-centric Computing

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 260))

  • 1079 Accesses

Abstract

To predict the tendency of Micro-blog information dissemination, provide the early warning of the Internet emergencies, and contribute to the content security of micro-blog, the paper offers a platform for Micro-blog information perceiving and mining. This platform is an integration of Micro-blog data collection and processing module, topic detection and tracking module, user behavior analysis module, trend prediction module, etc. It could access and analyze micro-blog information automatically, leading a positive significance to grasp the emergencies on micro-blog. This paper puts forward methods based on the Latent Dirichlet Allocation (LDA) document clustering and hot topics prediction, which could analysis and predict the micro-blog data effectively, avoiding the problems in the traditional algorithm. Also, these methods have a higher accuracy for clustering and prediction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Han R (2010) The influence of micro-blogging on personal public participation. In: Proceeding(s) of 2010 IEEE 2nd symposium on web society, pp 615–618

    Google Scholar 

  2. Kang S, Zhang C (2010) Complexity research of massively micro-blogging based on human behaviors. In: Proceeding(s) of 2nd international workshop on database technology and applications, pp 1–4

    Google Scholar 

  3. Wang R, Jin Y (2010) An empirical study on the relationship between the followers’ number and influence of micro-blogging. In: Proceeding(s) of the international conference on e-business and e-Government, pp 2014–2017

    Google Scholar 

  4. Pouliquen B, Steinberger R et al (2004) Multilingual and cross-lingual news topic tracking. In: Proceeding(s) of the 20th international conference on computational linguistics, pp 23–27

    Google Scholar 

  5. Yang Y, Pierce T, Carbonell J (1998) A study on retrospective and on-line event detection. In: Proceeding(s) of the 21st annual international ACM SIGIR conference on research and development in information retrieval, pp 28–36

    Google Scholar 

  6. Jin H, Schwartz R, Wall F (1999) Topic tracking for radio, TV broadcast, and newswire. In: Proceeding(s) of the DARPA broadcast news workshop, pp 199–204

    Google Scholar 

  7. Pui G, Fung C, Yu JX, Lu H (2005) Parameter free bursty events detection in text streams. In: Proceeding(s) of the 31st international conference on very large data bases, pp 181–192

    Google Scholar 

  8. Zhu J, Xiong F, Piao D, Liu Y, Zhang Y (2011) Statistically modeling the effectiveness of disaster information in social media. In: Proceeding(s) of the IEEE global humanitarian technology conference, pp 431–436

    Google Scholar 

  9. Xiong F, Liu Y, Zhu J, Lian J, Zhang Y (2012) Hot post prediction in BBS forums based on multifactor fusion. J Convergence Inf Technol 7(12):129–137

    Article  Google Scholar 

  10. http://en.wikipedia.org/wiki/Latent_Dirichlet_allocation

  11. Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3(4–5):993–1022

    MATH  Google Scholar 

  12. Farrahi K, Gatica-Perez (2011) Discovering routines from large-scale human locations using probabilistic topic models. ACM Trans Comput Logic 2(1):3

    Google Scholar 

  13. Can F, Ozkarahan EA (1990) Concepts and effectiveness of the cover-coefficient-based clustering methodology for text databases. ACM Trans Database Syst 15:483–517

    Article  Google Scholar 

  14. Zhou E, Zhong N, Li Y (2011) Hot topic detection in professional blogs. Active Media Technol 6890:141–152

    Article  Google Scholar 

  15. Zhang Z, Li Q (2011) QuestionHolic: hot topic discovery and trend analysis in community question answering systems. Expert Syst Appl 38(6):6848–6855

    Article  Google Scholar 

  16. Suykens JAK, Vandewalle J (1999) Least squares support vector machine classifiers. Neural Process Lett 9(3):293–300

    Article  MathSciNet  Google Scholar 

  17. http://en.wikipedia.org/wiki/Bootstrap_aggregating

Download references

Acknowledgments

This work has been supported by the National Natural Science Foundation of China under Grant 61172072, 61271308, the Beijing Natural Science Foundation under Grant 4112045, the Research Fund for the Doctoral Program of Higher Education of China under Grant W11C100030, the Beijing Science and Technology Program under Grant Z121100000312024.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fei Xiong .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer Science+Business Media Dordrecht

About this paper

Cite this paper

Wang, X., Xiong, F., Liu, Y. (2014). Research on Micro-Blog Information Perception and Mining Platform. In: Huang, YM., Chao, HC., Deng, DJ., Park, J. (eds) Advanced Technologies, Embedded and Multimedia for Human-centric Computing. Lecture Notes in Electrical Engineering, vol 260. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-7262-5_85

Download citation

  • DOI: https://doi.org/10.1007/978-94-007-7262-5_85

  • Published:

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-007-7261-8

  • Online ISBN: 978-94-007-7262-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics