Natural Language Analysis of Online Health Forums

Hasan, Abul; Levene, Mark; Weston, David J.

doi:10.1007/978-3-319-68765-0_11

Abul Hasan¹⁶,
Mark Levene¹⁶ &
David J. Weston¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10584))

Included in the following conference series:

International Symposium on Intelligent Data Analysis

1124 Accesses

Abstract

Despite advances in concept extraction from free text, finding meaningful health related information from online patient forums still poses a significant challenge. Here we demonstrate how structured information can be extracted from posts found in such online health related forums by forming relationships between a drug/treatment and a symptom or side effect, including the polarity/sentiment of the patient. In particular, a rule-based natural language processing (NLP) system is deployed, where information in sentences is linked together though anaphora resolution. Our NLP relationship extraction system provides a strong baseline, achieving an $\text {F}_1$ score of over 80% in discovering the said relationships that are present in the posts we analysed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Hybrid natural language processing tool for semantic annotation of medical texts in Spanish

Article Open access 08 January 2025

CAS: corpus of clinical cases in French

Article Open access 06 August 2020

Multi-label classification and knowledge extraction from oncology-related content on online social networks

Article 17 April 2020

References

Bodenreider, O.: The unified medical language system (UMLS): integrating biomedical terminology. Nucleic Acids Res. 32(suppl 1), D267–D270 (2004)
Article Google Scholar
Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V., Aswani, N., Roberts, I., Gorrell, G., Funk, A., Roberts, A., Damljanovic, D., et al.: Developing language processing components with gate version 6 (a user guide). University of Sheffield, Department of Computer Science (2011)
Google Scholar
Dai, H.J., Touray, M., Jonnagaddala, J., Syed-Abdul, S.: Feature engineering for recognizing adverse drug reactions from twitter posts. Information 7(2), 27 (2016)
Article Google Scholar
DailyStrength: https://www.dailystrength.org/. Accessed 04 May 2017
Denecke, K., Deng, Y.: Sentiment analysis in medical settings: new opportunities and challenges. Artif. Intell. Med. 64(1), 17–27 (2015)
Article Google Scholar
Gooch, P., Roudsari, A.: Lexical patterns, features and knowledge resources for coreference resolution in clinical notes. J. Biomed. Inform. 45(5), 901–912 (2012)
Article Google Scholar
Gupta, S., MacLean, D.L., Heer, J., Manning, C.D.: Induced lexico-syntactic patterns improve information extraction from online medical forums. J. Am. Med. Inf. Assoc. 21(5), 902–909 (2014)
Article Google Scholar
Karimi, S., Wang, C., Metke-Jimenez, A., Gaire, R., Paris, C.: Text and data mining techniques in adverse drug reaction detection. ACM Comput. Surv. (CSUR) 47(4), 56 (2015)
Article Google Scholar
Korkontzelos, I., Nikfarjam, A., Shardlow, M., Sarker, A., Ananiadou, S., Gonzalez, G.H.: Analysis of the effect of sentiment analysis on extracting adverse drug reactions from tweets and forum posts. J. Biomed. Inform. 62, 148–158 (2016)
Article Google Scholar
Manning, C.D., Schütze, H., et al.: Foundations of Statistical Natural Language Processing, vol. 999. MIT Press, Cambridge (1999)
MATH Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Google Scholar
Nikfarjam, A., Sarker, A., O’Connor, K., Ginn, R., Gonzalez, G.: Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features. J. Am. Med. Inform. Assoc. 22, 1–11 (2015)
Article Google Scholar
Pain, J., Levacher, J., Quinqunel, A., Belz, A.: Analysis of twitter data for postmarketing surveillance in pharmacovigilance. In: Proceedings of the 2nd Workshop on Noisy User-generated Text, pp. 94–101 (2016)
Google Scholar
PatientsLikeMe: https://www.patientslikeme.com/. Accessed 21 Apr 2017
Polanyi, L., Zaenen, A.: Contextual valence shifters. In: Shanahan, J.G., Qu, Y., Wiebe, J. (eds.) Computing Attitude and Affect in Text: Theory and Applications. The Information Retrieval Series, vol. 20, pp. 1–10. Springer, Dordrecht (2006). doi:10.1007/1-4020-4102-0_1
Chapter Google Scholar
Sampathkumar, H., Chen, X.W., Luo, B.: Mining adverse drug reactions from online healthcare forums using hidden markov model. BMC Med. Inform. Decis. Making 14(1), 91 (2014)
Article Google Scholar
U.S. National Library of Medicine: https://www.nlm.nih.gov/. Accessed 21 Jun 2016
Wilson, T., Wiebe, J., Hoffmann, P.: Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP), pp. 347–354. Association for Computational Linguistics (2005)
Google Scholar
Witten, I.H., Frank, E., Hall, M.A., Pal, C.J.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, San Francisco (2005)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Information Systems Birkbeck, University of London, London, WC1E 7HX, UK
Abul Hasan, Mark Levene & David J. Weston

Authors

Abul Hasan
View author publications
You can also search for this author in PubMed Google Scholar
Mark Levene
View author publications
You can also search for this author in PubMed Google Scholar
David J. Weston
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Abul Hasan .

Editor information

Editors and Affiliations

Imperial College London, London, United Kingdom
Niall Adams
Brunel University London, Uxbridge, United Kingdom
Allan Tucker
Birkbeck, University of London, London, United Kingdom
David Weston

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hasan, A., Levene, M., Weston, D.J. (2017). Natural Language Analysis of Online Health Forums. In: Adams, N., Tucker, A., Weston, D. (eds) Advances in Intelligent Data Analysis XVI. IDA 2017. Lecture Notes in Computer Science(), vol 10584. Springer, Cham. https://doi.org/10.1007/978-3-319-68765-0_11

Download citation

DOI: https://doi.org/10.1007/978-3-319-68765-0_11
Published: 04 October 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-68764-3
Online ISBN: 978-3-319-68765-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics