Rich Ontology Extraction and Wikipedia Expansion Using Language Resources

Schönberg, Christian; Pree, Helmuth; Freitag, Burkhard

doi:10.1007/978-3-642-14246-8_17

Christian Schönberg²⁰,
Helmuth Pree²⁰ &
Burkhard Freitag²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6184))

Included in the following conference series:

International Conference on Web-Age Information Management

1739 Accesses

Abstract

Existing social collaboration projects contain a host of conceptual knowledge, but are often only sparsely structured and hardly machine-accessible. Using the well known Wikipedia as a showcase, we propose new and improved techniques for extracting ontology data from the wiki category structure. Applications like information extraction, data classification, or consistency checking require ontologies of very high quality and with a high number of relationships. We improve upon existing approaches by finding a host of additional relevant relationships between ontology classes, leveraging multi-lingual relations between categories and semantic relations between terms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Methodology for Creating a Community Corpus Using a Wikibase Knowledge Graph

Building Wikipedia Ontology with More Semi-structured Information Resources

Getting the Most Out of Wikidata: Semantic Technology Usage in Wikipedia’s Knowledge Graph

References

Nédellec, C., Nazarenko, A.: Ontology and Information Extraction: a necessary symbiosis. In: Buitelaar, P., Philipp Cimiano, B.M. (eds.) Ontology Design and Population, pp. 155–170. IOS Press, Amsterdam (2005)
Google Scholar
Marko, K., Schulz, S., Hahn, U.: MorphoSaurus - Design and evaluation of an interlingua-based, cross-language document retrieval engine for the medical domain. Methods of Information in Medicine 44(4), 537–545 (2005)
Google Scholar
Chang, Y.: Automatically constructing a domain ontology for document classification. In: Int. Conf. on Machine Learning and Cybernetics, vol. 4 (2007)
Google Scholar
Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: a core of semantic knowledge. In: WWW 2007: Proc. of the 16th intern. conference on World Wide Web, pp. 697–706. ACM Press, New York (2007)
Chapter Google Scholar
Krötzsch, M., Vrandecic, D., Völkel, M., Haller, H., Studer, R.: Semantic Wikipedia. Journal of Web Semantics 5, 251–261 (2007)
Google Scholar
Miller, G.A.: WordNet – a lexical database for the English language (2006), http://wordnet.princeton.edu/
Rogers, J.E., Roberts, A., Solomon, W.D., van der Haring, E., Wroe, C.J., Zanstra, P.E., Rector, A.L.: GALEN ten years on: Tasks and supporting tools. In: Proceedings of MEDINFO 2001, pp. 256–260. IOS Press, Amsterdam (2001)
Google Scholar
Wu, F., Weld, D.S.: Automatically refining the Wikipedia infobox ontology. In: WWW 2008: Proceeding of the 17th international conference on World Wide Web, pp. 635–644. ACM, New York (2008)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Informatics and Mathematics, University of Passau, 94030, Passau, Germany
Christian Schönberg, Helmuth Pree & Burkhard Freitag

Authors

Christian Schönberg
View author publications
You can also search for this author in PubMed Google Scholar
Helmuth Pree
View author publications
You can also search for this author in PubMed Google Scholar
Burkhard Freitag
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China
Lei Chen
Computer Department, Sichuan University, 610064, Chengdu, China
Changjie Tang
Department of Computer Science, Duke University, Box 90129, NC 27708-0129, Durham, USA
Jun Yang
College of Computer Science, Zhejiang University, 388 Yuhangtang Road, 310058, Hangzhou, China
Yunjun Gao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Schönberg, C., Pree, H., Freitag, B. (2010). Rich Ontology Extraction and Wikipedia Expansion Using Language Resources. In: Chen, L., Tang, C., Yang, J., Gao, Y. (eds) Web-Age Information Management. WAIM 2010. Lecture Notes in Computer Science, vol 6184. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14246-8_17

Download citation

DOI: https://doi.org/10.1007/978-3-642-14246-8_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14245-1
Online ISBN: 978-3-642-14246-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics