Abstract
Existing social collaboration projects contain a host of conceptual knowledge, but are often only sparsely structured and hardly machine-accessible. Using the well known Wikipedia as a showcase, we propose new and improved techniques for extracting ontology data from the wiki category structure. Applications like information extraction, data classification, or consistency checking require ontologies of very high quality and with a high number of relationships. We improve upon existing approaches by finding a host of additional relevant relationships between ontology classes, leveraging multi-lingual relations between categories and semantic relations between terms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Nédellec, C., Nazarenko, A.: Ontology and Information Extraction: a necessary symbiosis. In: Buitelaar, P., Philipp Cimiano, B.M. (eds.) Ontology Design and Population, pp. 155–170. IOS Press, Amsterdam (2005)
Marko, K., Schulz, S., Hahn, U.: MorphoSaurus - Design and evaluation of an interlingua-based, cross-language document retrieval engine for the medical domain. Methods of Information in Medicine 44(4), 537–545 (2005)
Chang, Y.: Automatically constructing a domain ontology for document classification. In: Int. Conf. on Machine Learning and Cybernetics, vol. 4 (2007)
Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: a core of semantic knowledge. In: WWW 2007: Proc. of the 16th intern. conference on World Wide Web, pp. 697–706. ACM Press, New York (2007)
Krötzsch, M., Vrandecic, D., Völkel, M., Haller, H., Studer, R.: Semantic Wikipedia. Journal of Web Semantics 5, 251–261 (2007)
Miller, G.A.: WordNet – a lexical database for the English language (2006), http://wordnet.princeton.edu/
Rogers, J.E., Roberts, A., Solomon, W.D., van der Haring, E., Wroe, C.J., Zanstra, P.E., Rector, A.L.: GALEN ten years on: Tasks and supporting tools. In: Proceedings of MEDINFO 2001, pp. 256–260. IOS Press, Amsterdam (2001)
Wu, F., Weld, D.S.: Automatically refining the Wikipedia infobox ontology. In: WWW 2008: Proceeding of the 17th international conference on World Wide Web, pp. 635–644. ACM, New York (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Schönberg, C., Pree, H., Freitag, B. (2010). Rich Ontology Extraction and Wikipedia Expansion Using Language Resources. In: Chen, L., Tang, C., Yang, J., Gao, Y. (eds) Web-Age Information Management. WAIM 2010. Lecture Notes in Computer Science, vol 6184. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14246-8_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-14246-8_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14245-1
Online ISBN: 978-3-642-14246-8
eBook Packages: Computer ScienceComputer Science (R0)