Abstract
Paraphrase rating is an important problem with very interesting applications in plagiarism detection, language translation, text summarization, question answering, web search and information retrieval. In this paper, we present an adaptive neuro-fuzzy inference system (ANFIS) based model for automatic rating of semantic equivalence of pairs of sentences. Using a corpus of human-judged sentence pairs, lexical similarity metrics are first computed. Then, a model is constructed for predicting the mean of the rates assigned by a number of human beings. The correlation with the actual ratings and the prediction errors are studied for individual metrics as well as the model output using a nonlinear logistic regression function. The experimental results showed that much higher correlations and low error rates can be achieved with the proposed method compared to those obtained with individual metrics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
STS MSR corpus, http://www.cs.york.ac.uk/semeval/task6/
Androutsopoulos, I., Malakasiotis, P.: A survey of paraphrasing and textual entailment methods. Journal of Artificial Intelligence Research 38(1), 135–187 (2010)
Bhagat, R., Hovy, E.: What is a paraphrase? Computational Linguistics 39(3), 462–472 (2013)
Cordeiro, J., Dias, G., Brazdil, P.: A metric for paraphrase detection. In: Proc. International Multi-Conference on Computing in the Global Information Technology (ICCGI) (2007)
Dolan, B., Quirk, C., Brockett, C.: Unsupervised construction of large paraphrase corpora: Exploiting massively parallel news sources. In: Proc. 20th International Conference on Computational Linguistics (2004)
Finch, A., Hwang, Y.S., Sumita, E.: Using machine translation evaluation techniques to determine sentence-level semantic equivalence. In: Proc. 3rd International Workshop on Paraphrasing, IWP 2005 (2005)
Hatzivassiloglou, V., Klavans, J.L., Eskin, E.: Detecting text similarity over short passages: Exploring linguistic feature combinations via machine learning. In: Proc. Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (1999)
Jang, J.S.: Anfis: Adaptive-network-based fuzzy inference system. IEEE Transactions on Systems, Man and Cybernetics 23(5/6), 665–685 (1993)
Kar, S., Das, S., Ghosh, P.K.: Applications of neuro fuzzy systems: A brief review and future outline. Applied Soft Computing 15, 243–259 (2014)
Madnani, N., Tetreault, J., Chodorow, M.: Re-examining machine translation metrics for paraphrase identification. In: Proc. Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2012)
Malakasiotis, P.: Paraphrase recognition using machine learning to combine similarity measures. In: Proc. of ACL-IJCNLP Student Research Workshop (2009)
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press (2009)
Navarro, G.: A guided tour to approximate string matching. ACM Computing Surveys 33(1), 31–88 (2001)
Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: BLEU: A method for automatic evaluation of machine translation. In: Proc. 40th Annual Meeting on Association for Computational Linguistics (2002)
Pelan, A., Steinhaeuser, K., Chawla, N.V., de Alwis Pitts, D.A., Ganguly, A.R.: Empirical comparison of correlation measures and pruning levels in complex networks representing the global climate system. In: Proc. IEEE Symposium Series on Computational Intelligence and Data Mining (CIDM) (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
El-Alfy, ES.M. (2014). ANFIS-Based Model for Improved Paraphrase Rating Prediction. In: Loo, C.K., Yap, K.S., Wong, K.W., Teoh, A., Huang, K. (eds) Neural Information Processing. ICONIP 2014. Lecture Notes in Computer Science, vol 8834. Springer, Cham. https://doi.org/10.1007/978-3-319-12637-1_50
Download citation
DOI: https://doi.org/10.1007/978-3-319-12637-1_50
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12636-4
Online ISBN: 978-3-319-12637-1
eBook Packages: Computer ScienceComputer Science (R0)