Вы находитесь на странице: 1из 2

Correspondence Mining for the Identification

of Relationships in Product Reviews

Mayra Ruano1 and Javier Trejos2

Backcountry, Inc., mayra.ruano@gmail.com
CIMPA, University of Costa Rica, Costa Rica. javier.trejos@ucr.ac.cr

Abstract. We study product reviews by customers which are published at the e-

commerce website Backcountry.com We leverage from an existing natural processing
language framework called “General Architecture for Text Engineering” (GATE)
and apply Correspondence Analysis on custom contingency tables. These contin-
gency tables are deduced from the comments published by customers, and GATE
is applied as a filtering tool to select appropriate words by means of specific gram-
matical rules or regular expressions based on parts of speech.
Our work focuses on two main study cases. The first case consists on identify-
ing relationships between adjectives from customer reviews and their correspond-
ing products. The second case looks for relationships between products and users’
perceptions regarding product size. We obtain a visual representation of what cus-
tomers’ perceptions are, and provide a better understanding of the relationship
between information derived from reviews and specific products.

BAEZA-YATES, R.; RIBEIRO-NETO, B. (1999): Modern Information Retrieval.
Addison Wesley–ACM Press, New York.
COLLOBERT, R.; WESTON, J. (2008): A Unified Architecture for Natural Lan-
guage Processing: Deep Neural Networks with Multitask Learning. In: W. Co-
hen, A. McCallum and S. Roweis (Eds.): Proceedings of the 25th International
Conference on Machine Learning, Helsinki, 160–167.
CUNNINGHAM, H. (2000): Software Architecture for Language Engineering. Ph.D.
thesis, Department of Computer Science, University of Sheffield.
GREENACRE, M. (1984): Theory and Applications of Correspondence Analysis.
Academic Press, London.
PORTER, M.F. (1980): An Algorithm for Suffix Stripping. Program, 14, 130–137.

2 Mayra Ruano and Javier Trejos

Вам также может понравиться