Generating Recommendation Status of Electronic Products from Online Reviews

Abstract

The need for effective and efficient mining of online reviews cannot be overemphasized. This position is as a result of the overwhelmingly large number of reviews available online which makes it cumbersome for customers to read through all of them. Hence, the need for online web review mining system which will help customers as well as manufacturers read through a large number of reviews and provide a quick description and summary of the performance of the product. This will assist the customer make better and quick decision, and also help manufacturers improve their products and services. This paper describes a research work that focuses on mining the opinions expressed on some electronic products, providing ranks or ratings for the features, with the aim of summarizing them and making recommendations to potential customers for better online shopping. A technique is also proposed for scoring segments with infrequent features. The evaluation results using laptops demonstrate the effectiveness of these techniques.

Share and Cite:

Ojokoh, B. , Olayemi, O. and Adewale, O. (2013) Generating Recommendation Status of Electronic Products from Online Reviews. Intelligent Control and Automation, 4, 1-10. doi: 10.4236/ica.2013.41001.

1. Introduction

The rapid growth of the web has led to rapid expansion of e-commerce among other things. More customers are turning towards online shopping because it is convenient, reliable, and fast. In order to enhance customer shopping experience, it has become a common practice for online merchants to enable their customers to write reviews on products that they have purchased. Customer reviews of a product are generally considered more honest, unbiased and comprehensive than descriptions provided by the seller. In fact, review comments are one of the most powerful and expressive sources of user preferences. Furthermore, reviews written by other customers describe the usage experience and perspective of (non-expert) customers with similar needs. They give customers a voice, increase consumer confidence, enhance product visibility, and can dramatically increase sales [1,2].

With more and more users becoming comfortable with the Web, an increasing number of people are writing reviews. As a result, the number of reviews that a product receives grows rapidly [3]. Moreover, the consumer reviews are in free form text and consumers prefer to use natural language to express their opinion. It is difficult for a program to “understand” the text information and use these data. Many reviews are long, and as such, it is not an easy task for a potential customer to make a decision whether or not to purchase a product based on the reviews he reads. It is therefore a very important and challenging problem to mine these reviews and produce a summary of them and also propose a recommendation decision to the potential customer.

A host of research works have been proposed to profer solution to problems related to mining and summarizing customer reviews [4,5], which is called opinion mining or sentiment analysis. Opinion Mining has two main research directions, document level opinion mining and feature level opinion mining [6]. Document level mining involves determining the document’s polarity by calculating the average semantic orientation of extracted phrases. In feature level opinion mining, reviews are summarized and classified by extracting high frequency feature and opinion keywords. Feature-opinion pairs are identified by using a statistical approach or labeling approach and dependency grammar rules to handle different kinds of sentence structures. Generally, feature level opinion mining has greater precision over the document level and uses basically product features in analysis and evaluation. The main tasks present in most past and current research works are: to find product features that have been commented on by reviewers [7] and to decide whether the comments are positive, negative or neutral [3]. Nevertheless, it is important to discover how positive or negative the comments are and further, make a concise decision about the product (recommended or not) to the potential customer. This work makes a contribution by classifying the opinions about each feature, hence showing how negative or positive it is. It also proposes a technique to include comments on infrequent features into the recommendation decision for electronic products.

The remainder of this paper is organized as follows. Section 2 contains a summary of related works. We describe the proposed technique in Section 3. Section 4 contains the experiments and evaluation, while Section 5 concludes the work and presents proposed future research work.

2. Related Work

Opinion mining has been studied by many researchers in past years. The earliest research on opinion mining was on identifying opinion (or sentiment) bearing words. Hatzivassiloglou and McKeown [8] identified several linguistic rules that can be exploited to identify opinion words and their orientations from a large corpus. The work was applied, extended and improved in [9].

Another major development in the area of opinion mining is sentiment classification of product reviews at the document level [10]. The objective of this task is to classify each review document as expressing a positive or a negative sentiment about an object (such as a movie, camera, car).

Feature-based opinion mining and summarization have been proposed by a number of researchers. Hu & Liu [7] proposed a technique based on association rule mining to extract product features. The main idea is that people often use the same words when they comment on the same product features. Then frequent itemsets of nouns in reviews are likely to be product features while the infrequent ones are less likely to be product features. This work also introduced the idea of using opinion words to find additional (often infrequent) features. Liu et al. [9] improved upon Hu’s work by proposing a technique based on language pattern mining to identify product features from pros and cons in reviews in the form of short sentences. They also made an effort to extract implicit features. Moreover, [11] proposed feature extraction for capturing knowledge from product reviews. The output of Hu and Liu’s system served as input to their system, and the input was mapped to the user-defined taxonomy features hierarchy thereby eliminating redundancy and providing conceptual organization. To identify the expressions of opinions associated with features, Hu & Liu focused on adjacent adjectives that modify feature nouns or noun phrases. They used adjacent adjectives as opinion words that are associated with features.

Popescu and Etzioni [12] investigated the same problem. Their algorithm requires that the product class is known. The algorithm determines whether a noun/noun phrase is a feature by computing the pointwise mutual information (PMI) score between the phrase and class specific discrimination. This work first used part-whole patterns for feature mining, but it finds part-whole based features by searching the Web and querying the Web is time-consuming. Qiu et al. [13] proposed a double propagation method, which exploits certain syntactic relations of opinion words and features, and propagates through both opinion words and features iteratively. The extraction rules are designed based on different relations between opinion words and features, and among opinion words and features themselves. Dependency grammar was adopted to describe these relations.

Another related work, but using a bit different approach is that of [5]. They proposed a method to extract product features from user reviews and generate a review summary, using product specifications rather than resources like segmenter, POS tagger or parser. At the feature extraction stage, multiple specifications are clustered to extend the vocabulary of product features. Hierarchy structure information and unit of measurement information are mined from the specification to improve the accuracy of feature extraction. At summary generation stage, hierarchy information in specifications is used to provide a natural conceptual view of product features.

Another area is mining of comparative sentences. It basically consists of identifying what features and objects are compared and which objects are preferred by their authors (opinion holders) [14]. Another related work involving comparison is that of [15] that compare reviews of different products in one category to find the reputition of the target product. However, it does not summarize reviews, and it does not mine product features on which the reviewers have expressed their opinions. Although they do find some frequent phrases indicating reputations, these phrases may not be product features (for example, “doesn’t work”, “benchmark result” and “no problem(s)”).

Feng et al. [16] and Ojokoh and Kayode [17] also proposed relateds method to solve the problem of opinion mining and applied this for some electronic products from amazon. There are a few differences between the works and ours. The method for determing the opinion polarity and aggregation is different from ours. Moreover, [16] only stopped at classifying the aggregated opinion as negative or positive and adopted that for recommendation. Also, their work did not consider infrequent feature identification. This works proposes the adoption of a threshold value for recommendation based on experiments with amazon rated views.

3. The Proposed Method

Figure 1 gives the architectural overview of the proposed system. In summary, the system executes the following steps:

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1] L. Zhang, B. Liu, S. H. Lim, S. H. and E. O’Brien-Strain, “Extracting and Ranking Product Features in Opinion Documents,” Proceedings of the 23rd International Conference on Computational Linguistics (COLING), Beijing, 23-27 August 2010, pp. 1462-1470.
[2] S. Aciar, “Mining Context Information from Consumer’s Reviews,” Proceedings of the 2nd Workshop on Contex-Aware Recommender Systems, Barcelona, 26 September 2010.
[3] X. Ding, B. Liu and P. S. Yu, “A Holistic Lexicon-Based Approach to Opinion Mining,” Proceedings of WSDM 08, California, Palo Alto, 11-12 February 2008, pp. 231-239. doi:10.1145/1341531.1341561
[4] G. Somprasertsri and P. Lalitrojwong, “Mining Feature-Opinion in Online Customer Reviews for Opinion Summarization,” Journal of Universal Computer Science, Vol. 16, No. 6, 2009, pp. 938-955.
[5] X. Meng and H. Wang, “Mining User Reviews: From Specification to Summarization,” Proceedings of the ACLIJCNLP 2009 Conference Short Papers, 2009, pp. 177-180. doi:10.3115/1667583.1667637
[6] W. Jin and H. H. Ho, “A Novel Lexicalized HMM-Based Learning Framework for Web Opinion Mining,” Proceedings of the 26th International Conference on Machine Learning, Montreal, 14-18 June 2009.
[7] M. Hu and B. Liu, “Mining and Summarizing Customer Reviews,” Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Seattle, 22-25 August 2004, pp. 168-177.
[8] V. Hatzivassiloglou and K. McKeown, “Predicting the Semantic Orientation of Adjectives,” Proceedings of the 35th Annual Meeting of the Association for Computational Lingusitics, Madrid, 7-10 July 1997, pp. 174-181. doi:10.3115/976909.979640
[9] B. Liu, M. Hu and J. Cheng, “Opinion Observer: Analyzing and Comparing Opinions on the Web. Proceedings of International World Wide Web Conference (WWW’05), New York, 2005, pp. 342-351. doi:10.1145/1060745.1060797
[10] J. Wiebe and E. Riloff, “Creating Subjective and Objective sentence classifiers from unannotated texts,” Proceedings of International Conference on Intelligent Text Processing and Computational Linguistics (CICLing’05), Mexico City, 13-19 February 2005, pp. 486-497. doi:10.1007/978-3-540-30586-6_53
[11] G. Carenini, R. T. Ng and E. Zwart, “Extracting Knowledge from Evaluative Text,” Proceedings of the Third International Conference on Knowledge Capture, Banff, 2-5 October 2005, pp. 11-18. doi:10.1145/1088622.1088626
[12] M. Popescu and O. Etzioni, “Extracting Product Features and Opinions from Reviews,” Proceedings of the Conference on Empirical Methods in Natural Language Processing EMNLP ‘05, Vancouver, 6-8 October 2005, pp. 339-346. doi:10.3115/1220575.1220618
[13] G. Qiu, B. Liu, J. Bu and C. Chen, “Expanding Domain Sentiment Lexicon through Double Propagation,” International Joint Conferences on Artificial Intelligence, Pasadena, 11-17 July 2009.
[14] G. Ganapathibhotla and B. Liu, “Identifying Preferred Entities in Comparative Sentences,” Proceedings of the 22nd International Conference on Computational Linguistics (COLING’08), Manchester, 2008.
[15] S. Morinaga, K. Yamanishi, K. Tateishi and T. Fukushima, “Mining Product Reputations on the Web,” Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Edmonton, 23-26 July 2002.
[16] S. Feng, M. Zhang and Y. Zhang, “Recommended or Not Recommended? Review Classification through Opinion Extraction,” Proceedings of APWEB Conference, Busan, 6-8 April 2010.
[17] B. A. Ojokoh and O. Kayode, “A Feature-Opinion Extraction Approach to Opinion Mining,” Journal of Web Engineering, Vol. 11, No. 1, 2012, pp. 051-063.
[18] http://reviewcenter.com
[19] Stanford Parser. http://nlp.stanford.edu/software/lex-parser.shtml
[20] P. D. Turney, “Thumbs up or Thumbs down? Semantic Orientation Applied to Unsupervised Classification of Reviews,” Proceedings of the Conference of the Association of Computational Lingustics, Howard, 24 August-1 September 2002, pp. 417-424.
[21] J. Wiebe, R. Bruce and T. O’Hara, “Development and Use of a Gold Standard Data Set for Subjectivity Classifications,” Proceedings of the Conference of the Association of Computational Lingustics, College Park, 20-26 June 1999.
[22] R. Bruce and J. Wiebe, “Recognizing Subjectivity: A Case Study of Manual Tagging,” Natural Language Engineering, Vol. 5, No. 2, 2000 pp. 187-205. doi:10.1017/S1351324999002181
[23] Amazon Web Service. http://www.aws.amazon.com

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.