Академический Документы
Профессиональный Документы
Культура Документы
AbstractRecommendation system is the reason of success for most of the social media companies as well as e-commerce sites. Giving
recommendation to the uses is one of the interesting and challenging tasks nowadays, it helps to generate revenue, to increase number of users,
to reduce the searching time for particular item. Recommendation system helps for making interest in user and eventually it increases the
popularity of any site. Huge number of items (product, users, movies, songs, hotels etc.) and its feature sets makes it hard to predict the accurate
items to the user. It is important to keep all historic data of user as well as all information about the items to generate recommendation. In this
paper, the personality of the user is used with the combination on the most popular recommendation techniques like collaborative filtering (CF)
and content based filtering (CB) proposed on the amazon review data set. In the first model the personality of the user is calculated by using the
big five model on the twitter account. In the second module Collaborative filtering is used to generate the recommendation based on the historic
information of the user wherries in third module, Content based filtering is uses to generate recommendation based on the feature set of the item.
Pearson-correlation algorithm is applied on both modules and ranking are generated. Finally union of the both vector space are taken as the final
recommendation.
Keywords-Recommendation System; Collaborative Filtering (CF); Content Based Filtering (CB); Similarity measure, Social Media
__________________________________________________*****_________________________________________________
Euclidian distance:
According to the user's ratings to item the user is classified This is most widely and commonly used for
into the positive or negative categories. For getting best-selling measuring the proximity between the users.
of items the inverse frequency 'IFT' multiplied with the vector Euclidean distance is also known as simply distance.
component by algorithm which makes less well-known items The Euclidean distance between two points is the
much more relevant. This vector for almost all customers is length of the path connecting them. It is well suited
extremely sparse there are lots of unknown ratings. Density of
for dense data and the continuous data. By using
rating matrix is often < 1%. The task of the recommender
Pythagorean Theorem distance between two points X
system is to predict the rating for the user on non-rated item
[1]. and Y is given by the.
2) Data Format:
reviewerID: A3UTQPQPM4TQO0,
asin: 0000013714,
reviewerName: betty burnett,
helpful: [0, 0],
reviewText: We have many of the old, old
issue. But the number had depleted. There
were not enough books to allow us to use
them regularly. With the additional supply
the books will be used more often. They
Fig 1. System Architecture
arre a good old standby for gospel
User's social behavior are used to create the personalized singing.,
profile. By using Twitter API the personality of the user is overall: 5.0,
calculated. If the user dont have the twitter account or dont summary: I was disappointed that you would
want to share their twitter information, then by using Big 5 only allow me to purchase 4 when your
model the personality is calculated. inventory showed that you had 14
352
IJRITCC | June 2016, Available @ http://www.ijritcc.org
________________________________________________________________________________________________________
International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169
Volume: 4 Issue: 6 349 - 354
_______________________________________________________________________________________________
available., Recently the users are not considered as a single entity for
unixReviewTime: 1374883200, finding out their interest. Instead it is studied that even though
reviewTime: 07 27, 2013 user are multifaceted their choices can be predicted with the
help of social network such as twitter, facebook etc. user can
Fig2 . Data Format of amazon review dataset
have similar taste with other person, and can even depend on
the choice of their friend and family in the network. Now
Where, similarity distance measures are as follows. various giants in software industry like amazon also started of
product/productId: asin, e.g. using trust network for recommending items to the user. This
amazon.com/dp/B00006HAXW solves the sparsity problems of the dataset.
product/title: title of the product
product/price: price of the product VI. CONCLUSION
review/userId: id of the user, e.g. The architecture for PRec is highly modular and enables
A1RSDE90N6RSZF using various algorithms under the business knowledge layer.
review/profile Name: name of the user We have designed an interface for entering business rules that
review/helpfulness: fraction of users who found the can be used for explicit user feedback.
review helpful For the proof of concept the web system is implemented
review/score: rating of the product using Django framework in python.
Recommendations based on personality are an effective
review/time: time of the review (unix time)
way of providing dynamism. Moreover, providing user
review/summary: review summary feedback gives the user a sense of control to what is being
review/text: text of the review shown to him. The advantage of this model is that since general
algorithms are used, this recommender system is scalable and
3) Data Cleaning: can easily be integrated with any system.
Data cleaning is the key to extracting meaningful Though the system comes with many advantages, there are
associations between extracted features from the sources a few improvements that can be done to the same. Using
mentioned above. By using Python and Amazon API the multiprocessing modules, all the cores of the hardware can be
required data with category is obtained which further group used successfully.
into number of product reviewed by the same user and same Despite these shortcomings, personality based
product reviewed by number of reviewer. Watson API helps in recommender system is an effective way of giving
cleaning of twitter data by using various pre-processing recommendations to users.
methods.
REFERENCES
4) A Big Five Personality Questionnaires: [1] Eyun Gong, Qi Zhang, Xuyang Sun, Xuanjing Huang,Who
Here is the list of questions to predict the personality of the Will You @?,Shanghai Key Laboratory of Data
Science,School of Computer Science, Fudan
user with no Twitter account. The user can give one response University,CIKM15, October 1923, 2015, Melbourne,
for each question. The ve options are Australia.,2015 ACM. ISBN 978-1-4503-3794-6/15/10
Agree strongly ...$15.00.,DOI: http://dx.doi.org/10.1145/2806416.2806458.
[2] Peng Cui, Zhiyu Wang and Zhou Su,What Videos Are Similar
Agree a little with You? Learning a Common Attributed Representation for
Neither agree nor disagree Video Recommendation,Department of Computer Science and
Disagree a little Technology, Tsinghua University, Beijing,
China,cuip@tsinghua.edu.cn, zy-
Disagree strongly wang08@mails.tsinghua.edu.cn,suhmily@gmail.com,MM14,
By answering the list of the questionnaires the big five module November 37, 2014, Orlando, Florida, USA,Copyright 2014
calculate the personality of the user. ACM 978-1-4503-3063-
The formula to calculate the ve factors of personality is: 3/14/11$15.00.,http://dx.doi.org/10.1145/2647868.265494
[3] Bo-Wen Zhang,Bin Geng ,Xu-Cheng Yin,Xiao-Ping Cui ,Jiao
Openness = 5R + 10 (add reverse of response of 5th Qu ,Fang Zhou,Li Song ,Hong-Wei ,Hao,Social Book Search
and response of 10th question) Reranking with Generalized Content-Based
Conscientiousness = 3R + 8 Filtering,Department of Computer Science and Technology,
School of Computer and Communication
Extraversion = 1R + 6 Engineering,University of Science and Technology Beijing,
Agreeableness = 7R + 2 Beijing 100083, China,Copyright 2014 ACM 978-1-4503-2598-
1/14/11 ...$15.00.
Neuroticism = 4R + 9 (R = item is reverse scored.) [4] Hernan Badenes, Mateo N. Bengualid, Jilin Chen, Liang Gou ,
Eben Haber, Jalal Mahmud,Jeffrey W Nichols, Aditya Pal,
V. RELATED WORK Jerald Schoudt, Barton A Smith, Ying Xuan, Huahai Yang,
Many popular recommendation systems make use of Michelle X. Zhouo,System U: Automatically Deriving
Personality Traits from Social Media for People
collaborative filtering technique and content based filtering Recommendation,IBM, Buenos Aires, Argentina, RecSys14,
technique. Memory based collaborative filtering technique is October 610, 2014,Foster City, Silicon Valley, CA, USA
easy to implement as well as widely used by many system but [5] Fatemeh Vahedian,Weighted Hybrid Recommendation for
it has certain problems like it has limited scalability for large Heterogeneous Networks, Center for Web Intelligence, Depaul
dataset and works poorly in sparse data. The processing time is University, Chicago, IL 60604,fvahedia
@cs.depaul.edu,RecSys14, October 610, 2014, Foster City,
very high. In the model based approach the machine learning Silicon Valley, CA, USA.ACM 978-1-4503-2668-1/14/10
and mathematical concepts are used. It lead to increase the ...$15.00.
performance of the system. [6] International Journal of Information Technology and
Knowledge Management,July-December 2012, Volume 5, No.
353
IJRITCC | June 2016, Available @ http://www.ijritcc.org
________________________________________________________________________________________________________
International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169
Volume: 4 Issue: 6 349 - 354
_______________________________________________________________________________________________
2, pp. 443-445,ALGORITHM FOR RESEARCH PAPER de Minas Gerais, and Zunnit Technologies, Brazil, EDLENO
RECOMMENDATION SYSTEM,Megha Jain SILVA DE MOURA , Computer Science Department.
[7] Do You Want to Know?Recommending Strangers in the [15] Zhe Zhao, Zhiyuan Cheng, Lichan Hong, Ed H. Chi, Department
Enterprise, Ido Guy, Sigalit Ur, Inbal Ronen, Adam Perer, of EECS, University of Michigan, Improving User Topic
Michal Jacovi,CSCW 2011, March 1923, 2011, Hangzhou, Interest Profiles by Behavior Factorization, Ann Arbor, MI,
China.,Copyright 2011 ACM 978-1-4503-0556- USA Google Inc, ountain View,CA,USA,zhezhao@umich.edu,
3/11/03...$10.00.. zcheng@google.com,lichan@google.com, edchi@google.com
[8] Zhejiang Key Laboratory of Service Robot:Combining Social [16] A Revisit to Social Network-Based Recommender Systems
Media Information and Music Content,College of Computer Hui Li, Dingming Wu, Nikos Mamoulis, Department of
Science, Zhejiang University,MM10, October 2529, 2010, Computer Science ,The University of Hong Kong, Hong
Firenze, Italy,Copyright 2010 ACM 978-1-60558-933-6/10/10 Kong,{hli2,dmwu,nikos}@cs.hku.hk
...$10.00 [17] Recommender Systems,Prem Melville and Vikas
[9] Research Paper Recommender Systems: A Subspace Clustering Sindhwani,IBM T.J. Watson Research Center, Yorktown
Approach,Nitin Agarwal, Ehtesham Haque, Huan Liu, and Heights, NY 10598,{pmelvil,vsindhw}@us.ibm.com
Lance Parsons Arizona State University, Tempe AZ 85281, [18] Acknowledgements: slides are adapted from the lecture notes of
USA,Supported by grants from Prop 301 (No. ECR A601) and the book Recommender Systems: An Introduction Dietmar
CEINT 2004. Jannach, Markus Zanker, Alexander Felfernig, and Gerhard
[10] Greg inden, Brent Smith, and Jeremy York Amazon.com, Friedrich.Sinno Jialin PAN Data Analytics Department,Institute
Amazon.com Recommendations Item-to-Item Collaborative for Infocomm Research, A*STAR, Singapore
Filtering,JANUARY FEBRUARY 2003 Published by the [19] The Application of Data-Mining to Recommender Systems,J.
IEEE Computer Society. Ben Schafer, Ph.D. University of Northern Iowa
[11] Jun Wang, Arjen P. de Vries, Marcel J.T. Reinders, Unifying [20] Scienstein: A Research Paper Recommender System,Bela
User-based and Item-based Collaborative Filtering Approaches Gipp, Jran Beel, Christian Hentschel, Otto-von-Guericke
by Similarity Fusion Information and Communication Theory University, Dept. of Computer Science, Magdeburg, Germany,
Group, Faculty of Electrical Engineering, Mathematics and Fraunhofer Institute for Telecommunications, Berlin,
Computer Science, Delft University of Technology, Mekelweg Germany,Bela@Gipp.com, Joeran@Beel.org,
4, 2628 CD Delft, T@tudelft.nl, arjen@acm.org christian.hentschel@hhi.fraunhofer.de
[12] Ido Guy, Naama Zwerdling, Inbal Ronen, David Carmel, Erel [21] A Social Network Based Approach to Personalized
Uziel Social Media Recommendation based on People and Recommendation of,Participatory Media Content,Aaditeshwar
Tags, IBM Research Lab ,Haifa 31905, Seth and Jie Zhang,School of Computer Science,University of
Israel,{ido,naamaz,inbal,carmel,erelu}@il.ibm.com Waterloo, ON, Canada,WorldBank 2002.
[13] Pasquale Lops, Marco de Gemmis and Giovanni [22] Personalized Research Paper Recommendation System using
Semeraro,Content-based Recommender Systems: State of the Keyword Extraction Based on User Profile,Kwanghee Hong,
Art and Trend,Department of Computer Science, University of Hocheol Jeon, Changho JeonDepartment of Computer Science
Bari Aldo Moro, Via E. Orabona, 4, Bari & Engineering, Hanyang University, Korea,
(Italy),lops@di.uniba.it khhong@cse.hanyang.ac.kr,Agency for Defense Development,
[14] MARCO TULIO RIBEIRO and NIVIO ZIVIANI , Computer Seoul, Korea, hochuls@chollian.net,Department of Computer
Science Department,Multi objective Pareto-Efficient Science & Engineering, Hanyang University,
Approaches for Recommender Systems,Universidade Federal Korea,chj5193@hanyang.ac.kr
354
IJRITCC | June 2016, Available @ http://www.ijritcc.org
________________________________________________________________________________________________________