Академический Документы
Профессиональный Документы
Культура Документы
Project Team:
Patil Rahul Reddy(16001A0501)
Mummadi Ruthwick Reddy(16001A0550)
Muppidi Snigdha(16001A0554)
INTRODUCTION
Sentiment analysis is a logical evaluation of people’s opinions and
emotions. It is currently an active research area in Natural Language
Processing(NLP)and Text Mining.
Sentiment analysis is used to keep the spread of false news in check,
to remove any abusive content, to know the customer experience
and to monitor social media.
The number of social media users are increasing daily, therefore the
need for sentiment analysis cannot be over emphasized.
WHY?
There are many people using social media nowadays, hence abusive
content is also on the rise, due to this many people are getting
effected.
So, to handle this kinds of scenarios, an application which can detect
abusive content from the text data is required to be developed.
SCOPE:
This model only works with emotion oriented information seeking system.
This model works only with text data but not with multimedia data.
WORKING OF SENTIMENT ANALYSIS
IMPLEMENTATION:
INPUT
Text data for training and testing
Data Cleaning
Tokenization Abbreviation Treatment
Stop Words Removal Bad-words Synonyms
Mapping Punctuation Removal
APPLICATION FLOW
TF-IDF Transformation
Modelling
Classification
RESULT
INSIGHTS:
53% of comments which have abusive words are not actually abusive
For every one in five comments, abusive word variants are used to insult rather
than direct abusive words
Typing errors are a common part of chat but are penalized heavily by model in
case of a resemblance with abusive words
SUMMARY:
Model has an accuracy of 91.2% on training data and 81% on cross
validation
Logistic Regression is found to be the best suitable model in
comparison to popularly used Naïve Bayes and SVM
1500 relevant features are selected using Chi square test