Raphael Vasquez, Molly Eskelson,
Kabrina Ramnath, Omar Trejo
The dataset comprises of three main tables:
- The code in this project is written in Python 3.6.6 :: Anaconda custom (64-bit). The following additional libraries have been used:
- nltk for the Vader Sentiment Analyzer. The Vader lexicon has been downloaded nltk.downloader.download('vader_lexicon')
- wordcloud to generate wordclouds from the text of the reviews
-from nltk.corpus import stopwords # stopwords to detect language
- nltk.downloader.download('vader_lexicon')
-from nltk import word_tokenize
-from nltk.tokenize import RegexpTokenizer
from nltk.corpus import stopwords
from nltk.stem import WordNetLemmatizer
from gensim.corpora.dictionary import Dictionary
from gensim.models.tfidfmodel import TfidfModel
from gensim.models.ldamodel import LdaModel
What we did- used text analysis to understand specific customer's reviews and to understand the root cause of customer complainants.
After cleaning up the data. we ran a sentiment analyzer over each comment a reviewer. We then separated those scores at .25 with less the .25 being negative reviews and above .25 being positive reviews.
Then we ran a topic model over the two sets separately in the hope to gage some insight on what the overall talking points between the negative and positive reviews. But we didn’t do a good enough job filtering out the stop words. So other languages, numbers, punctuation , and stem words (*awesome', awwwwesome', 'awesomesause') made it through.
If we had more time it would have been better for us to filter all words that weren’t adjectives or nouns in the comments. We already had the context of the words due to polarity. At that point we would be looking at what the cause is.
The Vader Sentiment Analyzer is due C.J. Hutto and Eric Gilbert fro the paper "VADER: A Parsimonious Rule-based Model for Sentiment Analysis of Social Media Text", available here
Melbourne Reviews-IBM Cloud Data Services/Economy & Business
          
 Inside Airbnb:http://creativecommons.org/publicdomain/zero/1.0
Melbourne Listings-IBM Cloud Data Services/Economy & Business
          Inside Airbnb:http://creativecommons.org/publicdomain/zero/1.0
 Melbourne Calendar-IBM Cloud Data Services/Economy & Business
Inside Airbnb:http://creativecommons.org/publicdomain/zero/1.0
 Sydney Reviews-IBM Cloud Data Services/Economy & Business 
Inside Airbnb:http://creativecommons.org/publicdomain/zero/1.0
Sydney Listings-IBM Cloud Data Services/Economy & Business 
          Inside Airbnb:http://creativecommons.org/publicdomain/zero/1.0   
Sydney Calendar-IBM Cloud Data Services/Economy & Business 
Inside Airbnb:http://creativecommons.org/publicdomain/zero/1.0