NLP With Hotel Review dataset
£20-250 GBP
Paid on delivery
PLEASE DO THIS IN PYTHON! BUDGET IS £40 PLEASE! Pandas, NLTK and sci-kit learn libraries
Need several machine learning models to label the sentiment behind hotel reviews correctly. This is a fairly large project which needs to be completed by Saturday evening. Preferably Friday evening. Data to be analysed and interpreted has been provided as zip file attachment
The target column is the "rating" column which is a binary column denoting good ratings as 1 and bad ones as 0.
Modeling
[login to view URL] a logisitic regression model to this data with the solver set to lbfgs. record the accuracy score on the test set?
[login to view URL] are the 20 words most predictive of a good review (from the positive review column)? What are the 20 words most predictive with a bad review (from the negative review column)? Use the regression coefficients to solve this
[login to view URL] the dimensionality of the dataset using PCA, what is the relationship between the number of dimensions and run-time for a logistic regression?
2. Employ a K-Nearest Neighbour classifier on this dataset:
a. Fit a KNN model to this data. state accuracy score on the test set.
b. Reduce number of observations (data points) in the dataset. Briefly describe the relationship between the number of observations and run-time for KNN?
c. Find an optimal value for K in the KNN algorithm. split the dataset into train and validation sets when doing please.
d. Check if any issue with splitting the data into train and validation sets after performing vectorization?
3. Next Employ a Decision Tree classifier on this dataset:
a. Fit a decision tree model to this data. state accuracy score on the test set.
b. Use the data set (or a subsample) to find an optimal value for the maximum depth of the decision tree. again split the data set into train and validation.
4. What is the purpose of the validation set, i.e., how is it different than the test set?
5. Re-run a decision tree or logistic regression on the data again:
a. Perform a 5-fold cross validation to optimize the hyperparameters of the model.
b. What does the confusion matrix look like for the best model on the test set?
6. Create one new feature – any acceptable:
a. Explain your new feature and why you consider it will improve accuracy.
b. Run the model from part 4 again. re-optimize hyperparameters. Check if accuracy score of the best model improved on the test set after adding the new feature created?
---
some information however the data has been changed a little, i.e adding a rating column so this description summary may not be fully accurace
Hotel_Address: Address of hotel.
Review_Date: Date when reviewer posted the corresponding review.
Average_Score: Average Score of the hotel, calculated based on the latest comment in the last year.
Hotel_Name: Name of Hotel
Reviewer_Nationality: Nationality of Reviewer
Negative_Review: Negative Review the reviewer gave to the hotel. If the reviewer does not give the negative review, then it should be: 'No Negative'
ReviewTotalNegativeWordCounts: Total number of words in the negative review.
Positive_Review: Positive Review the reviewer gave to the hotel. If the reviewer does not give the negative review, then it should be: 'No Positive'
ReviewTotalPositiveWordCounts: Total number of words in the positive review.
Reviewer_Score: Score the reviewer has given to the hotel, based on his/her experience
TotalNumberofReviewsReviewerHasGiven: Number of Reviews the reviewers has given in the past.
TotalNumberof_Reviews: Total number of valid reviews the hotel has.
Tags: Tags reviewer gave the hotel.
dayssincereview: Duration between the review date and scrape date.
AdditionalNumberof_Scoring: There are also some guests who just made a scoring on the service rather than a review. This number indicates how many valid scores without review in there.
lat: Latitude of the hotel
lng: longtitude of the hotel
Project ID: #34101175
About the project
Awarded to:
Hi, I can help you with this project as I have worked on Sentiment Analysis, SMS Classification Projects before as well. I'll provide you the python and will use the Knn, deceision tree, logistic algorithms for predict More
24 freelancers are bidding on average £132 for this job
Hi, I hope you are doing fine. I have almost 10 years of experience in machine learning algorithms. I can implement various types of artificial intelligence algorithms including yours with Matlab, Python and etc. I hav More
Hi I am a very experienced statistician, data scientist and academic writer. I have completed several PhD level thesis projects involving advanced statistical analysis of data. I have worked with data from several comp More
Hi, I am sure that I can do this job. I'm artificial intelligence engineer experienced in nlp using Python programming. I have the hardware required to finish the project fast (Work station & GPU). I have accomplished More
Hello, I have 4 year experience in machine learning and implemented basic to advanced level ML DL model like LR, SVM, K-NN, XGBoost, Randamforest, CNN, RNN, LSTM, Bi-LSTM etc. I do your work very quickly and efficien More
Hello, hope you're doing well. As an experienced in AI engineer with 20 years of experience in Machine Learnign and Deep Learning. Specialized in machine learnign models like K-Neareset Neighbour, Decision Tree, Suppor More
I have thoroughly gone through your project description, I'm an expert and I can help you. Kindly send me a message I'm a senior engineer with rich experience in Machine Learning (ML), Data Analytics, Data Analysis. I More
Hello, I am placing a bid on your project " NLP With Hotel Review dataset " which is similar to one of the projects I have worked on previously. Being a data scientist with working experience in NLP , this can be achi More
Hello Sir , Iam excel expert level with top rating done projects before I can done this within short time also i can provide previous samples for make you understanding better about me and my skills. When do you need More
Hello there, I checked your data. I will use Pandas, NLTK and sci-kit learn libraries for your project. Because of your complete details, everything is clear. Contact me over chat to talk more about the details. I h More
Hi, I am a senior software developer with 7 years of experience. I have worked on various projects related to Python, machine learning, deep learning, data science and R programming. My priority is to complete the assi More
NLP With Hotel Review dataset Hello client? I have read and understood your project details; I will provide quality, plagiarism-free, and satisfactory work. I am always concerned about working for a cheaper cost to my More
Hello Dear Client, after thoroughly reading your PROJECT DESCRIPTION i have clearly understood it and i would like to work with you. I am a skilled and experienced in the named SKILLS. I consider your project DOABLE as More
"""Hi I'm glad for posting your NLP With Hotel Review dataset project. I have read your requirement carefully. I'm interested in your project. I have more than 5 years of experience with it. I will provide you high-qua More
I am interested in your project however I need to have a look at ur existing model architecture and the data you are using. I have working knowledge of : 1. Object detection algorithms like RCNN, Fast-RCNN, YOLO, SSD More