🕵️ Yelp Review Fraud Detection: Machine Learning for Trust & Safety

INF2179: Machine Learning Executive Project

This repository contains the machine learning pipeline and strategic executive report developed to detect fraudulent and deceptive reviews on Yelp. The core objective was to build a predictive model that protects platform integrity and ensures a trustworthy user experience.

🎯 Business Impact & Product Strategy

Fake reviews erode user trust and degrade the core value proposition of review-based platforms. This project moved beyond simple accuracy metrics to focus on actionable Trust & Safety interventions.

Strategic Threat Mitigation: Developed an NLP-driven classification strategy to identify deceptive language patterns, protecting the platform from astroturfing and coordinated review manipulation.
Executive Translation: Synthesized complex NLP and model evaluation metrics into an Executive Report, providing clear product recommendations for integrating the model into a live moderation queue.

🧠 Technical Methodology

The project required a full-stack data science approach, focusing on text mining, natural language processing, and supervised learning.

Natural Language Processing (NLP): Conducted extensive text pre-processing, tokenization, and sentiment analysis to engineer features from raw unstructured review text.
Predictive Modeling: Trained, evaluated, and tuned classification models (including Logistic Regression and ensemble methods) to isolate the linguistic signatures of fake reviews.
Model Evaluation: Prioritized precision/recall balance to ensure legitimate user reviews were not mistakenly flagged (minimizing false positives for a frictionless UX).

📂 Repository Contents

INF2179_Group 1 Executive Report.pdf: The strategic business report outlining the problem space, methodology, and final product recommendations.
INF2179 - ML Yelp Fake Reviews.pdf: The technical presentation deck detailing the NLP pipeline and model performance.
[Insert Your Notebook Name].ipynb: The core Python data analysis, NLP, and machine learning pipeline.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
INF2179 - ML Yelp Fake Reviews.pdf		INF2179 - ML Yelp Fake Reviews.pdf
INF2179_Eng_Hega_Irfan-1.ipynb		INF2179_Eng_Hega_Irfan-1.ipynb
INF2179_Group 1 Executive Report_Eng_Hega_Irfan-1.pdf		INF2179_Group 1 Executive Report_Eng_Hega_Irfan-1.pdf
README.md		README.md
asset_cover.png		asset_cover.png
new_data_test.csv		new_data_test.csv
new_data_train.csv		new_data_train.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🕵️ Yelp Review Fraud Detection: Machine Learning for Trust & Safety

🎯 Business Impact & Product Strategy

🧠 Technical Methodology

📂 Repository Contents

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🕵️ Yelp Review Fraud Detection: Machine Learning for Trust & Safety

🎯 Business Impact & Product Strategy

🧠 Technical Methodology

📂 Repository Contents

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages