🚀 Spam Detection System

A full-stack application that detects Spam / Smishing / Offensive content using Machine Learning. The system includes:

🧠 ML Model (Python)
⚡ Python API (Flask / FastAPI)
🌐 Node.js Backend
💻 React Web App
📱 React Native Mobile App (Android & iOS)

📌 Project Architecture

User Input (Web / Mobile)
        ↓
React / React Native UI
        ↓
Node.js Backend (API Gateway)
        ↓
Python ML API (Model Inference)
        ↓
Prediction (Spam / Ham / Offensive)

System Stability & Environment Fixes

This update addresses critical runtime issues that prevented the system from executing in the local development environment:

Security Policy Compliance: Migrated the project to a directory with appropriate execution permissions to resolve DLL load errors.
Model Loading Error: Corrected file path references to ensure the ML models are properly detected at runtime.
API Stability: Fixed 500 Internal Server Error by correctly serializing NumPy model outputs to JSON.

For a detailed breakdown, please refer to the recently merged Pull Request.

🧠 Machine Learning Model

📊 Dataset

CSV format:
- text / message
- label (spam / ham / offensive)

⚙️ Algorithms Used

Logistic Regression
Naive Bayes
Linear SVM (Best Accuracy)

📈 Performance

Accuracy: ~97–98%
Metrics:
- Precision
- Recall
- F1-score
- Confusion Matrix

🏋️ Model Training (Python)

from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.svm import LinearSVC
import pickle

# Load dataset
X = df['text']
y = df['label']

# Vectorization
vectorizer = TfidfVectorizer()
X_vec = vectorizer.fit_transform(X)

# Model
model = LinearSVC()
model.fit(X_vec, y)

# Save
pickle.dump(model, open("model.pkl", "wb"))
pickle.dump(vectorizer, open("vectorizer.pkl", "wb"))

🐍 Python API (Flask)

📦 Install Dependencies

pip install flask scikit-learn

🚀 API Code

from flask import Flask, request, jsonify
import pickle

app = Flask(__name__)

model = pickle.load(open("model.pkl", "rb"))
vectorizer = pickle.load(open("vectorizer.pkl", "rb"))

@app.route('/predict', methods=['POST'])
def predict():
    data = request.json['text']
    vec = vectorizer.transform([data])
    prediction = model.predict(vec)[0]
    return jsonify({"result": prediction})

if __name__ == "__main__":
    app.run(port=5000)

🌐 Node.js Backend

📦 Install

npm install express axios cors

⚙️ Server Code

const express = require("express");
const axios = require("axios");
const cors = require("cors");

const app = express();
app.use(cors());
app.use(express.json());

app.post("/predict", async (req, res) => {
  try {
    const response = await axios.post("http://localhost:5000/predict", {
      text: req.body.text,
    });
    res.json(response.data);
  } catch (err) {
    res.status(500).send("Error");
  }
});

app.listen(3000, () => console.log("Node server running"));

💻 React Frontend

📦 Setup

npm create vite@latest
npm install axios

⚛️ Example Component

import { useState } from "react";
import axios from "axios";

function App() {
  const [text, setText] = useState("");
  const [result, setResult] = useState("");

  const handlePredict = async () => {
    const res = await axios.post("http://localhost:3000/predict", { text });
    setResult(res.data.result);
  };

  return (
    <div>
      <h1>Spam Detection</h1>
      <input onChange={(e) => setText(e.target.value)} />
      <button onClick={handlePredict}>Check</button>
      <p>{result}</p>
    </div>
  );
}

export default App;

📱 React Native App (Android & iOS)

📦 Setup

npx create-expo-app
npm install axios

📲 Example Code

import { useState } from "react";
import { View, Text, TextInput, Button } from "react-native";
import axios from "axios";

export default function App() {
  const [text, setText] = useState("");
  const [result, setResult] = useState("");

  const predict = async () => {
    const res = await axios.post("http://YOUR_IP:3000/predict", { text });
    setResult(res.data.result);
  };

  return (
    <View>
      <Text>Spam Detection</Text>
      <TextInput onChangeText={setText} />
      <Button title="Check" onPress={predict} />
      <Text>{result}</Text>
    </View>
  );
}

🗄️ Email Classification Database

A MySQL-based system to store and manage classified email records.

Database Setup

mysql -u root -p < backend/schema.sql

API Endpoints

Method	Endpoint	Description
POST	`/api/emails/`	Insert new email record
PATCH	`/api/emails/{id}/mark`	Mark as spam or legitimate
GET	`/api/emails/spam`	Retrieve all spam emails
GET	`/api/emails/legitimate`	Retrieve all legitimate emails
GET	`/api/emails/count/spam`	Count total spam emails
GET	`/api/emails/count/legitimate`	Count total legitimate emails

Export Endpoints

Method	Endpoint	Description
GET	`/api/emails/export?format=csv`	Download all emails as CSV
GET	`/api/emails/export?format=pdf`	Download all emails as PDF report

CSV Export Format

email_id, subject, sender, is_spam, timestamp 1, Win a FREE iPhone!!!, promo@spam.com, Spam, 2024-01-01 10:00:00 2, Team standup at 10am, manager@company.com, Legitimate, 2024-01-01 11:00:00

PDF Report Includes

Summary: total emails, spam count, legitimate count
Full table with all email records

Email Record Fields

email_id — Auto-generated unique ID
subject — Email subject
sender — Sender email address
is_spam — Boolean spam status
timestamp — Record creation time

🔁 User Feedback Loop

After every prediction, the web app asks "Was this prediction correct?":

✅ Yes — records the prediction as confirmed correct
❌ No — shows a dropdown to pick the correct label (ham, spam, smishing), then submits the correction

How it flows

React Widget → POST /feedback (Node backend) → POST /feedback (Flask ML API) → feedback_store.csv

`POST /feedback`

Available on both the Node backend (/feedback, requires authentication) and the Flask ML API (/feedback).

Request body:

{
  "text": "Congratulations! You won a free prize, click here",
  "predicted_label": "ham",
  "correct_label": "spam"
}

Responses:

201 — {"message": "Feedback recorded. Thank you!"}
400 — {"error": "Invalid feedback data"} if text is empty or correct_label is not one of ham, spam, smishing

Feedback is appended to backend/feedback_store.csv (gitignored) with columns:

Column	Description
`text`	The original input text
`predicted_label`	What the model predicted
`correct_label`	What the user said it should be
`submitted_at`	UTC timestamp

Retraining the model

Once enough feedback has accumulated, run:

cd backend
python retrain.py

This merges feedback_store.csv with the original training dataset (DATASET_PATH, default dataset.csv), retrains the TF-IDF vectorizer, LinearSVC model and label encoder, and overwrites linear_svm_model.pkl, tfidf_vectorizer.pkl and label_encoder.pkl.

🔐 Features

✅ Spam / Smishing Detection
✅ Offensive Content Classification
✅ Real-time Prediction API
✅ Cross-platform (Web + Mobile)
✅ Scalable Architecture

🛠 Tech Stack

Python (ML + API)
Scikit-learn
Flask
Node.js
Express
React
React Native
Axios

📌 Future Improvements

Use Deep Learning (LSTM / BERT / CLIP)
Multilingual Support
More accuracy and advanced model
Include Email predicton perfectly and add mobile numbers also to track
Include Url prediction perfectly to check url is safe or not

🐳 Running with Docker

Prerequisites

Docker installed
Docker Compose installed

Docker Hub Images

Pre-built images are available — no build step required:

Service	Docker Hub
Flask ML API	rudra2006/spam-ml-api
Node.js Backend	rudra2006/spam-node-backend
React Frontend	rudra2006/spam-frontend

Quick Start (New Users — No Clone Needed)

Images are pre-built on Docker Hub. Just download the compose file and run:

curl -O https://raw.githubusercontent.com/Userunknown84/Spam-Detection-System/main/docker-compose.yml
docker-compose up

Docker will automatically pull all 3 images. No build step, no clone required.

Quick Start (From Source)

git clone https://github.com/Userunknown84/Spam-Detection-System.git
cd Spam-Detection-System
docker-compose up --build

Service	URL
React Frontend	http://localhost
Node.js Backend	http://localhost:3000
Flask ML API	http://localhost:5000

Stop all containers

docker-compose down

Architecture in Docker

Browser → nginx (port 80) → node-backend (port 3000) → ml-api (port 5000)

ml-api: Python Flask service that loads the SVM model and serves /predict
node-backend: Node.js API gateway forwarding requests to ml-api
frontend: React app built with Vite, served via nginx; nginx proxies /predict to node-backend

👨‍💻 Author

Aditya Sharma

⭐ Contribute

Feel free to fork, improve and contribute to this project!

📜 License

This project is open-source and available under the MIT License.

You are free to use, modify, and distribute this project for personal or commercial use, provided that proper credit is given.

For more details, see the LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
backend		backend
frontend		frontend
spamdetection		spamdetection
.DS_Store		.DS_Store
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.txt		LICENSE.txt
PROJECT_GOAL.md		PROJECT_GOAL.md
README.md		README.md
docker-compose.yml		docker-compose.yml
generate_vectorizer.py		generate_vectorizer.py
label_encoder.pkl		label_encoder.pkl
linear_svm_model.pkl		linear_svm_model.pkl
tfidf_vectorizer.pkl		tfidf_vectorizer.pkl

Folders and files

Latest commit

History

Repository files navigation

🚀 Spam Detection System

📌 Project Architecture

System Stability & Environment Fixes

🧠 Machine Learning Model

📊 Dataset

⚙️ Algorithms Used

📈 Performance

🏋️ Model Training (Python)

🐍 Python API (Flask)

📦 Install Dependencies

🚀 API Code

🌐 Node.js Backend

📦 Install

⚙️ Server Code

💻 React Frontend

📦 Setup

⚛️ Example Component

📱 React Native App (Android & iOS)

📦 Setup

📲 Example Code

🗄️ Email Classification Database

Database Setup

API Endpoints

Export Endpoints

CSV Export Format

PDF Report Includes

Email Record Fields

🔁 User Feedback Loop

How it flows

POST /feedback

Retraining the model

🔐 Features

🛠 Tech Stack

📌 Future Improvements

🐳 Running with Docker

Prerequisites

Docker Hub Images

Quick Start (New Users — No Clone Needed)

Quick Start (From Source)

Stop all containers

Architecture in Docker

👨‍💻 Author

⭐ Contribute

📜 License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`POST /feedback`

Packages