🚀 SpamShield AI — NLP Based Spam/Ham Classifier

An advanced NLP-based Spam Detection System built using Python, Machine Learning, TF-IDF, Threshold Tuning, and Flask with a modern futuristic UI inspired by AI applications.

📌 Project Overview

SpamShield AI classifies email/messages into:

✅ Ham (Safe Message)
🚨 Spam (Suspicious/Fraudulent Message)

This project focuses not only on prediction accuracy but also on:

Precision & Recall optimization
Threshold tuning
False positive reduction
Real-world spam detection behavior
Professional AI-style UI

🧠 Workflow of the Project

1️⃣ Dataset Collection

Used Spam/Ham dataset from Kaggle
Performed analysis on real-world SMS/email messages

2️⃣ Text Preprocessing

Performed complete NLP preprocessing pipeline:

Lowercasing
Contractions handling
Tokenization
Stopword removal
Lemmatization
Punctuation removal

Libraries used:

nltk
contractions
string

3️⃣ Feature Engineering

Used:

TF-IDF Vectorization

to convert text into numerical features.

4️⃣ Model Training

Multiple Machine Learning models were trained and compared.

Examples:

Logistic Regression
Naive Bayes
Random Forest
SVM
KNN

5️⃣ Model Evaluation

Compared models using:

Accuracy
Precision
Recall
F1-Score
Confusion Matrix
Precision-Recall Curve

Main focus was reducing false positives.

6️⃣ Hyperparameter Tuning

Performed tuning to improve model performance.

Optimized:

Threshold values
Precision/Recall balance
Spam sensitivity

7️⃣ Threshold Tuning

Instead of using default threshold:

0.6 - 0.7

custom threshold tuning was applied after analyzing:

Precision Recall Curve
Confusion Matrix
Prediction probabilities

Final threshold improved real-world spam classification.

8️⃣ Final Model Selection

After comparing all models:

✅ Best performing model was selected
✅ Exported as:

best_model.pkl

Used later in Flask application.

🌐 Flask Web Application

The trained model was integrated with Flask.

Features:

AI-style futuristic UI
Dark/Light mode
Real-time message analysis
Spam/Ham result card
Reset button
Responsive design

🎨 UI Design

The UI was inspired by modern AI interfaces and customized further for this project.

Main improvements done:

Futuristic glassmorphism effect
AI-themed dark mode
Clean result animations
Professional user experience

📸 Screenshots

🌙 Light Mode

☀️ Dark Mode

🚨 Spam Detection Example

✅ Ham Detection Example

📂 Project Structure

Spam_Ham_NLP_Project/
│
├── static/
│   ├── style.css
│   └── script.js
│
├── templates/
│   └── index.html
│
├── app.py
├── best_model.pkl
├── requirements.txt
├── Spam_ham_2.ipynb
└── README.md

🛠️ Technologies Used

Backend

Python
Flask

Machine Learning

Scikit-learn
TF-IDF
NLP

Frontend

HTML
CSS
JavaScript

NLP Libraries

NLTK
contractions

📸 Screenshots

🌙 Dark Mode

Modern AI Dashboard
Spam/Ham analysis card
Professional UI

☀️ Light Mode

Clean minimal design
Responsive layout

🚨 Spam Detection Example

Fraud/spam email classified successfully

✅ Ham Detection Example

Safe message classified correctly

⚙️ Installation

Clone Repository

git clone

Install Dependencies

pip install -r requirements.txt

Run Flask App

python app.py

📊 Future Improvements

Email phishing detection
Deep Learning models
BERT/Transformers integration
User authentication
API deployment
Live email scanning

👨‍💻 Author

ROHIT SHARMA

Passionate about:

Data Science
Machine Learning
NLP
AI-based applications

⭐ Project Highlights

✅ End-to-End NLP Project
✅ Real-world Spam Detection
✅ Threshold Tuning
✅ Precision/Recall Optimization
✅ Flask Deployment
✅ Professional AI UI
✅ Complete ML Workflow Included

🔥 Final Note

This project was built not just for accuracy, but to understand:

how NLP pipelines work,
how models behave in real-world scenarios,
and how threshold tuning can improve practical spam detection systems. =======

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
screenshots		screenshots
static		static
templates		templates
.gitignore		.gitignore
README.md		README.md
app.py		app.py

Folders and files

Latest commit

History

Repository files navigation

🚀 SpamShield AI — NLP Based Spam/Ham Classifier

📌 Project Overview

🧠 Workflow of the Project

1️⃣ Dataset Collection

2️⃣ Text Preprocessing

3️⃣ Feature Engineering

4️⃣ Model Training

5️⃣ Model Evaluation

6️⃣ Hyperparameter Tuning

7️⃣ Threshold Tuning

8️⃣ Final Model Selection

🌐 Flask Web Application

🎨 UI Design

📸 Screenshots

🌙 Light Mode

☀️ Dark Mode

🚨 Spam Detection Example

✅ Ham Detection Example

📂 Project Structure

🛠️ Technologies Used

Backend

Machine Learning

Frontend

NLP Libraries

📸 Screenshots

🌙 Dark Mode

☀️ Light Mode

🚨 Spam Detection Example

✅ Ham Detection Example

⚙️ Installation

Clone Repository

Install Dependencies

Run Flask App

📊 Future Improvements

👨‍💻 Author

ROHIT SHARMA

⭐ Project Highlights

🔥 Final Note

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages