CreditXplain - Explainable Credit Scoring System

Status: Production Ready | Version: 1.0 | Last Updated: March 27, 2026

Deployed Links

Frontend: https://creditxplain-client.vercel.app/
Backend API: https://creditxplain-server.onrender.com
ML Service: https://creditxplain-ml.onrender.com

Quick Navigation

Executive Summary

CreditXplain is a production-ready fintech application that makes credit scoring transparent and fair. Instead of black-box lending decisions, every application includes:

Explainable Scores - 7-factor breakdown showing what drove the decision
What-If Scenarios - Shows applicants how to improve their score
Fair Lending - Built-in bias detection aligned with compliance requirements
Real-World Ready - Automatically maps 40+ banking field name variations
Hybrid Resilience - Continues operating with fallback if ML service fails
Professional APIs - 11 RESTful endpoints with JWT authentication

Project Structure

creditxplain/
├── client/                          # React frontend (Vite + Tailwind)
│   ├── src/
│   │   ├── components/              # React components
│   │   ├── pages/                   # Page routes
│   │   ├── utils/api.js             # API client with auth interceptor
│   │   └── context/                 # Context providers
│   └── package.json
│
├── server/                          # Node.js/Express backend
│   ├── controllers/                 # Business logic
│   ├── routes/                      # API endpoints
│   ├── models/                      # MongoDB schemas
│   ├── middleware/                  # Auth, error handling
│   ├── utils/mlEngine.js            # Hybrid ML scoring
│   └── package.json
│
├── ml/                              # Python FastAPI ML service
│   ├── app.py                       # FastAPI application
│   ├── model.py                     # ML model architecture
│   ├── train.py                     # Model training pipeline
│   ├── train_real.py                # Real dataset trainer
│   ├── requirements.txt
│   └── artifacts/credit_model.joblib
│
├── assets/                          # NEW: Professional documentation
│   ├── documentation/               # Complete project docs
│   │   ├── README.md                # Documentation hub
│   │   ├── 01-PROJECT-OVERVIEW.md   # Vision & objectives
│   │   ├── 02-TECHNICAL-ARCHITECTURE.md
│   │   ├── 03-API-REFERENCE.md
│   │   ├── 04-ML-FRAMEWORK.md
│   │   ├── 05-DATABASE-DESIGN.md
│   │   ├── 06-SECURITY-IMPLEMENTATION.md
│   │   ├── 07-FEATURE-GUIDE.md
│   │   ├── 08-DEPLOYMENT-GUIDE.md
│   │   ├── 09-EXAMINER-GUIDE.md
│   │   └── 10-BEST-PRACTICES.md
│   │
│   └── testing/                     # Testing suite & QA
│       ├── README.md                # Testing guide
│       ├── smoke-tests/
│       │   ├── QUICK_TEST.js        # 5-minute endpoint check
│       │   └── TEST_SUITE.js        # 15-minute feature validation
│       └── feature-tests/
│           ├── DEMO_TEST.js         # Full feature demo
│           └── TEST_ALL_FEATURES.js # Comprehensive testing
│
├── .gitignore
├── .env.example
└── README.md                        # This file

Quick Start (5 Minutes)

Prerequisites

Node.js 18+
Python 3.9+
MongoDB (Atlas or local)

1. Backend Setup

cd server
npm install
# Create .env with MONGO_URI, JWT_SECRET, etc.
# Copy from server/.env.example
npm run dev
# Runs on http://localhost:5000

2. Frontend Setup

cd client
npm install
npm run dev
# Runs on http://localhost:5173

3. ML Service Setup (Optional but Recommended)

cd ml
python -m venv .venv
.\. venv\Scripts\activate
pip install -r requirements.txt
python train.py
uvicorn app:app --reload
# Runs on http://localhost:8000

4. Test It

# Run smoke tests
node assets/testing/smoke-tests/QUICK_TEST.js

# Or run full demo
node assets/testing/feature-tests/DEMO_TEST.js

Core Features

1. Smart Credit Scoring Engine

Hybrid approach: ML service first, JavaScript fallback
Sub-second response times for single applications
Bulk processing: Upload CSV/XLSX files with up to 200 rows
Four-tier risk stratification: approved, conditional, review, declined

2. Automatic Data Column Mapping (40+ Aliases)

Income: NETMONTHLYINCOME, annual_income, salary
Employment: Time_With_Curr_Empr, employment_years
Credit History: cibil_score, credit_score (auto-scale 300-900 to 0-10)
Delinquency: Tot_Missed_Pmnt, max_recent_level_of_deliq
Trade Lines: CC_TL, Home_TL, PL_TL (auto-detection)

Intelligent field derivation when fields are missing:

Credit History: Calculated from delinquency and missed payments
Monthly Expenses: Estimated as 45% of income
Loan Amount: Inferred from income ratio
Savings: Estimated as 8% of annual income

3. Explainable Decision Framework

Feature-level contributions showing factor influence
Plain-language justifications for non-technical users
What-if scenarios: Interactive score improvement simulation
Seven key drivers displayed: income, credit history, employment, debts, etc.

4. User Authentication and Security

JWT token-based stateless authentication
bcrypt password hashing with salt
Application-level data isolation
24-hour token expiry

5. Decision History and Analytics

Persistent application records with audit trail
User statistics dashboard (approvals, denials, average scores)
Historical tracking of all submissions

6. Fairness and Bias Monitoring

Demographic parity metrics (gender, marital status, education)
Impact ratio calculation for protected vs. unprotected groups
Disparate impact analysis
Score distribution visualization
Recommendation bias detection

7. Report Generation

PDF export of complete application assessment
Score, decision, explanations, what-if scenarios
Professional formatting with company branding areas

8. Bulk File Processing

Supported formats: CSV and XLSX
Automatic column parsing and data type detection
Preview results: First 25 predictions returned
Error reporting: Skipped rows logged with specific reasons

API Endpoints

Authentication (/api/auth)

Endpoint	Method	Auth	Purpose
/register	POST	None	User registration
/login	POST	None	Login, returns JWT
/me	GET	Bearer	User profile

Credit Applications (/api/credit)

Endpoint	Method	Auth	Purpose
/apply	POST	Bearer	Manual application
/apply-upload	POST	Bearer	Bulk CSV/XLSX upload
/history	GET	Bearer	Application history
/:id	GET	Bearer	Single application
/stats	GET	Bearer	User statistics
/bias-report	GET	Bearer	Fairness metrics

Reports (/api/reports)

Endpoint	Method	Auth	Purpose
/pdf/:id	GET	Bearer	Download PDF report

Data Models

User Schema

{
  _id: ObjectId,
  email: String (unique, lowercase),
  name: String,
  passwordHash: String (bcrypt),
  role: String (user or admin),
  createdAt: Date,
  updatedAt: Date
}

Application Schema

{
  _id: ObjectId,
  userId: ObjectId,
  applicantData: {
    age: Number,
    income: Number (annual),
    employmentYears: Number,
    loanAmount: Number,
    existingDebts: Number,
    creditHistory: Number (0-10),
    numberOfDependents: Number,
    monthlyExpenses: Number,
    savingsBalance: Number,
    educationLevel: String,
    maritalStatus: String,
    homeOwnership: String,
    loanPurpose: String,
    debtToIncomeRatio: Number (derived),
    loanToIncomeRatio: Number (derived),
    savingsToIncomeRatio: Number (derived),
    netMonthlyCashflow: Number (derived)
  },
  result: {
    creditScore: Number (300-900),
    decision: String (approved|conditional|review|declined),
    riskLevel: String (low|medium|high|very_high),
    probability: Number (0-1),
    explanations: [String],
    recommendation: String,
    interestRateRange: String,
    maxApprovedAmount: Number
  },
  whatIfScenarios: Array,
  createdAt: Date,
  updatedAt: Date
}

Machine Learning Components

Model Architecture

Two-Model Ensemble:

Gradient Boosting Classifier (primary risk model)
- 100 estimators with optimal depth
- Input: 13 numeric + 4 categorical features
- Output: Default probability
- Performance: ROC-AUC > 0.85 on CIBIL datasets
Logistic Regression (explainability model)
- Coefficient-based feature importance
- Enables interpretation of individual decisions
- Fast inference for real-time explanations

Training Pipeline

Multi-model comparison and automatic selection:

python train_real.py --data dataset.csv --target defaultRisk --column-map mapping.json

Compares Logistic Regression, Random Forest, and Gradient Boosting. Selects best by ROC-AUC score.

Feature Engineering

Derived metrics calculated at runtime:

Debt-to-Income Ratio: (existingDebts + loanAmount) / income
Loan-to-Income Ratio: loanAmount / income
Savings-to-Income Ratio: savingsBalance / income
Net Monthly Cashflow: (income/12) - monthlyExpenses - (existingDebts/12)

Installation and Setup

Prerequisites

Node.js 16+
Python 3.9+
MongoDB Atlas or local MongoDB
Git

Backend Setup

cd creditxplain/server
npm install
cp .env.example .env
npm run dev

Runs on http://localhost:5000

Frontend Setup

cd creditxplain/client
npm install
npm run dev

Runs on http://localhost:5173

ML Service Setup (Recommended)

cd creditxplain/ml
python -m venv .venv
.venv\Scripts\activate
pip install -r requirements.txt
python train.py
uvicorn app:app --host 0.0.0.0 --port 8000 --reload

Runs on http://localhost:8000

Environment Configuration

Backend (.env):

MONGODB_URI=mongodb+srv://username:password@cluster.mongodb.net/creditxplain
JWT_SECRET=your-secure-random-string-min-32-chars
JWT_EXPIRY=24h
ML_SERVICE_URL=http://localhost:8000
NODE_ENV=development
PORT=5000
CORS_ORIGIN=http://localhost:5173

Frontend (.env):

VITE_API_BASE_URL=http://localhost:5000

Demo and Testing

Test User Accounts

Account 1:

Email: demo@example.com
Password: Demo@123456

Account 2:

Email: test@example.com
Password: Test@123456

Manual Application Testing

Login with demo account
Navigate to "New Application"
Fill 4-step form with applicant details
View instant score and decision
Click "View Explanation" for what-if scenarios
Download PDF report

Bulk Upload Testing

Prepare CSV/XLSX with banking columns (NETMONTHLYINCOME, MARITALSTATUS, etc.)
Login to application
Click "Upload & Predict"
Select file, submit
View results summary (processed, successful, preview)

Bias Dashboard Testing

Submit 5+ applications with varying demographics
Navigate to "Bias Dashboard"
View approval rates by gender, marital status, education
Observe fairness metrics and disparate impact

Project Structure

creditxplain/
├── client/                      # Frontend React application
│   ├── src/
│   │   ├── components/          # Reusable React components
│   │   ├── pages/               # Page components
│   │   ├── context/             # Context providers
│   │   ├── utils/               # HTTP client
│   │   └── App.jsx, main.jsx
│   ├── package.json
│   └── vite.config.js
│
├── server/                      # Backend Node.js/Express
│   ├── controllers/             # Business logic
│   ├── routes/                  # API endpoints
│   ├── models/                  # MongoDB schemas
│   ├── middleware/              # Auth, error handling
│   ├── utils/                   # ML engine, normalizers
│   ├── server.js
│   └── package.json
│
├── ml/                          # Python ML microservice
│   ├── app.py                   # FastAPI service
│   ├── model.py                 # Model architecture
│   ├── train.py                 # Initial training
│   ├── train_real.py            # Real dataset trainer
│   ├── requirements.txt
│   └── artifacts/
│       └── credit_model.joblib
│
└── README.md

Key Achievements

Real-world data integration with 40+ banking field aliases
99% success rate on 100-row CIBIL dataset
Hybrid resilience with automatic fallback mechanism
Feature-level explainability for all decisions
JWT authentication with bcrypt password security
Built-in demographic parity and fairness metrics
Support for CSV/XLSX bulk processing (up to 200 rows)

Performance Metrics

Single application scoring: Sub-500ms
Bulk processing (100 rows): 30-60 seconds
API response times: P95 < 2 seconds
Database queries: P99 < 100ms
Model serialization: ~5MB

Troubleshooting

Issue	Solution
ML service connection refused	Ensure Python service running on port 8000
MongoDB connection error	Verify MONGODB_URI and IP whitelist in Atlas
CORS errors	Verify CORS_ORIGIN matches frontend URL
JWT token expired	User must login again
CSV upload fails	Ensure file has required columns, size < 10MB

Contact and Support

For technical questions or issues, contact the project maintainers.

Repository: GitHub
Last Updated: March 27, 2026

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
assets		assets
client		client
ml		ml
server		server
.gitignore		.gitignore
README.md		README.md
render.yaml		render.yaml

Folders and files

Latest commit

History

Repository files navigation

CreditXplain - Explainable Credit Scoring System

Deployed Links

Quick Navigation

For Interviews & Presentations

For Developers

For Deployment

Executive Summary

Project Structure

Quick Start (5 Minutes)

Prerequisites

1. Backend Setup

2. Frontend Setup

3. ML Service Setup (Optional but Recommended)

4. Test It

Core Features

1. Smart Credit Scoring Engine

2. Automatic Data Column Mapping (40+ Aliases)

3. Explainable Decision Framework

4. User Authentication and Security

5. Decision History and Analytics

6. Fairness and Bias Monitoring

7. Report Generation

8. Bulk File Processing

API Endpoints

Authentication (/api/auth)

Credit Applications (/api/credit)

Reports (/api/reports)

Data Models

User Schema

Application Schema

Machine Learning Components

Model Architecture

Training Pipeline

Feature Engineering

Installation and Setup

Prerequisites

Backend Setup

Frontend Setup

ML Service Setup (Recommended)

Environment Configuration

Demo and Testing

Test User Accounts

Manual Application Testing

Bulk Upload Testing

Bias Dashboard Testing

Project Structure

Key Achievements

Performance Metrics

Troubleshooting

Contact and Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages