Skip to content

agarba360-beep/prrsv-eii

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

7 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🧬 PRRSV ORF5 Evolutionary Intelligence System (EII Platform)

πŸš€ Live System

πŸ“Š Live Dashboard

πŸ‘‰ https://prrsv-orf5.aiconceptlimited.com.ng/

Dashboard

πŸ” Overview

This platform is designed for research, surveillance, and exploratory evolutionary analysis of PRRSV. The PRRSV ORF5 Evolutionary Intelligence System is an automated genomic surveillance platform designed to monitor the evolutionary dynamics of PRRSV (Porcine Reproductive and Respiratory Syndrome Virus) using ORF5 gene sequences.

The system integrates multiple biologically meaningful signals into a unified metric called the Evolutionary Intelligence Index (EII), enabling real-time tracking of viral evolution.


πŸš€ Key Features

  • πŸ”„ Automated sequence retrieval from NCBI

  • πŸ§ͺ Quality control and filtering of sequences

  • 🧬 Multiple sequence alignment using MAFFT

  • πŸ”— Codon-aware sequence alignment

  • πŸ“Š Multi-signal evolutionary analysis:

    • Epitope Drift (Shannon entropy)
    • Selection Pressure (codon variability)
    • Glycosylation Dynamics (N-X-S/T motif detection)
    • Phylogenetic Instability (pairwise sequence divergence)
    • Distance Outliers (anomaly detection)
  • πŸ—„οΈ MySQL database integration

  • πŸ“ˆ Real-time dashboard (Dash + Plotly)


🧠 Evolutionary Intelligence Index (EII)

The EII is computed as:

EII = mean(all normalized evolutionary signals) Γ— 100

This produces a single score (0–100) representing the evolutionary activity of PRRSV.


βš™οΈ System Architecture

Pipeline workflow:

  1. Fetch sequences from NCBI
  2. Perform quality control
  3. Align sequences (MAFFT)
  4. Perform codon alignment
  5. Compute evolutionary signals
  6. Store results in database
  7. Visualize via dashboard

πŸ“ Project Structure

prrsv_eii/
β”‚
β”œβ”€β”€ pipeline/              # Core processing pipeline
β”œβ”€β”€ dashboard/             # Visualization dashboard
β”œβ”€β”€ data/                  # Generated data (ignored in Git)
β”œβ”€β”€ logs/                  # Pipeline logs
β”œβ”€β”€ docs/                  # Screenshots / assets
β”œβ”€β”€ requirements.txt
β”œβ”€β”€ README.md
β”œβ”€β”€ .gitignore
└── LICENSE

πŸ”§ Installation & Setup

1. Clone the repository

git clone https://github.com/agarba360-beep/prrsv-eii.git
cd prrsv-eii

2. Create virtual environment

python3 -m venv venv
source venv/bin/activate

3. Install dependencies

pip install -r requirements.txt

4. Configure database

Create a .env file:

DB_HOST=localhost
DB_USER=prrsv_admin
DB_PASSWORD=your_password
DB_NAME=prrsv_genomics

▢️ Running the System

Run full pipeline

python pipeline/run_pipeline_full.py

Launch dashboard

python dashboard/dashboard_readonly.py

Open in browser:

https://prrsv-orf5.aiconceptlimited.com.ng/


πŸ” System Inspection (VERY IMPORTANT)

Check raw sequences

SELECT COUNT(*) FROM sequences_raw;
SELECT accession, LENGTH(sequence) FROM sequences_raw LIMIT 5;

Check QC results

SELECT COUNT(*) FROM qc_sequences;
SELECT * FROM qc_sequences LIMIT 5;

Check EII history

SELECT * FROM eii_index ORDER BY created_at DESC LIMIT 10;

Check evolutionary signals

SELECT * FROM eii_signals;

Check latest snapshot

SELECT * FROM eii_latest;

Inspect alignment files

head data/alignments/orf5_aligned.fasta
head data/alignments/orf5_codon_aligned.fasta

πŸ“Š Dashboard Features

  • Real-time EII monitoring
  • Signal composition visualization
  • Evolutionary trend analysis
  • Risk and impact classification

πŸ§ͺ Scientific Notes

  • Epitope Drift β†’ Shannon entropy across alignment positions
  • Selection Pressure β†’ codon-level variation proxy
  • Glycosylation Dynamics β†’ N-X-S/T motif detection
  • Phylogenetic Instability β†’ mean pairwise Hamming distance
  • Distance Outliers β†’ extreme divergence detection

🀝 Contributions

Contributions are welcome:

  • Improve biological accuracy (e.g., dN/dS models)
  • Add phylogenetic tree inference
  • Optimize performance
  • Enhance dashboard features

πŸ“Œ Future Improvements

  • Integration with HyPhy / PAML for true selection analysis
  • Phylogenetic tree visualization
  • Geographic and temporal analysis
  • API development
  • Cloud deployment

πŸ‘¨β€πŸ’» Author

Developed by AI Concepts Limited, Nigeria For advanced virology research and genomic intelligence.


⚠️ Disclaimer

This system is intended for research purposes only. Not for clinical or diagnostic use.


πŸ“œ License

MIT License

About

Real-time PRRSV evolutionary intelligence platform using ORF5 genomic analysis

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors