🧬 NeoCheck

Neoantigen HLA Compatibility Checker — v1.0

Analyze cancer mutations and HLA types to identify personalized immunotherapy opportunities. NeoCheck integrates four public biomedical databases to surface relevant epitopes, T-cell evidence, clinical trials, and publications for any gene–mutation–HLA combination.

Overview

NeoCheck is a Streamlit web application designed for researchers investigating neoantigen-based immunotherapy. Given a cancer mutation and a patient's HLA typing, it:

Searches for known epitopes harboring that mutation
Identifies which epitopes are restricted to the patient's HLA alleles
Summarizes the supporting evidence (T-cell assays, TCR sequences, MHC ligand data, crystal structures)
Finds relevant clinical trials currently recruiting
Retrieves recent publications from the literature

No API keys are required — all data sources are publicly accessible.

Data Sources

Source	What it provides	URL
CEDAR (IEDB)	Epitope structures, T-cell assays, TCR sequences, MHC ligand assays, PDB structures	cedar.iedb.org
IMGT/HLA (EBI)	HLA allele validation, accession numbers, metadata, citations	ebi.ac.uk/ipd/imgt/hla
ClinicalTrials.gov	Recruiting immunotherapy and neoantigen vaccine trials	clinicaltrials.gov
PubMed	Recent publications on the mutation and neoantigen context	pubmed.ncbi.nlm.nih.gov

Quick Start

Prerequisites

Python 3.10 or higher
pip

Install and Run

# Clone the repository
git clone <your-repo-url>
cd neocheck

# Install dependencies
pip install -r requirements.txt

# Launch the app
streamlit run app.py

The app will open in your browser at http://localhost:8501.

Project Structure

neocheck/
├── app.py                          # Main Streamlit application
├── config.py                       # API URLs, gene/mutation references, constants
├── requirements.txt                # Python dependencies
├── .streamlit/
│   └── config.toml                 # Streamlit theme configuration
├── clients/                        # REST API client modules
│   ├── cedar_client.py             # CEDAR epitope database (PostgREST)
│   ├── imgt_client.py              # IMGT/HLA allele database (EBI REST)
│   ├── clinicaltrials_client.py    # ClinicalTrials.gov v2 API
│   └── pubmed_client.py            # PubMed E-utilities (ESearch + EFetch)
├── analyzers/                      # Analysis orchestration
│   ├── epitope_analyzer.py         # Epitope search, HLA matching, evidence gathering
│   ├── hla_analyzer.py             # HLA allele validation via IMGT
│   ├── trial_analyzer.py           # Clinical trial search and filtering
│   └── publication_analyzer.py     # PubMed literature search
└── utils/                          # Shared utilities
    ├── scoring.py                  # Epitope ranking and evidence summarization
    ├── validators.py               # Input validation (mutations, HLA alleles)
    └── formatters.py               # Export formatting (JSON, CSV, HTML report)

Usage

Enter a mutation — Type any gene name (e.g., KRAS, BRAF, TP53, EGFR) and mutation (e.g., G12D, V600E). Common mutations are suggested for well-known oncogenes.
Enter HLA alleles — Provide at least one HLA allele. The app supports HLA-A, HLA-B, and HLA-C loci. Accepted formats include A*02:01, HLA-A*02:01, or A*02:01:01:01.
Click "Run Analysis" — The app queries all four databases with a progress indicator.
Review results:
- Summary metrics — Epitope count, T-cell assays, clinical trials, publications
- Top Epitopes — Detailed cards with evidence summaries, TCR sequences, and assay data
- All Epitopes — Sortable table view with descriptive summaries
- Clinical Trials — Expandable cards with status, phase, enrollment, and links
- Publications — Titles, authors, abstracts, and PubMed links
- HLA Information — IMGT validation status and allele metadata
Export — Download results as JSON, CSV, or a standalone HTML report.
AI Analysis (optional) — Scroll to the AI section, enter your Anthropic API key, and click "Run AI Analysis". Claude will use MCP tools to investigate your results in depth and provide a clinical interpretation.

Quick Examples

The app includes pre-configured example buttons:

KRAS G12D + HLA-A*02:01 / HLA-A*11:01
BRAF V600E + HLA-A*02:01 / HLA-A*03:01

Features (v1)

Free-text gene and mutation input (not restricted to a preset list)
HLA-A, HLA-B, and HLA-C allele support
Epitope search with neoantigen filtering via CEDAR
HLA-matched epitope identification and ranking
Descriptive evidence summaries for each epitope
T-cell assay and TCR sequence details for top epitopes
HLA allele validation against IMGT/HLA
Clinical trial search filtered by cancer type
PubMed literature search
Advanced options (cancer type filter, neoantigen-only toggle, result limits)
Export to JSON, CSV, and standalone HTML report
Aggressive caching to minimize redundant API calls
AI-powered analysis via Claude with MCP tool access to all 4 databases
Research disclaimer

AI Analysis Setup

The AI tab uses the Anthropic Messages API to send your results to Claude, which can then call MCP tools to investigate further. This requires:

1. Anthropic API Key

Get an API key from console.anthropic.com:

Create an account (or sign in)
Go to API Keys and create a new key
Add billing — the API is pay-per-token

Important: A Claude Max/Pro subscription (claude.ai) does not include API access. The API is a separate product with separate billing.

You can provide the key in two ways:

Environment variable (recommended): export ANTHROPIC_API_KEY=sk-ant-...
UI input: Enter it directly in the AI Analysis section (stored in session only, never persisted)

2. Build the MCP Servers

The AI tab launches 4 MCP servers as subprocesses. They must be built first:

# From the NeoCheck project root (parent of neocheck/)

# CEDAR (Node.js)
cd CEDARMCP && npm install && npm run build && cd ..

# IMGT/HLA (Node.js)
cd imgt-hla-mcp && npm install && npm run build && cd ..

# ClinicalTrials.gov (requires bun — install from bun.sh)
cd clinicaltrialsgov-mcp-server && bun install && bun run build && cd ..

# PubMed (Python)
cd pubmedmcp && pip install -e . && cd ..

Prerequisites:

Node.js 20+ (node --version)
Bun (bun --version) — install from bun.sh
Python 3.12+

3. Run

# Optional: set API key as env var
export ANTHROPIC_API_KEY=sk-ant-...

cd neocheck
streamlit run app.py

Run a standard analysis first, then scroll down to the AI section and click "Run AI Analysis".

Coming Soon

Docker containerization — One-command deployment with Docker
HLA Class II support — DRB1, DQB1, and other Class II loci
Batch analysis — Analyze multiple mutations in a single run
Interactive visualizations — Plotly charts for epitope comparisons and evidence landscapes
Streaming AI responses — Real-time display as Claude generates its analysis

Deployment

Streamlit Community Cloud (simplest)

Push your code to a GitHub repository
Go to share.streamlit.io
Connect your repository and set the main file path to app.py
Deploy — no server configuration needed

Docker

Create a Dockerfile in the project root:

FROM python:3.12-slim

WORKDIR /app
COPY requirements.txt .
RUN pip install --no-cache-dir -r requirements.txt

COPY . .

EXPOSE 8501

HEALTHCHECK CMD curl --fail http://localhost:8501/_stcore/health || exit 1

ENTRYPOINT ["streamlit", "run", "app.py", "--server.port=8501", "--server.address=0.0.0.0"]

Then build and run:

docker build -t neocheck .
docker run -p 8501:8501 neocheck

Generic Server

For a production deployment behind a reverse proxy (e.g., nginx):

# Install dependencies in a virtual environment
python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

# Run with nohup or a process manager (systemd, supervisor)
streamlit run app.py --server.port=8501 --server.address=0.0.0.0 --server.headless=true

Point your reverse proxy to http://localhost:8501 and configure WebSocket support for Streamlit's live connection.

Dependencies

Package	Purpose
`streamlit` >= 1.32.0	Web application framework
`pandas` >= 2.0.0	Data manipulation and table display
`plotly` >= 5.18.0	Interactive visualizations (used in future features)
`requests` >= 2.31.0	HTTP client for all API calls
`anthropic` >= 0.40.0	Claude AI API client (for AI Analysis tab)

Disclaimer

This tool is for research purposes only. Results should not be used for clinical decision-making without professional medical review. Always consult qualified healthcare professionals for patient care decisions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧬 NeoCheck

Overview

Data Sources

Quick Start

Prerequisites

Install and Run

Project Structure

Usage

Quick Examples

Features (v1)

AI Analysis Setup

1. Anthropic API Key

2. Build the MCP Servers

3. Run

Coming Soon

Deployment

Streamlit Community Cloud (simplest)

Docker

Generic Server

Dependencies

Disclaimer

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.streamlit		.streamlit
analyzers		analyzers
clients		clients
tests		tests
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
app.py		app.py
config.py		config.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

🧬 NeoCheck

Overview

Data Sources

Quick Start

Prerequisites

Install and Run

Project Structure

Usage

Quick Examples

Features (v1)

AI Analysis Setup

1. Anthropic API Key

2. Build the MCP Servers

3. Run

Coming Soon

Deployment

Streamlit Community Cloud (simplest)

Docker

Generic Server

Dependencies

Disclaimer

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages