Skip to content

Latest commit

Β 

History

History
127 lines (84 loc) Β· 3.41 KB

File metadata and controls

127 lines (84 loc) Β· 3.41 KB

πŸ”¬ Integrated Analysis of Bulk and Single-Cell Transcriptomics in Pancreatic Cancer

This repository accompanies the study:

Integrative Analysis of Bulk and Single-Cell Transcriptomics Reveals Two Novel Genes Associated with Poor Prognosis in Pancreatic Cancer

πŸ§ͺ This study is currently under review. Results and interpretations will be available upon publication.


πŸ“š Table of Contents


πŸ§ͺ Overview

This project integrates single-cell and bulk RNA-seq datasets to identify molecular drivers of pancreatic ductal adenocarcinoma (PDAC). Two genes, RNF149 and MBOAT7, were identified through comparative expression in ductal cells and analyzed further in bulk TCGA data to understand their role in immune and lipid metabolic remodeling in PDAC.


πŸ”¬ Flow of Analysis

  1. Single-cell RNA-seq (GSE155698): Clustering, ductal cell annotation, DEA between tumor and normal ductal cells.
  2. Gene selection: RNF149 (upregulated) and MBOAT7 (downregulated) from single-cell analysis.
  3. Bulk RNA-seq (TCGA-PAAD): DEA based on RNF149/MBOAT7 expression levels.
  4. Pathway & interactome analysis: KEGG, Reactome, GO, GSEA, and protein interaction analysis.

🧭 Workflow Diagram

πŸ“Œ Coming Soon: Full visual pipeline diagram.


πŸ“ Repository Structure

docs/
β”œβ”€β”€ scripts/
β”‚   β”œβ”€β”€ scRNA/
β”‚   β”‚   └── scRNA analysis.R
β”‚   β”œβ”€β”€ bulk/
β”‚   β”‚   β”œβ”€β”€ Downloading data (TCGA).R
β”‚   β”‚   β”œβ”€β”€ Preprocessing + DEA (RNF149).R
β”‚   β”‚   └── Preprocessing + DEA (MBOAT7).R
β”‚   └── enrichment/
β”‚       β”œβ”€β”€ RNF149 PATHWAY ENRICHMENT (KEGG + REACTOME ).R
β”‚       β”œβ”€β”€ MBOAT7 Pathway Enrichment (Kegg + Reactome).R
β”‚       β”œβ”€β”€ RNF149-INTACT-Multi- INTERACTORS-DEGS-GO-ENRICHMENT.R
β”‚       └── Ranked_GSEA.r

πŸ“‚ Data Sources

  • πŸ”Έ Single-cell RNA-seq: GSE155698 (PDAC and adjacent normal tissue)
  • πŸ”Έ Bulk RNA-seq: TCGA-PAAD (via TCGAbiolinks)

βš™οΈ Setup and Requirements

Install required R packages:

if (!requireNamespace("BiocManager", quietly = TRUE))
    install.packages("BiocManager")

# CRAN packages
install.packages(c(
  "tidyverse", "data.table", "ggplot2", "pheatmap", "cowplot"
))

# Bioconductor packages
BiocManager::install(c(
  "Seurat", "DESeq2", "org.Hs.eg.db", "AnnotationDbi", 
  "clusterProfiler", "ReactomePA", "DOSE", "msigdbr", 
  "TCGAbiolinks", "SummarizedExperiment", "fgsea"
))

πŸ“Š Outputs

  • πŸ”¬ Single-cell: UMAP plots, DEGs between ductal tumor vs. normal cells.
  • 🧬 Bulk RNA-seq: DEGs based on RNF149/MBOAT7 expression, volcano plots.
  • 🧠 Enrichment: GO/KEGG/Reactome results for significant genes.
  • πŸ“ˆ GSEA: Ranked enrichment analysis.

πŸ‘¨β€πŸ’» Authors

Assem K. Elsherif

Nourine Mamdouh Sabry Abdelfattah

Sajda Hussien Salah Tahoun

Sondos Ameen El-Sayed Mohammed Awad

Moaz Mohamed ElShiekh


🧠 Keywords

Pancreatic Cancer Β· Bulk RNA-seq Β· Single-cell RNA-seq Β· RNF149 Β· MBOAT7 Β· TCGA Β· Seurat Β· DESeq2 Β· Enrichment


πŸ“œ License

This project is licensed under the MIT License.