Pharmalyze: Enterprise Revenue Operations Engine

Project Overview

Pharmalyze is an end-to-end data engineering and business intelligence solution designed for the pharmaceutical industry. It transforms fragmented, flat-file sales data into a scalable Star Schema architecture, enabling high-performance regional analysis and sales representative tracking.

Business Case

Pharmaceutical firms often struggle with "Data Silos" where sales data is trapped in wide-format Excel sheets, making it impossible to perform granular regional or category-level analysis.

The Solution: This project implements a centralized Data Warehouse logic that:

Normalizes drug categories for scalable reporting.
Simulates Enterprise Volume (84k+ records scaled from real-world data).
Automates KPI Tracking for Sales Representatives and Regional Managers.

Technical Architecture

The system follows a modern OLAP (Online Analytical Processing) design:

Ingestion Layer: Python/Pandas handles initial cleanup of heterogeneous Excel/CSV sources.
Storage & Compute: DuckDB serves as the analytical engine, utilizing columnar storage for sub-second query execution on multi-million row datasets.
Data Modeling: A Star Schema design separates quantitative 'Facts' from descriptive 'Dimensions'.
Visualization: A Streamlit dashboard provides real-time interactivity powered by SQL Window Functions and CTEs.

Data Model (Star Schema)

To ensure high performance, the data is structured as follows:

Fact Table (fact_sales_scaled): Contains transaction-level data (Date, Units Sold, Foreign Keys).
Dimension Tables: * dim_geography: Mapping IDs to Cities and Regions (North, West, etc.).
- dim_reps: Sales personnel details and performance tiers.
- dim_products: Drug classifications (ATC codes) and pricing.

How to Run

Clone the repo: git clone https://github.com/YOUR_USERNAME/RevStream-Analytics.git
Install dependencies: pip install -r requirements.txt
Initialize Database: python scripts/ingest_data.py
Launch Dashboard: streamlit run app/main.py

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.devcontainer		.devcontainer
app		app
data/raw		data/raw
scripts		scripts
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pharmalyze: Enterprise Revenue Operations Engine

Project Overview

Business Case

Technical Architecture

Data Model (Star Schema)

How to Run

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Pharmalyze: Enterprise Revenue Operations Engine

Project Overview

Business Case

Technical Architecture

Data Model (Star Schema)

How to Run

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages