Skip to content
View SidharthKriplani's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report SidharthKriplani

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SidharthKriplani/README.md

Hey, I'm Sidharth 👋

Senior Data Scientist · Decision Systems · AI Evaluation · Experimentation

Most DS teams measure things. I build the systems that decide what to measure, whether to trust it, and what to do about it.

4+ years shipping ML and AI evaluation systems across credit risk, experimentation platforms, RAG pipelines, and KPI governance. 5 open-source PyPI libraries · 3 production-simulated ML platforms · 1,000+ member data community · Bengaluru, IN

LinkedIn · PyPI


🧠 Three Pillars

🔵 Decision Systems

ML systems for credit risk, feature governance, and business-critical decisions

riskframe_platform · metriclens · featureleakagelens

🟣 AI Evaluation & Observability

RAG evaluation, golden-set testing, version-safe AI deployment, LLM guardrails

devpulse_platform · goldensetauditor · docingestqa

🟢 Experimentation & Metrics

A/B testing infrastructure, CUPED, SRM detection, KPI governance, metric decomposition

metasignal_platform · trialcheck · pulserank_platform


🚀 What I Ship

  • ML decision systems — credit risk scoring with SHAP explainability, threshold optimization, calibration, drift monitoring, and FastAPI serving
  • Experimentation infrastructure — SRM detection, CUPED variance reduction, mSPRT sequential testing, guardrail-first decisioning, and audit-ready readouts
  • AI evaluation pipelines — RAG golden-set auditing, version-safe retrieval, wrong-version-answer-rate controls, and LLM quality gates
  • Metric decomposition — splitting any metric movement into mix effects, rate effects, and cross terms — available on PyPI as metriclens

🏆 Highlighted Projects

riskframe_platform — End-to-end credit risk decisioning platform. XGBoost/LightGBM challenger system, Optuna HPO, SHAP explainability, PSI drift detection, fairness checks, FastAPI serving, and model card documentation.

metasignal_platform — Production-simulated experimentation intelligence platform. SRM detection, CUPED, mSPRT, A/A calibration, guardrail-first decisioning, KPI dictionary, SQL metric contracts, streaming observability.

devpulse_platform — Version-safe RAG + agentic migration orchestration. Hybrid retrieval, deterministic conflict detection, SAFE/RISKY/BLOCKED verdicts, wrong-version-answer-rate controls, evidence-backed summaries.

metriclenspip install metriclens · DataFrame-native metric decomposition. Splits any metric movement into mix shift, rate shift, and cross term. JSON/Markdown/HTML outputs, quality gates.

goldensetauditorpip install goldensetauditor · LLM/RAG evaluation dataset auditor. Catches conflicting labels, duplicate prompts, weak reference answers, and ambiguous questions before they corrupt your evals.

trialcheckpip install trialcheck · A/B experiment readout auditor. Checks for SRM, peeking risk, practical significance, guardrail movement, and pre-period imbalance. Zero dependencies. PASS/WARN/FAIL reports.


🛠 Tech Stack

Python SQL PySpark FastAPI Docker Databricks scikit-learn XGBoost SHAP RAG LLM Evaluation CUPED A/B Testing


📫 Let's Connect

LinkedIn PyPI Email


💼 Currently: Open to Senior DS · Applied AI · Experimentation · Decision Science roles · Bengaluru

Pinned Loading

  1. metasignal_platform metasignal_platform Public

    Production-simulated experimentation, metrics intelligence, CUPED/guardrail decisioning, and streaming observability platform.

    Python

  2. metriclens metriclens Public

    DataFrame-native metric movement decomposition — mix shift, rate shift, cross term. pip install metriclens

    Python

  3. devpulse_platform devpulse_platform Public

    Production-simulated RAG + agentic migration intelligence platform with version-safe retrieval, conflict detection, repo-aware risk analysis, patch simulation, and evidence dashboard.

    Python

  4. riskframe_platform riskframe_platform Public

    End-to-end credit decisioning platform: XGBoost + LightGBM challenger, Optuna HPO, SHAP, PSI drift, fairness, FastAPI

    Python

  5. docingestqa docingestqa Public

    Pre-indexing QA auditor for RAG document ingestion pipelines. 11 deterministic checks for missing pages, OCR noise, duplicates, encoding corruption, and poor split boundaries.

    Python

  6. pulserank_platform pulserank_platform Public

    Production-simulated marketplace ranking system: IPS bias correction, hybrid candidate generation, exposure governance, delayed attribution, offline A/B simulation, 33 evidence artifacts.

    Python