CLAUDE.md

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

Project Overview

pgschema is a CLI tool that brings Terraform-style declarative schema migration workflow to PostgreSQL. It provides a dump/edit/plan/apply workflow for database schema changes:

Dump: Extract current schema in a developer-friendly format with support for all common objects
Edit: Modify schema files to represent desired state
Plan: Compare desired state with current database and generate migration plan
Apply: Execute the migration with safety features like concurrent change detection, transaction-adaptive execution, and lock timeout control

The tool is written in Go 1.24.0 (toolchain go1.24.7) and uses:

Cobra for CLI commands
embedded-postgres v1.33.0 for plan command (temporary instances) and testing (no Docker required)
pgx/v5 v5.7.5 for database connections
BurntSushi/toml for TOML parsing (ignore config)
Supports PostgreSQL versions 14-18

Key differentiators:

Comprehensive Postgres support for virtually all schema-level objects
State-based Terraform-like workflow (no migration history table)
Schema-level focus for single-schema apps to multi-tenant architectures
No shadow database required - works directly with schema files and target database

Skills

This project includes specialized skills for development workflows. Use these skills when working on pgschema:

pg_dump Reference - Consult PostgreSQL's pg_dump implementation for system catalog queries and schema extraction patterns
PostgreSQL Syntax Reference - Understand PostgreSQL's parser and grammar (gram.y) for SQL syntax and DDL structure
Validate with Database - Connect to live PostgreSQL to validate assumptions, compare pg_dump vs pgschema, and query system catalogs
Run Tests - Execute automated Go tests to validate diff logic, plan generation, and dump functionality
Fix Bug - Fix a bug from a GitHub issue using TDD (analyze, create reproducing test, implement fix, verify)
Refactor Pass - Perform a refactor pass focused on simplicity after recent changes

Skills are located in .claude/skills/ and provide detailed workflows for common development tasks.

Quick Start Commands

Build

# Standard build
go build -o pgschema .

# Build with version info (used in CI/Docker)
go build -ldflags="-w -s -X github.com/pgplex/pgschema/cmd.GitCommit=... -X 'github.com/pgplex/pgschema/cmd.BuildDate=...'" -o pgschema .

Testing

For detailed testing workflows, see the Run Tests skill (.claude/skills/run_tests/SKILL.md).

# Quick validation - diff tests only (fast, no database)
PGSCHEMA_TEST_FILTER="create_trigger/" go test -v ./internal/diff -run TestDiffFromFiles

# Full integration test
PGSCHEMA_TEST_FILTER="create_trigger/" go test -v ./cmd -run TestPlanAndApply

# All tests
go test -v ./...

Database Validation

For interactive database validation, see the Validate with Database skill (.claude/skills/validate_db/SKILL.md).

Connection details are in .env (configure as needed):

PGHOST=localhost
PGPORT=5432
PGDATABASE=<your_database>
PGUSER=postgres
PGPASSWORD=<your_password>
PGAPPNAME=pgschema

Architecture

Core Components

CLI Commands (cmd/):

dump/ - Schema extraction from live database
plan/ - Migration planning by comparing schemas
apply/ - Migration execution with safety checks
util/ - Shared utilities (connection, env, ignore file loading, SQL logging)
root.go - Main CLI setup with Cobra
migrate_integration_test.go, schema_integration_test.go, include_integration_test.go, ignore_integration_test.go - Integration test suites

Core Packages:

ir/ - Intermediate Representation (IR) package
- Schema objects (tables, indexes, functions, procedures, triggers, policies, etc.)
- Database inspector using pgx (queries pg_catalog for schema extraction)
- Schema normalizer
- Identifier quoting utilities
- Ignore configuration for filtering database objects (ignore.go)
- queries/ subdirectory with sqlc-generated code for type-safe SQL queries
- Note: Parser removed in favor of embedded-postgres approach

Internal Packages (internal/):

diff/ - Schema comparison and migration DDL generation
plan/ - Migration plan structures, execution, and plan rewriting (rewrite.go)
dump/ - Schema dump formatting and output
fingerprint/ - Schema fingerprinting and comparison for change detection
include/ - Include file processing for modular schemas
postgres/ - Database provider implementations (embedded and external)
- desired_state.go - DesiredStateProvider interface
- embedded.go - Embedded PostgreSQL implementation
- external.go - External database implementation
color/ - Terminal output colorization
logger/ - Structured logging using slog
version/ - Version information

PostgreSQL Reference Sources (internal/):

gram.y - Local copy of PostgreSQL's Yacc/Bison grammar for SQL syntax reference
scan.l - Local copy of PostgreSQL's Flex lexer for tokenization reference

Test Utilities (testutil/):

postgres.go - Shared test utilities for setting up embedded PostgreSQL
skip_list.go - PostgreSQL version-specific test skip lists

Key Architecture Patterns

Schema Representation: Uses an Intermediate Representation (IR) to normalize schema objects from database introspection. Both desired state (from user SQL files) and current state (from target database) are extracted by inspecting PostgreSQL databases.

Embedded Postgres for Desired State: The plan command spins up a temporary embedded PostgreSQL instance (by default) or connects to an external database (if --plan-host is provided), applies the user's SQL files to it, then inspects that database to get the desired state IR. This ensures both desired and current states come from the same source (database inspection), eliminating parser/inspector format differences. External database support is useful for environments where embedded postgres has limitations (e.g., ARM architectures, containerized environments).

Migration Planning: The diff package compares IR representations to generate a sequence of migration steps with proper dependency ordering (topological sort).

Database Integration: Uses pgx/v5 for database connections and embedded-postgres (v1.33.0) for both the plan command (temporary instances) and integration testing (no Docker required).

Inspector-Only Approach: Both desired state (from user SQL files) and current state (from target database) are obtained through database inspection. The plan command spins up an embedded PostgreSQL instance, applies user SQL files, then inspects it to get the desired state IR. This eliminates the need for SQL parsing and ensures consistency.

External Database for Plan Generation: As an alternative to embedded postgres, users can provide an external PostgreSQL database using --plan-host flags or PGSCHEMA_PLAN_* environment variables. The external database approach:

Creates temporary schemas with timestamp suffixes (e.g., pgschema_tmp_20251030_154501_123456789)
Validates major version compatibility with target database (exact match required)
Cleans up temporary schemas after use (best effort)
Useful for environments where embedded postgres has limitations (ARM architectures, containerized environments)

Common Development Workflows

Adding New Schema Object Support

Add IR representation in ir/ir.go
Add database introspection logic in ir/inspector.go (consult pg_dump Reference skill for system catalog queries)
Add diff logic in internal/diff/
Add test cases in testdata/diff/create_[object_type]/ (see Run Tests skill)
Validate with live database (see Validate with Database skill)

Note: Parser logic is no longer needed - both desired and current states come from database inspection.

Debugging Schema Extraction

Consult pg_dump Reference skill to understand correct system catalog queries
Use Validate with Database skill to test queries against live PostgreSQL
Compare pg_dump output with pgschema output using workflows in Validate with Database skill

Testing Changes

Use Run Tests skill for comprehensive testing workflows
Start with fast diff tests: PGSCHEMA_TEST_FILTER="..." go test -v ./internal/diff -run TestDiffFromFiles
Run integration tests: PGSCHEMA_TEST_FILTER="..." go test -v ./cmd -run TestPlanAndApply
Validate with live database using Validate with Database skill

Supported Schema Objects

The tool supports comprehensive PostgreSQL schema objects (see ir/ir.go for complete data structures):

Tables: Columns, identity columns, generated columns, partitioning, LIKE clauses
Constraints: Primary keys, foreign keys, unique constraints, check constraints, NOT VALID
Indexes: Regular, unique, partial, functional/expression indexes, concurrent creation
Views: Regular views and materialized views with indexes
Functions: Parameters, return types, volatility, STRICT, SECURITY DEFINER
Procedures: Parameters, signatures, language
Triggers: Timing, events, levels, WHEN conditions, constraint triggers, REFERENCING OLD/NEW TABLE
Sequences: Start, increment, min/max, cycle, cache, owned by tracking
Types: Enum, composite, domain types with constraints
Policies: Row-level security with commands, roles, USING/WITH CHECK expressions
Aggregates: Custom aggregates with transition and final functions
Privileges: GRANT/REVOKE for tables, functions, sequences, types
Default Privileges: ALTER DEFAULT PRIVILEGES for grantor-level access control
Comments: On all supported object types

Environment Variables

Target database connection: PGHOST, PGPORT, PGUSER, PGPASSWORD, PGDATABASE
Plan database connection (optional - for external database instead of embedded postgres):
- PGSCHEMA_PLAN_HOST - If set, uses external database for plan generation
- PGSCHEMA_PLAN_PORT - Plan database port (default: 5432)
- PGSCHEMA_PLAN_DB - Plan database name (required if PGSCHEMA_PLAN_HOST is set)
- PGSCHEMA_PLAN_USER - Plan database user (required if PGSCHEMA_PLAN_HOST is set)
- PGSCHEMA_PLAN_PASSWORD - Plan database password
Environment files: .env - automatically loaded by main.go
Test filtering: PGSCHEMA_TEST_FILTER - run specific test cases (e.g., "create_table/" or "create_table/add_column")
Postgres version: PGSCHEMA_POSTGRES_VERSION - test against specific versions (14, 15, 16, 17, 18)

Important Implementation Notes

Trigger Features:

Full support for WHEN conditions using pg_get_expr(t.tgqual, t.tgrelid, false) from pg_catalog.pg_trigger
Constraint triggers with deferrable options
REFERENCING OLD TABLE / NEW TABLE for statement-level triggers

Online Migration Support:

CREATE INDEX CONCURRENTLY for non-blocking index creation
ALTER TABLE ... ADD CONSTRAINT ... NOT VALID for online constraint addition
Proper transaction handling - some operations must run outside transactions

pgschema Directives:

Special SQL comments control behavior: --pgschema-lock-timeout, --pgschema-no-transaction
Handled in cmd/apply/directive.go

Reference Implementations:

PostgreSQL's pg_dump serves as reference for system catalog queries (see pg_dump Reference skill)
PostgreSQL's gram.y defines canonical SQL syntax (see PostgreSQL Syntax Reference skill)
Local copies of PostgreSQL parser sources are available at internal/gram.y and internal/scan.l for quick reference

Key Files Reference

Entry Point & CLI:

main.go - Entry point, loads .env and calls cmd.Execute()
cmd/root.go - Root CLI with global flags

IR Package (./ir):

ir/ir.go - Core IR data structures for all schema objects
ir/inspector.go - Database introspection using pgx (queries pg_catalog)
ir/normalize.go - Schema normalization (version-specific differences, type mappings)
ir/quote.go - Identifier quoting utilities
ir/ignore.go - IgnoreConfig for filtering database objects with glob patterns
ir/queries/ - sqlc-generated code for type-safe SQL queries (queries.sql, queries.sql.go, models.sql.go, dml.sql.go, sqlc.yaml)

Diff Package (internal/diff/):

diff.go - Main diff logic, schema comparison
table.go, column.go, constraint.go, index.go, trigger.go, view.go, function.go, procedure.go, sequence.go, type.go, policy.go - Object-specific diff operations
privilege.go, column_privilege.go, default_privilege.go - Permission management (GRANT/REVOKE, column-level privileges)
collector.go - SQL collection with context
header.go - Dump header generation
topological.go - Dependency sorting for migration ordering

Testing:

cmd/migrate_integration_test.go - Main integration test suite (TestPlanAndApply)
cmd/schema_integration_test.go - Schema-level integration tests
cmd/include_integration_test.go - Include file processing tests
cmd/ignore_integration_test.go - Ignore configuration tests
testdata/diff/ - 150+ diff test cases covering all schema object types
testdata/dump/ - 18 dump test suites (schema examples and issue-specific regressions)
testdata/include/ - Include file tests (domains, functions, materialized views, procedures, sequences, tables, types, views)
See Run Tests skill for complete testing workflows

Test Data Structure

Diff Tests (`testdata/diff/`)

Tests are organized by object type (150+ test cases):

comment/ (11), create_domain/ (5), create_function/ (8), create_index/ (2)
create_materialized_view/ (3), create_policy/ (10), create_procedure/ (3), create_sequence/ (3)
create_table/ (37), create_trigger/ (7), create_type/ (3), create_view/ (4)
default_privilege/ (9), privilege/ (13)
dependency/ (13), online/ (14), migrate/ (5)

Each test case contains: old.sql (starting state), new.sql (desired state), diff.sql (expected migration DDL), plus plan.json, plan.sql, plan.txt (plan output formats)

Dump Tests (`testdata/dump/`)

18 test suites with manifest.json, raw.sql, pgdump.sql, pgschema.sql:

Schema examples: bytebase/, employee/, sakila/, tenant/
Issue regressions: issue_125_*, issue_133_*, issue_183_*, issue_191_*, issue_252_*, issue_275_*, issue_307_*, issue_318_*, issue_320_*, issue_78_*, issue_80_*, issue_82_*, issue_83_*

Include Tests (`testdata/include/`)

8 categories: domains/, functions/, materialized_views/, procedures/, sequences/, tables/, types/, views/

For detailed test workflows, filtering, and regeneration, see the Run Tests skill.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLAUDE.md

Project Overview

Skills

Quick Start Commands

Build

Testing

Database Validation

Architecture

Core Components

Key Architecture Patterns

Common Development Workflows

Adding New Schema Object Support

Debugging Schema Extraction

Testing Changes

Supported Schema Objects

Environment Variables

Important Implementation Notes

Key Files Reference

Test Data Structure

Diff Tests (`testdata/diff/`)

Dump Tests (`testdata/dump/`)

Include Tests (`testdata/include/`)

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

CLAUDE.md

Project Overview

Skills

Quick Start Commands

Build

Testing

Database Validation

Architecture

Core Components

Key Architecture Patterns

Common Development Workflows

Adding New Schema Object Support

Debugging Schema Extraction

Testing Changes

Supported Schema Objects

Environment Variables

Important Implementation Notes

Key Files Reference

Test Data Structure

Diff Tests (testdata/diff/)

Dump Tests (testdata/dump/)

Include Tests (testdata/include/)

Diff Tests (`testdata/diff/`)

Dump Tests (`testdata/dump/`)

Include Tests (`testdata/include/`)