Skip to content

Add Databricks integration and to_databricks operator documentation#161

Draft
mavam wants to merge 16 commits intomainfrom
topic/databricks-integration
Draft

Add Databricks integration and to_databricks operator documentation#161
mavam wants to merge 16 commits intomainfrom
topic/databricks-integration

Conversation

@mavam
Copy link
Copy Markdown
Member

@mavam mavam commented Jan 16, 2026

Summary

Adds comprehensive documentation for the Databricks Unity Catalog integration:

  • Integration overview page explaining three ingestion methods (delta, iceberg, external_delta) with cost/governance trade-offs
  • to_databricks operator reference with detailed parameter documentation
  • Authentication setup guide with SQL permissions
  • Write-time optimization patterns and examples for different use cases
  • Decision matrix to help users choose the right ingestion method

Notes

  • Diagram file (databricks-basic.excalidraw) referenced in the integration page is pending and will be added as a follow-up
  • All linting checks pass
  • Integration added to sidebar under "Data Tools" section

Test Plan

  • Build succeeds: bun run build
  • Documentation renders correctly in dev server
  • All links resolve correctly
  • Sidebar entry appears in correct alphabetical position

🤖 Generated with Claude Code

@github-actions github-actions bot added reference Reference documentation integration Integration documentation site Site infrastructure labels Jan 16, 2026
@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented Jan 16, 2026

📦 Preview  ·  View →  ·  🟢 Live

Auto-updates on push

@mavam mavam force-pushed the topic/databricks-integration branch 7 times, most recently from fc3aa68 to 05c833f Compare January 16, 2026 08:07
mavam and others added 15 commits January 28, 2026 12:24
Adds comprehensive documentation for Databricks Unity Catalog integration,
including the to_databricks operator reference. Covers three ingestion methods
(delta, iceberg, external_delta) with cost and governance trade-offs, authentication
setup, and optimized write patterns. Integration guide provides decision matrix and
examples for common use cases.

Note: databricks-basic.excalidraw diagram referenced in integration page is pending
and will be added in a follow-up commit.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
- Replace Snappy with Zstd (our default compression)
- Use from_file instead of load_file/read_* pattern

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Remove external links from See Also section, use Op component for
operator reference.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Remove method parameter and planned methods (iceberg, external-delta) to
focus on the current delta-based implementation. Make warehouse_id required
since it's needed for COPY INTO. Update partition_by/sort_by types to
list<field> and show explicit partition column creation in examples.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
@mavam mavam force-pushed the topic/databricks-integration branch from 05c833f to 80ab689 Compare January 28, 2026 11:24
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

integration Integration documentation reference Reference documentation site Site infrastructure

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant