[fix][docs] Update Debezium Postgres connector docs to v2.x properties and resolve schema mismatch#1137
Merged
lhotari merged 2 commits intoapache:mainfrom May 4, 2026
Conversation
…lve silent data drop
9b63d3b to
257ac47
Compare
lhotari
approved these changes
May 4, 2026
Member
lhotari
left a comment
There was a problem hiding this comment.
Thanks for the contribution @Praveenkumar76
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Fixes apache/pulsar#23662
Motivation
The documented Debezium PostgreSQL CDC example currently relies on outdated Debezium 1.x property names (such as
database.server.name,schema.whitelist, andtable.whitelist). When users copy and paste these into modern Pulsar environments using Debezium 2.x, it leads to validation failures.Additionally, the documentation does not specify the correct converters. Without explicitly defining string converters, users experience silent data drops caused by
KeyValueschema mismatches, preventing messages from being published to the Pulsar topic.Modifications
Updated the PostgreSQL Debezium source configuration examples (both JSON and YAML) in the documentation to reflect Debezium 2.x standards:
database.server.namewithtopic.prefix.schema.whitelistwithschema.include.list.table.whitelistwithtable.include.list.key.converterandvalue.converterset toorg.apache.kafka.connect.storage.StringConverterto prevent silent data drops due to schema mismatches.localrunCLI command example to include--destination-topic-nameto ensure proper routing.Verifying this change
This documentation change was verified manually using the updated configurations with:
pulsar-io-debezium-postgres-4.x.x.narValidation steps:
wal_level=logical).localruncommand using the newly updated configuration properties.INSERT,UPDATE, andDELETEoperations on the source table.persistent://public/default/dbserver1.public.users.Example consumer output successfully captured CDC events:
A Quick Reminder for Your File Edits
Just to make sure your actual markdown edits match this PR description, double-check that the JSON block in the docs file (
docs/io-debezium-source.mdor similar) looks exactly like this now:{ "database.hostname": "localhost", "database.port": "5432", "database.user": "postgres", "database.password": "changeme", "database.dbname": "postgres", "topic.prefix": "dbserver1", "plugin.name": "pgoutput", "schema.include.list": "public", "table.include.list": "public.users", "key.converter": "org.apache.kafka.connect.storage.StringConverter", "value.converter": "org.apache.kafka.connect.storage.StringConverter", "database.history.pulsar.service.url": "pulsar://127.0.0.1:6650" }Does this pull request potentially affect one of the following parts?