使用kafka的自动删除行？

发布于 2025-02-06 06:18:49 字数 1194 浏览 3 评论 0原文

我有一个内存数据库，我正在使用KAFKA + JDBCSINKCONNECTOR将下游Postgres数据库与内存数据库同步。内存数据库用于有效的计算，而Postgres则用于查询。在开发中，我经常破坏并重新创建内存数据库。每次，我还会重新创建Kafka水槽连接器。

如果添加了新行或在内存数据库中更改了现有行，我认为JDBCSinkConnector能够将Postgres与新数据同步。但是，如果删除行，JDBCSinkConnector不会删除Postgres中的行。

JDBCSinkConnector是否可以检查下游数据库中的哪个行不再在上游数据库中，然后删除它们？如果没有，每次更新上游数据库时，我都必须销毁下游数据库。

配置：

{
  'connector.class': 'io.confluent.connect.jdbc.JdbcSinkConnector',
  'dialect.name': 'PostgreSqlDatabaseDialect',
  'key.converter': 'io.confluent.connect.avro.AvroConverter',
  'key.converter.schema.registry.url': `http://schema-registry:${process.env.SCHEMA_REGISTRY_PORT}`,
  'value.converter': 'io.confluent.connect.avro.AvroConverter',
  'value.converter.schema.registry.url': `http://schema-registry:${process.env.SCHEMA_REGISTRY_PORT}`,
  'insert.mode': 'upsert',
  'delete.enabled': 'true',
  'auto.create': 'true',
  'auto.evolve': 'false',
  'errors.retry.timeout': -1,
  'connection.url': `jdbc:postgresql://${process.env.INTERNAL_DOCKER_HOST}:${process.env.PG_PORT}/${process.env.PG_DB}`,
  'connection.user': process.env.PG_USER,
  'connection.password': process.env.PG_PASS,
  'pk.mode': 'record_key',
}

原文

I have an in-memory database and I'm using Kafka + JdbcSinkConnector to sync a downstream Postgres database with the in-memory database. The in-memory database is for efficient computations and Postgres is for querying. In development, I frequently destroy and recreate the in-memory database. Each time, I also recreate the Kafka sink connectors.

If new rows were added or existing rows were changed in the in-memory database, I think JdbcSinkConnector is able to sync Postgres with the new data. However, if rows were deleted, JdbcSinkConnector doesn't delete the rows in Postgres.

Is it possible for JdbcSinkConnector to check which of the rows in the downstream database are no longer in the upstream database, then delete them? If not, I'd have to destroy the downstream database every time I update the upstream database.

Config:

{
  'connector.class': 'io.confluent.connect.jdbc.JdbcSinkConnector',
  'dialect.name': 'PostgreSqlDatabaseDialect',
  'key.converter': 'io.confluent.connect.avro.AvroConverter',
  'key.converter.schema.registry.url': `http://schema-registry:${process.env.SCHEMA_REGISTRY_PORT}`,
  'value.converter': 'io.confluent.connect.avro.AvroConverter',
  'value.converter.schema.registry.url': `http://schema-registry:${process.env.SCHEMA_REGISTRY_PORT}`,
  'insert.mode': 'upsert',
  'delete.enabled': 'true',
  'auto.create': 'true',
  'auto.evolve': 'false',
  'errors.retry.timeout': -1,
  'connection.url': `jdbc:postgresql://${process.env.INTERNAL_DOCKER_HOST}:${process.env.PG_PORT}/${process.env.PG_DB}`,
  'connection.user': process.env.PG_USER,
  'connection.password': process.env.PG_PASS,
  'pk.mode': 'record_key',
}

分享到QQ

分享到微博