Trendyol
diff --git a/‎.github/workflows/build.yml‎
Lines changed: 1 addition & 0 deletions b/‎.github/workflows/build.yml‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎README.md‎
Lines changed: 76 additions & 1 deletion b/‎README.md‎
Lines changed: 76 additions & 1 deletion
diff --git a/‎connector.go‎
Lines changed: 18 additions & 0 deletions b/‎connector.go‎
Lines changed: 18 additions & 0 deletions
diff --git a/‎example/simple/docker-compose.yml‎
Lines changed: 12 additions & 42 deletions b/‎example/simple/docker-compose.yml‎
Lines changed: 12 additions & 42 deletions
diff --git a/‎example/simple/go.mod‎
Lines changed: 0 additions & 38 deletions b/‎example/simple/go.mod‎
Lines changed: 0 additions & 38 deletions
@@ -48,3 +48,4 @@ jobs:
       actions: read
       contents: read
       security-events: write
+    secrets: inherit
@@ -10,6 +10,7 @@ Kafka connector for [go-pq-cdc](https://github.com/Trendyol/go-pq-cdc).
 
 - **Optimized for Speed and Efficiency**: Minimal resource consumption and faster processing, designed to handle high-throughput data replication.
 - **Real-Time Data Streaming**: Streams data directly from PostgreSQL to Kafka, ensuring up-to-date synchronization across systems.
+- **Initial Snapshot Support**: Capture existing data before starting CDC, ensuring downstream systems receive both historical and real-time data.
 - **Automatic Failover**: In the event of a failure, `go-pq-cdc-kafka` can quickly recover and resume data replication.
 - **Concurrency**: Built with Go's concurrency model (goroutines and channels), ensuring lightweight and highly performant parallel operations.
 
@@ -38,8 +39,63 @@ The `go-pq-cdc-kafka` ensures high availability with passive/active modes for Po
 
 - **Passive Mode**: If the replication slot becomes inactive, it automatically captures the slot and resumes data streaming. Additionally, other deployments monitor the slot’s status, ensuring redundancy and failover capabilities.
 
-This architecture guarantees minimal downtime and continuous data synchronization, even in the event of failure. Additionally, Go’s faster cold starts provide quicker recovery times compared to Debezium, further minimizing potential downtime.
+This architecture guarantees minimal downtime and continuous data synchronization, even in the event of failure. Additionally, Go's faster cold starts provide quicker recovery times compared to Debezium, further minimizing potential downtime.
 
+## 📸 NEW: Snapshot Feature
+
+**Capture existing data before starting CDC!** The snapshot feature enables initial data synchronization, ensuring downstream systems (Kafka) receive both historical and real-time data.
+
+✨ **Key Highlights:**
+
+- **Zero Data Loss**: Consistent point-in-time snapshot using PostgreSQL's `pg_export_snapshot()`
+- **Chunk-Based Processing**: Memory-efficient processing of large tables
+- **Multi-Instance Support**: Parallel processing across multiple instances for faster snapshots
+- **Crash Recovery**: Automatic resume from failures with chunk-level tracking
+- **No Duplicates**: Seamless transition from snapshot to CDC mode
+- **Flexible Modes**: Choose between `initial`, `never`, or `snapshot_only` based on your needs
+
+### Snapshot Modes
+
+| Mode            | Description                                                                                      | Use Case                                        |
+|-----------------|--------------------------------------------------------------------------------------------------|-------------------------------------------------|
+| `initial`       | Takes snapshot only if no previous snapshot exists, then starts CDC                              | First-time setup with existing data             |
+| `never`         | Skips snapshot entirely, starts CDC immediately                                                  | New tables or when historical data not needed   |
+| `snapshot_only` | Takes snapshot and exits (no CDC, no replication slot required)                                  | One-time data migration or backfill             |
+
+### How It Works
+
+1. **Snapshot Phase**: Captures existing data in chunks for memory efficiency
+2. **Consistent Point**: Uses PostgreSQL's `pg_export_snapshot()` to ensure data consistency
+3. **CDC Phase**: Seamlessly transitions to real-time change data capture
+4. **No Gaps**: Ensures all changes during snapshot are captured via CDC
+
+### Identifying Snapshot vs CDC Messages
+
+Your handler function can distinguish between snapshot and CDC messages:
+
+```go
+func Handler(msg *cdc.Message) []gokafka.Message {
+    // Check if this is a snapshot message (historical data)
+    if msg.Type.IsSnapshot() {
+        // Handle snapshot data
+        return []gokafka.Message{{
+            Headers: []gokafka.Header{
+                {Key: "operation", Value: []byte("SNAPSHOT")},
+                {Key: "source", Value: []byte("initial-snapshot")},
+            },
+            Key:   key,
+            Value: data,
+        }}
+    }
+    
+    // Handle real-time CDC operations
+    if msg.Type.IsInsert() { /* ... */ }
+    if msg.Type.IsUpdate() { /* ... */ }
+    if msg.Type.IsDelete() { /* ... */ }
+}
+```
+
+For detailed configuration and usage, see the [snapshot example](./example/snapshot).
 
 ## Usage
 
@@ -159,6 +215,7 @@ func Handler(msg *cdc.Message) []gokafka.Message {
 ## Examples
 
 * [Simple](./example/simple)
+* [Snapshot](./example/snapshot)
 
 ## Configuration
 
@@ -182,6 +239,13 @@ func Handler(msg *cdc.Message) []gokafka.Message {
 | `cdc.slot.createIfNotExists`                |       bool        |    no    |    -    | Create replication slot if not exists. Otherwise, return `replication slot is not exists` error.                |                                                                                                                                                                          |
 | `cdc.slot.name`                             |      string       |   yes    |    -    | Set the logical replication slot name                                                                           | Should be unique and descriptive.                                                                                                                                        |
 | `cdc.slot.slotActivityCheckerInterval`      |        int        |   yes    |  1000   | Set the slot activity check interval time in milliseconds                                                       | Specify as an integer value in milliseconds (e.g., `1000` for 1 second).                                                                                                 |
+| `cdc.snapshot.enabled`                      |       bool        |    no    |  false  | Enable initial snapshot feature                                                                                 | When enabled, captures existing data before starting CDC.                                                                                                                |
+| `cdc.snapshot.mode`                         |      string       |    no    |  never  | Snapshot mode: `initial`, `never`, or `snapshot_only`                                                           | **initial:** Take snapshot only if no previous snapshot exists, then start CDC. <br> **never:** Skip snapshot, start CDC immediately. <br> **snapshot_only:** Take snapshot and exit (no CDC). |
+| `cdc.snapshot.chunkSize`                    |       int64       |    no    |  8000   | Number of rows per chunk during snapshot                                                                        | Adjust based on table size. Larger chunks = fewer chunks but more memory per chunk.                                                                                      |
+| `cdc.snapshot.claimTimeout`                 |   time.Duration   |    no    |   30s   | Timeout to reclaim stale chunks                                                                                 | If a worker doesn't send heartbeat for this duration, chunk is reclaimed by another worker.                                                                              |
+| `cdc.snapshot.heartbeatInterval`            |   time.Duration   |    no    |    5s   | Interval for worker heartbeat updates                                                                           | Workers send heartbeat every N seconds to indicate they're processing a chunk.                                                                                           |
+| `cdc.snapshot.instanceId`                   |      string       |    no    |  auto   | Custom instance identifier (optional)                                                                           | Auto-generated as hostname-pid if not specified. Useful for tracking workers in multi-instance scenarios.                                                                |
+| `cdc.snapshot.tables`                       |      []Table      |   no*    |    -    | Tables to snapshot (required for `snapshot_only` mode, optional for `initial` mode)                             | **snapshot_only:** Must be specified here (independent from publication). <br> **initial:** If specified, must be a subset of publication tables. If not specified, all publication tables are snapshotted. |
 | `kafka.tableTopicMapping`                   | map[string]string |   yes    |    -    | Mapping of PostgreSQL table events to Kafka topics                                                              | Maps table names to Kafka topics.                                                                                                                                        |
 | `kafka.brokers`                             |     []string      |   yes    |    -    | Broker IP and port information                                                                                  |                                                                                                                                                                          |
 | `kafka.producerBatchSize`                   |      integer      |    no    |  2000   | Maximum message count for batch, if exceeded, flush will be triggered.                                          |                                                                                                                                                                          |
@@ -223,6 +287,17 @@ the `/metrics` endpoint.
 | go_pq_cdc_kafka_write_total                                  | The total number of successful in write operation to kafka.                                           | slot_name, host, topic_name | Counter      | 
 | go_pq_cdc_kafka_err_total                                    | The total number of unsuccessful in write operation to kafka.                                         | slot_name, host, topic_name | Counter      | 
 
+### Snapshot Metrics
+
+| Metric Name                                                  | Description                                                                                           | Labels                      | Value Type   |
+|--------------------------------------------------------------|-------------------------------------------------------------------------------------------------------|-----------------------------|--------------|
+| go_pq_cdc_snapshot_in_progress                               | Indicates whether snapshot is currently in progress (1 for active, 0 for inactive).                  | slot_name, host             | Gauge        |
+| go_pq_cdc_snapshot_total_tables                              | Total number of tables to snapshot.                                                                   | slot_name, host             | Gauge        |
+| go_pq_cdc_snapshot_total_chunks                              | Total number of chunks to process across all tables.                                                  | slot_name, host             | Gauge        |
+| go_pq_cdc_snapshot_completed_chunks                          | Number of chunks completed in snapshot.                                                               | slot_name, host             | Gauge        |
+| go_pq_cdc_snapshot_total_rows                                | Total number of rows read during snapshot.                                                            | slot_name, host             | Counter      |
+| go_pq_cdc_snapshot_duration_seconds                          | Duration of the last snapshot operation in seconds.                                                   | slot_name, host             | Gauge        |
+
 You can also use all cdc related metrics explained [here](https://github.com/Trendyol/go-pq-cdc#exposed-metrics). 
 All cdc related metrics are automatically injected. It means you don't need to do anything.
 
 
@@ -78,6 +78,22 @@ func NewConnector(ctx context.Context, config config.Connector, handler Handler,
 }
 
 func (c *connector) Start(ctx context.Context) {
+	// Snapshot-only mode: different flow since upstream CDC exits immediately
+	if c.cfg.CDC.IsSnapshotOnlyMode() {
+		logger.Info("starting snapshot-only mode")
+		logger.Info("bulk process started")
+		c.producer.StartBatch()
+
+		// Signal ready immediately since there's no CDC to wait for
+		c.readyCh <- struct{}{}
+
+		// Start CDC synchronously - it will execute snapshot and return
+		c.cdc.Start(ctx)
+		logger.Info("snapshot-only mode completed")
+		return
+	}
+
+	// Normal CDC mode: async flow
 	go func() {
 		logger.Info("waiting for connector start...")
 		if err := c.cdc.WaitUntilReady(ctx); err != nil {
@@ -119,6 +135,8 @@ func (c *connector) listener(ctx *replication.ListenerContext) {
 		msg = NewUpdateMessage(m)
 	case *format.Delete:
 		msg = NewDeleteMessage(m)
+	case *format.Snapshot:
+		msg = NewSnapshotMessage(m)
 	default:
 		return
 	}
 
@@ -9,7 +9,17 @@ services:
       POSTGRES_PASSWORD: "cdc_pass"
       POSTGRES_DB: "cdc_db"
       POSTGRES_HOST_AUTH_METHOD: trust
-    network_mode: "host"
+    volumes:
+      - postgres_data:/var/lib/postgresql/data
+    ports:
+      - "5432:5432"
+    networks:
+      - cdc_net
+    healthcheck:
+      test: ["CMD-SHELL", "pg_isready -U cdc_user -d cdc_db"]
+      interval: 10s
+      timeout: 5s
+      retries: 5
 
   ##### KAFKA ######
   redpanda:
@@ -57,57 +67,17 @@ services:
           adminApi:
             enabled: true
             urls: ["http://redpanda:9644"]
-        connect:
-          enabled: true
-          clusters:
-            - name: local-connect-cluster
-              url: http://connect:8083
     ports:
       - 8085:8080
     networks:
       - cdc_net
     depends_on:
       - redpanda
 
-  connect:
-    image: docker.redpanda.com/redpandadata/connectors:latest
-    hostname: connect
-    container_name: connect
-    networks:
-      - cdc_net
-    # platform: 'linux/amd64'
-    depends_on:
-      - redpanda
-    ports:
-      - "8083:8083"
-    environment:
-      CONNECT_CONFIGURATION: |
-        key.converter=org.apache.kafka.connect.converters.ByteArrayConverter
-        value.converter=org.apache.kafka.connect.converters.ByteArrayConverter
-        group.id=connectors-cluster
-        offset.storage.topic=_internal_connectors_offsets
-        config.storage.topic=_internal_connectors_configs
-        status.storage.topic=_internal_connectors_status
-        config.storage.replication.factor=-1
-        offset.storage.replication.factor=-1
-        status.storage.replication.factor=-1
-        offset.flush.interval.ms=1000
-        producer.linger.ms=50
-        producer.batch.size=131072
-      CONNECT_BOOTSTRAP_SERVERS: redpanda:9092
-      CONNECT_GC_LOG_ENABLED: "false"
-      CONNECT_HEAP_OPTS: -Xms512M -Xmx512M
-      CONNECT_LOG_LEVEL: info
-
 volumes:
-  couchbase_data: null
   postgres_data: null
-  zookeeper_data: null
-  zookeeper_log: null
-  kafka_data: null
   redpanda_data: null
-  kibana1_data: null
-  es01_data: null
+
 networks:
   cdc_net:
     driver: bridge