How to choose Kafka transaction id for several applications, hosted in Kubernetes?

7/24/2019

I have a classic microservice architecture. So, there are differ applications. Each application may have 1..N instances. The system is deployed to Kubernetes. So, we have many differ PODs, which can start and stop in any time.

I want to implement read-process-write pattern, so I need Kafka transactions.

To configure transactions, I need to set some transaction id for each Kafka producer. (Actually, I need transaction-id-prefix, because of I use Spring for my applications, and it has such API). These IDs have to be the same, after application is restarted.

So, how to choose Kafka transaction id for several applications, hosted in Kubernetes?

-- Denis

apache-kafka

java

kafka-producer-api

kubernetes

spring-kafka

Similar Questions

Changing image of kubernetes job

Using k8s secret in httpHeaders for livenessProbe

How to pass environmental variables in envconsul config file?

how does kubernetes guarantee reliability of kube proxy and kubelet?

No external DNS requests are being resolved inside Kubernetes cluster

How to access container environment variables with kubernetes env?

1 Answer

7/24/2019

If the consumer starts the transaction (read-process-write) then the transaction id prefix must be the same for all instances of the same app (so that zombie fencing works correctly after a rebalance). The actual transaction id used is <prefix><group>.<topic>.<partition>.

If you have multiple apps, they should have unique prefixes (although if they consume from different topics, they will be unique anyway).

For producer-only transactions, the prefix must be unique in each instance (to prevent kafka fencing the producers).

-- Gary Russell

Source: StackOverflow