Running kafka connect in Distributed mode?
Asked Answered
H

1

6

I have a total of 3 VM's(CloudVPS). Each of them has java, confluent open source installed on them. In VM1 I am running 3 processes of Splunk-sink-connector which reads from different topics and are running on different ports. And using REST calls I posted JSON configuration to each of them.

Since I am running in distributed mode I want to take advantage of other 2 VM's also. Can anyone please tell me what to do, to add other 2 VM's to those 3 processes to achieve parallel processing.

Haily answered 6/3, 2018 at 4:52 Comment(0)
P
15

You just need to run Kafka Connect in Distributed mode on the three VMs, follow the instructions here and make sure you give them all the same group.id which identifies them as members of the same cluster (and thus eligible for sharing workload of tasks out across them). More config details for distributed mode here.

See also:

Prairial answered 6/3, 2018 at 11:24 Comment(2)
Do you have similar document or helm charts for kafka connect distributed modee on kubernetes cluster ?Peninsula
is groud.id property still existing in Apache Kafka?Acropetal

© 2022 - 2024 — McMap. All rights reserved.