spark-kafka-integration Questions
2
Solved
I'm running Spark version 2.3.0.2.6.5.1175-1 with Python 3.
6.8 on Ambari. While submitting the application I get the following logs in stderr
22/06/15 12:29:31 INFO StateStoreCoordinatorRef: Regi...
Closefisted asked 15/6, 2022 at 7:52
1
Solved
I'm having problem understanding how to connect Kafka and PySpark.
I have kafka installation on Windows 10 with topic nicely streaming data.
I've installed pyspark which runs properly-I'm able to c...
Kilimanjaro asked 16/12, 2021 at 6:12
2
Solved
I understand that using Kafka's own offset tracking instead of other methods (like checkpointing) is problematic for streaming jobs.
However I just want to run a Spark batch job every day, reading ...
Cason asked 27/1, 2021 at 21:16
4
Solved
I want to use Spark Structured Streaming to read from a secure kafka. This means that I will need to force a specific group.id. However, as is stated in the documentation this is not possible.
Stil...
Thermograph asked 26/3, 2019 at 10:52
1
Solved
Based on the introduction in Spark 3.0, https://spark.apache.org/docs/latest/structured-streaming-kafka-integration.html. It should be possible to set "kafka.group.id" to track the offset...
Laxity asked 22/9, 2020 at 5:10
1
Solved
Kafka with spark-streaming throws an error:
from pyspark.streaming.kafka import KafkaUtils ImportError: No module named kafka
I have already setup a kafka broker and a working spark environment wi...
Flytrap asked 19/5, 2020 at 13:4
1
I was going through the Spark structured streaming - Kafka integration guide here.
It is told at this link that
enable.auto.commit: Kafka source doesn’t commit any offset.
So how do I manual...
Ragout asked 13/6, 2018 at 19:1
1
© 2022 - 2024 — McMap. All rights reserved.