【发布时间】:2020-06-11 12:55:59
【问题描述】:
我们有一个 Spark Java 应用程序,它从数据库中读取数据并在 Kafka 上发布消息。当我们使用以下参数在 Windows 命令行上本地执行作业时,它按预期工作:
bin/spark-submit -class com.data.ingestion.DataIngestion --jars local:///opt/spark/jars/spark-sql-kafka-0-10_2.11-2.3.0.jar local:///opt/spark/jars/data-ingestion-1.0-SNAPSHOT.jar
spark-submit --packages org.apache.spark:spark-sql-kafka-0-10_2.11:2.3.0 --class com.data.ingestion.DataIngestion data-ingestion-1.0-SNAPSHOT.jar
同样,当尝试使用 k8s master 运行命令时
bin/spark-submit --master k8s://https://172.16.3.105:8443 --deploy-mode cluster --conf spark.kubernetes.container.image=localhost:5000/spark-example:0.2 --class com.data.ingestion.DataIngestion --jars local:///opt/spark/jars/spark-sql-kafka-0-10_2.11-2.3.0.jar local:///opt/spark/jars/data-ingestion-1.0-SNAPSHOT.jar
它给出以下错误:
Exception in thread "main" java.util.ServiceConfigurationError:
org.apache.spark.sql.sources.DataSourceRegister: Provider
org.apache.spark.sql.kafka010.KafkaSourceProvider could not be instantiated
【问题讨论】:
-
您使用的是什么数据库?听起来你可以改用 Debezium
标签: java apache-spark kubernetes apache-kafka