spark-sql-kafka-0-8

License	License The Apache Software License, Version 2.0
GroupId	GroupId com.github.harbby
ArtifactId	ArtifactId spark-sql-kafka-0-8
Last Version	Last Version 1.0.1
Release Date	Release Date Jul 8, 2019
Type	Type jar
Description	Description spark-sql-kafka-0-8 spark-sql-kafka-0-8
Project URL	Project URL https://github.com/harbby/spark-sql-kafka-0-8
Source Code Management	Source Code Management https://github.com/harbby/spark-sql-kafka-0-8

Download spark-sql-kafka-0-8

Filename	Size
spark-sql-kafka-0-8-1.0.1.pom
spark-sql-kafka-0-8-1.0.1.jar	25 KB
spark-sql-kafka-0-8-1.0.1-sources.jar	14 KB
spark-sql-kafka-0-8-1.0.1-javadoc.jar	58 KB
Browse

How to add to project

Apache Maven

<!-- https://jarcasting.com/artifacts/com.github.harbby/spark-sql-kafka-0-8/ -->
<dependency>
    <groupId>com.github.harbby</groupId>
    <artifactId>spark-sql-kafka-0-8</artifactId>
    <version>1.0.1</version>
</dependency>

Gradle Groovy

// https://jarcasting.com/artifacts/com.github.harbby/spark-sql-kafka-0-8/
implementation 'com.github.harbby:spark-sql-kafka-0-8:1.0.1'

Gradle Kotlin

// https://jarcasting.com/artifacts/com.github.harbby/spark-sql-kafka-0-8/
implementation ("com.github.harbby:spark-sql-kafka-0-8:1.0.1")

Apache Buildr

'com.github.harbby:spark-sql-kafka-0-8:jar:1.0.1'

Apache Ivy

<dependency org="com.github.harbby" name="spark-sql-kafka-0-8" rev="1.0.1">
  <artifact name="spark-sql-kafka-0-8" type="jar" />
</dependency>

Groovy Grape

@Grapes(
@Grab(group='com.github.harbby', module='spark-sql-kafka-0-8', version='1.0.1')
)

Scala SBT

libraryDependencies += "com.github.harbby" % "spark-sql-kafka-0-8" % "1.0.1"

Leiningen

[com.github.harbby/spark-sql-kafka-0-8 "1.0.1"]

Dependencies

compile (1)

Group / Artifact	Type	Version
org.apache.spark : spark-streaming-kafka-0-8_2.11	jar	2.4.2

test (1)

Group / Artifact	Type	Version
junit : junit	jar	4.12

Project Modules

There are no modules declared in this project.

spark-sql-kafka-0-8

Spark Structured Streaming kafka source

support kafka-0.8.2.1+ kafka-0.9

License

Apache License, Version 2.0 http://www.apache.org/licenses/LICENSE-2.0

maven

<dependency>
  <groupId>com.github.harbby</groupId>
  <artifactId>spark-sql-kafka-0-8</artifactId>
  <version>1.0.0</version>
</dependency>

limit

must spark2.3+
must writeStream().trigger(Trigger.Continuous...)

Use

create

val sparkSession = ...

val kafka:DataFrame = sparkSession.readStream()
    .format("kafka08")
    .option("topics", "topic1,topic2")
    .option("bootstrap.servers", "broker1:9092,broker2:9092")
    .option("group.id", "test1")
    .option("auto.offset.reset", "largest")  //largest or smallest
    .option("zookeeper.connect", "zk1:2181,zk2:2181")
    .option("auto.commit.enable", "true")
    .option("auto.commit.interval.ms", "5000")
    .load();

schema

kafka.printSchema();

root
 |-- _key: binary (nullable = true)
 |-- _message: binary (nullable = true)
 |-- _topic: string (nullable = false)
 |-- _partition: integer (nullable = false)
 |-- _offset: long (nullable = false)

sink

    dataFrame.writeStream()
         .trigger(Trigger.Continuous(Duration.apply(90, TimeUnit.SECONDS))) //it is necessary
         ...

Versions

Version
1.0.1 Jul 8, 2019
1.0.0 May 14, 2019
1.0.0-alpha3 May 11, 2019
1.0.0-alpha2 May 8, 2019
1.0.0-alpha1 May 8, 2019

spark-sql-kafka-0-8

License

GroupId

ArtifactId

Last Version

Release Date

Type

Description

Project URL

Source Code Management

Download spark-sql-kafka-0-8

How to add to project

Dependencies

compile (1)

test (1)

Project Modules

spark-sql-kafka-0-8

License

maven

limit

Use

Versions