Spark BigQuery Writer

Its a thin wrapper around BigQuery Client Java library which enables streaming writes to BigQuery tables of Spark DStreams.

Its quite lightweight with just one dependency BigQuery Java client.

libraryDependencies += "com.google.cloud" % "google-cloud-bigquery" % "1.103.0"

The main interface to interact with the wrapper is the BigQueryWriter object which can be used in following way

BigQueryWriter.write[String](messages, toTableRow, toTableMetadata)

  def toTableRow(s: String): TableRow = {
    val num = s.toInt
    val row = new TableRow()
    row.put("number", num.asInstanceOf[Object]);
    row
  }

  def toTableMetadata(s: String): BigQueryTableMetadata = {
    val t = TableId.of("<DATASET>", "<TABLE-NAME>")
    BigQueryTableMetadata(t,SchemaUtils.convertTableRowToBQSchema(toTableRow(s)),false,Nil)
  }

The function toTableMetadata can be used to provide destination table on row basis. This design is inspired by Apache Beam's DynamicDestinations.

Using in your project

Since we don't have any central JFrog or Sonatype repository the only way to consume this library is to publish it to local maven/ivy repository.

Clone the repo, and publish to local ivy repo.

sbt publishLocal

After this add following dependency in your Spark app to use the library.

Maven

<dependency>
    <groupId>com.tokopedia</groupId>
    <artifactId>spark-bigquery-writer_2.12</artifactId>
    <version>0.2</version>
</dependency>

SBT

libraryDependencies += "com.tokopedia" %% "spark-bigquery-writer" % "0.2"

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
src		src
.gitignore		.gitignore
README.md		README.md
build.sbt		build.sbt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spark BigQuery Writer

Using in your project

Maven

SBT

About

Releases

Packages

Languages

sushil-tokopedia/spark-bigquery-writer

Folders and files

Latest commit

History

Repository files navigation

Spark BigQuery Writer

Using in your project

Maven

SBT

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages