Spark
Configure oleander with Apache Spark
Overview
For Spark, the OpenLineage integration is published to Maven. A comprehensive guide on how to configure and use the OpenLineage integration with Apache Spark can be found at OpenLineage documentation.
Getting Started
In this tutorial, you’ll be running a simple self-contained Spark application with Iceberg locally via Docker Compose.
We’ll use an existing example iceberg table demo.nyc.taxis
used in the Iceberg quickstart guide for Spark.
Our team is based in SF, so naturally, we’ll be using demo.sf.waymo
.
Clone
Clone our tutorial repository and navigate to the spark
directory:
Requirements
Oleander API Key
Go to https://oleander.dev
, then copy your API key.
In the step below, replace [OLEANDER-API-KEY]
with your API key before running the Spark job.
Running the Spark Job
Next, start up the docker containers with:
Then, run:
Finally, go to https://oleander.dev
to view the OpenLineage events emitted by your Spark job.
Was this page helpful?