Documentation Index
Fetch the complete documentation index at: https://docs.oleander.dev/llms.txt
Use this file to discover all available pages before exploring further.

Spark
Run your PySpark applications on managed Spark infrastructure. Upload scripts, execute jobs, and automatically capture lineage metadata for complete observability of your data transformations.Bring your own catalog & compute
Prefer to run your own Iceberg catalog and Spark compute? Use our CDK example to stand up S3 Tables with EMR Serverless and integrate with oleander-managed workflows. View the CDK reference on GitHub →Lake
Run SQL queries, sync data, and collaborate on both public and private tables. Every operation automatically captures lineage metadata for observability. That lake is compatible with the same datasets used by oleander managed spark.Observability
Get comprehensive insights into your data pipelines with seamless integration to Spark (oleander, or not!), Airflow, dbt, and Flink. Visualize lineage, track dependencies, and monitor execution. Learn more about observability →Educational resources
Explore our free tools to help you work with data and OpenLineage:- Parquet viewer - View and inspect Parquet files directly in your browser
- OpenLineage validator - Validate OpenLineage facets and ensure your metadata conforms to the specification
- OpenLineage graph - Visualize and explore OpenLineage lineage graphs interactively


