Lightweight pipelines are in the beta phase of development and need to be turned on for your enrollment. Functionality may change during active development. Contact your Palantir Support to enable.
If you are unfamiliar with creating pipelines in Pipeline Builder, review the documentation on how to create a batch pipeline in Pipeline Builder before proceeding.
Pipeline Builder now supports lightweight pipelines, which can provide faster execution for batch and incremental pipelines. Pipeline Builder's lightweight pipelines use a backend powered by DataFusion ↗, an open-source query engine written in Rust ↗. Compared to traditional Spark-based pipelines, lightweight pipelines can substantially accelerate compute processes for small to medium-sized datasets.
Lightweight pipelines are specifically engineered to optimize build times and execute low-latency operations efficiently. In particular, "quick" pipelines that run in under 15 minutes will benefit most from lightweight configuration.
We encourage you to experiment with different pipeline configurations to improve performance. You can explore the capabilities of lightweight pipelines by testing them on a branch or making a copy of an existing pipeline to compare lightweight performance with your original configuration.
You can convert between lightweight and standard batch pipelines, and vice versa, by following the steps below. This conversion can be reversed at any time by repeating the process and selecting the desired options.
To convert a lightweight pipeline to a batch pipeline, go to Settings and select Convert to Batch pipeline.
Lightweight pipelines do not currently support the same set of transforms and expressions as standard batch pipelines. Most notably, unsupported transforms and expressions include LLM features, media set operations, and split nodes.
Due to the differences between lightweight and batch pipelines, you should always verify results using Preview or by examining build outputs.
Most supported expressions in lightweight pipelines will behave as their batch equivalents. Known limitations in lightweight pipelines include:
NULL
value.<
, >
, ==
, etc.pow
overflow returns NULL
instead of inf
.TimestampToString
, DateToString
, StringToTimestamp
, and StringToDate
.