transforms.api.IncrementalTransformInput

class transforms.api.IncrementalTransformInput(tinput, prev_txrid=None, batch_end_txrid=None)

TransformInput with added functionality for incremental computation.

The configuration for an incremental input that will be read in batches.

The branch of the dataset.

The column descriptions of the dataset.

The column typeclasses of the dataset.

Return a pyspark.sql.DataFrame ↗ for the given read mode.

Only current, previous and added modes are supported.

Parameters: mode (str ↗ , optional) – The read mode, one of current, previous, added, modified, or removed. Defaults to added.
Returns: The DataFrame for the dataset.
Return type: DataFrame ↗

The ending transaction of the input dataset.

Construct a FileSystem object for reading from FoundryFS for the given read mode.

Only current, previous and added modes are supported.

Parameters: mode (str ↗ , optional) – The read mode, one of current, previous, added, modified, or removed. Defaults to added.
Returns: A filesystem object for the given view.
Return type: FileSystem

pandas.DataFrame ↗: A Pandas dataframe containing the full view of the dataset.

The Compass path of the dataset.

The resource identifier of the dataset.

The starting transaction of the input dataset.