transforms.api.TableTransformInput

class transforms.api.TableTransformInput(rid, branch, table_dfreader)

The input object passed into transform objects at runtime for virtual table inputs. Mimics the TransformInput API.

The configuration for an incremental input that will be read in batches.

The branch of the dataset.

Returns the name of the table’s Spark catalog, intended for use in Spark procedures, if supported by the underlying table type.

Returns: The name of the table’s catalog.
Return type: str ↗
Raises: ValueError ↗ – If the underlying table type does not expose a Spark catalog.

The column descriptions of the dataset.

The column typeclasses of the dataset.

Return a pyspark.sql.DataFrame ↗ containing the full view of the table.

Parameters: options (dict ↗ , option) – Additional Spark read options to pass when reading the table.
Returns: The DataFrame for the table.
Return type: pyspark.sql.DataFrame ↗

The ending transaction of the input dataset.

Construct a FileSystem object for reading from FoundryFS.

Returns the fully-qualified, catalog-prefixed, Spark V2 identifier of the table, if supported by the underlying table type.

Returns: The fully-qualified identifier of the table.
Return type: str ↗
Raises: ValueError ↗ – If the underlying table type does not expose a Spark V2 identifier.

pandas.DataFrame ↗: A Pandas dataframe containing the full view of the dataset.

The Compass path of the dataset.

The resource identifier of the dataset.

The starting transaction of the input dataset.