Data connectivity & integrationAvailable connectorsSpark SQL

Spark SQL

The Spark SQL connector is a Palantir-provided driver for Spark SQL.

To create a new Spark SQL source, follow the standard setup flow for Palantir-provided drivers, then use the sections below for Spark SQL-specific configuration and networking. For the complete property reference, see the official Spark SQL driver documentation ↗.

Configuration

The properties below are mandatory or recommended.

Property	Required?	Description	Default
`AuthScheme` ↗	Mandatory	The authentication scheme used. Accepted entries are Plain, LDAP, NOSASL, and Kerberos.	`Plain`
`Server` ↗	Mandatory	The host name or IP address of the server hosting the SparkSQL database.	`{serverAddress}`
`UseSSL` ↗	Mandatory	Specifies whether to use SSL Encryption when connecting to Hive.	`TRUE`
`Database` ↗	Recommended	The name of the SparkSQL database.	—
`Password` ↗	Recommended	The password used to authenticate with SparkSQL.	—
`Port` ↗	Recommended	The port for the SparkSQL database.	`10000`
`TransportMode` ↗	Recommended	The transport mode to use to communicate with the Hive server. Accepted entries are BINARY and HTTP.	`BINARY`
`User` ↗	Recommended	The username used to authenticate with SparkSQL.	—

Networking

The table below lists the domains that the source needs to be able to access in order to successfully run.

For each domain, add a corresponding egress policy. If the source is hosted on-premises and not directly reachable from Foundry, use an agent proxy egress policy instead; the agent host itself must also be able to reach the listed domains. See using an agent as a proxy for details.

Domain	Required
<Server>:<Port>	Always. Server and Port connection properties; default `Port=10000`
<KerberosKDC>:88	If `AuthScheme=Kerberos`
<KerberosServiceKDC>:88	If `AuthScheme=Kerberos` and Kerberos topology uses multiple realms