Spark SQL

The Spark SQL connector is a Palantir-provided driver for Spark SQL.

To create a new Spark SQL source, follow the standard setup flow for Palantir-provided drivers, then use the sections below for Spark SQL-specific configuration and networking. For the complete property reference, see the official Spark SQL driver documentation ↗.

Configuration

The properties below are mandatory or recommended.

PropertyRequired?DescriptionDefault
AuthSchemeMandatoryThe authentication scheme used. Accepted entries are Plain, LDAP, NOSASL, and Kerberos.Plain
ServerMandatoryThe host name or IP address of the server hosting the SparkSQL database.{serverAddress}
UseSSLMandatorySpecifies whether to use SSL Encryption when connecting to Hive.TRUE
DatabaseRecommendedThe name of the SparkSQL database.
PasswordRecommendedThe password used to authenticate with SparkSQL.
PortRecommendedThe port for the SparkSQL database.10000
TransportModeRecommendedThe transport mode to use to communicate with the Hive server. Accepted entries are BINARY and HTTP.BINARY
UserRecommendedThe username used to authenticate with SparkSQL.

Networking

The table below lists the domains that the source needs to be able to access in order to successfully run.

For each domain, add a corresponding egress policy. If the source is hosted on-premises and not directly reachable from Foundry, use an agent proxy egress policy instead; the agent host itself must also be able to reach the listed domains. See using an agent as a proxy for details.

DomainRequired
<Server>:<Port>Always. Server and Port connection properties; default Port=10000
<KerberosKDC>:88If AuthScheme=Kerberos
<KerberosServiceKDC>:88If AuthScheme=Kerberos and Kerberos topology uses multiple realms