As your workflow increases in complexity, it can become more difficult to ensure that every step is working as intended. Setting up observability tooling is an important step in creating a stable production workflow and detecting issues before they affect users.
The Palantir platform provides built-in observability tools that give you visibility into the status and health of many resource types. These tools can help you debug issues during development, as well as monitor pipeline and workflow stability in production.
Users can view logs from both third-party libraries being used to run their code (such as Kafka when using streams), as well as logs emitted by your code. Logs are available on several types of jobs:
Log exporting may not be available in your enrollment. Contact Palantir Support for more information.
To allow for arbitrary processing outside the current capabilities of in-platform tools, you can create a stream in a specified folder containing all telemetry for an organization. This includes logs, metrics, and traces. Data in this stream can be analyzed using Foundry’s suite of data analysis tools or exported to third-party systems.
Learn more about exporting logs to a stream on the Configure logging page.
Metrics for long-running compute workloads allow you to monitor the health and stability of a stream or compute module over time.
Access metrics in the Metrics tab of both streams and compute modules.
You can trace the execution of actions, functions, and other AIP tools using AIP trace views.
Monitors in Foundry can alert you to a wide variety of preconfigured events and metric thresholds for Foundry resources.
There are two ways to configure monitors in Foundry:
You can subscribe directly to health checks to receive a notification when an alert is raised.
You can also create a monitoring view to group and review alerts from both health checks and monitoring rules.
Alerts from this monitoring view can be sent in the following ways: