Announcements

REMINDER: Sign up for the Foundry Newsletter to receive a summary of new products, features, and improvements across the platform directly to your inbox. For more information on how to subscribe, see the Foundry Newsletter and Product Feedback channels announcement.

Share your thoughts about these announcements in our Developer Community Forum ↗.


Spreadsheets are now supported as a media set schema type

Date published: 2025-10-02

Spreadsheet media sets are now generally available, allowing you to upload, preview, and process spreadsheet (XLSX) files directly within Foundry's media sets and enabling powerful LLM-driven workflows with tabular data that was previously difficult to handle.

Organizations frequently need to archive and process data from various poorly defined sources like manufacturing quotes, progress reports, and status updates that come in spreadsheet format. Until now, media sets did not support previews for spreadsheets, and tools for converting spreadsheets to datasets were not suitable for the workflows.

What are spreadsheet media sets?

Spreadsheet media sets allow you to work with tabular data designed for human consumption that is difficult to automate using traditional programming methods. The primary format supported is XLSX (Excel) files.

Spreadsheet media sets are ideal for processing unstructured spreadsheets in scenarios such as:

  • Files with significant formatting differences between versions
  • Spreadsheets where the structure is not known ahead of time (including email attachments, ad-hoc reports, and third-party vendors)
  • Storing and displaying source data alongside processed datasets
  • Supporting LLM-driven extraction and analysis workflows

Spreadsheet media sets are also an excellent way to maintain your original source of truth for referencing from downstream transformations or ingestions.

Key capabilities

  • Upload and preview: Upload XLSX files to media sets and view interactive previews that render spreadsheet content directly in Foundry. The preview provides a familiar tabular view of your data without requiring file downloads.

A preview of spreadsheet content uploaded to a media set.

A preview of spreadsheet content uploaded to a media set.

  • Text extraction for LLM processing: Extract spreadsheet content as JSON for use in LLM-powered workflows. This enables intelligent processing of tabular data that might have inconsistent formatting or meaningful layout structure such as merged cells.
  • Workshop integration: Spreadsheet media sets are fully integrated with Workshop, allowing you to preview spreadsheets directly in your workflow, view and create annotations, and scroll through content seamlessly.
  • Pipeline Builder support: Use Pipeline Builder expressions to extract and transform spreadsheet data within your pipelines, making it easy to incorporate spreadsheet processing into your workflows.
  • Python transforms in Code Workspaces: Perform advanced transformations in Code Workspaces using the transforms-media package.

What's next?

In upcoming releases, we plan to enhance spreadsheet media sets with additional Workshop annotation features, enhanced formatting extraction, more options for text extraction, and improved support for edge cases and embedded data.

Your feedback matters

We want to hear about your experience with spreadsheet media sets and welcome your feedback. Share your thoughts with Palantir Support channels, or on our Developer Community ↗ using the media-sets tag ↗.