AIP Document Intelligence is in the beta phase of development and may not be available on your enrollment. Functionality may change during active development. To enable AIP Document Intelligence, navigate to Application access in Control Panel.
AIP Document Intelligence is Foundry's entry point for all document extraction workflows. You can use AIP Document Intelligence to open a media set of enterprise documents, quickly execute different state-of-the-art document extraction strategies, and retrieve evaluation results of the quality, speed, and token cost of such strategies. With a single click, you can then deploy the configured extraction strategy as a Python transform to a batch pipeline over a media set.
AIP Document Intelligence features several extraction capabilities alongside a simplified interface for efficient walk-up use:
AIP Document Intelligence follows a testing workflow where users import a media set, select a configuration, and iterate on their strategy by observing results and evaluations until satisfaction before deploying. Follow the steps below to get started:
Upload a media set: From the application landing page, choose to select a media set from your available files, or upload a new media set.
Select a configuration: Once the media set opens, open the Configuration tab to set up the extraction method you want to use. Choose between traditional or generative AI methods, enable preprocessing if desired, and customize prompts to use with a VLM. Select Save to remember your configuration choice in the future.
Execute strategy on a media set: After configuring the extraction method, select Run in the top right corner of the Configuration tab.
Preview extraction results: After running the extraction, navigate to the Extraction result tab to view the output based on the chosen strategy.
Evaluate extraction results: From the results tab, select Evaluate results to use an LLM to evaluate the extraction results. Continue to test steps 3 through 5 until you are satisfied with the extraction results and evaluation.
Deploy extraction strategy: Once you decide on a final extraction strategy, open the Deployment tab, choose the saved configuration from the dropdown menu, and select Create transform repository. This will prompt you to choose a name and location for a new Python transforms repository and open a template in Code Repositories. To use the transform repository, specify the output dataset and start the build.