Getting Started with Ingestion Configuration
This guide introduces you to configuring data ingestion in the Zeta Alpha platform. You'll learn about the different types of connectors available, how to set them up, and best practices for managing your data ingestion workflows.
Overview
Data ingestion is the process of importing and indexing content from various sources into the Zeta Alpha platform. The platform provides two main approaches to data ingestion:
- Pre-built Connectors: Ready-to-use connectors for popular enterprise systems like SharePoint, Google Drive, Box, and others
- Custom Connectors: Flexible connectors that allow you to ingest data from your own sources via the Ingestion API
Both approaches allow you to configure field mappings, access controls, and processing workflows tailored to your organization's needs.
Prerequisites
Before you begin configuring data ingestion, ensure you have:
- Access to the Zeta Alpha Platform UI - You'll need administrative access to configure connectors
- A tenant created - Your organization's Zeta Alpha tenant must be set up
- An index created - You'll need at least one index to receive ingested documents
- Appropriate credentials - Depending on your connector type, you may need authentication credentials such as API tokens, OAuth credentials, or service account keys. See the specific connector guide for detailed requirements.
Key Concepts
Connectors
A connector is a configuration that defines how the Zeta Alpha platform should retrieve and process documents from a specific source. Each connector includes:
- Source credentials: Authentication details for accessing the data source
- Field mappings: Rules for mapping source fields to index fields
- Crawl configuration: Filters and patterns that determine what content to ingest
- Processing workflows: Steps for transforming and enriching documents
Content Sources
A content source is an instance of a connector attached to a specific index. You can create multiple content sources using the same connector type, each with different configurations. For example, you might have separate content sources for different SharePoint sites or Google Drive folders.
Field Mappings
Field mappings define how fields from your source documents correspond to fields in your Zeta Alpha index. This allows you to:
- Map source-specific field names to standardized index fields
- Transform nested data structures
- Combine multiple source fields into a single index field
Workflows
Workflows define the sequence of processing steps that documents go through during ingestion. Common workflow steps include:
- Text extraction from various file formats
- Metadata extraction using AI agents
- Field validation and transformation
- Indexing
Next Steps
Choose a guide based on your data source:
For Enterprise Systems
- Create a SharePoint Connector - If your organization uses Microsoft SharePoint or OneDrive
- Create a Google Drive Connector - If your organization uses Google Workspace
- Create a Box Connector - If your organization uses Box
For Custom Data Sources
- Create a Custom Connector - If you want to ingest data from a custom source or via the API
For Advanced Features
- Extract Metadata using AI Agents - Add AI-powered metadata extraction to your workflow
- Create Document Edits - Learn how to manually override document metadata
Getting Help
If you encounter issues or have questions:
- Review the connector-specific guides linked above
- Check the API Reference documentation for detailed endpoint information
- Contact Zeta Alpha support for assistance with complex configurations