> ## Documentation Index
> Fetch the complete documentation index at: https://www.integrate.io/docs/llms.txt
> Use this file to discover all available pages before exploring further.

# What is Integrate.io ELT & CDC

> Integrate.io ELT & CDC replicates data from databases and SaaS platforms to Snowflake, Redshift, BigQuery, and S3 using change data capture.

## What is Integrate.io ELT & CDC

Integrate.io ELT & CDC is a managed data replication platform that moves data from transactional databases and SaaS applications to cloud data warehouses and storage. It uses change data capture (CDC) to replicate data in near real-time, keeping your analytics warehouse in sync with your source systems without manual ETL scripting.

## How it works

ELT & CDC connects to your source databases and applications, performs an initial full sync of your selected tables, then continuously captures changes (inserts, updates, and deletes) as they happen. Data flows through a staging layer and lands in your destination in a query-ready format.

The pipeline handles schema changes automatically. When columns are added or modified at the source, those changes propagate to the destination without manual intervention. See [How Integrate.io ELT & CDC Handles Schema Changes](/cdc/how-integrateio-elt-cdc-handles-schema-changes) for details.

## Supported sources

ELT & CDC supports two categories of sources:

**Database sources with log-based CDC.** PostgreSQL, MySQL, Microsoft SQL Server, and Oracle. These use native database replication logs (WAL for PostgreSQL, binlog for MySQL, transaction log for SQL Server) to capture changes with minimal impact on the source database. Supported across AWS RDS, Aurora, Azure, Google Cloud SQL, and self-hosted deployments.

**SaaS and API sources.** 40+ connectors including Salesforce, Shopify, HubSpot, Stripe, Google Ads, Facebook Ads, and more. These use API-based polling to sync data on a scheduled basis.

See the full list in [Sources](/cdc/category/sources).

## Supported destinations

Replicated data can be loaded to:

* **Amazon Redshift**, with optional custom S3 staging bucket
* **Snowflake**, with key-pair or password authentication
* **Google BigQuery**, direct load
* **Amazon S3**, Avro file output

See [Destinations](/cdc/category/destinations) for setup guides.

## Key features

| Feature                | Description                                                                                                               |
| :--------------------- | :------------------------------------------------------------------------------------------------------------------------ |
| Initial sync           | Full historical load with parallel chunking for large tables                                                              |
| Continuous sync        | Near real-time change capture after initial sync completes                                                                |
| History mode           | SCD Type 2 tracking that preserves previous record versions. See [History Mode](/cdc/history-mode-scd-type-2)             |
| Auto sync new tables   | Automatically detects and syncs newly created tables at the source. See [Auto Sync](/cdc/auto-sync-new-tables-feature)    |
| Schema change handling | Propagates column additions and type changes to the destination                                                           |
| Custom primary keys    | Override default primary keys for tables without them. See [Custom Primary Key](/cdc/custom-primary-key-feature)          |
| Data observability     | Built-in monitoring with replication delay metrics and alerts. See [Data Observability](/cdc/category/data-observability) |
| SSH tunnels            | Connect to databases behind firewalls. See [SSH Tunnel](/cdc/ssh-tunnel)                                                  |
| PrivateLink            | AWS PrivateLink support for MySQL and PostgreSQL                                                                          |

## Security

All data is encrypted in transit using TLS. Connections to databases behind firewalls can use [SSH tunnels](/cdc/ssh-tunnel) or [AWS PrivateLink](/cdc/privatelink-set-up). See the [IP allowlist](/cdc/ip-list) for Integrate.io's egress IP addresses that need to be whitelisted in your firewall.

## Related

<CardGroup cols={2}>
  <Card title="Initial Sync Process" icon="arrow-right" href="/cdc/initial-sync-process" horizontal />

  <Card title="ELT & CDC Features and Requirements" icon="arrow-right" href="/cdc/elt-cdc-features-and-requirements" horizontal />

  <Card title="History Mode" icon="arrow-right" href="/cdc/history-mode-scd-type-2" horizontal />

  <Card title="Schema Changes" icon="arrow-right" href="/cdc/how-integrateio-elt-cdc-handles-schema-changes" horizontal />
</CardGroup>
