Skip to main content

What is Integrate.io ELT & CDC

Integrate.io ELT & CDC is a managed data replication platform that moves data from transactional databases and SaaS applications to cloud data warehouses and storage. It uses change data capture (CDC) to replicate data in near real-time, keeping your analytics warehouse in sync with your source systems without manual ETL scripting.

How it works

ELT & CDC connects to your source databases and applications, performs an initial full sync of your selected tables, then continuously captures changes (inserts, updates, and deletes) as they happen. Data flows through a staging layer and lands in your destination in a query-ready format. The pipeline handles schema changes automatically. When columns are added or modified at the source, those changes propagate to the destination without manual intervention. See How Integrate.io ELT & CDC Handles Schema Changes for details.

Supported sources

ELT & CDC supports two categories of sources: Database sources with log-based CDC — PostgreSQL, MySQL, Microsoft SQL Server, and Oracle. These use native database replication logs (WAL for PostgreSQL, binlog for MySQL, transaction log for SQL Server) to capture changes with minimal impact on the source database. Supported across AWS RDS, Aurora, Azure, Google Cloud SQL, and self-hosted deployments. SaaS and API sources — 40+ connectors including Salesforce, Shopify, HubSpot, Stripe, Google Ads, Facebook Ads, and more. These use API-based polling to sync data on a scheduled basis. See the full list in Sources.

Supported destinations

Replicated data can be loaded to:
  • Amazon Redshift — with optional custom S3 staging bucket
  • Snowflake — with key-pair or password authentication
  • Google BigQuery — direct load
  • Amazon S3 — Avro file output
See Destinations for setup guides.

Key features

FeatureDescription
Initial syncFull historical load with parallel chunking for large tables
Continuous syncNear real-time change capture after initial sync completes
History modeSCD Type 2 tracking that preserves previous record versions. See History Mode
Auto sync new tablesAutomatically detects and syncs newly created tables at the source. See Auto Sync
Schema change handlingPropagates column additions and type changes to the destination
Custom primary keysOverride default primary keys for tables without them. See Custom Primary Key
Data observabilityBuilt-in monitoring with replication delay metrics and alerts. See Data Observability
SSH tunnelsConnect to databases behind firewalls. See SSH Tunnel
PrivateLinkAWS PrivateLink support for MySQL and PostgreSQL

Security

All data is encrypted in transit using TLS. Connections to databases behind firewalls can use SSH tunnels or AWS PrivateLink. See the IP allowlist for Integrate.io’s egress IP addresses that need to be whitelisted in your firewall.

Initial Sync Process

ELT & CDC Features and Requirements

History Mode

Schema Changes

Last modified on April 15, 2026