Datastream
Serverless and easy-to-use change data capture and replication service.
-
Access to streaming, low-latency data from MySQL, PostgreSQL, AlloyDB, and Oracle databases
-
Near real-time analytics in BigQuery with Datastream for BigQuery
-
Easy-to-use setup with built-in secure connectivity for faster time to value
-
Serverless platform that automatically scales, with no resources to provision or manage
-
Secure foundation for database synchronization and event-driven architectures
Benefits
Replicate and synchronize data with minimal latency
Synchronize data across heterogeneous databases, storage systems, and applications reliably, with low latency, while minimizing impact on source performance.
Scale up or down with a serverless architecture
Get up and running fast with a serverless and easy-to-use service that seamlessly scales up or down, and has no infrastructure to manage.
Unmatched flexibility with Google Cloud services
Connect and integrate data across your organization with the best of Google Cloud services like BigQuery, Cloud Spanner, Dataflow, and Data Fusion.
Key features
Datastream's differentiated approach
Streaming data from relational databases
Datastream reads and delivers every change—insert, update, and delete—from your MySQL, PostgreSQL, AlloyDB and Oracle databases to load data into BigQuery, Cloud SQL, Cloud Storage, and Cloud Spanner. Agentless and Google-native, it reliably streams every event as it happens.
Normalized data types across sources
Datastream normalizes every event’s data type from the source database’s type into a unified Datastream type. This lossless data type normalization across sources means easier downstream processing in a source-agnostic way, regardless of where the data originated.
Schema drift resolution
As source schemas change, Datastream allows for fast and seamless schema drift resolution. Datastream rotates files, creating a new file in the destination bucket, on every schema change. Original source data types are just an API-call away with an up-to-date, versioned Schema Registry.
Secure by design
Datastream supports multiple secure, private connectivity methods to protect data in transit. In addition, data is encrypted in transit and at rest so you can rest easy knowing your data is protected as it streams.
"As we’re modernizing our applications, we're replicating data reliably out of Oracle and into Cloud SQL for PostgreSQL with Datastream which has resulted in minimal downtime to our application."
Shawn O’Rourke, Manager of Technology, MLB
Blog LinkWhat's new
News and events about Datastream
Documentation
Find resources and documentation for Datastream
Datastream overview
Learn how Datastream helps you replicate and synchronize data across heterogeneous databases, storage systems, and applications.
Datastream quickstart using the Cloud Console
Learn how to use the Google Cloud Console as a visual interface to start streaming data.
Configure Datastream using the API
Learn how to use the API to configure Datastream to transfer data from a source Oracle database into Cloud Storage.
How to migrate from Oracle to Cloud SQL for PostgreSQL
Use Google’s Datastream-based, open-source toolkit to migrate from Oracle to Cloud SQL for PostgreSQL.
Replicate data into BigQuery with Datastream and Dataflow
Learn how to combine Datastream with Dataflow templates to replicate data from a relational database.
How to replicate data from an Oracle database into BigQuery
Watch this video to learn how to replicate data in real time from Oracle into BigQuery using Data Fusion’s replication accelerator that's integrated with Datastream.
Use cases
Continuous change data capture: replicate every event as it happens
Change data capture integrates data by reading change events (inserts, updates, and deletes) from source databases and writing them to a data destination, so action can be taken. Datastream supports change streams from Oracle and MySQL databases into BigQuery, Cloud SQL, Cloud Storage, and Cloud Spanner, enabling real-time analytics, database replication, and other use cases. Additional sources and destinations are coming in the future.
Pricing
Datastream pricing details
Datastream pricing is calculated based upon actual monthly
data processed. Additional pricing details are available on
the Datastream pricing page.
Additional resources like Cloud Storage, Dataflow, and BigQuery are billed per that service’s pricing.
ORACLE® is a registered trademark of Oracle Corporation.