Sync data from Amazon Aurora to Redshift
B

3

6

I am trying to setup a sync between AWS Aurora and Redshift. What is the best way to achieve this sync?

Possible ways to sync can be: -

  1. Query table to find changes in a table(since I am only doing inserts, updates don't matter), export these changes to a flat file in S3 bucket and use Redshift copy command to insert into Redshift.

  2. Use python publisher and Boto3 to publish changes into a Kinesis stream and then consume this stream in Firehose from where I can copy directly into Redshift.

  3. Use Kinesis Agent to detect changes in binlog (Is it possible to detect changes int binlog using Kinesis Agent) and publish it to Firehose and from there copy into Firehose.

I haven't explored AWS Datapipeline yet.

Bullnose answered 16/6, 2017 at 21:53 Comment(1)
aws.amazon.com/dmsSemipermeable
S
7

As pointed out by @Mark B, the AWS Database Migration Service can migrate data between databases. This can be done as a one-off exercise, or it can run continuously, keeping two databases in sync.

The documentation shows that Amazon Aurora can be a source and Amazon Redshift can be a target.

Sceptic answered 17/6, 2017 at 4:41 Comment(1)
Amazon Aurora can be a source only if it's mySQL compatible. Aurora Postgres and Redshift would the ideal matchJaunty
F
2

AWS has just announced this new feature: Amazon Aurora zero-ETL integration with Amazon Redshift

This natively provides near real-time (second) synchronization from Aurora to Redshift.

Faretheewell answered 5/12, 2022 at 18:44 Comment(0)
C
0

You can also use federated queries: https://docs.aws.amazon.com/redshift/latest/dg/federated-overview.html

Copalm answered 15/12, 2020 at 16:18 Comment(0)

© 2022 - 2024 — McMap. All rights reserved.