AWS Database Blog
Migrate Azure SQL Database to Amazon Aurora using Azure Data Sync Agent and AWS DMS
Increasingly, customers are looking to break free from their legacy database (e.g., Oracle and Microsoft SQL Server) and move to a cloud-native database such as open-source engines running on AWS. One of the preferred destinations for this data is Amazon Aurora. In this post, we walk through a migration of a Microsoft Azure SQL database to an Amazon Aurora PostgreSQL-Compatible Edition or Amazon Aurora MySQL-Compatible Edition cluster using AWS Database Migration Service (AWS DMS) and Azure Data Sync with a minimal migration window.
Our solution uses Azure Data Sync Agent and other AWS services to facilitate a seamless migration of Azure SQL databases into Aurora PostgreSQL with low downtime. The solution also comes with a deployment option using AWS CloudFormation, which automates deployment of AWS DMS and an Aurora PostgreSQL instance with Multi-AZ. This migration strategy is applicable to migrate Azure SQL databases to Aurora PostgreSQL and Aurora MySQL engines. For an option using CloudBasic, see Migrating SQL Server databases from Microsoft Azure to AWS in near-real time with CloudBasic.
Cost of the solution
The pricing is based on the AWS Pricing Calculator, which provides an estimate of your AWS fees (not including any taxes that might apply). Your actual fees depend on a variety of factors, including the actual usage of AWS services.
The estimated cost of running this environment on an on-demand pricing strategy monthly is $411.54. The estimation cost of running the environment per hour is $0.57. For this post, all our services are in the US East (N. Virginia) Region. Let’s break down the costs further:
- Our Aurora PostgreSQL-compatible database (a 1-node cluster) has 100 GB of storage, instance type db.r5.large, and On-Demand pricing, totalling $222.75.
- We use Amazon Elastic Compute Cloud (Amazon EC2) as a staging SQL Server instance with the Data Sync Agent. It uses Windows Server, On-Demand instances, 65 GB of storage, and instance type m5.large, for a monthly total of $143.74.
- We use Amazon Virtual Private Cloud (Amazon VPC) to create a NAT Gateway, with a monthly cost of $37.35.
- We also use a NAT Elastic IP address with two EC2 instances and 1 Elastic IP per instance, for a monthly cost of $7.30.
- Lastly, we use AWS Secrets Manager to store an Aurora secret. Our number of API calls is 30 per month, and the average duration of the secret is 30 days. The monthly cost is $0.40.
Prerequisites
This post assumes that you already have an existing Azure SQL database. You need the connection information for this database, including the DNS endpoint, username, and password. The username and password provided must have adequate permissions to access the data in your Azure SQL database.
Also, you need to have already converted the schema of SQL Server to Aurora PostgreSQL using the AWS Schema Conversion Tool. AWS SCT is an excellent tool to migrate schemas with minimal effort. For more details, see Converting SQL Server to PostgreSQL.
Additionally, make sure the following AWS Identity and Access Management (IAM) roles are available in the account where you’re deploying the CloudFormation template:
DMS-VPC-ROLE
DMS-CLOUDWATCH-LOGS-ROLE
If these roles don’t exist, create them before you deploy the CloudFormation template.
Solution overview
Data migration from an Azure SQL database to an Aurora cluster is a staged migration process, in which we first pull the data from the Azure SQL database using the Data Sync Agent to a local SQL Server instance hosted on Amazon EC2 as a staging environment. From this point, AWS DMS moves the data from SQL Server on Amazon EC2 to an Aurora cluster using change data capture (CDC) as illustrated in the following diagram.
In this migration, we connect the Azure SQL database to a staging SQL Server instance deployed on Amazon EC2 using the Data Sync Agent, which starts the data transfer from the Azure SQL database to the EC2 instance. Then we configure and start the AWS DMS replication task to transfer the data from SQL Server on Amazon EC2 to the Aurora PostgreSQL cluster.
Create the sync job at the Azure SQL database source
To create your sync job at the source database (Azure SQL), complete the following steps:
- Log in to your Azure portal and navigate to the Azure SQL database you want to migrate to Aurora.
- On the database page, choose Sync to other database.
- For Sync Group Name, enter a name (for example,
AzureAuroraSyncGroup
). - Turn Automatic Sync to ON.
- For Sync Frequency, enter a replication frequency.You can schedule replication as per your requirements. For this post, we use a 30 seconds frequency for replication.
- For Conflict Resolution, choose Hub win.
- Choose OK.
- Under Hub Database, enter a username and password for the Data Sync Agent (the same as your Azure SQL database credentials).
- For Membership Database, choose Add an On-Premises Database.
You use these credentials in a later step. - In the Sync Gateway agent section, select Create a new agent.
- For Agent name, enter a name.
- Create and generate an agent key.
You need to use this key if you reregister or reinstall the agent in the Amazon EC2 staging environment.
Build the infrastructure on AWS
To build your infrastructure, complete the following steps:
- Sign in to the AWS Management Console and deploy the CloudFormation template.
AWS CloudFormation deploys the SQL Server along with the Data Sync Agent installed on Amazon EC2 (our staging environment). It also deploys an AWS DMS instance and Aurora PostgreSQL cluster. - Provide values to the stack parameters as shown in the following table.
Parameter Value Populated Purpose AgentKey Nothing Internal Internal AWS CloudFormation working. AmiID /aws/service/ami-windows-latest/Windows_Server-2019-English-Full-SQL_2017_Standard Internal Pulls the latest AMI for Microsoft SQL Server 2017 Standard edition. AzureSQLDBCIDR 0.0.0.0/0 User Azure SQL database IP address for inbound connectivity from Azure to the staging SQL Server on Amazon EC2. AzureSyncStageDBName AzureSyncStageDB Internal Staging database on Amazon EC2 SQL Server. EnvironmentType Prod Internal – KeyPairName KeyPairName User Select the key pair that you have access to. This key pair is used to connect to the staging Amazon EC2 SQL Server. MsSqlPort 1433 Internal Staging SQL Server port to allow communication between the AWS DMS instance and staging SQL Server. PostgresPort 5432 Internal PostgreSQL port to allow communication between the AWS DMS instance and the Aurora PostgreSQL cluster. Pramas Yes Internal – RdpPort 3389 Internal Allows you to use the staging SQL Server security group from your remote IP. ReplicationInstanceAllocatedStorage 50 Internal Storage for the AWS DMS replication instance. ReplicationInstanceClass dms.C5.large Internal The AWS DMS replication instance class. You can choose another class as needed for your use case. StagingSQLPassword **** Staging SQL Server password. deployCluster Yes – rdpAccessCIDR 0.0.0.0/0 Your remote IP to access the staging SQL Server over RDP. - After the CloudFormation template is successfully deployed, go to the Outputs tab on the stack details page.
The following table summarizes your outputs.Key Value Description AuroraCluster aurora-cluster Elastic IP assigned to Amazon EC2 DBPassword xxxxxx Staging SQL Server user password KeyPair xxxxx Name of key pair in use SQLServerElasticIP xxxxxxx Elastic IP assigned to Amazon EC2 SQLServerPrivateDNS xxxxxx Private DNS of EC2 instance SQLServerPublicDNS xxxxxx Public DNS of EC2 instance clusterEndpoint xxxxxx Aurora cluster endpoint clusterName xxxxxx Cluster name secretArn xxxxxx Database credentials secret ARN vpcId xxxxxx Environment VPC - Note the value for the staging environment instance ID.
- Get the Elastic IP for this instance and add that on the Azure database firewall rule to allow inbound connections.
- Log in to the Amazon EC2 staging environment.
- Log in to the local SQL Server using SSMS to check if
AzureSyncStageDB
exists. - Check if the SQL Server login
azuresqlagent
exists. - Test the login for
azuresqlagent
on the local SQL Server using the password from the CloudFormation stack outputs (the value forStagingDBPassword
). - Configure the Data Sync Agent with the agent key along with the Azure SQL database username and password you captured earlier.
- Configure the SQL data target as the local instance by choosing Register and logging in as
azuresqlagent
.
Sync the data from the Azure SQL database to the staging environment
For this step, return to the Azure portal session you had running when you created the sync job.
- Choose Add on-prem database.
- Select Existing agents.
- Choose the agent you registered earlier.
- For Sync Member Name, enter a unique name.
- For On-premises databases connected to this agent, choose your database.
- For Sync Directions, choose From the Hub.
- For Select a database, choose Hub Database.
- Select all the tables you want to replicate (all the tables should have a primary key).
- Choose Save.
A success message should appear in the portal and data should start flowing to your AWS staging database hosted on Amazon EC2.
Start the sync to Aurora via AWS DMS
For this step, return to the Outputs tab of the AzureDataSyncAurora
stack and get the AWS DMS instance details. We log in to the AWS DMS instance and confirm that AWS CloudFormation deployed the following components:
- Replication instance
- Source endpoint
- Target endpoint
- Replication task—can be full load or ongoing replication (CDC)
- On the AWS DMS console, choose Endpoints in the navigation pane.
- On the Endpoints page, confirm the source endpoint (SQL Server pointing to the staging environment) and target endpoint (Aurora PostgreSQL pointing to the new Aurora cluster) are active.
- In the navigation pane, choose Database migration tasks.
- Verify that the source and target endpoints are pointing at the respective endpoints noted in the stack outputs.
- On the Actions menu, choose Restart/Resume to start the job.
- Verify the full load is complete and the job has moved to ongoing replication mode.
Now that the job is in ongoing replication, your data changes from Azure to Aurora PostgreSQL flow automatically (as per the sync schedule you set up for the Data Sync Agent).
Cut over
You’re now ready to cut over to Aurora PostgreSQL and test your application.
- Stop the application traffic to the Azure SQL database.
- Wait about 30 seconds to allow in-flight transactions to catch up to the staging environment.
- Stop the Azure SQL Data Sync Agent.
- Allow the migration task to catch up to the final changes from the Azure SQL database to Aurora PostgreSQL (as per the sync schedule you specified).
- Stop and remove the AWS DMS migration task.
- Point your application to Aurora PostgreSQL.
You’re now live with Aurora PostgreSQL.
Clean up the resources
Now that the tables have been successfully migrated from the Azure SQL database to Aurora, it’s time to clean up by deleting your migration task, replication task, and EC2 instance.
- On the AWS DMS console, choose Database migration tasks.
- Select your migration task and on the Actions menu, choose Stop.
- On the Actions menu, choose Delete.
- Choose Delete again to confirm.
- In the navigation pane, choose Replication instances.
- Select your replication instance and on the Actions menu, choose Delete.
- On the Amazon EC2 console, stop and delete your instance.
Advantages of this solution
This solution uses native tools from Azure and AWS services, which means you have support for all components of the solution.
It provides near-real-time replication and near-real-time migration with CDC, which gives you an option to cut over from Azure to AWS with minimal downtime.
The AWS service deployment is fully automated. You can start the migration and use the CloudFormation template for future enhancement if you choose to continue with the same platform.
Conclusion
In this post, I showed you how to combine Azure Data Sync and AWS DMS to migrate your Azure SQL database to Aurora with minimal downtime. The step-by-step guide helps you get hands-on experience migrating an Azure SQL database to Aurora PostgreSQL and Aurora MySQL using the Azure Data Sync Agent along with AWS SCT and AWS DMS.
If you’re looking modernize SQL Server or Oracle from on premises, refer to the following:
- Migrating Autodesk’s mission-critical database from Microsoft SQL Server to Amazon Aurora MySQL
- Best practices for migrating an Oracle database to Amazon RDS PostgreSQL or Amazon Aurora PostgreSQL: Source database considerations for the Oracle and AWS DMS CDC environment
Leave your feedback in the comments so we can continue to improve upon this solution.
About the authors
Asif Mujawar is a Senior Specialist Database Solutions Architect based in London. A database professional with broad expertise and experience across the different industry verticals where he has been seen as a trusted advisor in the database space. In his role he helps customers to unlock opportunities by facilitating their cloud migration journey with an indexed focus on modernisation and democratisation of data.