[AWS] DataSync

DataSync is an online data transfer service that automates transferring large amounts of data to and from AWS storage services over the internet or AWS Direct Connect (DX) in a simple way.

Features

  • DataSync can copy data between on-premise data storages (Network File System (NFS), Server Message Block (SMB) file servers, or AWS Snowcone) and AWS storages (S3 buckets, EFS file systems, and Amazon FSx for Windows).
  • DataSync can even be used to move data from EFS to EFS.
  • You can use DataSync to archive cold data by sending them directly to S3 Glacier or S3 Glacier Deep Dive.
  • Replication can be done hourly, daily, or weekly.

Steps to Use DataSync

  1. Deploy an agent
  2. Create a data transfer task
  3. Start the transfer and monitor it in the console or CloudWatch

A DataSync agent is a VM that is owned by a user, and is used to read and write data. The agent cannot encrypt or decrypt data.


Comparisons with Other Services

Data Migration Decision

  • Offline
    • AWS Snowball Family
  • Online
    • Database
      • AWS DMS (Data Migration Service)
    • File
      • DataSync
    • Virtual machines
      • VMware Cloud

Services

  • DataSync vs. AWS Snowball Edge
    • DataSync is for online data transfer, and Snowball Edge is for offline transfer.
  • DataSync vs. Storage Gateway
    • Use AWS DataSync to migrate existing data to Amazon S3, and then use the File Gateway to retain access to the migrated data and perform ongoing updates from your on-premises file-based applications.

Leave a Comment

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s