[AWS] DataSync

DataSync is an online data transfer service that automates transferring large amounts of data to and from AWS storage services over the internet or AWS Direct Connect (DX) in a simple way.


Features

  • Copy data between on-premise data storage and AWS storage.
    • On-premise: Network File System (NFS), Server Message Block (SMB) file servers, or AWS Snowcone
    • AWS: S3 buckets, EFS file systems, and Amazon FSx for Windows
  • Can even be used to move data from EFS to EFS.
  • Archive cold data by sending them directly to S3 Glacier or S3 Glacier Deep Dive.
  • Replication can be done hourly, daily, or weekly.

Steps to Use DataSync

  1. Deploy an agent
    • A DataSync agent is a VM that is owned by a user, and is used to read and write data. The agent cannot encrypt or decrypt data.
  2. Create a data transfer task
  3. Start the transfer and monitor it in the console or CloudWatch

Comparisons with Other Services

Data Migration Decision

  • Offline
    • AWS Snowball Family
  • Online
    • Database
      • AWS DMS (Data Migration Service)
    • File
      • DataSync
    • Virtual machines
      • VMware Cloud

Services

  • DataSync vs. AWS Snowball Edge
    • DataSync is for online data transfer, and Snowball Edge is for offline transfer.
  • DataSync vs. Storage Gateway
    • Use AWS DataSync to migrate existing data to Amazon S3, and then use the File Gateway to retain access to the migrated data and perform ongoing updates from your on-premises file-based applications.

Leave a Comment