Services

Careers

Products

Contact

Schedule a Call

About

Services

Blogs

Careers

Products

Contact

Schedule a Call

Home
> Blogs
> MongoDB
> Seamless Data Transition: Exploring MongoDB Cluster To Cluster Sync

Seamless Data Transition: Exploring MongoDB Cluster To Cluster Sync

Achieve real-time data consistency with MongoDB Cluster-to-Cluster Sync. Discover how MongoSync enables seamless data synchronization for disaster recovery, global availability, and smooth migrations.

Rexcily March 12, 2025

Subscribe for email updates

In large-scale deployments, syncing data between MongoDB clusters across regions or environments becomes critical. Whether for disaster recovery, migrations, or high availability, Cluster-to-Cluster Sync ensures data consistency between clusters. In this blog, we will explore how cluster-to-cluster synchronization works in MongoDB, the architecture, and practical steps for setting it up.

What Is Cluster-to-Cluster Sync in MongoDB?

Cluster-to-cluster synchronization is a process that enables two separate MongoDB clusters to mirror each other, ensuring that data is consistently synced across multiple geographic locations or cloud providers. Starting from MongoDB version 6.0, this feature is available natively, facilitating disaster recovery, hybrid cloud setups, and high-availability architecture.

Architecture Overview

The architecture for MongoDB cluster-to-cluster synchronization typically involves the following components:

Source Cluster: The MongoDB cluster from which data is being synchronized.
Destination Cluster: The target MongoDB cluster where the data will be synchronized.
MongoSync: A tool that facilitates the synchronization process, ensuring that the data transfer is consistent and efficient.

Key Architectural Elements:

Sharding Support: MongoSync supports both shared and replica set clusters, allowing flexible scaling.
High Availability: Synchronization works seamlessly with MongoDB replica sets to ensure redundancy and fault tolerance.
Incremental Data Transfer: After the initial full sync, MongoSync performs incremental syncs based on changes (oplog entries) in the source cluster.

How Cluster-to-Cluster Sync Works

MongoDB Cluster-to-Cluster Sync helps keep two clusters (databases) in sync, making sure both have the same data. Here’s a simple explanation of how it works:

Connecting Clusters: Mongosync connects your source cluster (where the data currently is) and the destination cluster (where the data will go). After connecting, it waits in an IDLE state, ready to start syncing.
Starting the Sync: When the sync begins, MongoSync copies existing data from the source to the destination, partitioning large collections for efficient copying. After the collection copy phase, MongoSync tracks any new changes (inserts, updates, deletes) in the source cluster and applies them to the destination.
Change Event Application: Once the initial copy is complete, MongoSync captures ongoing changes using a change stream and applies them to the destination cluster to ensure both clusters remain synchronized in real-time.
Pausing and Resuming Sync: Syncing can be paused and resumed without affecting ongoing operations. When paused, MongoSync enters a PAUSED state and RESUMES from where it left off when restarted.
Finalizing the Sync: Once the sync is complete, issuing a commit ensures that all data is up-to-date, and MongoSync transitions to the COMMITTED state.

Advantages of Cluster-to-Cluster Sync

Disaster Recovery:Cluster-to-cluster sync enables organizations to maintain a backup cluster, ready to take over if the primary cluster fails.
Data Migration:Synchronizing between clusters allows for seamless data migration, especially when upgrading hardware or moving to a new data center or cloud environment.
Cross-Region Replication:Ensures data availability across different geographic regions, providing low-latency access to users worldwide.
No Downtime:The synchronization process, including the full and incremental syncs, happens in real-time without causing any downtime to the running clusters.

Setting Up Cluster-to-Cluster Sync Using MongoSync:

In this section, we will focus on configuring Cluster-to-Cluster Sync using mongosync. There are three options available for synchronization:

Connect Two Atlas Clusters
Connect Two Self-Managed Clusters
Connect a Self-Managed Cluster to Atlas

In this blog, we will concentrate on connecting two self-managed clusters.

Stay tuned for upcoming blogs, where we will cover the other options for cluster synchronization, including connecting two Atlas clusters and connecting a self-managed cluster to Atlas.

Installing Cluster-to-Cluster Sync Tool

Step 1: Download the Tool

You can obtain the Cluster-to-Cluster Sync tool (Mongosync) as a .tgz tarball from the official MongoDB website. Make sure to download the correct version based on your operating system.

Step 2: Installation Instructions

Once downloaded, follow the detailed installation steps outlined in the MongoDB Mongosync Installation Guide. It includes platform-specific instructions to ensure a smooth installation process.

Initializing Mongosync

There are two common ways to initialize Mongosync, offering flexibility based on your preferences: via command-line or configuration file. Let’s explore both methods.

Method 1: Command-Line Initialization

You can initialize Mongosync directly from the command line using the following command:

 mongosync \
--cluster0: "mongodb://mafadmin:maf313@172.17.0.13:27017,172.17.0.13:27018" \
--cluster1: "mongodb://mafadmin:maf313@172.17.0.14:27017,172.17.0.14:27018"

Method 2: Configuration File Initialization

Alternatively, you can configure Mongosync using a configuration file for more control. Here is an example configuration file:

cat /etc/mongosync.conf

cluster0: "mongodb://mafadmin:maf313@172.17.0.13:27017,172.17.0.13:27018"
cluster1: "mongodb://mafadmin:maf313@172.17.0.14:27017,172.17.0.14:27018"
logPath: "/var/log/mongosync/mongosync.log"

Once the config file is ready, initialize Mongosync with the following command:

mongosync --config /etc/mongosync.conf

You can verify if Mongosync is running correctly by checking the log file:

tail -f /var/log/mongosync/mongosync.log

Look for entries similar to this to confirm initialization:

{"time":"2024-10-16T05:10:19.115377Z","level":"info","message":"Running webserver."}

Starting Data Synchronization

To start syncing data between the clusters, use the following curl command to initiate the process via Mongosync’s API:

curl localhost:27182/api/v1/start -X POST \
 --data '{ "source": "cluster0", "destination": "cluster1" }'

If successful, you should see:

{"success":true}

Monitoring Sync Progress

You can monitor the sync process by querying the current status using this command:

curl localhost:27182/api/v1/progress -X GET

The response will provide detailed insights, including whether Mongosync is actively syncing:

{"progress":{"state":"RUNNING","canCommit":true,"canWrite":false,"info":"change event application","lagTimeSeconds":0,"collectionCopy":{"estimatedTotalBytes":3953412,"estimatedCopiedBytes":3953412}}}

Pausing Data Synchronization

There might be situations where you need to pause the synchronization process. Use the following command to pause Mongosync:

curl localhost:27182/api/v1/pause -X POST --data '{ }'

If successful, Mongosync will transition to the “PAUSED” state:

{"success":true}

Tip: If you plan to pause for an extended period, consider increasing the size of the source cluster’s oplog to prevent issues during resumption.

Resuming Data Synchronization

To resume the synchronization process after a pause, execute the following command:

curl localhost:27182/api/v1/resume -X POST --data '{ }'

A successful response will indicate that the sync has resumed:

{"success":true}

Committing Data Synchronization

Once the synchronization process is complete, it’s crucial to commit the changes to ensure that everything is properly applied to the destination cluster.

Step 1: Verify Sync Status

Before committing, verify that the synchronization is ready for a commit:

curl localhost:27182/api/v1/progress -X GET

Check the canCommit flag in the response:

{"progress":{"canCommit":true,"info":"change event application"}}

Step 2: Commit the Synchronization

Once you’ve verified that the sync is ready, issue the commit request:

curl localhost:27182/api/v1/commit -X POST --data '{ }'

A successful commit will return:

{"success":true}

By following these steps, you can smoothly install, initialize, monitor, and manage Mongosync for your Cluster-to-Cluster synchronization needs, ensuring efficient data replication between MongoDB clusters.

At Mafiree, we simplify MongoDB Cluster-to-Cluster Sync, ensuring smooth, secure, and hassle-free data migration, disaster recovery, and cross-region replication. With our in-depth experience, we offer end-to-end services tailored to your business needs from initial consultation to full implementation and ongoing support.

Ready to elevate your database infrastructure? Let Mafiree handle the complexities so you can focus on growing your business. Contact us today and let’s start building a seamless data future together!

Author Bio

Rexcily

Rexcily Rijo is a MongoDB Certified DBA at Mafiree, with deep expertise in MongoDB database administration, performance optimization, and scalable data solutions. He writes practical, hands-on technical content focused on MongoDB internals, best practices for database operations, TTL and index strategies, and real-world techniques that help engineers and DBAs manage and optimize modern document databases.