I have to create DR of a 50TB+ TiDB cluster #66514
Replies: 3 comments
-
|
Original author: TiDB-AI-Assistant Here is the detailed Disaster Recovery (DR) plan for a 50TB+ TiDB cluster: Creating a comprehensive Disaster Recovery (DR) plan for a 50TB+ TiDB cluster involves several key components, including backup strategies, incremental backup options, replication using TiCDC, and leveraging cloud storage solutions. Here’s a detailed plan: 1. Backup StrategiesFull Backup
Alternative Backup Methods
2. Incremental Backup Options
3. Replication with TiCDC
4. Cloud Storage Solutions
Performance and Storage Impact Minimization
By implementing these strategies, you can ensure a robust disaster recovery plan for your TiDB cluster, minimizing downtime and data loss in the event of a disaster. For further details, you can explore the provided links to the official documentation. |
Beta Was this translation helpful? Give feedback.
-
|
Original author: Hazel @prateekrckstr Could you please let us know how long your current backup process takes? If the GC is stuck for a day, it generally doesn’t affect performance too much. However, if your cluster is already running at its limits, you might consider scaling it up. After scaling, you can try the backup again. |
Beta Was this translation helpful? Give feedback.
-
|
Original author: prateekrckstr @Hazel We haven’t taken backup of this cluster yet hence unsure about backup ETA. The main challenge is that even though the backup is completed I can only enable the GC back only after the restore of the replica server is completed and TiCDC is in sync. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Original author: prateekrckstr
Original time: 2025-01-13 10:13:15.864032
Original ID/Slug: 10365 / i-have-to-create-dr-of-a-50tb-tidb-cluster
Application environment:
Production
TiDB version: TiDB-v4.0.14
Problem:
I have to take backup of 50TB+ TiDB backup and restore it on a new TiDB cluster as a part of our DR Strategy. To take backup using BR we have make sure tidb_gc_enable=FALSE or we have to set it to a high value so that garbage collection gets enabled after TiCDC catches up. But if we disable this then space and performance for production is compromised. Is there any other methodology that we can adopt as backing up and restoring such a huge dataset will take a long time and hampering performance and increased Storage Usage doesn’t look feasible.
Beta Was this translation helpful? Give feedback.
All reactions