I have a Cassandra 09 node cluster Apache Cassandra version 3.11.6 with single DC running on AWS EC2 Instance .
Each EC2 instance is of R5.2x large having GP2 SSD 01 TB disk .
Customer runs batches to load data onto Cassandra cluster , there are four batches runs per day and each batch runs for about 04 hrs .
Customer came up with requirement asking DBA team to have solid backup and restore/recovery use case like below
1) Take a backup just before starting running each batch , in any case batch run fails restore database with recent backup , before running batch application gateway will be set in blackout so that no application comes during the period of batch data load , once batch job completes successfully then only application gateway will be opened for application .
2) Customer doesn't want have one more DC which can be used like DR/backup , but they are always looking at cost saving solution
3) What are all the best backup and recovery method or tool for the above use case , provided it's fast and simple to use .
4) Customer mayn't want to go with taking EC2 volume level backup and push them to S3 ,
Is the Cassandra snapshot still the best option or any other open source tool for Apache Cassandra for the above use cases .
Snapshot backup consumes less space and easy to take and also it's local to node but recovering/restoring needs automation and expertise