have cassandra cluster where CPU usage is between 10-20% normally with occcassional spikes to 40-50%. I am storing time series data ( which can come at times out of order in some delta time typical IOT use case)
Currently I have configured 45 days as my interval for time window compaction.
As of now at end of 45 days my data gets compacted. Currently this process takes about 3-4 hrs for SStable of 70-80GB (with parition size of 10-20MB)
As load and amount of data ingested increases SStable size also will increse what are options I can use to deal with these long compaction taking place ? What is ideal way to handle end of time window compaction gracefully ? Is there any config that I can push this to happen on weekends instead (When load is very less) ?
Also does having smaller parition size eg. 5 MB Partition size 80GB SStable v/s 20MB parition 80GB SStable would there be significant difference in compaction times?