question

tecles_192911 avatar image
tecles_192911 asked Erick Ramirez edited

Are SSTables tracked in Cassandra?

From today's Q and A session, 08/28/2020, I understood that SSTables can be removed even with the cluster running. Did I get it right? That is really cool for time series data.

So, there is no metadata tracking SSTables in Cassandra?

cassandratwcstimeseries
10 |1000

Up to 8 attachments (including images) can be used with a maximum of 1.0 MiB each and 10.0 MiB total.

1 Answer

Erick Ramirez avatar image
Erick Ramirez answered Erick Ramirez edited

Yes, Cassandra keeps track of the SSTables it manages. I feel that you've taken Patrick McFadin's comments out of context so allow me to explain.

Cassandra keeps track of all SSTables it owns. You will see on startup in the debug.log that C* opens all the files so it knows what data is on disk including the partition index, partition summaries, etc.

For the purposes of time series data, we recommend you use TimeWindowCompactionStrategy (TWCS) with a TTL on the data. When all the data in an SSTable (time window) is fully-expired, all TWCS does is drop (delete) it from the filesystem. Cheers!

2 comments Share
10 |1000

Up to 8 attachments (including images) can be used with a maximum of 1.0 MiB each and 10.0 MiB total.

tecles_192911 avatar image tecles_192911 commented ·

Awesome! That is great!

Thank you.

0 Likes 0 ·
Erick Ramirez avatar image Erick Ramirez ♦♦ tecles_192911 commented ·

Good to hear. Cheers!

0 Likes 0 ·