question

shiji.cherian_137128 avatar image
shiji.cherian_137128 asked Erick Ramirez answered

what is the data size per node supported in apache cassandra(3.11.3 and later)?

Have gone through datastax docs where the recommended capacity is 2TB disk size per node for DSE 6.7, however, are the same recommendations applicable for apache cassandra too?


if the data storage requirement is 100TB; can it be split between 5 nodes having 20TB each?

or does it have to be 50 nodes having 2TB each?

hardwarecapacity
10 |1000

Up to 8 attachments (including images) can be used with a maximum of 1.0 MiB each and 10.0 MiB total.

1 Answer

Erick Ramirez avatar image
Erick Ramirez answered

@shiji.cherian_137128 There are optimisations in DSE which are not available in OSS Apache Cassandra so your experience may vary.

I would suggest sticking with the 500GB to 1TB density per node if your data is mostly hot (meaning it is updated regularly and accessed constantly). Only go beyond to 2-4 TB per node IF your data is cold. Cold data means:

  • no longer getting updated (or almost never)
  • stored for archive purposes only
  • not getting accessed often, even rarely

The same caveats apply for dense nodes -- repairs are problematic, recovery takes a long time, bootstrap/decommission tasks take days. Cheers!

Share
10 |1000

Up to 8 attachments (including images) can be used with a maximum of 1.0 MiB each and 10.0 MiB total.