question

arunkolluri07_188529 avatar image
arunkolluri07_188529 asked arunkolluri07_188529 commented

Can I get the partition size using tablehistograms?

I have 18 node cassandra cluster which has large partition size issue. My question is can i do table histograms on every node for the table and take the average to get the partition size.

cassandra
10 |1000

Up to 8 attachments (including images) can be used with a maximum of 1.0 MiB each and 10.0 MiB total.

steve.lacerda avatar image
steve.lacerda answered

I'm not sure about the question. The partition size is specific to that node because each node owns different token ranges. All you would get from taking all the means of each node would be an overall cluster mean. If that's what you want then yes. However, I'm not sure what that gains you. If you want to find large partitions, you can look in the logs for "writing large partition" or use something like sstablemetadata to see the partition sizes.

Share
10 |1000

Up to 8 attachments (including images) can be used with a maximum of 1.0 MiB each and 10.0 MiB total.

Erick Ramirez avatar image
Erick Ramirez answered arunkolluri07_188529 commented

By definition, histograms are approximate distributions of data. If you tried to average out the partition sizes from all the histogram outputs, the best you could get is an approximate average partition size -- it does not lead you "... to get the partition size".

In any case, I'm not sure what outcome you're trying to achieve. If you tell us what problem you're trying to solve, we might be able to give you a better answer. Cheers!

1 comment Share
10 |1000

Up to 8 attachments (including images) can be used with a maximum of 1.0 MiB each and 10.0 MiB total.

arunkolluri07_188529 avatar image arunkolluri07_188529 commented ·

[Converted to a comment since it's not an answer]

0 Likes 0 ·