I am reading a table from cassandra , and I am facing delay in read request. My observation is that there is very big partition in cassandra but when I am running cfstats command it returns
I am reading a table from cassandra , and I am facing delay in read request. My observation is that there is very big partition in cassandra but when I am running cfstats command it returns
There isn't an easy way of finding the largest partition in a table. That functionality requires a full table scan and that operation isn't allowed in Cassandra.
Have a look at cassandra-sstable-tools. I haven't tried it myself but you might find something there that meets your needs. Cheers!
In addition to what Erick said, you might also look into Spark partition sizes in Spark UI. You won't get exact Cassandra partition sizes there, as Spark partition usually contains many Cassandra partitions.
6 People are following this question.
DataStax Enterprise is powered by the best distribution of Apache Cassandra ™
© 2022 DataStax, Titan, and TitanDB are registered trademarks of DataStax, Inc. and its subsidiaries in the United States and/or other countries.
Apache, Apache Cassandra, Cassandra, Apache Tomcat, Tomcat, Apache Lucene, Lucene, Apache Solr, Apache Hadoop, Hadoop, Apache Spark, Spark, Apache TinkerPop, TinkerPop, Apache Kafka and Kafka are either registered trademarks or trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries.
Privacy Policy Terms of Use