Which 3 data structures is supported by SS table from the below ?
- Bloom Filter
- Partition table
- Parititon Index
- Partition Summary
Which 3 data structures is supported by SS table from the below ?
- Bloom Filter
- Partition table
- Parititon Index
- Partition Summary
The Bloom filter is a data structure that helps determine the probability that an SSTable contains partition data. Serialised bloom filter of the partition keys are stored in the *-Filter.db component of the SSTables which get read into off-heap memory.
The partition index contains the index of partition keys in an SSTable and their corresponding offset or pointers to their position in the SSTable. The partition index is stored on disk in the *-Index.db component of the SSTables and is cached in off-heap memory as the partition key cache.
The partition summary is a sample (subset) of the partition index used to optimise reads. The partition summary is stored on disk in the *-Summary.db component of the SSTables which get read into off-heap memory.
"Partition table" is not a SSTable data structure.
These structures are discussed in more detail in How data is read in Cassandra. Cheers!
@ashok.dcosta_187920 Cassandra processes data at several stages on the read path to discover where the data is stored, starting with the data in the memtable and finishing with SSTables:
Some of the stages checked are memtable,Bloom filter,partition key ,partition summary.
For more information please refer to Cassandra Read Path
THis is a interview question i was asked .Does this mean the data structures supported by SS table are Bloom Filter, Partition index and partition summary ? Are these the correct answers ?
A friendly note to let you know that I've converted your post to a comment since it's not an "answer". Cheers!
7 People are following this question.
DataStax Enterprise is powered by the best distribution of Apache Cassandra ™
© 2023 DataStax, Titan, and TitanDB are registered trademarks of DataStax, Inc. and its subsidiaries in the United States and/or other countries.
Apache, Apache Cassandra, Cassandra, Apache Tomcat, Tomcat, Apache Lucene, Lucene, Apache Solr, Apache Hadoop, Hadoop, Apache Spark, Spark, Apache TinkerPop, TinkerPop, Apache Kafka and Kafka are either registered trademarks or trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries.
Privacy Policy Terms of Use