Hi,
I was reading the documentation about SAI index and the default is 10 per table and 100 per cluster.
Is there a negative impact if I increase it to 30 per table and 300 per cluster? I couldn’t find any reference about this subject.
Thank you.
Hi,
I was reading the documentation about SAI index and the default is 10 per table and 100 per cluster.
Is there a negative impact if I increase it to 30 per table and 300 per cluster? I couldn’t find any reference about this subject.
Thank you.
We picked conservative defaults for Storage Attached Indexing (SAI) limits because we wanted users to avoid being surprised by unfavourable cluster performance if they decide to go overboard and index dozens of columns in lots of tables.
To answer your question directly, too many indexes can affect the performance of a database regardless of the database you use. To be clear, this isn't limited to Cassandra -- the performance hit of too many indexes applies to any NoSQL or relational databases.
I don't have readily available data to show you but I would posit that the write performance of a table degrades linearly as the number of indexes grows. I would even hypothesise that in some instances (in variations of data model, access pattern, data types), there's an inflection point where the degradation gets worse than linear.
The reality is that too many indexes on a table amplifies the writes and affects the performance for inserts and deletes. As most things, YMMV.
It is difficult to guess how it will affect your application because there are lots of moving parts -- use case, data model, access patterns, hardware configuration, network configuration, etc. Only you can determine how your cluster will perform based on extensive production-like testing. Cheers!
6 People are following this question.
DataStax Enterprise is powered by the best distribution of Apache Cassandra ™
© 2022 DataStax, Titan, and TitanDB are registered trademarks of DataStax, Inc. and its subsidiaries in the United States and/or other countries.
Apache, Apache Cassandra, Cassandra, Apache Tomcat, Tomcat, Apache Lucene, Lucene, Apache Solr, Apache Hadoop, Hadoop, Apache Spark, Spark, Apache TinkerPop, TinkerPop, Apache Kafka and Kafka are either registered trademarks or trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries.
Privacy Policy Terms of Use