We get asked this question a bit so I wanted to use this opportunity to provide some reference information for running Apache Cassandra or DataStax Enterprise on Nutanix.
Bringing together the Apache Cassandra experts from the community and DataStax.
Want to learn? Have a question? Want to share your expertise? You are in the right place!
Not sure where to begin? Getting Started
The Nutanix Enterprise Cloud has an elastic, scalable architecture that builds upon a distributed storage fabric. While all disks in a hypervisor host are local PCIe bus attached. They are also made globally accessible within a storage pool, by storing all filesystem metadata in a distributed (Cassandra) KV store.
When designing storage for your Datastax instances, always bear in mind the requirement for high outstanding IO (OIO). Usually, this is achieved by striping the data across a collection of vdisks. We provide the most commonly used example below, with LVM2:
$ echo $DISKS /dev/sdb /dev/sdc /dev/sdd /dev/sde /dev/sdf /dev/sdg $ sudo pvcreate /dev/sd[b-g] $ sudo vgcreate dstx-data /dev/sd[b-g] $ sudo lvcreate -l 100%FREE -i 6 -n dstx-data vol01
The default stripe width of 64k is normally fine. If your workload requirements mean you need to change this from the default, then use :
$ sudo lvcreate -l 100%FREE -i 6 -l 1024 -n dstx-data vol01
- this sets the stripe width to 1024k (1Mb), as an example.
You can read more best practice ideas for running Datastax DSE/Cassandra on Nutanix by reading the following freely available guide created by Nutanix Solutions Engineering :
Feel free to send feedback and help us to improve on the current revision.
6 People are following this question.