Bringing together the Apache Cassandra experts from the community and DataStax.

Want to learn? Have a question? Want to share your expertise? You are in the right place!

Not sure where to begin? Getting Started

 

question

Valuser avatar image
Valuser asked ·

Do I need to run sstableloader in each of the target nodes?

Hi, I have a 3 node cluster . I took a backup snapshot from each of these 3 nodes. I want to restore the above snapshot to a 2 node (n1, n2) target cluster.

I am currenlty logged in to one of the target cluster nodes (n1). I am running sstableloader from this instance to restore the above said 3 node cluster snapshot folder.

Is it enough to run sstableloader in one of the target cluster nodes?(here n1). I mean will sstableloader understand the other node(n2) also belong in the target cluster ring and distribute accordingly.

Or do i need to run sstableloader in each of the target cluster nodes?

sstableloader
10 |1000 characters needed characters left characters exceeded

Up to 8 attachments (including images) can be used with a maximum of 1.0 MiB each and 10.0 MiB total.

1 Answer

Erick Ramirez avatar image
Erick Ramirez answered ·

As I explained in your other question, the Cassandra bulk loader utility sstableloader loads table snapshots to a cluster by streaming relevant parts of SSTables to destination nodes. In this context, "relevant parts" means the data which belongs in the token range(s) owned by the destination nodes.

This means that any data you restore is streamed to the relevant token range owners in the cluster.

For what it's worth, you just happen to be running the sstableloader utility on "node 1". You can run multiple instances of the utility on other servers which are not part of the cluster for optimum performance. Cheers!

Share
10 |1000 characters needed characters left characters exceeded

Up to 8 attachments (including images) can be used with a maximum of 1.0 MiB each and 10.0 MiB total.