Hi all,
Is there a supported way to convert snapshot sstables to csv? sstables are taken from DSE 6.8 snapshot.
We have a huge table - 40TB of data on a 14 nodes cluster and need to export it to csv atomically. That should be a point in time export.
Using dsbulk is not an option:
- takes ages to export 40TB of data
- export is not consistent. we need a point in time export - we need the data exactly as it was when the export started.
We'd like to use something like https://github.com/jberragan/spark-cassandra-bulkreader for reading snapshot files, but it doesn't work with DSE 6.8 'bti' SSTables.
Any suggestions how to accomplish this?