Can spark-cassandra-connector ensure data locality in a Spark cluster which runs on Apache YARN cluster mode?
Bringing together the Apache Cassandra experts from the community and DataStax.
Want to learn? Have a question? Want to share your expertise? You are in the right place!
Not sure where to begin? Getting Started
@srlsooriyagoda_185665 Similar to question 2323 you asked a couple of weeks ago, data locality can only be achieved when the Spark task is running on the same servers as the Cassandra nodes. If the Spark cluster and the Cassandra cluster are two distinct (separate clusters) then there is no data that is local to Spark. Cheers!
5 People are following this question.