Does Cassandra support storage of Japanese characters - Kanji & Katakana ? It requires UTF-16 i believe and could not find any supporting documents. Reaching out to wider community to see how others are handling.
Does Cassandra support storage of Japanese characters - Kanji & Katakana ? It requires UTF-16 i believe and could not find any supporting documents. Reaching out to wider community to see how others are handling.
The Unicode standard supports over ninety thousand CJK characters (Chinese, Japanese, Korean) including both Katakana and Hiragana [1].
All these CJK characters are supported regardless of the encoding used so it applies to UTF-8, UTF-16 or UTF-32.
The CQL data types text
and varchar
are both UTF-8 strings so choosing between the two types is a matter of personal preference since they're the same.
For a complete list, see CQL data types. Cheers!
[1] "Unicode 13.0 UCD: PropList.txt", (2019-11-27)
5 People are following this question.
DataStax Enterprise is powered by the best distribution of Apache Cassandra ™
© 2023 DataStax, Titan, and TitanDB are registered trademarks of DataStax, Inc. and its subsidiaries in the United States and/or other countries.
Apache, Apache Cassandra, Cassandra, Apache Tomcat, Tomcat, Apache Lucene, Lucene, Apache Solr, Apache Hadoop, Hadoop, Apache Spark, Spark, Apache TinkerPop, TinkerPop, Apache Kafka and Kafka are either registered trademarks or trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries.
Privacy Policy Terms of Use