Bringing together the Apache Cassandra experts from the community and DataStax.

Want to learn? Have a question? Want to share your expertise? You are in the right place!

Not sure where to begin? Getting Started

 

question

Sarannyaa avatar image
Sarannyaa asked ·

Does Cassandra support storage of Japanese characters - Kanji & Katakana?

Does Cassandra support storage of Japanese characters - Kanji & Katakana ? It requires UTF-16 i believe and could not find any supporting documents. Reaching out to wider community to see how others are handling.

cql
10 |1000 characters needed characters left characters exceeded

Up to 8 attachments (including images) can be used with a maximum of 1.0 MiB each and 10.0 MiB total.

1 Answer

Erick Ramirez avatar image
Erick Ramirez answered ·

The Unicode standard supports over ninety thousand CJK characters (Chinese, Japanese, Korean) including both Katakana and Hiragana [1].

All these CJK characters are supported regardless of the encoding used so it applies to UTF-8, UTF-16 or UTF-32.

The CQL data types text and varchar are both UTF-8 strings so choosing between the two types is a matter of personal preference since they're the same.

For a complete list, see CQL data types. Cheers!

[1] "Unicode 13.0 UCD: PropList.txt", (2019-11-27)

Share
10 |1000 characters needed characters left characters exceeded

Up to 8 attachments (including images) can be used with a maximum of 1.0 MiB each and 10.0 MiB total.