Bringing together the Apache Cassandra experts from the community and DataStax.

Want to learn? Have a question? Want to share your expertise? You are in the right place!

Not sure where to begin? Getting Started

 

question

Sarannyaa avatar image
Sarannyaa asked Erick Ramirez answered

Does Cassandra support storage of Japanese characters - Kanji & Katakana?

Does Cassandra support storage of Japanese characters - Kanji & Katakana ? It requires UTF-16 i believe and could not find any supporting documents. Reaching out to wider community to see how others are handling.

cql
10 |1000

Up to 8 attachments (including images) can be used with a maximum of 1.0 MiB each and 10.0 MiB total.

1 Answer

Erick Ramirez avatar image
Erick Ramirez answered

The Unicode standard supports over ninety thousand CJK characters (Chinese, Japanese, Korean) including both Katakana and Hiragana [1].

All these CJK characters are supported regardless of the encoding used so it applies to UTF-8, UTF-16 or UTF-32.

The CQL data types text and varchar are both UTF-8 strings so choosing between the two types is a matter of personal preference since they're the same.

For a complete list, see CQL data types. Cheers!

[1] "Unicode 13.0 UCD: PropList.txt", (2019-11-27)

Share
10 |1000

Up to 8 attachments (including images) can be used with a maximum of 1.0 MiB each and 10.0 MiB total.