FAQ - What is the default character encoding for Aerospike

The Aerospike Knowledge Base has moved to https://support.aerospike.com. Content on https://discuss.aerospike.com is being migrated to either https://support.aerospike.com or https://docs.aerospike.com. Maintenance on articles stored in this repository ceased on December 31st 2022 and this article may be stale. If you have any questions, please do not hesitate to raise a case via https://support.aerospike.com.

FAQ - What is the default character encoding for Aerospike?

Detail

On a relational database, for string fields, there is the concept of character encoding which defines the manner in which characters are stored within the database. The encoding can refer to a specific code page or the encoding of Unicode used. What is the default encoding for an Aerospike database?

Answer

The Aerospike cluster stores strings, by default, as UTF-8 but the conversion is handled within the client library when data is converted into Aerospike internal datatypes. The key difference to a more traditional database system is that, whereas in a more traditional system the client and server encodings must match, within Aerospike the client library handles any necessary conversion implicitly. In effect the encoding of the database is simply an internal implementation detail. The key information is what the client does with that data and this is left to the user to define according to their needs.

Notes

  • A full reference of datatypes supported by the Aerospike server can be found here.
  • Data type handling within client libraries is described here.
  • UTF-8 encoding of Unicode described in detail here.

Keywords

AEROSPIKE CODE PAGE ENCODING DEFAULT UTF-8

Timestamp

April 2020