Aerospike schema: which one is the better design?

Mangat_Modi · March 10, 2016, 5:11am

We have database requirements as follows:-

Huge number of records: 10-100 millions per set.
Huge number of bins: around 100 bins per set
Some point queries need to be run within milliseconds.

Now which one of the following two schema design philosophy will be better for Aerospike?

Have one set of each type with all possible bins (in hundreds). But would it degrade Aerospike performance?
Categorize bins and have multiple sets with each set have around 10 bins max. But this means redundant keys in each set. High space complexity and hard to combine data for same key from different bins.

TimF · March 17, 2016, 4:04pm

That really depends on a number of factors. How big are the records (combination of all the bins)? When you read a record in Aerospike it will read the whole record on the server and return just the selected bins to the client, but if it’s a very large record and you’re returning only a small set of the data, this may result in a performance penalty as you’re reading too much data from the SSDs. Obviously if your data is memory based this isn’t an issue.

If the bins are small (say integers or short strings) and the total record size small, or you need the whole record in one hit, you won’t get too much performance penalty with the first approach.

Personally, I would set up your cluster to use the first approach and use the Aerospike Benchmark Tool to performance test it and see if it meets your requirements.

Mangat_Modi · March 30, 2016, 6:13am

@TimF

Thanks you for the answer. It cleared some concepts for me. However we have decided to go with design-2 because of the limitation with key expiry. Aerospike doesn’t support bin level expiry instead whole record will be expired.

sumitsrv · May 25, 2017, 9:28am

Didn’t the multiple key things bring in any disadvantage to your system?

Topic		Replies	Views
I'm wondering if Aerospike can handle this database Use Cases	1	1910	September 11, 2015
How well suited is Aerospike for this niche use case Data Modeling	6	2426	June 30, 2016
Map in a single bin VS one bin per key of map Data Types	1	2610	April 24, 2015
Optimized way for querying using aerospike client for javay How Developers Are Using Aerospike query , primarykey , java , client	1	1469	September 25, 2020
Data modeling for expiration Data Modeling pk	1	2063	May 22, 2015

Aerospike schema: which one is the better design?

Related topics