UDF and secondary indexes

HimanshuKumarSingh · December 9, 2015, 4:45am

Hi,

I have implemented UDFs to implement queries that require multiple predicate filters.

My question is - do I need to create secondary indexes on the bins that I want to filter upon? I know that if i don’t create a secondary index on these bins, the UDF search still succeeds. But will the fetch be faster if I create secondary indexes on these bins?

With Best Regards, Himanshu

rbotzer · December 9, 2015, 5:54am

Hi. A query with no predicate turns into a scan. For a stream UDF to work fastest you want to shed as much unneeded data up front, meaning that you want to apply the predicate that will return the smallest subset of records in the set first. Those records are then passed to the UDF and can be filtered further.

In order for such a query predicate to work you’ll need a secondary index built on the bin which it operates over.

HimanshuKumarSingh · December 9, 2015, 6:08am

Thanks for the quick reply.

Do you mean that if my UDF function uses (reads values) of 5 bins, I will have to create secondary indexes on all of those 5 bins?

Let me add specifics:

I find this text in Aerospike docs: Indexed MapReduce One of the main differences from other systems in that the aggrega+on is done against an index -‐ essen+ally a WHERE clause. By filtering against an index performance can be very high.

My questions are -

do I have to create secondary indexes on the bins used in the filter() function?
do I have to create secondary indexes on the bins used in the map() function?

-himanshu

Topic		Replies	Views
Multiple range filters User Defined Functions (UDF)	1	1807	September 18, 2014
Stream UDF over list of primary keys? User Defined Functions (UDF)	3	1628	January 27, 2017
Can I execute record UDF using secondary indexes..? User Defined Functions (UDF)	7	2139	July 12, 2017
Querying data based on MAPKEYS index and applying filters in UDF secondary , udf , stream , map	6	2340	August 20, 2017
I'm I able to do a unique scan based on bins? secondary , scan , udf	6	1515	March 3, 2019

UDF and secondary indexes

Related topics