The ingest will depend on which toolchain you’re using.
We have a repo with some hadoop and thus spark connectors, and some basic operations on it. This includes some spark analytics examples on aerospike.
This tooling will also allow you to easily get data from Aerospike to HDFS / Hbase / Hadoop, as well as to run MapReduce jobs on aerospike data without “ingest”.
A guy named Sasha published a nice Spark RDD example for Aerospike.
We have published a Storm client integration. It creates both spouts and bolts that read and write from Aerospike.
We have published an example real-time recommendation engine as a stand alone example.
A gentleman was doing some predictive Caltrans / traffic analytics with a great tool called Dato (was Graphlab) and Aerospike but I can’t find his DevWeek talk online
We have not done integration with R clients. I’m fond of the R language for similar small-data processing (limited to in-memory quick jobs), but we haven’t done an integration. As we have a C client and a Java client, both open source, I would expect that anyone who wanted to port/publish the connector would have a reasonable time.
Let me know what tooling you’re using, and perhaps I can be more specific.