Using Spark connector for pyspark inside Apache Zeppellin

ser0t0nin · April 14, 2017, 3:43pm

Hello everyone!

I want to use the aerospark connector for some advanced data science tasks in Apache Zeppelin notebooks with the pyspark interpreter. It seems like finally everything is working, at least i can import the aerospike client in scala enviroment of spark without any exceptions. But i cannot find any examples of quering Aerospike from pyspark, do you have some? How can i create a proper SQLContext with pyspark using the driver?

Thanks

ser0t0nin · April 17, 2017, 11:25am

I have found out, how to connect to AS:

%spark.pyspark
conn = sqlc.read.format("com.aerospike.spark.sql").\
    option('aerospike.seedhost', AS_HOST).\
    option('aerospike.port', AS_PORT).\
    option('aerospike.namespace', NAMESPACE).\
    option('aerospike.set', SET_NAME).load()

how to query AS now?

ser0t0nin · May 10, 2017, 8:21am

yes, sorry for a dummy question)

Topic		Replies	Views
Aerospike Java spark connector	0	831	February 15, 2018
Aerospike, Spark and Java Spark	6	2992	July 10, 2019
Aerospark: an open-source Spark connector for Aerospike’s NoSQL database Spark	2	3086	October 21, 2016
Spark aerospike connector support for opensource aerospike Spark spark	0	6	September 4, 2024
Spark Streaming join with Aerospark RDD Spark stream , spark	10	2792	August 30, 2019

Using Spark connector for pyspark inside Apache Zeppellin

Related topics