About the Spark category

Mnemaudsyne · March 6, 2015, 2:53am

Apache Spark is a top-level project of the Apache Foundation. In one deployment configuration, Spark can run in tandem with or instead of Hadoop databases, since it can read from Hadoop Distributed File Systems (HDFS), and can work with YARN (Yet Another Resource Negotiator) or Apache Mesos. In a second configuration, Spark can run on a standalone mode, either SQL or NoSQL, and integrate with, for instance, Aerospike.

Apache Spark has four main modules:

Spark Streaming
Machine Learning (MLlib)
Spark SQL
GraphX

Please use this forum to discuss aspects of working with Apache Spark in your architecture, or topics of interest to the Apache Spark community.

Topic		Replies	Views
Aerospark: an open-source Spark connector for Aerospike’s NoSQL database Spark	2	3110	October 21, 2016
Apache Spark — How are you using it? Spark	0	3298	January 9, 2015
Aerospike, Spark and Java Spark	6	3015	July 10, 2019
About the Hadoop category Hadoop	0	1617	December 9, 2014
Recommendations for integrating AS 3.x server with Spark 2.x Tools spark	4	788	August 24, 2022

About the Spark category

Related topics