Apache Spark
▻http://spark.apache.org/faq.html
#Spark is a fast and powerful engine for processing Hadoop data. It runs in #Hadoop clusters through Hadoop YARN or Spark’s standalone mode, and it can process data in HDFS, HBase, Cassandra, Hive, and any Hadoop InputFormat. It is designed to perform both general data processing (similar to #MapReduce) and new workloads like streaming, interactive queries, and machine learning.