Fil

Fil @fil 13/09/2014

Apache Spark
▻http://spark.apache.org/faq.html
#Spark is a fast and powerful engine for processing Hadoop data. It runs in #Hadoop clusters through Hadoop YARN or Spark’s standalone mode, and it can process data in HDFS, HBase, Cassandra, Hive, and any Hadoop InputFormat. It is designed to perform both general data processing (similar to #MapReduce) and new workloads like streaming, interactive queries, and machine learning.
#big_data #parallel

Fil @fil

Écrire un commentaire