Build a #mapreduce flow in #elixir
▻https://hackernoon.com/build-a-mapreduce-flow-in-elixir-f97c317e457e?source=rss----3a8144eabfe3
Giving the Elephant Some ElixirMapReduce is a common Big Data pattern for analyzing a data set concurrently. This tutorial will introduce you to Elixir and the principals behind Hadoop. We will be building the equivalent of Hello World in MapReduce which is a word count program. Map and Reduce are also common higher order functions in the world of functional programming. Map is a function that takes a list and an anonymous function or lambda as arguments, applies the function to each element in the list, and returns a new list with the output of the lambda on each element. Reduce is a similar function in that it takes the same arguments with one additional argument in Elixir, an accumulator, but returns an accumulated value instead of a list. Elixir is a great language to learn (...)
]]>Apache Spark
▻http://spark.apache.org/faq.html
#Spark is a fast and powerful engine for processing Hadoop data. It runs in #Hadoop clusters through Hadoop YARN or Spark’s standalone mode, and it can process data in HDFS, HBase, Cassandra, Hive, and any Hadoop InputFormat. It is designed to perform both general data processing (similar to #MapReduce) and new workloads like streaming, interactive queries, and machine learning.
]]>Map Reduce - A really simple introduction « Kaushik Sathupadi
▻http://ksat.me/map-reduce-a-really-simple-introduction-kloudo
Ever since google published its research paper on map reduce, you have been hearing about it. Here and there. If you have uptil now considered map-reduce a mysterious buzzword, and ignored it, Know that its not. The basic concept is really very simple. and in this tutorial I try to explain it in the simplest way that I can. Note that I have intentionally missed out some deeper details to make it really friendly to a beginner.
]]>Vous êtes plutôt #SQL ou bien vous êtes plutôt #MapReduce pour l’analyse de vos grosses quantités de données ? Ne pleurez pas devant la difficulté du choix, vous pouvez combiner les deux, dit le projet #HadoopDB :
►http://db.cs.yale.edu/hadoopdb/hadoopdb.html
L’article original :
]]>Data-Intensive Text Processing with MapReduce
►http://www.umiacs.umd.edu/%7Ejimmylin/MapReduce-book-final.pdf
MapReduce – The Fanfiction
►http://www.snailinaturtleneck.com/blog/2010/03/15/mapreduce-the-fanfiction
#MapReduce #nosql
Introduction_to_CouchDB_views - Couchdb Wiki
►http://wiki.apache.org/couchdb/Introduction_to_CouchDB_views
#MapReduce #couchdb
Running Hadoop MapReduce on Amazon EC2 and Amazon S3
http://developer.amazonwebservices.com/connect/entry.jspa?externalID=873&categoryID=55
#s3 #ec2 #hadoop #MapReduce