About 948,000 results
Open links in new tab
  1. frameworks - Simple explanation of MapReduce? - Stack Overflow

    May 23, 2017 · The reason MapReduce is split between Map and Reduce is because different parts can easily be done in parallel. (Especially if Reduce has certain mathematical …

  2. Newest 'mapreduce' Questions - Stack Overflow

    Im running a Hadoop Mapreduce program to calculate the average, maximum and minimum temperature. Temperature is stored in input1.csv file with three columns Date in YYYY-MM …

  3. mapreduce - Does Spark internally use Map-Reduce? - Stack …

    Feb 3, 2019 · Actually spark use DAG(Directed Acyclic Graph) not tradicational mapreduce. You can think of it as an alternative to Map Reduce. While MR has just two steps (map and …

  4. How does the MapReduce sort algorithm work? - Stack Overflow

    One of the main examples that is used in demonstrating the power of MapReduce is the Terasort benchmark. I'm having trouble understanding the basics of the sorting algorithm used in the …

  5. MapReduce - How sort reduce output by value - Stack Overflow

    Jul 23, 2013 · Best way to do it is to use the output of your first MapReduce job as the input of another job, which I call Sort.java. Since the Hadoop Map function has a sorting algorithm in …

  6. mapreduce - Hadoop WordCount example stuck at map 100

    Apr 15, 2012 · My MapReduce job become Fails. 0. map 100% reduce 0% in running hadoop word count. 15. Wordcount program ...

  7. mapreduce - Hadoop one Map and multiple Reduce - Stack …

    Feb 6, 2016 · Also your use of MapReduce paradigm for the given problem is incorrect, using a single map function and multiple "different" reduce function makes no sense, it shows that you …

  8. java - Mapreduce Combiner - Stack Overflow

    If a combine function is used, then it is the same form as the reduce function (and is an implementation of Reducer), except its output types are the intermediate key and value types …

  9. mapreduce - Why is Spark faster than Hadoop Map Reduce

    Sep 14, 2015 · MapReduce starts a new JVM for each task, which can take seconds with loading JARs, JITing, parsing configuration XML, etc. Spark keeps an executor JVM running on each …

  10. Good MapReduce examples - Stack Overflow

    Sep 12, 2012 · MapReduce is a framework originally developed at Google that allows for easy large scale distributed computing across a number of domains. Apache Hadoop is an open …