MapReduce-Reference-Cited by-同舟云学术

MapReduce

Published:2008-01 Issue:1 Volume:51 Page:107-113
ISSN:0001-0782
Container-title:Communications of the ACM
language:en
Short-container-title:Commun. ACM

Author:

Dean Jeffrey¹,Ghemawat Sanjay¹

Affiliation:

1. Google, Mountain View, CA

Abstract

MapReduce is a programming model and an associated implementation for processing and generating large datasets that is amenable to a broad variety of real-world tasks. Users specify the computation in terms of a map and a reduce function, and the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks. Programmers find the system easy to use: more than ten thousand distinct MapReduce programs have been implemented internally at Google over the past four years, and an average of one hundred thousand MapReduce jobs are executed on Google's clusters every day, processing a total of more than twenty petabytes of data per day.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/1327452.1327492

Reference17 articles.

1. Hadoop: Open source implementation of MapReduce. http://lucene. apache.org/hadoop/. Hadoop: Open source implementation of MapReduce. http://lucene. apache.org/hadoop/.

2. The Phoenix system for MapReduce programming. http://csl.stanford. edu/~christos/sw/phoenix/. The Phoenix system for MapReduce programming. http://csl.stanford. edu/~christos/sw/phoenix/.

3. High-performance sorting on networks of workstations

4. Web search for a planet: the google cluster architecture

Cited by 6572 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A temporal ontology guided clustering methodology with a case study on detection and tracking of artificial intelligence topics;Expert Systems with Applications;2024-08

2. Efficient hybrid oversampling and intelligent undersampling for imbalanced big data classification;Expert Systems with Applications;2024-07

3. Population based metaheuristics in Spark: Towards a general framework using PSO as a case study;Swarm and Evolutionary Computation;2024-03

4. Integrating systems of systems with a federation of rule engines;Journal of Industrial Information Integration;2024-03

5. Non-MapReduce computing for intelligent big data analysis;Engineering Applications of Artificial Intelligence;2024-03