MapReduce: simplified data processing on large clusters
Citations
20,557 citations
17,433 citations
6,590 citations
Cites methods from "MapReduce: simplified data processi..."
...Programming abstractions such as Google’s MapReduce [16] and its open-source counterpart Hadoop [11] allow programmers to express such tasks while hiding the operational complexity of choreographing parallel execution across hundreds of Cloud Computing servers....
[...]
...Equally important, these companies also had to develop scalable software infrastructure (such as MapReduce, the Google File System, BigTable, and Dynamo [16, 20, 14, 17]) and the operational expertise to armor their datacenters against potential physical and electronic attacks....
[...]
5,542 citations
5,198 citations
References
20,309 citations
5,429 citations
3,885 citations
"MapReduce: simplified data processi..." refers background in this paper
...Bulk Synchronous Programming [17] and some MPI primitives [11] provide higher-level abstractions that make it easier for programmers to write parallel programs....
[...]
2,666 citations
2,479 citations
"MapReduce: simplified data processi..." refers background in this paper
...Network bandwidth requirements for writing data would be reduced if the underlying file system used erasure coding [14] rather than replication....
[...]