Data-Processing-on-Large-Clusters Abstract: Map-Re

Title: Data-Processing-on-Large-Clusters

Category:
Other windows programs
Tags:
File Size:
149kb
Update:
2015-07-10
Downloads:
0 Times
Uploaded by:
Maddy

Description: Abstract: Map-Reduce is a programming model that enables easy development of scalable parallel applications to process vast amounts of data on large clusters of commodity machines. Through a simple interface with two functions, map and reduce, this model facilitates parallel implementation of many real-world tasks such as data processing for search engines and machine learning. However, this model does not directly support processing multiple related heterogeneous datasets. While processing relational data is a common need, this limitation causes dif- ficulties and/or inefficiency when Map-Reduce is applied on relational operations like joins. We improve Map-Reduce into a new model called MapReduce-Merge. It adds to Map-Reduce a Merge phase that can efficiently merge data already partitioned and sorted (or hashed) by map and reduce modules. We also demonstrate that this new model can express relational algebra operators as well as implement several join algorithms.

Downloaders recently: [More information of uploader Maddy]

To Search:

File list (Check if you may need any files):

 

Map-Reduce-Merge Simplified Relational Data Processing on Large Clusters.docx

Main Category

SourceCode

Web Code

Develop Tools

Document

Other resource

Category

GUI Develop

Windows Kernel

WinSock-NDIS

Driver Develop

ADO-ODBC

GDI-Bitmap

CSharp

.net

Multimedia Develop

Communication

Shell api

ActiveX/DCOM/ATL

IME Develop

ISAPI-IE

Hook api

Screen saver

DirextX

Process-Thread

Console

File Operate

Printing program

Multi Monitor

DNA

Other

About site

CodeBus www.codebus.net