Skip to content

Rewrite of Hadoop DistCp, for Hadoop 0.20.203+. Support for multiple copy-strategies. Code also reads better.

Notifications You must be signed in to change notification settings

shwethags/DistCpV2-0.20.203

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

65 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DistCp (distributed copy) is a tool used for large inter/intra-cluster copying. 
It uses Map/Reduce to effect its distribution, error handling and recovery, 
and reporting. It expands a list of files and directories into input to map tasks, 
each of which will copy a partition of the files specified in the source list.

Version 0.1 (2010/08/02 sriksun)
 - Initial Version

About

Rewrite of Hadoop DistCp, for Hadoop 0.20.203+. Support for multiple copy-strategies. Code also reads better.

Resources

Stars

Watchers

Forks

Packages

No packages published