Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop >> mail # user >> Re: What is the best way to load data from one cluster to another cluster (Urgent requirement)


Copy link to this message
-
Re: What is the best way to load data from one cluster to another cluster (Urgent requirement)
I might be wrong but have you considered distcp?
On Jan 31, 2013 11:15 AM, "samir das mohapatra" <[EMAIL PROTECTED]>
wrote:

> Hi All,
>
>    Any one knows,  how to load data from one hadoop cluster(CDH4) to
> another Cluster (CDH4) . They way our project needs are
>    1) It should  be delta load or incremental load.
>    2) It should be based on the timestamp
>    3) Data volume are 5PB
>
> Any Help ????????????
>
> Regards,
> samir.
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB