-Re: Mapreduce outputs to a different cluster?
Shahab Yunus 2013-10-24, 22:42
As far as I know, you can use distcp to transfer the results of the job
form one cluster to another, once the job is done. You can write a simple
script to do that. Simple and tested. Some poiners below:
You might be able to do this through the job as well byt changing the
output paths of the generated files but I wouldn't suggest that there can
be latency and performance issues.
Maybe others have better idea....
On Thu, Oct 24, 2013 at 6:28 PM, S. Zhou <[EMAIL PROTECTED]> wrote:
> The scenario is: I run mapreduce job on cluster A (all source data is in
> cluster A) but I want the output of the job to cluster B. Is it possible?
> If yes, please let me know how to do it.
> Here are some notes of my mapreduce job:
> 1. the data source is an HBase table
> 2. It only has mapper no reducer.