Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase, mail # user - TableMapReduceUtil, multiple scan objects, how to identify originating scan object in mapper


Copy link to this message
-
Re: TableMapReduceUtil, multiple scan objects, how to identify originating scan object in mapper
Nick Dimiduk 2013-10-21, 19:38
Hi Jim,

I don't see an obvious way to gain access to this information. If you don't
find a clever way to get at this, would you mind opening a ticket for this
feature request?

Thanks,
Nick
On Mon, Oct 21, 2013 at 9:44 AM, Jim Holloway
<[EMAIL PROTECTED]>wrote:

> Hello,
>
> I’m using the new 94.6 multi scan feature to pull rows from different
> tables
> into a single mapper.
>
>                         ArrayList<Scan> scans = new ArrayList<Scan>();
>                         scans.add(scanMain);
>                         scans.add(scanJunction);
>
>                         TableMapReduceUtil.initTableMapperJob(scans,
>                                         EI_ExtractType3Mapper1.class,
>                                         Text.class,
>                                         Text.class,
>                                         job1);
>
> My problem is I don’t know how to identify the source (table) of a
> particular row coming in to my mapper job.  My first couple of map/reduce
> jobs I had the luxury of being able to identify the table source based on a
> specific characteristic of the row key (one of the tables having an
> underscore(_) embedded).  Knew that was sloppy but haven’t been able to
> find
> a better way.
>
> Is there a definitive way?
>
> Jim
>
>
>
>
> --
> View this message in context:
> http://apache-hbase.679495.n3.nabble.com/TableMapReduceUtil-multiple-scan-objects-how-to-identify-originating-scan-object-in-mapper-tp4051954.html
> Sent from the HBase User mailing list archive at Nabble.com.
>