ailinykh 2010-07-29, 15:10
Hadoop does not prevent you from writing key value pair multiple times in
the same map iteration if that is what is your roadblock.
You can call collector.collect() multiple times with same or distinct key
value pairs within a single map iteration.
On Thu, Jul 29, 2010 at 8:10 AM, ailinykh <[EMAIL PROTECTED]> wrote:
> Hello, everybody!
> I have a bunch of records. Each record has key, and two fields A,B - R(k,
> I want to build two inverted indexes, one per each field. As output I
> two files
> IndexA =(A1- [k1,k2,k3..]),(A2 -[k1,k2,k4...]) ...
> IndexB =(B1- [k1,k2,k3..]),(B2-[k1,k2,k4...]) ...
> Hadoop mapper allows to write out only one key value pair. Of cause, I can
> run it twice, but I don't want to read the same record twice. What is the
> best way to handle this situation?
> Thank you,
> View this message in context:
> Sent from the Hadoop core-user mailing list archive at Nabble.com.