Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig, mail # user - adding an id to token sequence.


Copy link to this message
-
RE: adding an id to token sequence.
Santhosh Srinivasan 2010-03-27, 17:57
Someone posted the following post about adding unique row ids using
MapReduce:
http://www.data-miners.com/blog/2009/11/hadoop-and-mapreduce-parallel-pr
ogram.html

Hope that helps.

Santhosh

-----Original Message-----
From: Edward Middleton [mailto:[EMAIL PROTECTED]]
Sent: Saturday, March 27, 2010 10:36 AM
To: [EMAIL PROTECTED]
Subject: adding an id to token sequence.

I have a sequence of uniq tokens and I would like to add a sequential
unique integer id to each token.  I appreciate that this is going to be
difficult because mapping is likely to be performed in multiple tasks.
Is there a good way of doing this in pig?

Cheers,

Edward