Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> adding an id to token sequence.


Copy link to this message
-
RE: adding an id to token sequence.
Someone posted the following post about adding unique row ids using
MapReduce:
http://www.data-miners.com/blog/2009/11/hadoop-and-mapreduce-parallel-pr
ogram.html

Hope that helps.

Santhosh

-----Original Message-----
From: Edward Middleton [mailto:[EMAIL PROTECTED]]
Sent: Saturday, March 27, 2010 10:36 AM
To: [EMAIL PROTECTED]
Subject: adding an id to token sequence.

I have a sequence of uniq tokens and I would like to add a sequential
unique integer id to each token.  I appreciate that this is going to be
difficult because mapping is likely to be performed in multiple tasks.
Is there a good way of doing this in pig?

Cheers,

Edward
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB