Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> Duplicate rows when using regular expression


Copy link to this message
-
Re: Duplicate rows when using regular expression
Can you provide the script you're running? That will help people better
understand what you're doing.

On Saturday, March 24, 2012, Mohit Anchlia <[EMAIL PROTECTED]> wrote:
> Could someone please help me understand or give some pointers to me,
>
> On Fri, Mar 23, 2012 at 4:57 PM, Mohit Anchlia <[EMAIL PROTECTED]
>wrote:
>
>> I am running a script to load data in the database. When I use [0-4] I
see
>> 2 rows being created for every record that I process. But when I run them
>> individually then it works. Could someone please help me understand or
>> troubleshoot this behaviour?
>>
>>
>> pig -f script6.pig -p in="/examples/2/part-m-0000[0-4]" --creates 2 rows
>>
>> pig -f script6.pig -p in="/examples/2/part-m-00000 --works
>>
>> pig -f script6.pig -p in="/examples/2/part-m-00001 --works
>>
>> pig -f script6.pig -p in="/examples/2/part-m-00002 --works
>>
>> pig -f script6.pig -p in="/examples/2/part-m-00003 --works
>>
>> pig -f script6.pig -p in="/examples/2/part-m-00004 --works
>>
>

--
*Note that I'm no longer using my Yahoo! email address. Please email me at
[EMAIL PROTECTED] going forward.*