Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> PIG Regex Problems


Copy link to this message
-
Re: PIG Regex Problems
Cool, actually, I just got what I wanted to work like this:

filtered = FILTER orig_set BY orig MATCHES  '^[\\#\\<].*';

I didn't know MATCHES worked for regex before. Sweet!

Eli

On 2/16/12 12:54 PM, Grig Gheorghiu wrote:
> Can you try with RegexMatch? I am doing something similar in one of my
> scripts and it works fine.
>
> Grig
>
> On Thu, Feb 16, 2012 at 9:50 AM, Eli Finkelshteyn<[EMAIL PROTECTED]>  wrote:
>> Hi,
>> I'm trying to do a pretty simple regex test in PIG right now and getting a
>> weird error. All I'm doing is:
>>
>> orig_set = load '/data/dictionaries/Eng-Spa.dic' USING PigStorage('\t') AS
>> (orig: CHARARRAY, trans: CHARARRAY);
>> filtered = FILTER orig_set BY REGEX_EXTRACT(orig, '^[\\#\\<]') == 1;
>>
>> The error I get is:
>> 2012-02-16 12:45:24,000 [main] ERROR org.apache.pig.tools.grunt.Grunt -
>> ERROR 1045: Could not infer the matching function for
>> org.apache.pig.builtin.REGEX_EXTRACT as multiple or none of them fit. Please
>> use an explicit cast
>>
>> Ideas?
>>
>> Cheers,
>> Eli
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB