Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> error in pig script


Copy link to this message
-
Re: error in pig script
This looks like a bug, can you please open a jira ?
If you are able to reproduce this issue using a smaller query, that
would be great !

Thanks,
THejas
On 10/10/11 12:44 PM, ipshita chatterji wrote:
> sorry there was a typo in the script.
> last line of the script is as follows:
>
> temp = filter temp by org.apache.pig.piggybank.evaluation.datetime.DiffDate(
> A.$0,elmp)/30<= 9;
> Thanks,
> Ips
>
> On Tue, Oct 11, 2011 at 1:05 AM, ipshita chatterji<[EMAIL PROTECTED]>  wrote:
>> Dear All,
>> I am consistently getting following error on execution of my pig script:
>>
>> " [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2997: Unable
>> to recreate exception from backed error: java.lang.ClassCastException:
>> org.apache.pig.data.DataByteArray cannot be cast to java.lang.String"
>>
>> I have checked my input files and they seem to have inputs in the
>> right format. My pig script is as follows:
>> At this point I am unable to figure out whats going wrong at my end.
>> Any helpful info on the problem will be appreciated.
>>
>> Thanks and regards,
>> Ips
>>
>> register piggybank.jar;
>> register joda-time-1.6.jar ;
>>
>> A= LOAD 'date.csv' USING PigStorage(',') AS (a1:chararray);
>>
>>
>>
>> o1 =  load 'obsv.csv'
>>         as (obs_id:chararray, en_id:chararray, sub_en_id:chararray,
>> o_by:chararray, v_by:chararray, rmk:chararray);
>>
>> oc1 = load 'obsv_concpt.csv' as (obs_id:chararray,
>> concept_id:chararray, concept_val:chararray);
>>
>> l1 = load 'locations.csv'
>>         as (lid:chararray, lng:chararray, lt:chararray, field1:chararray,
>> field2:chararray, field3:chararray, field4:chararray,
>>                 field5:chararray, field6:chararray, field7:chararray,
>> field8:chararray, field9:chararray);
>>
>> e1 = load 'encntr.csv'
>>         as (en_id:chararray, pid:chararray, lid:chararray, edttime:chararray);
>>
>> p1 = load 'persons.csv'
>>         as (pid:chararray, gen:chararray, giv_nm:chararray, middle:chararray,
>> fm_nm:chararray, bdt:chararray,
>>                 bdt_est:chararray, bt_pl:chararray, mnm:chararray, sps_nm:chararray,
>>                 dth_dt:chararray, dth_dt_estm:chararray, lid:chararray, mrr_dt:chararray,
>>                 mrr_dt_estm:chararray, entry_date:chararray, mrr_dt_sts:chararray,
>> contact_number:chararray,
>>                 fth_name:chararray);
>>
>> pid = load 'p_pat_id_type.csv'
>>         as (p_pat_id_type_id:chararray, pid:chararray, p_pat_id:chararray);
>>
>> f1 = load 'families.csv'
>>         as (fam_id:chararray, fam_hd:chararray, identifier:chararray);
>>
>> oc1 = filter oc1 by concept_id eq '55';
>>
>> temp = join  oc1 by obs_id, o1 by obs_id;
>>
>> temp = join temp by o1::en_id, e1 by en_id;
>>
>> temp = join p1 by pid, temp by e1::pid;
>>
>> temp = join l1 by lid, temp by p1::lid;
>>
>> temp = join f1 by fam_hd, temp by p1::pid;
>>
>> temp = join pid by pid, temp by p1::pid;
>>
>>
>> temp = group temp by p1::pid;
>>
>> temp = foreach temp generate
>> flatten(temp),org.apache.pig.builtin.MAX(temp.oc1::concept_val) as
>> elmp;
>>
>> temp  = foreach temp generate
>>             pid::p_pat_id_type_id as p_pat_id_type_id,
>>             p1::giv_nm as giv_nm,
>>                 p1::mnm as mnm,
>>                 pid::p_pat_id as p_pat_id,
>>                 l1::field3 as field3,
>>                         p1::bdt as bdt,
>>                         elmp as elmp;
>>
>> temp  = foreach temp generate
>>             p_pat_id_type_id as p_pat_id_type_id,
>>             giv_nm as giv_nm,
>>                 mnm as mnm,
>>                 p_pat_id as p_pat_id,
>>                 field3 as field3,
>>                         bdt as bdt,
>>                         elmp as elmp,
>>             org.apache.pig.piggybank.evaluation.string.SUBSTRING(bdt,0,10)
>> as bdt;
>>
>> temp  = foreach temp generate
>>             p_pat_id_type_id as p_pat_id_type_id,
>>             giv_nm as giv_nm,
>>                 mnm as mnm,
>>                 p_pat_id as p_pat_id,
>>                 field3 as field3,