Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 21 to 30 from 45 (0.095s).
Loading phrases to help you
refine your search...
Re: AvroStorage can't read multiple files? - Pig - [mail # user]
...Looks like you have corrupted avro files / NOT avro files inside catalog. Or your files have different schema.  Try to read about AvroStorage and run it using debug key (see doc https:/...
   Author: Serega Sheypak, 2013-10-01, 15:05
unittest for Jython pig UDFs - Pig - [mail # user]
...Hi, I'm trying to integrate Jython UDF into my maven project. I have a problem with running scripts where @outputSchema("blabla") is defined  Here is an error:  File "/home/ssa/dev...
   Author: Serega Sheypak, 2013-09-16, 07:25
Re: Python UDFs with Pig (support for Filter functions?) - Pig - [mail # user]
...It should work. filtered_result = FILTER dirty_data udf.my_python_filter_func(field1, field2);   2013/9/4 Max Von Tilden  ...
   Author: Serega Sheypak, 2013-09-04, 19:01
Re: COALESCE UDF? - Pig - [mail # user]
...def coalesce(*arg):     for el in arg:         if el is not None:             return el     return None   2013/9/4 ...
   Author: Serega Sheypak, 2013-09-04, 19:00
Re: Avro to Tuples during UnitTest - Pig - [mail # user]
...I understand you solution. I want to reuse exisitng code. This json with schema to Tuple conversion utility would took to much time to implement.   2013/8/30 Ruslan Al-Fakikh  ...
   Author: Serega Sheypak, 2013-08-30, 08:18
Re: Date Function in Pig - Pig - [mail # user]
...You can use REGEX_EXTRACT with appropriate pattern and dummy ternary operator or write dummy jython UDF using java/jython classes for Datetime conversion or You can convert it to unix second...
   Author: Serega Sheypak, 2013-08-26, 15:43
Re: No matter what I do, pig is trying to run locally - Pig - [mail # user]
...Are you sure that your core-site, hdfs-site, maped-site are in pig's classpath?   2013/8/23 Tim Chan  ...
   Author: Serega Sheypak, 2013-08-23, 06:35
Re: Distinct IDs from different time periods - Pig - [mail # user]
...Not to much knowledge to help you. What is the nature of your data? You get it daily, montly? 30 days is a sliding window or month?  Imho the approach sould be: When data arrives find t...
   Author: Serega Sheypak, 2013-08-13, 23:44
Re: Adding dependent jars for UDF in the PIG - Pig - [mail # user]
...If you run your script using oozie, you can control it using                       mapreduce.task.classpath.user.precedence   &nbs...
   Author: Serega Sheypak, 2013-08-13, 10:15
Re: size of words+counts of words getting failed. - Pig - [mail # user]
...f = join d by b.word,e by b.word; 1. d doesn't have b 2. e doesn't have b I don't understand how you try to reference them in join. looks liek you have to join by 'group'  I suggest you...
   Author: Serega Sheypak, 2013-08-12, 12:55
Pig (44)
Hadoop (2)
Hive (1)
mail # user (43)
mail # dev (2)
last 7 days (1)
last 30 days (1)
last 90 days (4)
last 6 months (7)
last 9 months (45)
Daniel Dai (361)
Dmitriy Ryaboy (346)
Alan Gates (333)
Cheolsoo Park (291)
Jonathan Coveney (237)
Russell Jurney (174)
Rohini Palaniswamy (170)
Bill Graham (131)
Olga Natkovich (130)
Prashant Kommireddi (106)
Aniket Mokashi (87)
Julien Le Dem (84)
Thejas Nair (69)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (41)
pi song (41)
Gianmarco De Francisci Mo...(38)
"Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Koji Noguchi (33)
Pradeep Gollakota (33)
Jeff Zhang (32)
Santhosh Srinivasan (29)
Serega Sheypak