Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 21 to 30 from 47 (0.078s).
Loading phrases to help you
refine your search...
Attach bag for each tuple and pass to UDF - Pig - [mail # dev]
...Hi, I have two relations: relation *rows* (>10GB) relation *tinyDictionary* (<1MB)  I want to take each tuple from *rows* and attach *tinyDictionary *to it. And then pass it to py...
   Author: Serega Sheypak, 2013-10-21, 21:21
Re: HTTP Post - Pig - [mail # user]
...Better to implement it using Java action in oozie. Oozie java action is implemented as single map task. http://oozie.apache.org/docs/3.2.0-incubating/WorkflowFunctionalSpec.html#a3.2.7_Java_...
   Author: Serega Sheypak, 2013-10-08, 10:01
Re: AvroStorage can't read multiple files? - Pig - [mail # user]
...Looks like you have corrupted avro files / NOT avro files inside catalog. Or your files have different schema.  Try to read about AvroStorage and run it using debug key (see doc https:/...
   Author: Serega Sheypak, 2013-10-01, 15:05
unittest for Jython pig UDFs - Pig - [mail # user]
...Hi, I'm trying to integrate Jython UDF into my maven project. I have a problem with running scripts where @outputSchema("blabla") is defined  Here is an error:  File "/home/ssa/dev...
   Author: Serega Sheypak, 2013-09-16, 07:25
Re: Python UDFs with Pig (support for Filter functions?) - Pig - [mail # user]
...It should work. filtered_result = FILTER dirty_data udf.my_python_filter_func(field1, field2);   2013/9/4 Max Von Tilden  ...
   Author: Serega Sheypak, 2013-09-04, 19:01
[expand - 1 more] - Re: COALESCE UDF? - Pig - [mail # user]
...def coalesce(*arg):     for el in arg:         if el is not None:             return el     return None   2013/9/4 ...
   Author: Serega Sheypak, 2013-09-04, 19:00
[expand - 2 more] - Re: Avro to Tuples during UnitTest - Pig - [mail # user]
...I understand you solution. I want to reuse exisitng code. This json with schema to Tuple conversion utility would took to much time to implement.   2013/8/30 Ruslan Al-Fakikh  ...
   Author: Serega Sheypak, 2013-08-30, 08:18
Re: Date Function in Pig - Pig - [mail # user]
...You can use REGEX_EXTRACT with appropriate pattern and dummy ternary operator or write dummy jython UDF using java/jython classes for Datetime conversion or You can convert it to unix second...
   Author: Serega Sheypak, 2013-08-26, 15:43
Re: No matter what I do, pig is trying to run locally - Pig - [mail # user]
...Are you sure that your core-site, hdfs-site, maped-site are in pig's classpath?   2013/8/23 Tim Chan  ...
   Author: Serega Sheypak, 2013-08-23, 06:35
Re: Distinct IDs from different time periods - Pig - [mail # user]
...Not to much knowledge to help you. What is the nature of your data? You get it daily, montly? 30 days is a sliding window or month?  Imho the approach sould be: When data arrives find t...
   Author: Serega Sheypak, 2013-08-13, 23:44
Pig (46)
HBase (5)
Hadoop (2)
Hive (1)
mail # user (45)
mail # dev (2)
last 7 days (0)
last 30 days (0)
last 90 days (3)
last 6 months (7)
last 9 months (47)
Daniel Dai (396)
Dmitriy Ryaboy (346)
Alan Gates (334)
Cheolsoo Park (310)
Jonathan Coveney (237)
Rohini Palaniswamy (185)
Russell Jurney (176)
Bill Graham (131)
Olga Natkovich (131)
Prashant Kommireddi (107)
Aniket Mokashi (87)
Julien Le Dem (84)
Thejas Nair (71)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
Gianmarco De Francisci Mo...(38)
Koji Noguchi (38)
"Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Pradeep Gollakota (34)
Jeff Zhang (32)
Santhosh Srinivasan (29)
Serega Sheypak