Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 12 (0.057s).
Loading phrases to help you
refine your search...
Re: Documentation bug in REGEX_EXTRACT - Pig - [mail # user]
...Well, usually for regexp, 0 match is the whole match and groups start from 1. Are you sure you are getting group (the thing in brackets) with 0? 8 жовт. 2013 13:04, користувач "Steve Bernste...
   Author: Vitalii Tymchyshyn, 2013-10-10, 01:32
Re: pig script - failed reading input from s3 - Pig - [mail # user]
...Have you tried it with native? AFAIR the limitation was raised to 5TB few years ago. 8 квіт. 2013 18:30, "Panshul Whisper"  напис.  ...
   Author: Vitalii Tymchyshyn, 2013-04-09, 07:09
Re: Aggregation for chronologically ordered dataset - Pig - [mail # user]
...I'd use rank function to join previous and next row, then filter out middle rows, then join first to last and calculate time. 15 бер. 2013 19:04, "pranjal rajput"  напис.  ...
   Author: Vitalii Tymchyshyn, 2013-03-17, 19:33
Re: S3 store and load - Pig - [mail # user]
...Well, simply use s3:// uris. It works. 2 бер. 2013 21:51, "Mohit Anchlia"  напис.  ...
   Author: Vitalii Tymchyshyn, 2013-03-02, 21:56
Re: Parse data with multiple delimiters and transpose rows and columns - Pig - [mail # user]
...You can try reading as single column, then splitting with reg exp and then flattening. 2 бер. 2013 00:29, "Mix Nin"  напис.  ...
   Author: Vitalii Tymchyshyn, 2013-03-02, 08:38
Re: COUNT() thinks non-null tuples are null if the first field is null? - Pig - [mail # user]
...Old db trick. If you want complex count, use sum( complex_boolean_expression ? 1 : 0 ) I think, it should work for pig   2013/2/6 Adair Kovac      Best regards,  Vit...
   Author: Vitalii Tymchyshyn, 2013-02-07, 21:12
[expand - 1 more] - Re: Using UDF to process whole record - Pig - [mail # user]
...BTW: http://pig.apache.org/docs/r0.10.0/basic.html has next example: C = FOREACH A GENERATE name, age, MyUDF(*); Looks like right what you need. 22 січ. 2013 09:35, "Young Ng"  напис. &...
   Author: Vitalii Tymchyshyn, 2013-01-22, 11:34
Re: Parallelism for small input data - Pig - [mail # user]
...Well, if you will set split size to 1, you should get per-line split.   2013/1/13 Dipesh Kumar Singh      Best regards,  Vitalii Tymchyshyn...
   Author: Vitalii Tymchyshyn, 2013-01-14, 10:22
[expand - 1 more] - Re: XML -> Pig UDF - Pig - [mail # user]
...Nope, sorry, I wish I could open source this. I did some patches to the loader (e.g. it did not like empty tags) - those are submitted as pull requests.  Some more hints: 1) I've found ...
   Author: Vitalii Tymchyshyn, 2012-12-29, 23:00
Re: Re: Re: Pig UT last nearly 8 hours and TestEvalPipeline2 lasts for 37 minutes - Pig - [mail # user]
...Well, you could try multi JVM parallelization. AFAIR maven has the option....
   Author: Vitalii Tymchyshyn, 2012-11-21, 13:12
Sort:
project
Zookeeper (13)
Pig (12)
Hadoop (5)
HDFS (3)
Cassandra (1)
type
mail # user (11)
issue (1)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (12)
author
Daniel Dai (404)
Dmitriy Ryaboy (346)
Alan Gates (334)
Cheolsoo Park (312)
Jonathan Coveney (237)
Rohini Palaniswamy (188)
Russell Jurney (177)
Bill Graham (131)
Olga Natkovich (131)
Prashant Kommireddi (108)
Aniket Mokashi (87)
Julien Le Dem (84)
Thejas Nair (71)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (38)
Pradeep Gollakota (36)
"Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Jeff Zhang (32)
Santhosh Srinivasan (29)
Vitalii Tymchyshyn