Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 33 (0.115s).
Loading phrases to help you
refine your search...
Re: problems with .gz - Pig - [mail # user]
...I know what's going on, and it's kind of dumb on my part, but I'll post anyways to help someone else who might be puzzled.  To review, I had data that looked like this (and yes, it's co...
   Author: William Oberman, 2013-06-12, 20:16
Re: problems with .gz - Pig - [mail # user]
...I'm using gzip as I had a huge S3 bucket of uncompressed files, and s3distcp only supported {gz, lzo, snappy}.  I haven't ever done this, but can I mix/match files?  My backup proc...
   Author: William Oberman, 2013-06-10, 17:12
Re: problems with .gz - Pig - [mail # user]
...I still don't fully understand (and am still debugging), but I have a "problem file" and a theory.  The file has a "corrupt line" that is a huge block of null characters followed by a "...
   Author: William Oberman, 2013-06-10, 16:06
Re: problems with .gz - Pig - [mail # user]
...They are all *.gz, I confirmed that first :-)  On Saturday, June 8, 2013, Niels Basjes wrote:    Will Oberman Civic Science, Inc. 6101 Penn Avenue, Fifth Floor Pittsburgh, PA ...
   Author: William Oberman, 2013-06-08, 12:00
problems with .gz - Pig - [mail # user]
...I'm using pig 0.11.2.  I had been processing ASCII files of json with schema: (key:chararray, columns:bag {column:tuple (timeUUID:chararray, value:chararray, timestamp:long)}) For what ...
   Author: William Oberman, 2013-06-07, 21:10
Re: udf + boolean constructor - Pig - [mail # user]
...I hacked the UDF, but it looks like I'll have to rewrite REGEX_EXTRACT_ALL.  If I get it working I'll see about submitting a patch.  I'm tying to match all of my URI query params, ...
   Author: William Oberman, 2013-05-08, 17:30
udf + boolean constructor - Pig - [mail # user]
...I'm trying to set useMatches=false in REGEX_EXTRACT_ALL as per the javadoc: http://pig.apache.org/docs/r0.11.0/api/org/apache/pig/builtin/REGEX_EXTRACT_ALL.html (and yes, I'm using pig 0.11)...
   Author: William Oberman, 2013-05-08, 17:20
Re: String Representation of DataBag and its Schema - Pig - [mail # user]
...We managed to piece this together.  It's not fully generic (we assume a single field).  But, it gets the job done for unit testing. package com.civicscience.util;  import org....
   Author: William Oberman, 2013-03-21, 15:51
Re: ERROR 2999: Unexpected internal error. null - Pig - [mail # user]
...For what it's worth, the error is on the cassandra side, so I'd post to that mailing list.   On Tue, Dec 11, 2012 at 2:13 PM, James Schappet  wrote:  ...
   Author: William Oberman, 2012-12-11, 19:20
Re: ERROR 2999: Unexpected internal error. null - Pig - [mail # user]
...Your line numbers aren't matching up to the 1.1.7 release, which is weird.  Based on the "stock" 1.1.7 source, there was a null check on str before predicateFromString(str), making your...
   Author: William Oberman, 2012-12-11, 18:50
Sort:
project
Pig (33)
MapReduce (1)
type
mail # user (33)
date
last 7 days (1)
last 30 days (5)
last 90 days (8)
last 6 months (8)
last 9 months (33)
author
Dmitriy Ryaboy (1351)
Alan Gates (959)
Jonathan Coveney (736)
Daniel Dai (545)
Russell Jurney (493)
Olga Natkovich (453)
Prashant Kommireddi (370)
Bill Graham (333)
Cheolsoo Park (263)
Mridul Muralidharan (201)
Thejas Nair (195)
Ashutosh Chauhan (169)
Julien Le Dem (156)
Jeff Zhang (146)
Santhosh Srinivasan (142)
William Oberman