| clear query|facets|time |
Search criteria: .
Results from 1 to 10 from
16 (0.16s).
|
|
|
Loading phrases to help you refine your search...
|
|
Re: Do we have any plan for "Cost based optimizer"? - Pig - [mail # user]
|
|
...Not sure for Hadoop but the spark/shark guys have one that works against hive stats Don't have any idea how portable it is Guy On Thursday, December 6, 2012, Bill Graham wrote: &...
|
|
|
Author: Guy Bayes,
2012-12-06, 17:14
|
|
|
Re: Getting Data into Data Warehouse from Pig - Pig - [mail # user]
|
|
...you might want to take a look at zookeeper as a coordination mechanism for when to process what file On Tue, Jan 10, 2012 at 12:42 PM, rakesh sharma <[EMAIL PROTECTED] ...
|
|
|
Author: Guy Bayes,
2012-01-10, 21:27
|
|
|
Re: creating a graph over time - Pig - [mail # user]
|
|
...ahh TV that explains it 12G data file is a bit too big for R unless you sample, not sure if the use case is conducive to sampling? If it is, could sample it down and structure in...
|
|
|
Author: Guy Bayes,
2011-10-31, 16:58
|
|
|
Re: creating a graph over time - Pig - [mail # user]
|
|
...if it fits in R, it's trivial, draw a density plot or a histogram, about three lines of R code why I was wondering about the data volume. His example is students attending classe...
|
|
|
Author: Guy Bayes,
2011-10-28, 15:02
|
|
|
Re: creating a graph over time - Pig - [mail # user]
|
|
...how big is your dataset? On Thu, Oct 27, 2011 at 9:23 AM, Marco Cadetg wrote: ...
|
|
|
Author: Guy Bayes,
2011-10-27, 20:05
|
|
|
Re: Benchmark Haddop and Pig UDFs - Pig - [mail # user]
|
|
...One thing I would say is don't benchmark on EC2, do it on physical hardware... There is a test harness infrastructure for generic benchmarking at http://bbltest.sourceforge.net/ ...
|
|
|
Author: Guy Bayes,
2011-04-20, 22:04
|
|
|
Re: Any reason a bunch of nearly-identical jobs would suddenly stop working? - Pig - [mail # user]
|
|
...Question, do normal map-reduce jobs run on this cluster? Like the example ja r jobs? Guy On Mar 9, 2011, at 2:29 PM, Kris Coward wrote: n e s h 439200,1...
|
|
|
Author: Guy Bayes,
2011-03-10, 00:48
|
|
|
Re: Displaying source log file names in pig logs - Pig - [mail # user]
|
|
...I'm pretty sure they are suppose to be on the Input split of the tasktracker logs aren't they? For some reason all the Input-Slits are null Input-split file: null Input-split sta...
|
|
|
Author: Guy Bayes,
2010-10-25, 16:09
|
|
|
Displaying source log file names in pig logs - Pig - [mail # user]
|
|
...We have a job that processes several hundred files in a directory We generally glob the directory in a single load statement Sometimes the jobs chokes on a bad row in a single fi...
|
|
|
Author: Guy Bayes,
2010-10-21, 16:57
|
|
|
map.input.file from pig? - Pig - [mail # user]
|
|
...Is there any way to retrieve job parameters and task execution / environment variables from inside a pig script? I'm trying to grab the name of the file I am processing using map.input...
|
|
|
Author: Guy Bayes,
2010-01-17, 03:49
|
|
|
|