Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 21 (0.33s).
Loading phrases to help you
refine your search...
RE: How to change the separator of INSERT OVERWRITE LOCAL DIRECTORY - Hive - [mail # user]
...Works for me like this, using the most recent Hive on AWS, using a ~ delimiter:  row format delimited fields terminated by '~' stored as textfile location 's3://mybucket/path/to/data'; ...
   Author: Tony Burton, 2013-06-19, 15:21
RE: How to balance reduce job - MapReduce - [mail # user]
...The typical Partitioner method for assigning reducer r from reducers R is  r = hash(key) % count(R)  However if you find your partitioner is assigning your data to too few or one r...
   Author: Tony Burton, 2013-05-07, 15:13
RE: Bloom Filter analogy in SQL - HDFS - [mail # user]
...There’s an explanation of Bloom Filters and a MapReduce implementation in Chuck Lam’s book “Hadoop In Action”. Maybe that might guide the way a bit more.   From: Sai Sai [mailto:[EMAIL ...
   Author: Tony Burton, 2013-05-07, 14:56
[expand - 5 more] - RE: S3/EMR Hive: Load contents of a single file - Hive - [mail # user]
...No problem Keith - it was a worthwhile exercise for me to go back and double check everything was working as expected.     From: Keith Wiley [mailto:[EMAIL PROTECTED]]  Sent: ...
   Author: Tony Burton, 2013-03-27, 17:18
[expand - 1 more] - RE: Group names for custom Counters - MapReduce - [mail # user]
...Thanks Michel! Looks like that’ll do the trick.  It wasn’t clear originally from the docs that context.getCounter(groupName, counterName) will create the group and counter if they don’t...
   Author: Tony Burton, 2013-03-25, 11:02
hadoop file append - HDFS - [mail # user]
...Hi list,    I'm using Hadoop 1.0.3 for a MapReduce task and I thought it might be a simple job to append a Counter value and some text to the end of a file (which ultimately will b...
   Author: Tony Burton, 2013-03-18, 15:44
[expand - 8 more] - RE: hadoop 1.0.3 equivalent of MultipleTextOutputFormat - HDFS - [mail # user]
...Thanks for the reply Alejandro. Using a temp output directory was my first guess as well. What's the best way to proceed? I've come across FileSystem.rename but it's consistently returning f...
   Author: Tony Burton, 2013-02-01, 15:12
[expand - 7 more] - RE: HWI use on AWS/EMR - Hive - [mail # user]
...Hi Nitin (and others who have contributed)  Thanks again for the suggestions - however my original objective was to investigate HWI as an alternative to hive in CLI mode for less techie...
   Author: Tony Burton, 2013-01-21, 16:37
RE: Map output compression in Hadoop 1.0.3 - MapReduce - [mail # user]
...Hi Andy and list,  Apologies - I've not been looking at my list inbox for a while, so missed this request. I'm running some tests as I type, and will report back when they're done. I'm ...
   Author: Tony Burton, 2012-12-19, 09:19
[MAPREDUCE-4616] Improvement to MultipleOutputs javadocs - MapReduce - [issue]
...In the new API, and using MultipleOutputs it is possible to segment output into directories by using MultipleOutputs.write(KEYOUT key, VALUEOUT value, String baseOutputPath) in the Reducer t...
http://issues.apache.org/jira/browse/MAPREDUCE-4616    Author: Tony Burton, 2012-10-12, 17:04
MapReduce (8)
Hive (5)
Hadoop (4)
HDFS (3)
Pig (1)
mail # user (19)
issue (1)
mail # dev (1)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (21)
Ted Yu (1699)
Harsh J (1295)
Todd Lipcon (995)
Stack (978)
Jun Rao (969)
Jonathan Ellis (844)
Andrew Purtell (816)
Jean-Daniel Cryans (753)
Yusaku Sako (718)
stack (714)
Jarek Jarcec Cecho (703)
Eric Newton (688)
Jonathan Hsieh (673)
Roman Shaposhnik (663)
Namit Jain (649)
Hitesh Shah (627)
Owen O'Malley (625)
Steve Loughran (624)
Siddharth Seth (614)
Josh Elser (557)
Brock Noland (549)
Eli Collins (545)
Neha Narkhede (545)
Arun C Murthy (543)
Doug Cutting (533)
Tony Burton