Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 11 to 20 from 262 (0.072s).
Loading phrases to help you
refine your search...
Re: Regarding DataJoin contrib jar for 1.0.3 - Hadoop - [mail # user]
...DataJoin is an example. Most people doing joins use Hive or Pig rather then code them up themselves.   On Tue, Jul 24, 2012 at 5:19 PM, Abhinav M Kulkarni  wrote:...
   Author: Edward Capriolo, 2012-07-25, 20:27
Re: hadoop FileSystem.close() - Hadoop - [mail # user]
...In all my experience you let FileSystem instances close themselves.  On Tue, Jul 24, 2012 at 10:34 AM, Koert Kuipers  wrote:...
   Author: Edward Capriolo, 2012-07-24, 14:46
Re: Avro vs Protocol Buffer - Hadoop - [mail # user]
...We just open sourced our protobuf support for Hive. We built it out because in our line of work protobuf is very common and it gave us the ability to log protobufs directly to files and then...
   Author: Edward Capriolo, 2012-07-20, 22:03
Re: Group mismatches? - Hadoop - [mail # user]
...In all places I have found it only to be the primary group, not all the users supplemental groups.  On Mon, Jul 16, 2012 at 3:05 PM, Clay B.  wrote:...
   Author: Edward Capriolo, 2012-07-16, 19:15
Re: stuck in safe mode after restarting dfs after found dead node - Hadoop - [mail # user]
...If the files are gone forever you should run:  hadoop fsck -delete /  To acknowledge they have moved on from existence. Otherwise things that attempt to read this files will, to pu...
[+ more]    Author: Edward Capriolo, 2012-07-14, 15:23
Re: Setting number of mappers according to number of TextInput lines - Hadoop - [mail # user]
...No. The number of lines is not known at planning time. All you know is the size of the blocks. You want to look at mapred.max.split.size .  On Sat, Jun 16, 2012 at 5:31 AM, OndÅ™ej Klimp...
   Author: Edward Capriolo, 2012-06-16, 16:12
Re: Ideal file size - Hadoop - [mail # user]
...It does not matter what the file size is because the file size is split into blocks which is what the NN tracks.  For larger deployments you can go with a large block size like 256MB or...
   Author: Edward Capriolo, 2012-06-06, 14:55
Re: Hadoop with Sharded MySql - Hadoop - [mail # user]
...Maybe you can do some VIEWs or unions or merge tables on the mysql side to overcome the aspect of launching so many sqoop jobs.  On Thu, May 31, 2012 at 6:02 PM, Srinivas Surasani  ...
   Author: Edward Capriolo, 2012-06-01, 00:12
Re: Hadoop on physical Machines compared to Amazon Ec2 / virtual machines - Hadoop - [mail # user]
...We actually were in an Amazon/host it yourself debate with someone. Which prompted us to do some calculations:  http://www.edwardcapriolo.com/roller/edwardcapriolo/entry/myth_busters_op...
   Author: Edward Capriolo, 2012-05-31, 19:22
Re: Problems with block compression using native codecs (Snappy, LZO) and MapFile.Reader.get() - Hadoop - [mail # user]
...if You are getting a SIGSEG it never hurts to try a more recent JVM. 21 has many bug fixes at this point.  On Tue, May 22, 2012 at 11:45 AM, Jason B  wrote: ring t records) ath, pt...
   Author: Edward Capriolo, 2012-05-22, 15:59
Hive (633)
Hadoop (262)
Cassandra (63)
HBase (47)
Kafka (10)
MapReduce (6)
Pig (6)
HDFS (2)
Zookeeper (1)
mail # user (258)
issue (4)
last 7 days (0)
last 30 days (4)
last 90 days (5)
last 6 months (7)
last 9 months (262)
Harsh J (554)
Owen O'Malley (394)
Steve Loughran (383)
Todd Lipcon (238)
Eli Collins (182)
Alejandro Abdelnur (163)
Arun C Murthy (162)
Chris Nauroth (142)
Allen Wittenauer (128)
Tom White (120)
Ted Yu (118)
Nigel Daley (115)
Daryn Sharp (110)
Konstantin Shvachko (107)
Doug Cutting (95)
Aaron Kimball (94)
Colin Patrick McCabe (86)
Edward Capriolo (86)
Mark Kerzner (86)
jason hadoop (82)
Hairong Kuang (74)
Konstantin Boudnik (72)
Runping Qi (72)
Benoy Antony (69)
Suresh Srinivas (65)