Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 11 to 20 from 263 (0.157s).
Loading phrases to help you
refine your search...
[expand - 8 more] - Re: NN Memory Jumps every 1 1/2 hours - Hadoop - [mail # user]
...I tried your suggested setting and forced GC from Jconsole and once it crept up nothing was freeing up.  So just food for thought:  You said "average file name size is 32 bytes". W...
   Author: Edward Capriolo, 2012-12-27, 22:58
Re: Regarding DataJoin contrib jar for 1.0.3 - Hadoop - [mail # user]
...DataJoin is an example. Most people doing joins use Hive or Pig rather then code them up themselves.   On Tue, Jul 24, 2012 at 5:19 PM, Abhinav M Kulkarni  wrote:...
   Author: Edward Capriolo, 2012-07-25, 20:27
Re: hadoop FileSystem.close() - Hadoop - [mail # user]
...In all my experience you let FileSystem instances close themselves.  On Tue, Jul 24, 2012 at 10:34 AM, Koert Kuipers  wrote:...
   Author: Edward Capriolo, 2012-07-24, 14:46
Re: Avro vs Protocol Buffer - Hadoop - [mail # user]
...We just open sourced our protobuf support for Hive. We built it out because in our line of work protobuf is very common and it gave us the ability to log protobufs directly to files and then...
   Author: Edward Capriolo, 2012-07-20, 22:03
Re: Group mismatches? - Hadoop - [mail # user]
...In all places I have found it only to be the primary group, not all the users supplemental groups.  On Mon, Jul 16, 2012 at 3:05 PM, Clay B.  wrote:...
   Author: Edward Capriolo, 2012-07-16, 19:15
[expand - 1 more] - Re: stuck in safe mode after restarting dfs after found dead node - Hadoop - [mail # user]
...If the files are gone forever you should run:  hadoop fsck -delete /  To acknowledge they have moved on from existence. Otherwise things that attempt to read this files will, to pu...
   Author: Edward Capriolo, 2012-07-14, 15:23
Re: Setting number of mappers according to number of TextInput lines - Hadoop - [mail # user]
...No. The number of lines is not known at planning time. All you know is the size of the blocks. You want to look at mapred.max.split.size .  On Sat, Jun 16, 2012 at 5:31 AM, OndÅ™ej Klimp...
   Author: Edward Capriolo, 2012-06-16, 16:12
Re: Ideal file size - Hadoop - [mail # user]
...It does not matter what the file size is because the file size is split into blocks which is what the NN tracks.  For larger deployments you can go with a large block size like 256MB or...
   Author: Edward Capriolo, 2012-06-06, 14:55
Re: Hadoop with Sharded MySql - Hadoop - [mail # user]
...Maybe you can do some VIEWs or unions or merge tables on the mysql side to overcome the aspect of launching so many sqoop jobs.  On Thu, May 31, 2012 at 6:02 PM, Srinivas Surasani  ...
   Author: Edward Capriolo, 2012-06-01, 00:12
Re: Hadoop on physical Machines compared to Amazon Ec2 / virtual machines - Hadoop - [mail # user]
...We actually were in an Amazon/host it yourself debate with someone. Which prompted us to do some calculations:  http://www.edwardcapriolo.com/roller/edwardcapriolo/entry/myth_busters_op...
   Author: Edward Capriolo, 2012-05-31, 19:22
Sort:
project
Hive (639)
Hadoop (263)
Cassandra (64)
HBase (47)
Kafka (11)
MapReduce (6)
Pig (6)
HDFS (2)
Zookeeper (1)
type
mail # user (259)
issue (4)
date
last 7 days (0)
last 30 days (0)
last 90 days (3)
last 6 months (6)
last 9 months (263)
author
Harsh J (558)
Owen O'Malley (394)
Steve Loughran (390)
Todd Lipcon (238)
Eli Collins (182)
Alejandro Abdelnur (178)
Arun C Murthy (163)
Allen Wittenauer (148)
Chris Nauroth (146)
Ted Yu (126)
Tom White (121)
Daryn Sharp (115)
Nigel Daley (115)
Konstantin Shvachko (107)
Doug Cutting (96)
Aaron Kimball (94)
Colin Patrick McCabe (93)
Edward Capriolo (88)
Mark Kerzner (87)
jason hadoop (82)
Hairong Kuang (74)
Konstantin Boudnik (72)
Runping Qi (72)
Benoy Antony (70)
Suresh Srinivas (64)