Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 31 to 40 from 260 (0.136s).
Loading phrases to help you
refine your search...
Re: LZO with sequenceFile - Hadoop - [mail # user]
...On Sun, Feb 26, 2012 at 1:49 PM, Harsh J  wrote:  LZO confuses most because how it was added and removed. Also there is a system to make raw LZO files split-table by indexing it. &...
   Author: Edward Capriolo, 2012-02-26, 19:28
Re: Writing small files to one big file in hdfs - Hadoop - [mail # user]
...On Tue, Feb 21, 2012 at 7:50 PM, Mohit Anchlia  wro te: t ote:  I  with h)); ); de nce e want to  at  a normal e ow  You might want to look at https://github.co...
   Author: Edward Capriolo, 2012-02-22, 02:42
Re: Addendum to Hypertable vs. HBase Performance Test (w/ mslab enabled) - Hadoop - [mail # user]
...As your numbers show.  Dataset Size Hypertable Queries/s HBase Queries/s Hypertable Latency (ms) HBase Latency (ms) 0.5 TB 3256.42 2969.52 157.221 172.351 5 TB 2450.01 2066.52...
   Author: Edward Capriolo, 2012-02-18, 01:58
Re: Brisk vs Cloudera Distribution - Hadoop - [mail # user]
...Hadoop can work on a number of filessytems hdfs , s3. Local files. Brisk file system is known as cfs. Cfs stores all block and meta data in cassandra. Thus it does not use a name node. Brisk...
   Author: Edward Capriolo, 2012-02-09, 04:57
Re: Checking Which Filesystem Being Used? - Hadoop - [mail # user]
...On Tue, Feb 7, 2012 at 5:24 PM, Eli Finkelshteyn  wrote:   conf.get("fs.default.name") would return a URI such as hdfs://bla:8000 or file:///this. Although an application could hav...
   Author: Edward Capriolo, 2012-02-07, 22:42
Re: jobtracker url(Critical) - Hadoop - [mail # user]
...Task tracker sometimes so not clean up their mapred temp directories well if that is the case the tt on startup can spent many minutes deleting files. I use find to delete files older then a...
   Author: Edward Capriolo, 2012-01-27, 13:06
Re: NameNode per-block memory usage? - Hadoop - [mail # user]
...On Tue, Jan 17, 2012 at 10:08 AM, Otis Gospodnetic  wrote:   Some real world statistics. From NN web Interface. replication factor=2  Cluster Summary 22,061,605 files and dire...
   Author: Edward Capriolo, 2012-01-17, 15:22
Re: hadoop filesystem cache - Hadoop - [mail # user]
...The challenges of this design is people accessing the same data over and over again is the uncommon usecase for hadoop. Hadoop's bread and butter is all about streaming through large dataset...
   Author: Edward Capriolo, 2012-01-16, 18:07
Re: desperate question about NameNode startup sequence - Hadoop - [mail # user]
...The problem with checkpoint /2nn is that it happily "runs" and has no outward indication that it is unable to connect.  Because you have a large edits file you startup will complete, ho...
   Author: Edward Capriolo, 2011-12-17, 21:00
Re: Analysing Completed Job info programmatically apart from Jobtracker GUI - Hadoop - [mail # user]
...I would check out hitune. I have a github project that connects to the JobTracker and stores counters, job times and other stats into Cassandra.  https://github.com/edwardcapriolo/hadoo...
   Author: Edward Capriolo, 2011-12-14, 17:30
Hive (631)
Hadoop (260)
Cassandra (63)
HBase (47)
Kafka (10)
MapReduce (6)
Pig (6)
HDFS (2)
Zookeeper (1)
mail # user (257)
issue (3)
last 7 days (2)
last 30 days (2)
last 90 days (3)
last 6 months (5)
last 9 months (260)
Harsh J (553)
Owen O'Malley (396)
Steve Loughran (376)
Todd Lipcon (236)
Eli Collins (181)
Alejandro Abdelnur (160)
Arun C Murthy (160)
Chris Nauroth (141)
Allen Wittenauer (122)
Tom White (118)
Ted Yu (114)
Nigel Daley (113)
Daryn Sharp (110)
Konstantin Shvachko (106)
Aaron Kimball (93)
Doug Cutting (93)
Edward Capriolo (86)
Mark Kerzner (86)
Colin Patrick McCabe (85)
jason hadoop (82)
Hairong Kuang (74)
Runping Qi (72)
Benoy Antony (69)
Konstantin Boudnik (68)
Suresh Srinivas (63)