Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 31 to 40 from 58 (0.109s).
Loading phrases to help you
refine your search...
Eclipse target - Hadoop - [mail # dev]
...When I checkout common, run ant eclipse and then open eclipse I get this error:  The type com.sun.javadoc.RootDoc cannot be resolved. It is indirectly referenced from required .class fi...
   Author: Niels Basjes, 2011-05-18, 21:24
Re: MapReduce compilation error - Hadoop - [mail # dev]
...Today I ran into the same error and I was puzzled by the content of this file. What is the purpose of a test file that appears to have a deliberate error and no code what so ever?   201...
   Author: Niels Basjes, 2011-05-18, 20:00
Re: Unsplittable files on HDFS - Hadoop - [mail # user]
...Hi,  I did the following with a 1.6GB file    hadoop fs -Ddfs.block.size=2147483648 -put /home/nbasjes/access-2010-11-29.log.gz /user/nbasjes and I got  Total number of b...
   Author: Niels Basjes, 2011-04-27, 09:48
Unsplittable files on HDFS - Hadoop - [mail # user]
...Hi,  In some scenarios you have gzipped files as input for your map reduce job (apache logfiles is a common example). Now some of those files are several hundred megabytes and as such w...
   Author: Niels Basjes, 2011-04-27, 07:55
Re: Re-generate datanode storageID? - Hadoop - [mail # user]
...Hi,  To solve that simply do the following on the problematic nodes: 1) Stop the datanode (probably not running) 2) Remove everything inside the .../cache/hdfs/ 3) Start the datanode ag...
   Author: Niels Basjes, 2011-03-24, 19:48
Re: TextInputFormat and Gzip encoding - wordcount displaying binary data - Hadoop - [mail # user]
...Hi,  2011/3/21 Saptarshi Guha :  No, .zip is "pkzip" and .gz is gzip.  The applicable hadoop code actually chooses the decompressor on the extention of the filename.  Nie...
   Author: Niels Basjes, 2011-03-21, 23:01
Re: File formats in Hadoop - Hadoop - [mail # user]
...And then there is the matter of how you put the data in the file. I've heard that some people write the data as protocolbuffers into the sequence file.  2011/3/19 Harsh J :    ...
   Author: Niels Basjes, 2011-03-20, 11:04
Re: Efficiently partition broadly distributed keys - Hadoop - [mail # user]
...Hi Luca,  2011/3/10 Luca Aiello :   "hadoop embedded" tool to do this. This is what I suspected indeed.  The mapreduce model simply uses the  as the pivot of the processi...
   Author: Niels Basjes, 2011-03-10, 22:07
Re: Efficiently partition broadly distributed keys - Hadoop - [mail # user]
...If I understand your problem correctly you actually need some way of knowing if you need to "chop" a large set with a specific key in to subsets. In mapreduce the map only has information ab...
   Author: Niels Basjes, 2011-03-10, 20:23
Re: Comparison between Gzip and LZO - Hadoop - [mail # user]
...Question: Are you 100% sure that nothing else was running on that system during the tests? No cron jobs, no "makewhatis" or "updatedb"?  P.S. There is a permission issue with downloadin...
   Author: Niels Basjes, 2011-03-02, 20:16
Sort:
project
Hadoop (58)
MapReduce (36)
Pig (12)
HBase (4)
HDFS (1)
type
mail # dev (33)
mail # user (23)
issue (1)
mail # general (1)
date
last 7 days (2)
last 30 days (4)
last 90 days (4)
last 6 months (6)
last 9 months (58)
author
Harsh J (1373)
Steve Loughran (928)
Owen O'Malley (816)
Todd Lipcon (755)
Arun C Murthy (574)
Eli Collins (513)
Allen Wittenauer (461)
Doug Cutting (344)
Konstantin Boudnik (335)
Mark Kerzner (334)
Edward Capriolo (328)
Ted Dunning (321)
Brian Bockelman (305)
Tom White (302)
jason hadoop (279)
Niels Basjes