Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 13 (0.085s).
Loading phrases to help you
refine your search...
Re: hadoop missing file? - HDFS - [mail # user]
...(10-3) * 129 = 903  But long answer 1) which missing file? 2) how do you know it is missing?  You have a cluster with 3 datanodes, the default replication factor is 3 but not for t...
   Author: Bertrand Dechoux, 2013-07-30, 07:10
Re: Running a single cluster in multiple datacenters - HDFS - [mail # user]
...According to your own analysis, you wouldn't be more available but that was your aim. Did you consider having two separate clusters? One per datacenter, with an automatic copy of the data? I...
   Author: Bertrand Dechoux, 2013-07-15, 22:37
Re: MapReduce shuffle algorithm - HDFS - [mail # user]
...An introduction to the subject can be found in the best known reference :  Hadoop: The Definitive Guide, 3rd Edition  Storage and Analysis at Internet Scale By Tom White  Publ...
   Author: Bertrand Dechoux, 2013-05-21, 19:21
[expand - 1 more] - Re: Wrapping around BitSet with the Writable interface - HDFS - [mail # user]
...You can disregard my links as their are only valid for java 1.7+. The JavaSerialization might clean your code but shouldn't bring a significant boost in performance. The EWAH implementation ...
   Author: Bertrand Dechoux, 2013-05-12, 20:40
Re: Hadoop Mapreduce fails with permission management enabled - HDFS - [mail # user]
...Permission denied: user=*realtime*, access=EXECUTE, inode="*system*":*hadoop:**supergroup:rwx------*   It seems like you tried to run a job with a user 'realtime' but this one has no ac...
   Author: Bertrand Dechoux, 2013-03-28, 14:08
[expand - 2 more] - Re: Naïve k-means using hadoop - HDFS - [mail # user]
...And there is also Cascading ;) : http://www.cascading.org/ But like Crunch, this is Hadoop. Both are 'only' higher APIs for MapReduce.  As for the number of reducers, you will have to d...
   Author: Bertrand Dechoux, 2013-03-27, 13:24
Re: namenode directory failure question - HDFS - [mail # user]
...You may want to check this JIRA: https://issues.apache.org/jira/browse/HADOOP-4885  It won't help you right know but it could allow you next time to avoid restarting.  Regards &nbs...
   Author: Bertrand Dechoux, 2013-03-18, 15:28
Re: basic question about rack awareness and computation migration - HDFS - [mail # user]
...I might have missed something but is there a reason for the input of the mappers to be a list of files and not the files themselves? The usual way is to provide a path to the files that shou...
   Author: Bertrand Dechoux, 2013-03-07, 12:35
Re: Maximum Storage size in a Single datanode - HDFS - [mail # user]
...I would say the hard limit is due to the OS local file system (and your budget).  So short answer for ext3 : it doesn't seems so. http://en.wikipedia.org/wiki/Ext3  And I am not su...
   Author: Bertrand Dechoux, 2013-01-30, 09:14
Re: Mapper outputs an empty file - HDFS - [mail # user]
...You should write unit tests (MRUnit) and do debugging if that's not enough. I would assume that you are a reading your file line by line. And each line is not a valid xml, thus an exception ...
   Author: Bertrand Dechoux, 2012-11-30, 13:06
MapReduce (62)
Hadoop (49)
Hive (27)
HDFS (12)
Spark (12)
Avro (11)
Flume (9)
Pig (7)
HBase (4)
mail # user (13)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (13)
Todd Lipcon (326)
Eli Collins (263)
Harsh J (261)
Colin Patrick McCabe (241)
Tsz Wo (203)
Jing Zhao (175)
Chris Nauroth (166)
Arpit Agarwal (152)
Andrew Wang (143)
Aaron T. Myers (141)
Haohui Mai (141)
Suresh Srinivas (138)
Brandon Li (137)
Kihwal Lee (114)
Daryn Sharp (105)
Ted Yu (83)
Uma Maheswara Rao G (82)
Alejandro Abdelnur (73)
Tsz Wo Nicholas Sze (64)
Konstantin Shvachko (63)
Akira AJISAKA (61)
Stephen Chu (58)
Yongjun Zhang (57)
Steve Loughran (53)
Allen Wittenauer (49)
Bertrand Dechoux