Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 14 (0.116s).
Loading phrases to help you
refine your search...
Re: unsubsrcibe - Hadoop - [mail # user]
...You are:1) Not unsubscribing correctly. From the welcome email you get when yousubscribed -            'To remove your address from the list, send a messag...
   Author: Devin Suiter RDX, 2014-03-12, 14:44
Re: Map-Reduce: How to make MR output one file an hour? - Hadoop - [mail # user]
...If you only want one file, then you need to set the number of reducers to 1.If the size of the data makes the original MR job impractical to use asingle reducer, you run a second job on the ...
   Author: Devin Suiter RDX, 2014-03-01, 12:48
Hadoop FileCrush - Hadoop - [mail # user]
...Hi,Has anyone used Hadoop Filecrush?http://www.jointhegrid.com/hadoop_filecrush/I was just curious about the reliability and integrity of it.It seems like a nice concept. But, if it is a nic...
   Author: Devin Suiter RDX, 2014-02-27, 17:26
Re: Logic of isSplittable() of class FileInputFormat - Hadoop - [mail # user]
...Or, as another example, I'm writing a program to analyze a large emaildump. The emails are more than one line. TextInputFormat will split them upby line, in addition to deserializing them to...
   Author: Devin Suiter RDX, 2014-02-26, 13:09
Re: Performance - Hadoop - [mail # user]
...http://sortbenchmark.org/Doesn't just cover Hadoop, but maybe the methodology will give you an ideaof what you're looking for.There's too many variables to pin down a "general" average. Ever...
   Author: Devin Suiter RDX, 2014-02-25, 20:43
[expand - 1 more] - Re: Questions from a newbie to Hadoop - Hadoop - [mail # user]
...Pseudo-distributed mode.On Feb 21, 2014 5:19 PM, "Publius"  wrote: ...
   Author: Devin Suiter RDX, 2014-02-22, 19:24
Re: How to keep data consistency? - Hadoop - [mail # user]
...Edward,It doesn't seem like your "hadoop -put ..." command will even complete -the master isn't receiving the file at any point. It instructs the node1 toconnect to the client, after asking ...
   Author: Devin Suiter RDX, 2014-02-19, 15:39
Re: A hadoop command to determine the replication factor of a hdfs file ? - Hadoop - [mail # user]
...Also Raj - if you're using pseudo-distributed mode, the replication factorwill be 1. This is part of pseudo-distributed configuration. So if you'reworking on a Cloudera preconfigured machine...
   Author: Devin Suiter RDX, 2014-02-08, 17:53
Re: Newbie: How to set up HDFS file system - Hadoop - [mail # user]
...Installing Hadoop will install HDFS, and you will need to declare storage directories on the host nodes, etc. There is also the question of what setup you want to use, there is what is calle...
   Author: Devin Suiter RDX, 2014-01-07, 14:17
MapReduce MIME Input type? - Hadoop - [mail # user]
...Hi,  I am trying to puzzle this out, and am hoping for some insight - I have an IMAP inbox dump that I am analyzing - I need to track how many times a given item is referred to in the i...
   Author: Devin Suiter RDX, 2013-12-30, 18:29
Sort:
project
Hadoop (13)
Flume (6)
Sqoop (5)
MapReduce (4)
Avro (1)
HDFS (1)
type
mail # user (14)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (14)
author
Harsh J (559)
Owen O'Malley (394)
Steve Loughran (390)
Todd Lipcon (238)
Eli Collins (182)
Alejandro Abdelnur (178)
Arun C Murthy (163)
Allen Wittenauer (148)
Chris Nauroth (146)
Ted Yu (126)
Tom White (121)
Daryn Sharp (115)
Nigel Daley (115)
Konstantin Shvachko (107)
Doug Cutting (96)
Aaron Kimball (94)
Colin Patrick McCabe (93)
Edward Capriolo (88)
Mark Kerzner (87)
jason hadoop (82)
Hairong Kuang (74)
Konstantin Boudnik (72)
Runping Qi (72)
Benoy Antony (70)
Suresh Srinivas (64)
Devin Suiter RDX