Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 28 (0.117s).
Loading phrases to help you
refine your search...
[expand - 2 more] - RE: JobClient: Error reading task output - after instituting a DNS server - MapReduce - [mail # user]
...So simple I was hoping to avoid admitting to it. ;-)     I had set the tasks java options at -Xmx1.5g, that needed to be -Xmx1500m, the telltale output of a mistake like that is ra...
   Author: David Parks, 2013-05-15, 07:28
RE: Access HDFS from OpenCL - MapReduce - [mail # user]
...Hadoop just runs as a standard java process, you should find something that bridges between OpenCL and java, a quick google search yields: http://www.jocl.org/     I expect that yo...
   Author: David Parks, 2013-05-13, 14:11
600s timeout during copy phase of job - MapReduce - [mail # user]
...I have a job that's getting 600s task timeouts during the copy phase of the reduce step. I see a lot of copy tasks all moving at about 2.5MB/sec, and it's taking longer than 10 min to do tha...
   Author: David Parks, 2013-05-13, 06:05
What's the best disk configuration for hadoop? SSD's Raid levels, etc? - MapReduce - [mail # user]
...We've got a cluster of 10x 8core/24gb nodes, currently with 1 4TB disk (3 disk slots max), they chug away ok currently, only slightly IO bound on average.     I'm going to upgrade ...
   Author: David Parks, 2013-05-11, 06:30
[expand - 1 more] - RE: Uploading file to HDFS - MapReduce - [mail # user]
...I just realized another trick you might trying. The Hadoop dfs client can read input from STDIN, you could use netcat to pipe the stuff across to HDFS without hitting the hard drive, I haven...
   Author: David Parks, 2013-04-19, 08:42
Mapreduce jobs to download job input from across the internet - MapReduce - [mail # user]
...For a set of jobs to run I need to download about 100GB of data from the internet (~1000 files of varying sizes from ~10 different domains).     Currently I do this in a simple lin...
   Author: David Parks, 2013-04-17, 04:26
RE: Hadoop distcp from CDH4 to Amazon S3 - Improve Throughput - MapReduce - [mail # user]
...4-20MB/sec are common transfer rates from S3 to *1* local AWS box, this was, of course, a cluster, and s3distcp is specifically designed to take advantage of the cluster, so it was a 45 minu...
   Author: David Parks, 2013-03-31, 01:26
RE: Which hadoop installation should I use on ubuntu server? - MapReduce - [mail # user]
...Hmm, seems intriguing. I'm still not totally clear on bigtop here. It seems like they're creating and maintain basically an installer for Hadoop?     I tried following their docs f...
   Author: David Parks, 2013-03-29, 08:09
RE: - MapReduce - [mail # user]
...Can I suggest an answer of "Yes, but  you probably don't want to"?  As a "typical user" of Hadoop you would not do this. Hadoop already chooses the best server to do the work based...
   Author: David Parks, 2013-03-25, 07:16
RE: For a new installation: use the BackupNode or the CheckPointNode? - MapReduce - [mail # user]
...So... the answer is... SecondaryNameNode is what I should be installing here. And the SecondaryNameNode is essentially just an earlier version of the checkpoint node, in terms of functionali...
   Author: David Parks, 2013-03-24, 01:21
MapReduce (21)
Hadoop (14)
HDFS (11)
Pig (3)
HBase (1)
mail # user (28)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (28)
Harsh J (454)
Arun C Murthy (326)
Vinod Kumar Vavilapalli (309)
Todd Lipcon (223)
Amar Kamat (181)
Thomas Graves (165)
Jason Lowe (161)
Amareshwari Sriramadasu (152)
Sandy Ryza (124)
Tom White (111)
Siddharth Seth (109)
Aaron Kimball (107)
Owen O'Malley (105)
Alejandro Abdelnur (103)
Devaraj K (103)
Ramya Sunil (103)
Robert Joseph Evans (101)
Hemanth Yamijala (97)
Steve Loughran (90)
Ted Yu (78)
Eli Collins (77)
Ravi Gummadi (76)
Karthik Kambatla (71)
Mahadev konar (67)
Ravi Prakash (66)
David Parks