Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 50 (0.201s).
Loading phrases to help you
refine your search...
[expand - 1 more] - RE: Flattening nested bags - Pig - [mail # user]
...Ah, ok, that was very helpful, thanks. I've been able to flatten things out now. So now I'm trying to re-group 2 levels of bags that I flattened (after doing a join).  After some flatte...
   Author: David Parks, 2013-06-05, 23:57
RE: Complex joins - Pig - [mail # user]
...Hi, I'm working alongside Ha on this.  You were right and wrong about the PigStorage format. It *is* a tab delimited format, that was our mistake, but those tabs *can* contain tuples an...
   Author: David Parks, 2013-05-23, 06:24
RE: [Bulk] pig 0.10.0 JsonLoader and nested list - Pig - [mail # user]
...I'm quite new to Pig, so perhaps my input is off base here, but if you input one such record without defining the schema I believe the JsonLoader will define the schema for you, no?  If...
   Author: David Parks, 2013-05-22, 10:26
Recovering the namenode from failure - Hadoop - [mail # user]
...I'm on CDH4, and trying to recover both the namenode and cloudera manager VMs from HDFS after losing the namenode.     All of our backup VMs are on HDFS, so for the moment I just w...
   Author: David Parks, 2013-05-21, 07:30
RE: About configuring cluster setup - Hadoop - [mail # user]
...We have a box that's a bit overpowered for just running our namenode and jobtracker on a 10-node cluster and we also wanted to make use of the storage and processor resources of that node, l...
   Author: David Parks, 2013-05-15, 07:50
[expand - 2 more] - RE: JobClient: Error reading task output - after instituting a DNS server - MapReduce - [mail # user]
...So simple I was hoping to avoid admitting to it. ;-)     I had set the tasks java options at -Xmx1.5g, that needed to be -Xmx1500m, the telltale output of a mistake like that is ra...
   Author: David Parks, 2013-05-15, 07:28
RE: Access HDFS from OpenCL - MapReduce - [mail # user]
...Hadoop just runs as a standard java process, you should find something that bridges between OpenCL and java, a quick google search yields: http://www.jocl.org/     I expect that yo...
   Author: David Parks, 2013-05-13, 14:11
Using FairScheduler to limit # of tasks - Hadoop - [mail # user]
...Can I use the FairScheduler to limit the number of map/reduce tasks directly from the job configuration? E.g. I have 1 job that I know should run a more limited # of map/reduce tasks than is...
   Author: David Parks, 2013-05-13, 11:21
600s timeout during copy phase of job - MapReduce - [mail # user]
...I have a job that's getting 600s task timeouts during the copy phase of the reduce step. I see a lot of copy tasks all moving at about 2.5MB/sec, and it's taking longer than 10 min to do tha...
   Author: David Parks, 2013-05-13, 06:05
What's the best disk configuration for hadoop? SSD's Raid levels, etc? - MapReduce - [mail # user]
...We've got a cluster of 10x 8core/24gb nodes, currently with 1 4TB disk (3 disk slots max), they chug away ok currently, only slightly IO bound on average.     I'm going to upgrade ...
   Author: David Parks, 2013-05-11, 06:30
MapReduce (21)
Hadoop (14)
HDFS (11)
Pig (3)
HBase (1)
mail # user (47)
issue (3)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (50)
Ted Yu (1701)
Harsh J (1293)
Jun Rao (1059)
Todd Lipcon (1000)
Stack (978)
Jonathan Ellis (844)
Andrew Purtell (822)
Jean-Daniel Cryans (754)
Yusaku Sako (733)
stack (714)
Jarek Jarcec Cecho (702)
Eric Newton (698)
Jonathan Hsieh (673)
Brock Noland (668)
Neha Narkhede (665)
Roman Shaposhnik (665)
Namit Jain (649)
Hitesh Shah (627)
Owen O'Malley (625)
Steve Loughran (624)
Siddharth Seth (614)
Josh Elser (590)
Eli Collins (545)
Arun C Murthy (543)
Doug Cutting (533)
David Parks