Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 11 to 20 from 158 (0.103s).
Loading phrases to help you
refine your search...
Re: How can I record some position of context in Reduce()? - Hadoop - [mail # user]
...Hi,  Your cross join is supported in both pig and hive. (Cross, and Theta joins)   So there must be code to do this.   Essentially in the reducer you would have your key and t...
   Author: Michael Segel, 2013-04-09, 13:08
Re: Database insertion by HAdoop - Hadoop - [mail # user]
...Nope HBase wasn't mentioned.  The OP could be talking about using external tables and Hive.   The OP could still be stuck in the RDBMs world and hasn't flattened his data yet. &nbs...
   Author: Michael Segel, 2013-02-18, 16:57
Re: Select Linux Distro for Hbase - Hadoop - [mail # general]
...RedHat, or Centos is the best.  (Its the same thing... well sort of... ;-)   You can use other distros but YMMV and you need to make sure that you're not using the Open Source JDK ...
   Author: Michael Segel, 2013-01-23, 14:14
Re: Hello and request some advice. - Hadoop - [mail # user]
...Uhm...   Well, you can talk to Microsoft and Hortonworks about Microsoft as a platform.  Depending on the power of your laptop, you could create a VM and run hadoop in a pseudo dis...
   Author: Michael Segel, 2013-01-04, 19:53
Re: NN Memory Jumps every 1 1/2 hours - Hadoop - [mail # user]
...Hey Silly question...   How long have you had 27 million files?   I mean can you correlate the number of files to the spat of OOMs?   Even without problems... I'd say it would...
   Author: Michael Segel, 2012-12-22, 15:42
Re: What should I do with a 48-node cluster - Hadoop - [mail # user]
...While Ted ignores that the world is going to end before X-Mas, he does hit the crux of the matter head on.   If you don't have a place to put it, the cost of setting it up would kill yo...
   Author: Michael Segel, 2012-12-20, 15:38
Re: Chaining MapReduce Jobs - Hadoop - [mail # general]
...Have you looked at the ToolRunner class?   On Nov 8, 2012, at 7:03 AM, Claudio Reggiani  wrote:  ...
   Author: Michael Segel, 2012-11-08, 19:12
[expand - 1 more] - Re: Disks RAID best practice - Hadoop - [mail # user]
...Oleg, that's for an overall raid preference.   Specifically for the 'control nodes' aka (NN, SN, JT, HM, ZK...)   I tend to just use simple mirroring because these processes are no...
   Author: Michael Segel, 2012-11-01, 14:49
Re: measuring iops - Hadoop - [mail # user]
...You have two issues.   1) You need to know the throughput in terms of data transfer between  disks and controller cards on the node.  2) The actual network throughput of havin...
   Author: Michael Segel, 2012-10-23, 13:19
Re: Hadoop counter - Hadoop - [mail # user]
...Yup.  The counters at the end of the job are the most accurate.   On Oct 22, 2012, at 3:00 AM, Lin Ma  wrote:  discussion. how/when JT merge counter in the middle of the ...
   Author: Michael Segel, 2012-10-23, 03:57
Sort:
project
HBase (397)
Hadoop (155)
MapReduce (51)
HDFS (25)
Hive (2)
Spark (2)
Storm (1)
Zookeeper (1)
type
mail # user (139)
mail # general (19)
date
last 7 days (0)
last 30 days (0)
last 90 days (2)
last 6 months (2)
last 9 months (158)
author
Harsh J (571)
Steve Loughran (438)
Owen O'Malley (393)
Todd Lipcon (240)
Allen Wittenauer (223)
Chris Nauroth (184)
Eli Collins (184)
Alejandro Abdelnur (180)
Ted Yu (170)
Arun C Murthy (168)
Tom White (121)
Daryn Sharp (117)
Nigel Daley (115)
Konstantin Shvachko (111)
Colin Patrick McCabe (110)
Doug Cutting (96)
Aaron Kimball (94)
Edward Capriolo (88)
Mark Kerzner (87)
jason hadoop (82)
Kai Zheng (80)
Akira AJISAKA (75)
Hairong Kuang (75)
Benoy Antony (73)
Konstantin Boudnik (73)
Michael Segel
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB