Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 6 from 6 (0.081s).
Loading phrases to help you
refine your search...
[expand - 4 more] - Re: Improving MR job disk IO - MapReduce - [mail # user]
...Yep, have several tens of terabytes of data that will easily be over couple of hundred TB in a year. Now it isn't as if I have one or two use cases to run on these data sets. I need to run s...
   Author: Xuri Nagarin, 2013-10-15, 03:50
[expand - 1 more] - Re: Hadoop graphing tools - MapReduce - [mail # user]
...Not really performance monitoring but a simple charting tool. Scan a data set, extract keys/values and place them on a 2-D chart. The way you would load a small data set in an Excel spreadsh...
   Author: Xuri Nagarin, 2013-10-15, 03:46
Modifying Grep to read Sequence/Snappy files - MapReduce - [mail # user]
...Hi,  I am trying to get the Grep example bundled with CDH to read Sequence/Snappy files.  By default, the program throws errors trying to read Sequence/Snappy files: java.io.EOFExc...
   Author: Xuri Nagarin, 2013-10-08, 17:52
[expand - 3 more] - Re: Cloudera Vs Hortonworks Vs MapR - MapReduce - [mail # user]
...I simply stated the process my team went through but in hindsight, given that I understand the Hadoop ecosystem better, I think yes, given that MapR uses HDFS, we could simply use distcp to ...
   Author: Xuri Nagarin, 2013-09-18, 18:41
Re: Hadoop Clients (Hive,Pig) and Hadoop Cluster - MapReduce - [mail # user]
...Yes, ideally you want to setup a 4th gateway node to run clients. http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH4/latest/CDH4-Security-Guide/AppxG-Setting-Up-Gateway.html...
   Author: Xuri Nagarin, 2013-08-29, 22:24
TB per core sweet spot - MapReduce - [mail # user]
...Hi,  I realize there is no perfect spec for data nodes as lot depends on use cases and work loads but I am curious if there are any rules of thumb or no-go zones in terms of how many te...
   Author: Xuri Nagarin, 2013-08-29, 21:33
Sort:
project
MapReduce (5)
Pig (2)
Spark (2)
HDFS (1)
Kafka (1)
type
mail # user (6)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (6)
author
Harsh J (454)
Arun C Murthy (326)
Vinod Kumar Vavilapalli (309)
Todd Lipcon (215)
Amar Kamat (181)
Thomas Graves (165)
Jason Lowe (159)
Amareshwari Sriramadasu (152)
Sandy Ryza (124)
Tom White (111)
Siddharth Seth (109)
Aaron Kimball (107)
Owen O'Malley (105)
Devaraj K (103)
Ramya Sunil (103)
Alejandro Abdelnur (102)
Robert Joseph Evans (101)
Hemanth Yamijala (97)
Steve Loughran (90)
Ted Yu (78)
Eli Collins (77)
Ravi Gummadi (76)
Karthik Kambatla (70)
Mahadev konar (67)
Ravi Prakash (66)
Xuri Nagarin