Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 13 (0.182s).
Loading phrases to help you
refine your search...
Re: how to prevent JAVA HEAP OOM in shuffle process? - HDFS - [mail # dev]
...Setting map memory at command line with the *new* api: hadoop jar hadoop-mapreduce-examples- wordcount -Dmapreduce.map.java.opts=-Xmx1024m /user/hdfs/ades /tmp/wordcount ...
   Author: Adam Muise, 2013-12-02, 13:13
Re: HDFS read/write data throttling - HDFS - [mail # dev]
...See https://issues.apache.org/jira/browse/HDFS-3475  Please note that this has met with many unexpected impacts on workload. Be careful and be mindful of your Datanode memory and networ...
   Author: Adam Muise, 2013-11-11, 19:27
Re: Cloudera Vs Hortonworks Vs MapR - MapReduce - [mail # user]
...I would just through an additional point on top of Shahab's excellent summary.  To evaluate a distribution requires more than just the technical aspects of that distribution. Even if we...
   Author: Adam Muise, 2013-09-13, 18:01
[expand - 2 more] - Re: Hadoop on IPv6 - MapReduce - [mail # user]
...Harsh is giving you a best practice for JVMs using IPv4 in general. As what I am suggesting is IPv4-only connections to the Hadoop daemons and clients on the cluster and gateway, you would n...
   Author: Adam Muise, 2013-09-10, 16:34
Re: Concatenate multiple sequence files into 1 big sequence file - Hadoop - [mail # user]
...Jerry,  It might not help with this particular file, but you might considered the approach used at Blackberry when dealing with your data. They block compressed into small avro files an...
   Author: Adam Muise, 2013-09-10, 15:20
Re: Jaspersoft -Reg - Hadoop - [mail # user]
...Note, Jaspersoft requires HiveServer version 1 for it's jdbc connection the last time I checked (a few months ago). Most distros do not start hiveserver v1 by default anymore so you will hav...
   Author: Adam Muise, 2013-09-03, 16:27
Re: Multidata center support - MapReduce - [mail # user]
...Nothing has changed. DR best practice is still one (or more) clusters per site and replication is handled via distributed copy or some variation of it. A cluster spanning multiple data cente...
   Author: Adam Muise, 2013-08-30, 10:26
Re: Is hadoop tread safe? - MapReduce - [mail # user]
...Mappers don't communicate with each other in traditional MapReduce. If you need something more MPI-ish then look to MPI over YARN or write your own YARN app.  If you need multi-threadin...
   Author: Adam Muise, 2013-08-29, 14:40
Re: updated to 1.2.1, map completed percentage keeps oscillating - MapReduce - [mail # user]
...I'm assuming you are trying the same job with the same data as before. Try taking a look at the job output for the mappers in the JobTracker. Likely you will see some failures and probably a...
   Author: Adam Muise, 2013-08-14, 01:31
Re: HDFS 1.1.0 - File Append - Hadoop - [mail # user]
...Thomas,  Try using Flume to ingest the realtime message from RabbitMQ. Flume ingests event data and has pluggable components: source -> channel -> sink.  http://flume.apache....
   Author: Adam Muise, 2013-08-06, 13:41
MapReduce (5)
HDFS (4)
Hadoop (4)
mail # user (9)
issue (2)
mail # dev (2)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (13)
Ted Yu (1688)
Harsh J (1295)
Jun Rao (1056)
Todd Lipcon (1001)
Stack (976)
Jonathan Ellis (843)
Andrew Purtell (821)
Jean-Daniel Cryans (753)
jacques@... (738)
Yusaku Sako (733)
stack (717)
Jarek Jarcec Cecho (702)
Eric Newton (697)
Jonathan Hsieh (675)
Brock Noland (666)
Roman Shaposhnik (665)
Neha Narkhede (660)
Namit Jain (649)
Hitesh Shah (626)
Owen O'Malley (625)
Steve Loughran (619)
Siddharth Seth (614)
Josh Elser (584)
Eli Collins (545)
Arun C Murthy (543)
Adam Muise