Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 81 to 90 from 169 (0.381s).
Loading phrases to help you
refine your search...
Re: Data-Intensive Text Processing with MapReduce - Hadoop - [mail # user]
...Dear Jimmy and Chris:  I am reading your book (thank you for providing the pre-release version) and I find it great in contents and in style. Thank you!  Sincerely, Mark  On S...
   Author: Mark Kerzner, 2010-05-09, 18:06
Accepting contributions for the "Hadooop in Practice" book - Hadoop - [mail # user]
...Hi, guys,  I am working on this book for Manning , and I need your solutions. If you had a specific problem that you solved with Hadoop, and you can share your solution, even in general...
   Author: Mark Kerzner, 2010-05-04, 23:06
Re: Hadoop Cookbook? - Hadoop - [mail # user]
...Thank you  On Tue, May 4, 2010 at 4:52 AM, Steve Loughran  wrote:  ...
   Author: Mark Kerzner, 2010-05-04, 13:16
leads? - Hadoop - [mail # user]
...Hi, guys,  without imposing, any leads for cloud-based projects will be appreciated, my resume here .  Thank you, Mark...
   Author: Mark Kerzner, 2010-04-27, 14:57
Re: DeDuplication Techniques - Hadoop - [mail # user]
...Joe,  your approach would work, whether you use files to keep old data, or a database. However, it feels like a mix of new and old technologies. It just does not feel right to open a fi...
   Author: Mark Kerzner, 2010-03-26, 00:24
Re: Parallelizing HTTP calls with Hadoop - Hadoop - [mail # user]
...Phil,  what you are describing is close to what Nutch is already doing. You can look at it - all this coding is non-trivial, and you can save yourself a lo t of work and debugging. &nbs...
   Author: Mark Kerzner, 2010-03-07, 14:34
Re: Hadoop as master's thesis - Hadoop - [mail # user]
...Tonci,  here are Enron email files used in the litigation that they had: http://edrm.net/resources/data-sets/enron-data-set-files  Here is much more stuff: http://infochimps.org/ &...
   Author: Mark Kerzner, 2010-03-01, 14:28
Re: HDFS behaving strangely - Hadoop - [mail # user]
...You may be facing the other well-known problem in Hadoop - don't use many small files:  http://www.cloudera.com/blog/2009/02/02/the-small-files-problem/  On Mon, Jan 25, 2010 at 7:...
   Author: Mark Kerzner, 2010-01-26, 03:16
Re: rmr: org.apache.hadoop.hdfs.server.namenode.SafeModeException: Cannot delete /op. Name node is in safe mode. - Hadoop - [mail # user]
...A few things may help     - delete individual files under /op    - open another terminal  I don't know why, but it helps, and then the error goes away  On Mon, ...
   Author: Mark Kerzner, 2010-01-19, 04:49
Re: which hadoop-ec2 is preferred ( cloudera/hadoop ? ) - Hadoop - [mail # user]
...My personal experience led me to prefer cloudera. Can't talk for every situation, but for me the hadoop distro had many bugs and was unreliable.  Mark  On Sun, Jan 17, 2010 at 10:5...
   Author: Mark Kerzner, 2010-01-18, 05:01
Hadoop (168)
HBase (29)
MapReduce (13)
Hive (5)
Pig (1)
mail # user (168)
mail # general (1)
last 7 days (0)
last 30 days (1)
last 90 days (2)
last 6 months (2)
last 9 months (169)
Harsh J (537)
Owen O'Malley (402)
Steve Loughran (359)
Todd Lipcon (234)
Eli Collins (181)
Arun C Murthy (157)
Chris Nauroth (129)
Alejandro Abdelnur (121)
Allen Wittenauer (115)
Nigel Daley (112)
Tom White (111)
Daryn Sharp (108)
Konstantin Shvachko (102)
Aaron Kimball (93)
Ted Yu (93)
Mark Kerzner