Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 81 to 90 from 172 (0.148s).
Loading phrases to help you
refine your search...
[expand - 1 more] - Re: Cloudera EC2 scripts - Hadoop - [mail # user]
...That would be fine, but where is the link to get them  On Fri, May 28, 2010 at 12:10 AM, Andrew Nguyen  wrote:  ...
   Author: Mark Kerzner, 2010-05-28, 05:11
Passing binary files in maps - Hadoop - [mail # user]
...Hi,  I need to put a binary file in map and then emit that map. I do it by encoding it as a string using Base64 encoding, so that's fine, but I am dealing with pretty large files, and I...
   Author: Mark Kerzner, 2010-05-28, 04:33
[expand - 2 more] - Re: Import the results into SimpleDB - Hadoop - [mail # user]
...I create this text file in Hadoop. Only I want to make the db import a separate Hadoop job, run it in Amazon EMR, and make it fast by running sufficient number of nodes.  Mark  On ...
   Author: Mark Kerzner, 2010-05-12, 02:17
Re: Data-Intensive Text Processing with MapReduce - Hadoop - [mail # user]
...Dear Jimmy and Chris:  I am reading your book (thank you for providing the pre-release version) and I find it great in contents and in style. Thank you!  Sincerely, Mark  On S...
   Author: Mark Kerzner, 2010-05-09, 18:06
Accepting contributions for the "Hadooop in Practice" book - Hadoop - [mail # user]
...Hi, guys,  I am working on this book for Manning , and I need your solutions. If you had a specific problem that you solved with Hadoop, and you can share your solution, even in general...
   Author: Mark Kerzner, 2010-05-04, 23:06
[expand - 1 more] - Re: Hadoop Cookbook? - Hadoop - [mail # user]
...Thank you  On Tue, May 4, 2010 at 4:52 AM, Steve Loughran  wrote:  ...
   Author: Mark Kerzner, 2010-05-04, 13:16
leads? - Hadoop - [mail # user]
...Hi, guys,  without imposing, any leads for cloud-based projects will be appreciated, my resume here .  Thank you, Mark...
   Author: Mark Kerzner, 2010-04-27, 14:57
[expand - 1 more] - Re: DeDuplication Techniques - Hadoop - [mail # user]
...Joe,  your approach would work, whether you use files to keep old data, or a database. However, it feels like a mix of new and old technologies. It just does not feel right to open a fi...
   Author: Mark Kerzner, 2010-03-26, 00:24
Re: Parallelizing HTTP calls with Hadoop - Hadoop - [mail # user]
...Phil,  what you are describing is close to what Nutch is already doing. You can look at it - all this coding is non-trivial, and you can save yourself a lo t of work and debugging. &nbs...
   Author: Mark Kerzner, 2010-03-07, 14:34
[expand - 1 more] - Re: Hadoop as master's thesis - Hadoop - [mail # user]
...Tonci,  here are Enron email files used in the litigation that they had: http://edrm.net/resources/data-sets/enron-data-set-files  Here is much more stuff: http://infochimps.org/ &...
   Author: Mark Kerzner, 2010-03-01, 14:28
Sort:
project
Hadoop (171)
HBase (29)
MapReduce (13)
Hive (5)
Blur (2)
Phoenix (1)
Pig (1)
type
mail # user (171)
mail # general (1)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (2)
last 9 months (172)
author
Harsh J (561)
Steve Loughran (405)
Owen O'Malley (394)
Todd Lipcon (237)
Eli Collins (182)
Alejandro Abdelnur (179)
Arun C Murthy (166)
Allen Wittenauer (159)
Chris Nauroth (156)
Ted Yu (137)
Tom White (121)
Daryn Sharp (115)
Nigel Daley (115)
Konstantin Shvachko (107)
Colin Patrick McCabe (99)
Doug Cutting (96)
Aaron Kimball (94)
Edward Capriolo (88)
Mark Kerzner (87)
jason hadoop (82)
Hairong Kuang (74)
Benoy Antony (72)
Konstantin Boudnik (72)
Runping Qi (72)
Karthik Kambatla (67)
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB