Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 11 (0.076s).
Loading phrases to help you
refine your search...
Introducing Parquet: efficient columnar storage for Hadoop. - Hadoop - [mail # general]
...Fellow Hadoopers,  We'd like to introduce a joint project between Twitter and Cloudera engineers -- a new columnar storage format for Hadoop called Parquet ( http://parquet.github.com)....
   Author: Dmitriy Ryaboy, 2013-03-12, 15:45
Re: Small question - Hadoop - [mail # user]
...That's a fuzzy match (join two tables not on equality, but on one table's column value matching a dynamically generated regex based on another column). I don't know of efficient ways of doin...
   Author: Dmitriy Ryaboy, 2012-10-04, 06:25
Re: Hadoop 0.22/HBase 0.92 package repos are now available - Hadoop - [mail # general]
...9.1 just went up for vote, and holding up a release to add 0.22 compatibility seems ill-advised. I would be willing to help get 9.2 out in short order to provide compatibility with 22, and I...
   Author: Dmitriy Ryaboy, 2011-09-30, 18:26
Re: Split control in Lzo index - Hadoop - [mail # user]
...Shi, bzip compresses much better than lzo. It is also significantly more expensive (we are talking orders of magnitude) than LZO, both on compression and decompression.  As for your que...
   Author: Dmitriy Ryaboy, 2011-06-23, 21:35
Re: [VOTE] Powered by Logo - Hadoop - [mail # general]
...2 6 5 1 4 3  On Wed, Jun 15, 2011 at 9:47 AM, Anupam Seth  wrote:  them all oting, ing is on't...
   Author: Dmitriy Ryaboy, 2011-06-15, 16:53
Re: HDFS and distcp issue?? - Hadoop - [mail # user]
...Do you have the failing task's log?  -Dmitriy  On Sat, Dec 4, 2010 at 12:47 PM, hadoopman  wrote:     Dmitriy V Ryaboy Twitter Analytics http://twitter.com/squarecog...
   Author: Dmitriy Ryaboy, 2010-12-07, 02:48
Re: Errors reading lzo-compressed files from Hadoop - Hadoop - [mail # user]
...Both Kevin's and Todd's branches now pass my tests. Thanks again Todd.  -D  On Thu, Apr 8, 2010 at 10:46 AM, Todd Lipcon  wrote:  - bca2053f731cdd58 d ys be s a  see...
   Author: Dmitriy Ryaboy, 2010-04-08, 18:20
Elephant Bird released - Hadoop - [mail # user]
...Hi folks, We (but mostly Kevin Weil) just open-sourced some of the code we use at Twitter to make working with Hadoop and Pig easier. Most of what is currently included in "Elephant Bird" de...
   Author: Dmitriy Ryaboy, 2010-04-02, 21:19
Re: PIG bin/labeling relation - Hadoop - [mail # user]
...Unless you actually need the ordinal numbers, you can do it all in one step: B = ORDER A by x PARALLEL 100; Store B into ......  This will create 100 ordered part files, with the first ...
   Author: Dmitriy Ryaboy, 2009-11-21, 20:04
Re: Hadoop dfs can't allocate memory with enough hard disk space when data gets huge - Hadoop - [mail # user]
...For searching (grepping) mailing list archives, I like MarkMail: http://hadoop.markmail.org/ (try searching for "small files").  For concatenating files -- cat works, if you don't care ...
   Author: Dmitriy Ryaboy, 2009-10-20, 02:01
Sort:
project
Pig (875)
Hadoop (9)
Drill (5)
MapReduce (3)
Bigtop (1)
HBase (1)
type
mail # user (8)
mail # general (3)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (11)
author
Harsh J (554)
Owen O'Malley (396)
Steve Loughran (379)
Todd Lipcon (237)
Eli Collins (182)
Alejandro Abdelnur (162)
Arun C Murthy (162)
Chris Nauroth (141)
Allen Wittenauer (126)
Tom White (118)
Ted Yu (116)
Nigel Daley (115)
Daryn Sharp (110)
Konstantin Shvachko (107)
Doug Cutting (94)
Aaron Kimball (93)
Edward Capriolo (87)
Colin Patrick McCabe (86)
Mark Kerzner (86)
jason hadoop (82)
Hairong Kuang (74)
Runping Qi (72)
Konstantin Boudnik (70)
Benoy Antony (69)
Suresh Srinivas (63)
Dmitriy Ryaboy