Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 21 (0.288s).
Loading phrases to help you
refine your search...
[HIVE-1898] The ESCAPED BY clause does not seem to pick up newlines in colums and the line terminator cannot be changed - Hive - [issue]
...If I want to preserve data in columns which contains a newline (webcrawling for instance) I cannot set the ESCAPED BY clause to escape these out (other characters such as commas escape fine,...
http://issues.apache.org/jira/browse/HIVE-1898    Author: Josh Patterson, 2012-09-06, 20:39
Re: hbase data - HBase - [mail # user]
...unless you need low latency access to all of this time series, it might be a more cost efficient path to store large archives of the data in plain HDFS.  The scanning can be done more e...
   Author: Josh Patterson, 2012-05-29, 17:13
[expand - 1 more] - Re: [jira] [Created] (MAPREDUCE-2911) Hamster: Hadoop And Mpi on the same cluSTER - MapReduce - [mail # dev]
...I've also heard that Matei Z is working on moving Spark to MRv2, but I havent confirmed that yet.  JP  On Thu, Sep 1, 2011 at 12:22 AM, Arun C Murthy  wrote: wse/MAPREDUCE-291...
   Author: Josh Patterson, 2011-09-02, 15:19
Re: Block size in HDFS - HDFS - [mail # user]
...It will only take up ~1KB of local datanode disk space (+ metadata space such as the CRC32 of every 512 bytes, along with replication @ 1KB per replicated block, in this case 2KB) but the re...
   Author: Josh Patterson, 2011-06-10, 19:34
[expand - 1 more] - Re: Reg HDFS checksum - Hadoop - [mail # user]
...If you take a look at:  https://github.com/jpatanooga/IvoryMonkey/blob/master/src/tv/floe/IvoryMonk ey/hadoop/fs/ExternalHDFSChecksumGenerator.java  you'll see a single process ver...
   Author: Josh Patterson, 2011-04-12, 14:06
Re: how to sort the output by value in reduce instead of by key? - Hadoop - [mail # user]
...Leibnitz, I think you are looking for "secondary sort" in this case where the data arrives in some sort of order at the reducer as opposed to "in a group by key". Is that the case?  For...
   Author: Josh Patterson, 2011-04-11, 14:09
Re: hbase and hdfs - HDFS - [mail # user]
...Rita, Specifically what type of data are we talking about, and what type of queries are you looking to do? Effectively, what do you need to learn from the data?  Thanks,  Josh &nbs...
   Author: Josh Patterson, 2011-03-08, 14:51
Re: md5sum of files on HDFS ? - HDFS - [mail # user]
...I actually had to pull this code out for a project about two years ago (we had to 2 hop the files due to some security issues, and the sender wanted to know if the file got to hdfs "intact")...
   Author: Josh Patterson, 2011-03-08, 14:36
Re: Digital Signal Processing Library + Hadoop - Hadoop - [mail # general]
...Roger, A basic time series construct is the "sliding" window in conjunction with sorted time/value data; A sample implementation is at my github:  https://github.com/jpatanooga/Caduceus...
   Author: Josh Patterson, 2011-03-08, 14:24
Re: how to get lzo library library loaded?(error :Caused by: java.lang.ClassNotFoundException: com.hadoop.compression.lzo.LzoCodec) - HDFS - [mail # user]
...Alex, LZO can be a pain, we've all seen it; I have a few tips I've compiled that might help you (I've posted these before):  http://mail-archives.apache.org/mod_mbox/hadoop-common-user/...
   Author: Josh Patterson, 2010-08-15, 22:50
Sort:
project
Hadoop (11)
HDFS (5)
MapReduce (3)
HBase (1)
Hive (1)
type
mail # user (16)
mail # general (3)
issue (1)
mail # dev (1)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (21)
author
Ted Yu (1844)
Harsh J (1304)
Jun Rao (1016)
Todd Lipcon (998)
Stack (986)
Andrew Purtell (882)
Jonathan Ellis (854)
stack (765)
Jean-Daniel Cryans (751)
Jarek Jarcec Cecho (747)
Yusaku Sako (745)
Eric Newton (707)
Hitesh Shah (684)
Jonathan Hsieh (684)
Roman Shaposhnik (679)
Josh Elser (678)
Steve Loughran (651)
Namit Jain (648)
Siddharth Seth (644)
Brock Noland (637)
Owen O'Malley (623)
Hyunsik Choi (584)
Neha Narkhede (569)
Arun C Murthy (548)
Eli Collins (545)
Josh Patterson
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB