Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 316 (0.175s).
Loading phrases to help you
refine your search...
[expand - 1 more] - Re: Better file sink - Flume - [mail # user]
...Those would be ignored. The sink uses the FileSystem interface andwhen you use file:// URL it will use the LocalFileSystemimplementation so HDFS-specific configuration will just get safelyig...
   Author: Joey Echeverria, 2014-09-26, 18:41
[expand - 2 more] - Re: Flume log4j-appender - Flume - [mail # user]
...Why do you need to remove the timestamp from the file names?As far as creating the files on an hourly basis, it sounds like whatyou want is a dataset partitioned by hour. This is very easy t...
   Author: Joey Echeverria, 2014-09-24, 22:12
[AVRO-1588] ReflectData.AllowNull incorrectly handles primitive types. - Avro - [issue]
...When doing the following:private static class PoJo {  private long id;  private String name;}ReflectData.AllowNull.get().getSchema(PoJo.class);I'd expect a schema like th...
http://issues.apache.org/jira/browse/AVRO-1588    Author: Joey Echeverria, 2014-09-22, 23:30
Re: Why Avro file format is larger than CSV? - Avro - [mail # user]
...What is the schema for the data?If every field is a string, then you could end up in this situation. Your best bet is to use compression for the Avro data. If you have a lot of CSV files tha...
   Author: Joey Echeverria, 2014-09-19, 14:21
Re: Flume, what happen if the batchsize is not enough - Flume - [mail # user]
...There isn't an explicit timeout on the HBaseSink, though there is onthe AsyncHBaseSink. For the HBaseSink, it will grab events out of thechannel until it reaches the batch size or the channe...
   Author: Joey Echeverria, 2014-09-10, 12:49
Re: Avro source and sink - Flume - [mail # user]
...Hi Ed!This is definitely doable. What you want is an intercepter on source A thatwill do the conversion from log lines to Avro. The easiest way to do thiswould probably be to use Morphlines[...
   Author: Joey Echeverria, 2014-09-08, 20:28
Re: Using Flume to process data - Flume - [mail # user]
...You should be able to accomplish this with the Morplhinesintercepter[1]. It will let you build a configuration file thatconverts from JSON to CSV. There's a similar example, though thetarget...
   Author: Joey Echeverria, 2014-09-03, 21:22
[SQOOP-1483] Support passing Kite partition config when importing into parquet - Sqoop - [issue]
...When importing to Parquet, Sqoop uses Kite to create the dataset. If the dataset doesn't already exist, it'd be useful to provide a Kite partition configuration so that the resulting dataset...
http://issues.apache.org/jira/browse/SQOOP-1483    Author: Joey Echeverria, 2014-08-28, 18:56
Re: Kite Dataset sink - repo uri - Flume - [mail # user]
...Hi Roshan!Sorry for the confusion! The current release of Flume uses thedeprecated method of specifying a repository URI and a dataset namewhile the CLI documentation covers the use of the n...
   Author: Joey Echeverria, 2014-08-28, 02:30
[HIVE-7633] Warehouse#getTablePath() doesn't handle external tables - Hive - [issue]
...Warehouse#getTablePath() takes a DB and a table name. This means it will generate the wrong path for external tables. This can cause a problem if you have an external table on the local file...
http://issues.apache.org/jira/browse/HIVE-7633    Author: Joey Echeverria, 2014-08-06, 19:03
Hadoop (93)
HBase (60)
MapReduce (56)
Accumulo (36)
HDFS (32)
Flume (12)
Sqoop (12)
Avro (6)
Hive (6)
Zookeeper (2)
Bigtop (1)
mail # user (238)
mail # dev (52)
issue (26)
last 7 days (0)
last 30 days (3)
last 90 days (12)
last 6 months (26)
last 9 months (316)
Ted Yu (1694)
Harsh J (1295)
Jun Rao (1059)
Todd Lipcon (1000)
Stack (978)
Jonathan Ellis (844)
Andrew Purtell (825)
Jean-Daniel Cryans (754)
jacques@... (738)
Yusaku Sako (731)
stack (717)
Jarek Jarcec Cecho (703)
Eric Newton (698)
Jonathan Hsieh (675)
Brock Noland (666)
Roman Shaposhnik (665)
Neha Narkhede (662)
Namit Jain (649)
Hitesh Shah (625)
Owen O'Malley (625)
Steve Loughran (622)
Siddharth Seth (614)
Josh Elser (593)
Eli Collins (545)
Arun C Murthy (543)
Joey Echeverria