Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 45 (0.899s).
Loading phrases to help you
refine your search...
[expand - 1 more] - Re: HDFS sink: "clever" routing - Flume - [mail # user]
...Human error is most common reason in my experience. Whether it is a configuration error or fault in app development, I was just relaying a method to make your flume infrastructure more resil...
   Author: Paul Chavez, 2014-10-16, 15:54
RE: flume on windows ? - Flume - [mail # user]
...We’ve been running the 1.4 release of flume on windows for over a year. We had to do a custom build at first before it was initially released to pick up a SpoolDir source issue.I use winsw (...
   Author: Paul Chavez, 2014-09-12, 16:32
Re: performances tuning... - Flume - [mail # user]
...Start adding additional HDFS sinks attached to the same channel. You can also tune batch sizes when writing to HDFS to increase per sink performance.On Sep 2, 2014, at 11:54 PM, "Sebastiano ...
   Author: Paul Chavez, 2014-09-03, 07:12
RE: question using multiplexing and the the same serializer for multiple sinks to multiple hbase tables - Flume - [mail # user]
...There is a configuration error in your multiplexing channel selector section. You are referencing ‘server-agent.sources.avor-Src.’ and it should be ‘server-agent.sources.mySrc.’. Otherwise, ...
   Author: Paul Chavez, 2014-08-09, 01:41
RE: Import files from a directory on remote machine - Flume - [mail # user]
...I would recommend using a scheduled script to create diff files off the log files. I have one that runs against large logs files that roll over on UTC day. It runs once a minute, checkpoints...
   Author: Paul Chavez, 2014-04-23, 16:52
[expand - 1 more] - Re: File formats supported by flume - Flume - [mail # user]
...Excel and word, no, unless you wrote an application to process the data in those files and send events to Flume. CSV can be processed line by line using spoolDir source or the built in Avro ...
   Author: Paul Chavez, 2014-02-14, 18:03
RE: distributed weblogs ingestion on HDFS via flume - Flume - [mail # user]
...Hi Asim,I have a similar use case that has been in production for about a year. We have 6 web servers sending about 15GB a day of web server logs to an 11 node Hadoop cluster. Additionally t...
   Author: Paul Chavez, 2014-02-06, 03:09
RE: File Channel Contents - Flume - [mail # user]
...The 'ChannelSize' metric will give you the number of events currently in the channel. It's one of the main metrics we alert on (well ChannelFillPercentage, actually).  Snippet from one ...
   Author: Paul Chavez, 2014-01-02, 22:31
RE: File Channel Best Practice - Flume - [mail # user]
...We co-locate our flume agents on our data nodes in order to have access to many 'spindles' for the file channels. We have a small cluster (10 nodes) so these are also our task tracker nodes ...
   Author: Paul Chavez, 2013-12-17, 17:43
RE: Recording Windows System Events - Flume - [mail # user]
...No, you would need to have some kind of script or application run to read the events and send them to flume. A script that is scheduled to run every 5 minutes and save the events since the l...
   Author: Paul Chavez, 2013-11-19, 22:04
Flume (45)
mail # user (44)
mail # dev (1)
last 7 days (1)
last 30 days (1)
last 90 days (4)
last 6 months (5)
last 9 months (45)
Hari Shreedharan (418)
Jonathan Hsieh (365)
Mike Percy (278)
Brock Noland (229)
Disabled imported user (213)
E. Sammer (112)
Roshan Naik (106)
Will McQueen (91)
Alexander Alten-Lorenz (90)
Arvind Prabhakar (83)
Juhani Connolly (54)
Bruce Mitchener (48)
Denny Ye (44)
Jeff Lord (39)
Israel Ekpo (36)
Ashish (33)
Mubarak Seyed (32)
Ashish Paliwal (30)
Jarek Jarcec Cecho (30)
Prasad Mujumdar (25)
Nicholas Verbeck (20)
Edward Sargisson (19)
Otis Gospodnetic (19)
Patrick Hunt (19)
Patrick Wendell (19)
Paul Chavez