Home | About | Sematext search-lucene.com search-hadoop.com search-devops.com metrics + logs = try SPM and Logsene for free
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 12 (0.133s).
Loading phrases to help you
refine your search...
[expand - 1 more] - Required vs. optional methods for KeyValueStore - Samza - [mail # dev]
...Hi all,I'm looking at using embedded Solr as the KeyValueStore, as that lets me extract ranked results from the state to publish as part of the task's operation.Some of the methods defined b...
   Author: Ken Krugler, 2015-07-29, 00:30
RE: Thoughts and obesrvations on Samza - Samza - [mail # dev]
...Hi Martin,As a lurker here, this has been a very interesting thread.I would suggest talking to one of the Solr committers about their experience in merging with Lucene, as that's got many si...
   Author: Ken Krugler, 2015-07-06, 22:08
Best approach to detecting local mode in Hadoop 2.x - Hadoop - [mail # user]
...Hi all,For some of the helicopter stunts we do while instrumenting Hadoop jobs, we need to know whether we're running in local vs. distributed mode.This is done on the client side, when sett...
   Author: Ken Krugler, 2015-06-27, 16:08
[expand - 1 more] - Building skip table for Avro data - Avro - [mail # user]
...Hi all,I'm looking for suggestions on how to optimize a number of Hadoop jobs (written using Cascading) that only need a fraction of the records store in Avro files.Essentially I have a smal...
   Author: Ken Krugler, 2014-12-04, 02:08
MiniMRClientCluster and JAVA_HOME - Hadoop - [mail # user]
...Hi all,I'm having fun trying to update some mini-cluster test code to Hadoop 2.2.0One thing I ran into is that it seems as though MiniMRClientCluster depends on there being a /bin/java comma...
   Author: Ken Krugler, 2014-08-20, 21:55
[expand - 1 more] - Re: Apache Kafka in AWS - Kafka - [mail # user]
...Hi Jason,Thanks for the notes.I'm curious whether you went with using local drives (ephemeral storage) or EBS, and if with EBS then what IOPS.Thanks,On May 22, 2013, at 1:42pm, Jason Weiss w...
   Author: Ken Krugler, 2013-05-22, 21:24
Re: Relationship between Zookeeper and Kafka - Kafka - [mail # user]
...Hi Jason,On May 20, 2013, at 10:01am, Jason Weiss wrote:In my experience directly hitting an ephemeral drive on m1.large is faster than using EBS.I've seen some articles where RAIDing multip...
   Author: Ken Krugler, 2013-05-20, 17:44
[expand - 2 more] - Re: ETL with Kafka - Kafka - [mail # user]
...Hi Guy,On Jan 6, 2013, at 11:11pm, Guy Doulberg wrote:Interesting - we build ETLs on top of Hadoop using Cascading (open source workflow API), which has a lot of what it calls "Taps" for con...
   Author: Ken Krugler, 2013-01-07, 17:57
[SQOOP-350] Add support for requiring that a connector be used, otherwise the job should fail - Sqoop - [issue]
...There are situations where it is critical that a specific connector be used during a Sqoop. For example, if you have a table that doesn't have a suitable column for partitioning, and thus yo...
http://issues.apache.org/jira/browse/SQOOP-350    Author: Ken Krugler, 2012-10-12, 21:50
[AVRO-838] Support reading of files created with Avro 1.5 that use invalid characters in field and record names - Avro - [issue]
...Avro 1.4 had a bug that let users create schemas with invalid characters in field and record names.For example, the '-' character used to be allowed in field and record names, but Avro 1.5 w...
http://issues.apache.org/jira/browse/AVRO-838    Author: Ken Krugler, 2011-12-13, 19:06
Avro (3)
Kafka (3)
Hadoop (2)
Samza (2)
Sqoop (2)
Ted Yu (1081)
GitHub Import (895)
Jonathan Ellis (891)
Siddharth Seth (879)
Josh Elser (852)
stack (833)
Hitesh Shah (826)
Andrew Purtell (800)
Reynold Xin (764)
Hyunsik Choi (760)
Todd Lipcon (746)
Yusaku Sako (730)
James Taylor (711)
Andrew Onischuk (665)
Jonathan Hsieh (644)
Jarek Jarcec Cecho (637)
Eric Newton (633)
Brock Noland (626)
Namit Jain (613)
Edward J. Yoon (611)
Jaimin D Jetly (608)
Roman Shaposhnik (603)
Antonenko Alexander (598)
Bikas Saha (592)
Sergey Shelukhin (572)
Xiangrui Meng (554)
Andrii Tkach (542)
Andrii Babiichuk (537)
Srimanth Gunturi (531)
Oleg Nechiporenko (513)
Sean Busbey (512)
Alejandro Abdelnur (491)
Siddharth Wagle (489)
Vinod Kumar Vavilapalli (477)
Steve Loughran (475)
Amareshwari Sriramadasu (470)
Jun Rao (456)
Vinod Kone (455)
Eli Collins (442)
Owen O'Malley (442)
Keith Turner (437)
Aleksandr Kovalenko (428)
Chris Nauroth (428)
Robert Kanter (427)
Colin Patrick McCabe (415)
Patrick Wendell (408)
Sandy Ryza (406)
Hadoop QA (404)
John Vines (400)
Mahadev konar (399)
Ken Krugler
mail # user (6)
issue (4)
mail # dev (2)
last 7 days (0)
last 30 days (0)
last 90 days (3)
last 6 months (3)
last 9 months (12)
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB