Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 24 (0.17s).
Loading phrases to help you
refine your search...
Amazon SNS and Kafka comparison - Kafka - [mail # user]
...Hi,I have a system that needs to process tens of thousands of user events persecond.I've looked at both Kafka and Amazon SNS.Using SNS would mean I can avoid the operational overhead of main...
   Author: James Newhaven, 2013-06-14, 14:45
Events stored in Apache Web Logs - how best to get these into Kafka? - Kafka - [mail # user]
...I have a lot of user event data being sent to an apache web server andwritten to web logs. Unfortunately I don't control this flow, but do I haveaccess to the server the logs are being writt...
   Author: James Newhaven, 2013-06-11, 21:58
Debugging UDFs - Pig - [mail # user]
...I have defined a pig UDF want to track problems using warnings like this:  warn("My warning", PigWarning.UDF_WARNING_1);  I'm testing this in local mode first, but I never see this...
   Author: James Newhaven, 2013-04-13, 16:24
Subtracting contents of two bags - Pig - [mail # user]
...Hi,  I have two relations - A and B.  Both just contain user ids.  I want to get a list of users who are in A but not in B.  I am running Pig 0.9.1 and think this might b...
   Author: James Newhaven, 2013-01-22, 12:46
[expand - 1 more] - Re: UDF Performance Problem - Pig - [mail # user]
...Thanks Dmitriy, all sorted now.  James  On Mon, Sep 3, 2012 at 6:21 PM, Dmitriy Ryaboy  wrote:  ...
   Author: James Newhaven, 2012-09-03, 20:31
Nested binary conditionals - Pig - [mail # user]
...Has anyone had success nesting conditional statements?  I am trying to implement a standard if, else if, else if statement without success.  e.g.  A = FOREACH B GENERATE (cond...
   Author: James Newhaven, 2012-08-14, 16:02
Using average function is really slow - Pig - [mail # user]
...Hi,  I am using the built-in org.apache.pig.builtin.AVG function. I have a set of 100,000 items that I want to average.  The relevant pig latin is below:   L = FOREACH K GENER...
   Author: James Newhaven, 2012-07-04, 17:37
Pig script is failing during reduce phase - Pig - [mail # user]
...Hi,  I am executing a Pig script on Elastic MapReduce. It runs fine over one day's worth of data, but when I increase my dataset size to 30 days, the reducers have started failing with ...
   Author: James Newhaven, 2012-06-18, 15:30
Copying files to Amazon S3 using Pig is slow - Pig - [mail # user]
...I want to copy 26,000 HDFS files generated by a pig script to Amazon S3.  I am using the copyToLocal command, but I noticed the copy throughput is only one file per second - so it is go...
   Author: James Newhaven, 2012-06-08, 11:40
Saving files to HDFS - Pig - [mail # user]
...I have a pig script that creates tuples containing JSON.  I want to save the content of each tuple to a separate file e.g. file1.json to HDFS.  So, my question is, is it possible t...
   Author: James Newhaven, 2012-06-07, 10:09
Pig (20)
Hive (2)
Kafka (2)
mail # user (24)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (24)
Ted Yu (1708)
Harsh J (1298)
Jun Rao (1059)
Todd Lipcon (995)
Stack (978)
Jonathan Ellis (845)
Andrew Purtell (826)
Jean-Daniel Cryans (752)
Yusaku Sako (736)
stack (724)
Jarek Jarcec Cecho (703)
Eric Newton (697)
Jonathan Hsieh (674)
Neha Narkhede (673)
Roman Shaposhnik (666)
Namit Jain (649)
Hitesh Shah (627)
Steve Loughran (626)
Owen O'Malley (625)
Siddharth Seth (615)
Josh Elser (605)
Brock Noland (567)
Eli Collins (545)
Arun C Murthy (543)
Doug Cutting (536)
James Newhaven