Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 15 (0.26s).
Loading phrases to help you
refine your search...
PySpark and Cassandra 2.1 Examples - Spark - [mail # user]
...Hey all,Just thought I'd share this with the list in case any one else wouldbenefit.  Currently working on a proper integration of PySpark andDataStax's new Cassandra-Spark connector, b...
   Author: Mike Sukmanowsky, 2014-10-29, 16:02
Using the DataStax Cassandra Connector from PySpark - Spark - [mail # user]
...Hi there,I'm using Spark 1.1.0 and experimenting with trying to use the DataStaxCassandra Connector (https://github.com/datastax/spark-cassandra-connector)from within PySpark.As a baby step,...
   Author: Mike Sukmanowsky, 2014-10-21, 23:03
[PIG-4240] Python UDF output serialization is invalid - Pig - [issue]
...The serialize_output function handles str and unicode return types from UDFs inappropriately.Namely, a function could return a valid UTF-8 string, but the call to unicode will trigger a deco...
http://issues.apache.org/jira/browse/PIG-4240    Author: Mike Sukmanowsky, 2014-10-16, 23:06
[STORM-361] Add JSON-P support to Storm UI API - Storm - [issue]
...The recent API that is being released in Storm UI with 0.9.2 is great, but it'd be useful if the API supported an optional ?callback parameter that would provide a wrapped JSON-P response fo...
http://issues.apache.org/jira/browse/STORM-361    Author: Mike Sukmanowsky, 2014-08-25, 17:32
GROUP ALL Partitioning - Pig - [mail # user]
...Hi there,Just curious, can anyone provide a quick explanation or link to the sourcecode of how Pig partitions data on a GROUP alias ALL operation?  We'reseeing some odd behaviour, likel...
   Author: Mike Sukmanowsky, 2014-01-23, 19:38
[expand - 1 more] - Re: Log File Versioning and Pig - Pig - [mail # user]
...Thanks Pradeep - none of our logs currently use Proto Buf/Thrift/Avro and we were somewhat trying to stay away from these guys but they may be a good option.   On Thu, Dec 12, 2013 at 6...
   Author: Mike Sukmanowsky, 2013-12-13, 14:42
Bug in ILLUSTRATE operator - Pig - [mail # user]
...Was going to file in JIRA, but wanted to reach out here first to see if I'm just going crazy.  When using 0.11.2-SNAPSHOT I'm seeing errors only when using ILLUSTRATE (dump and describe...
   Author: Mike Sukmanowsky, 2013-08-29, 14:34
Distinct IDs from different time periods - Pig - [mail # user]
...Hi all,  Trying to produce some data using clickstream logs from Pig that does the following:     1. Pull data for the past 30 days (current period)    2. Classify G...
   Author: Mike Sukmanowsky, 2013-08-13, 20:32
Re: Welcome our newest committer Prashant Kommireddi - Pig - [mail # user]
...Congrats!   On Thu, May 2, 2013 at 3:56 PM, Julien Le Dem  wrote:      Mike Sukmanowsky  Product Lead, http://parse.ly 989 Avenue of the Americas, 3rd Floor New...
   Author: Mike Sukmanowsky, 2013-05-02, 22:41
Re: Pig write to single file - Pig - [mail # user]
...How many output files are you getting?  You can set SET DEFAULT_PARALLEL 1; so you don't have to specify parallelism on each reduce phase.  In general though, I wouldn't recommend ...
   Author: Mike Sukmanowsky, 2013-05-01, 17:17
Sort:
project
Pig (11)
Spark (2)
Hadoop (1)
Storm (1)
type
mail # user (13)
issue (2)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (1)
last 9 months (15)
author
Ted Yu (2020)
Harsh J (1318)
Jun Rao (1098)
Todd Lipcon (1014)
Andrew Purtell (1011)
Stack (1001)
GitHub Import (895)
Jonathan Ellis (862)
Josh Elser (861)
stack (828)
Jarek Jarcec Cecho (814)
Yusaku Sako (793)
Hitesh Shah (788)
Siddharth Seth (773)
Jean-Daniel Cryans (752)
Eric Newton (736)
Brock Noland (724)
Steve Loughran (717)
Jonathan Hsieh (702)
James Taylor (688)
Roman Shaposhnik (687)
Namit Jain (648)
Hyunsik Choi (646)
Owen O'Malley (618)
Bikas Saha (583)
Mike Sukmanowsky
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB