Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 15 (0.261s).
Loading phrases to help you
refine your search...
Re: filtering out non English tweets using TwitterUtils - Spark - [mail # user]
...Fwiw if you do decide to handle language detection on your machine thislibrary works great on tweets https://github.com/carrotsearch/langid-javaOn Tue, Nov 11, 2014, 7:52 PM Tobias Pfeiffer ...
   Author: Ryan Compton, 2014-11-12, 05:53
Re: Json Parsing in Apache Pig - Pig - [mail # user]
...I've found Twitter's elephantbird library very useful here(https://github.com/kevinweil/elephant-bird )a = LOAD 'file3.json' USINGcom.twitter.elephantbird.pig.load.JsonLoader('-nestedLoad')W...
   Author: Ryan Compton, 2014-07-26, 00:08
Interconnect benchmarking - Spark - [mail # user]
...We are going to upgrade our cluster from 1g to 10g ethernet. I'd liketo run some benchmarks before and after the upgrade. Can anyonesuggest a few typical Spark workloads that are network-bou...
   Author: Ryan Compton, 2014-06-28, 00:08
Re: best practice: write and debug Spark application in scala-ide and maven - Spark - [mail # user]
...Sounds like there's two questions here:First, from the command line, if you "mvn package" and then run thecode with "java -cp targe/*jar-with-dependencies.jar com.ibm.App" doyou still get th...
   Author: Ryan Compton, 2014-06-07, 19:17
Re: Java IO Stream Corrupted - Invalid Type AC? - Spark - [mail # user]
...Just ran into this today myself. I'm on branch-1.0 using a CDH3cluster (no modifications to Spark or its dependencies). The errorappeared trying to run GraphX's .connectedComponents() on a ~...
   Author: Ryan Compton, 2014-06-06, 21:21
[SPARK-1952] slf4j version conflicts with pig - Spark - [issue]
...Upgrading from Spark-0.9.1 to Spark-1.0.0 causes all Pig scripts to fail when they "register" a jar containing Spark. The error appears to be related to org.slf4j.spi.LocationAwareLogger.log...
http://issues.apache.org/jira/browse/SPARK-1952    Author: Ryan Compton, 2014-06-03, 00:35
[expand - 3 more] - Re: Spark 1.0: slf4j version conflicts with pig - Spark - [mail # user]
...posted a JIRA https://issues.apache.org/jira/browse/SPARK-1952On Wed, May 28, 2014 at 1:14 PM, Ryan Compton  wrote: ...
   Author: Ryan Compton, 2014-05-28, 22:19
[expand - 1 more] - Re: ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2998: Unhandled internal error. org.slf4j.spi.LocationAwareLogger.log - Pig - [mail # user]
...Update: Turns out I'm getting it on 11.1 as well. Must be a problemwith something in my jar.On Tue, May 27, 2014 at 1:26 PM, Ryan Compton  wrote: ...
   Author: Ryan Compton, 2014-05-27, 20:35
[expand - 1 more] - Re: Need example of python code with dependency files - Pig - [mail # user]
...Thanks, this worked for some test programs, but then I ran into some other problems when I had to import things (for some reason pig was looking for python2.4 stuff even though I explicitly ...
   Author: Ryan Compton, 2013-11-09, 20:48
Re: Reading json data - Pig - [mail # user]
...It sounds like you have two problems: parsing json and joining the datasets  For reading jsons you can use: http://stackoverflow.com/questions/11035105/processing-json-through-pig-scrip...
   Author: Ryan Compton, 2013-10-23, 02:01
Sort:
project
Pig (8)
Spark (6)
HBase (1)
type
mail # user (14)
issue (1)
date
last 7 days (0)
last 30 days (1)
last 90 days (1)
last 6 months (8)
last 9 months (15)
author
Ted Yu (1771)
Harsh J (1298)
Jun Rao (995)
Todd Lipcon (992)
Stack (981)
Andrew Purtell (850)
Jonathan Ellis (846)
Jean-Daniel Cryans (751)
stack (740)
Yusaku Sako (735)
Jarek Jarcec Cecho (724)
Eric Newton (695)
Jonathan Hsieh (674)
Roman Shaposhnik (672)
Namit Jain (649)
Hitesh Shah (646)
Steve Loughran (633)
Siddharth Seth (625)
Owen O'Malley (624)
Josh Elser (622)
Brock Noland (597)
Neha Narkhede (555)
Arun C Murthy (546)
Eli Collins (545)
Hyunsik Choi (542)
Ryan Compton
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB