Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 5 from 5 (0.2s).
Loading phrases to help you
refine your search...
Re: Python vs Scala performance - Spark - [mail # user]
...Interesting thread Marius,Btw, I'm curious about your cluster size.How small it is in terms of ram and cores.Arian2014-10-22 13:17 GMT+01:00 Nicholas Chammas : ...
   Author: Arian Pasquali, 2014-10-22, 14:19
[expand - 2 more] - Re: java.lang.OutOfMemoryError: Requested array size exceeds VM limit - Spark - [mail # user]
...That's true Guillaume.I'm currently aggregating documents considering a week as time range.I will have to make it daily and aggregate the results later.thanks for your hints anywayArian Pasq...
   Author: Arian Pasquali, 2014-10-21, 13:48
[expand - 1 more] - Re: Counting elements for each group - Pig - [mail # user]
...Thanks Gianmarco!!here the final version is like thisby_clusters = GROUP sample_data by (cluster_id, terms);by_clusters_terms_count = FOREACH by_clusters GENERATE FLATTEN(group)as (cluster_i...
   Author: Arian Pasquali, 2014-07-29, 23:44
[expand - 2 more] - Re: How do I load JSON in Pig? - Pig - [mail # user]
...I dont think you really need to build it. you can find it at any maven repository.  Arian Rodrigo Pasquali FEUP, SAPO Labs http://www.arianpasquali.com twitter @arianpasquali   &nb...
   Author: Arian Pasquali, 2012-11-19, 00:31
[expand - 4 more] - Re: problem filtering null values with pig - Pig - [mail # user]
...just for the record I m posting here the solution for my problem.  Thank you for your help.  In the end the problem seams to be with the JsonLoader I was using. I don't know why ex...
   Author: Arian Pasquali, 2012-11-17, 05:01
Sort:
project
Pig (3)
Spark (2)
type
mail # user (5)
date
last 7 days (0)
last 30 days (0)
last 90 days (2)
last 6 months (3)
last 9 months (5)
author
Ted Yu (1835)
Harsh J (1303)
Jun Rao (1017)
Todd Lipcon (994)
Stack (989)
Andrew Purtell (876)
Jonathan Ellis (854)
stack (760)
Jean-Daniel Cryans (751)
Jarek Jarcec Cecho (747)
Yusaku Sako (744)
Eric Newton (707)
Hitesh Shah (683)
Jonathan Hsieh (682)
Roman Shaposhnik (677)
Josh Elser (676)
Steve Loughran (651)
Namit Jain (648)
Siddharth Seth (644)
Brock Noland (634)
Owen O'Malley (623)
Hyunsik Choi (582)
Neha Narkhede (568)
Arun C Murthy (548)
Eli Collins (545)
Arian Pasquali
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB