Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 86 (0.24s).
Loading phrases to help you
refine your search...
[expand - 4 more] - Re: Processing audio/video/images - Spark - [mail # user]
...So..Here is my experimental code to get a feel of itdef read_file(filename):   with open(filename) as f:        lines = [ line for line in f]   &...
   Author: jamal sasha, 2014-06-19, 21:14
Computing cosine similiarity using pyspark - Spark - [mail # user]
...Hi,  I have bunch of vectors like[0.1234,-0.231,0.23131].... and so on.and  I want to compute cosine similarity and pearson correlation usingpyspark..How do I do this?Any idea...
   Author: jamal sasha, 2014-05-22, 14:49
Frequency count in pig - Pig - [mail # user]
...Hi,   My data is in format:   user_id,movie_id,timestamp    123, abc,unix_timestamp    123, def, ...    123, abc, ... &n...
   Author: jamal sasha, 2014-05-16, 13:59
MultipleInputs.addInputPath - MapReduce - [mail # user]
...Hi,    So, I have two different directories.. which i want to process differently... For which I have to mappers for the job..  Data1 Data2  and in my driver.. I add the ...
   Author: jamal sasha, 2013-11-21, 18:10
Simple word count in pig.. - Pig - [mail # user]
...Hi,  I have data already processed in following form:   ( id ,{ bag of words}) So for example:  (foobar, {(foo), (foo),(foobar),(bar)}) (foo,{(bar),(bar)})  and so on.. d...
   Author: jamal sasha, 2013-11-19, 23:45
Dealing with stragglers in hadoop - HDFS - [mail # user]
...Hi,   I have a very simple use case... Basically I have an edge list and I am trying to convert it into adjacency list.. Basically  src target a     b a    c b ...
   Author: jamal sasha, 2013-11-15, 08:44
simple pig logic - Pig - [mail # user]
...Hi,  I have two datasets.. main_data.txt {"id":"foo", "some_field:12354, "score":0} {"id":"foobar", "some_field:12354, "score":0}   score_data.txt {"id":"foo", "score":1} {"id":"fo...
   Author: jamal sasha, 2013-10-31, 16:41
MRUNIT basic question - Hadoop - [mail # user]
...Hi,   I have been searching in mrunit documentation but hasnt been able to find it so far.. How do i pass configuration parameters in my mrunit.  So for example, if i take the word...
   Author: jamal sasha, 2013-10-26, 22:55
[expand - 1 more] - Re: Unable to use third party jar - MapReduce - [mail # user]
...OOps..forgot the code: http://pastebin.com/7XnyVnkv   On Thu, Oct 24, 2013 at 10:54 AM, jamal sasha  wrote:  ...
   Author: jamal sasha, 2013-10-24, 17:54
Reading json data - Pig - [mail # user]
...Hi,   I have three data types...  1) Base data 2) data_dict_1 3) data_dict_2  Base data is very well formatted json.. For example: {"id1":"foo", "id2":"bar" ,type:"type1"} {"i...
   Author: jamal sasha, 2013-10-22, 20:31
Sort:
project
Pig (29)
MapReduce (23)
Hadoop (15)
HDFS (14)
HBase (3)
Spark (2)
type
mail # user (86)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (3)
last 9 months (86)
author
Ted Yu (1693)
Harsh J (1294)
Jun Rao (1059)
Todd Lipcon (1001)
Stack (977)
Jonathan Ellis (844)
Andrew Purtell (824)
Jean-Daniel Cryans (754)
jacques@... (738)
Yusaku Sako (732)
stack (717)
Jarek Jarcec Cecho (702)
Eric Newton (698)
Jonathan Hsieh (674)
Brock Noland (666)
Roman Shaposhnik (665)
Neha Narkhede (662)
Namit Jain (649)
Hitesh Shah (625)
Owen O'Malley (625)
Steve Loughran (623)
Siddharth Seth (614)
Josh Elser (589)
Eli Collins (545)
Arun C Murthy (543)
jamal sasha