Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 74 (0.178s).
Loading phrases to help you
refine your search...
MLLib: LinearRegressionWithSGD performance - Spark - [mail # user]
...Hi All,I have been using MLLib's linear regression and I have some question regarding the performance. We have a cluster of 10 nodes -- each node has 24 cores and 148GB memory. I am running ...
   Author: Sameer Tilak, 2014-11-21, 19:19
Group operator and variable schema (reformatted email) - Pig - [mail # user]
...Hi All,I have the following question:Snippet of my sample.txt. First column is id, however each row can have variable number of columns. id1 100 200 300 400 500id2 10 20 30id1 800 900 600id3...
   Author: Sameer Tilak, 2014-11-13, 07:21
Group operator and variable shema - Pig - [mail # user]
...Hi All,I have the following question:Snippet of my sample.txt. First column is id, however each row can have variable number of columns. id1 100 200 300 400 500id2 10 20 30id1 800 900 600id3...
   Author: Sameer Tilak, 2014-11-13, 07:18
[expand - 1 more] - RE: Model characterization - Spark - [mail # user]
...Excellent,  many thanks.  Really appreciate your help.Sent via the Samsung GALAXY S®4, an AT&T 4G LTE smartphone-------- Original message --------From: Xiangrui Meng  Date:11/...
   Author: Sameer Tilak, 2014-11-04, 15:54
LinearRegression and model prediction threshold - Spark - [mail # user]
...Hi All,I am using LinearRegression and have a question about the details on model.predict method. Basically it is predicting variable y given an input vector x. However, can someone point me...
   Author: Sameer Tilak, 2014-10-31, 18:19
MLLib: libsvm - default value initialization - Spark - [mail # user]
...Hi All,I have my sparse data in libsvm format. val examples: RDD[LabeledPoint] = MLUtils.loadLibSVMFile(sc, "mllib/data/sample_libsvm_data.txt")I am running Linear regression. Let us say tha...
   Author: Sameer Tilak, 2014-10-30, 04:21
Further details about Apache Pig join - Pig - [mail # user]
...Hi ,I have simplified the problem further and here are the details:in1.txt:null    null    <=6.9null    null    <7.0in2.txt:null null <=6.9nu...
   Author: Sameer Tilak, 2014-10-23, 06:42
A question about Pig join - Pig - [mail # user]
...Hi All,I have the following problem:This is a statement in my Pig script. cust_joined = JOIN cust_filtered BY (LOW, HIGH, NORMAL), cust_conversion BY (Low, High, Normal);When Datatypes are c...
   Author: Sameer Tilak, 2014-10-23, 06:23
[expand - 1 more] - RE: MLLib libsvm format - Spark - [mail # user]
...Great, I will sort them.Sent via the Samsung GALAXY S®4, an AT&T 4G LTE smartphone-------- Original message --------From: Xiangrui Meng  Date:10/21/2014  3:29 PM  (GMT-08:00) ...
   Author: Sameer Tilak, 2014-10-21, 22:34
[expand - 2 more] - RE: MLLib Linear regression - Spark - [mail # user]
...Hi Xiangrui,Changing the default step size to 0.01 made a huge difference. The results make sense when I use A + B + C + D. MSE is ~0.07 and the outcome matches the domain knowledge. I was w...
   Author: Sameer Tilak, 2014-10-08, 14:22
Sort:
project
Spark (41)
Pig (28)
Hadoop (3)
Hive (1)
MapReduce (1)
type
mail # user (74)
date
last 7 days (1)
last 30 days (6)
last 90 days (18)
last 6 months (48)
last 9 months (74)
author
Ted Yu (1778)
Harsh J (1297)
Jun Rao (1001)
Todd Lipcon (993)
Stack (980)
Andrew Purtell (852)
Jonathan Ellis (846)
Jean-Daniel Cryans (749)
stack (742)
Yusaku Sako (737)
Jarek Jarcec Cecho (724)
Eric Newton (695)
Jonathan Hsieh (674)
Roman Shaposhnik (673)
Namit Jain (649)
Hitesh Shah (645)
Steve Loughran (633)
Josh Elser (631)
Siddharth Seth (627)
Owen O'Malley (624)
Brock Noland (601)
Neha Narkhede (556)
Arun C Murthy (546)
Eli Collins (545)
Hyunsik Choi (543)
Sameer Tilak
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB