Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Accumulo, mail # user - Mahout


Copy link to this message
-
Re: Mahout
Jason Trost 2012-03-15, 11:57
I was going to wait on announcing this until I had more time to optimize
and clean this up, but I created an AccumuloDataModel for mahout
(specifically for recommendations).  I will be honest, this has not
been thoroughly tested and recommendations using this are pretty slow.  I
have some ideas for speeding it up, but haven't had time to implement them.

https://github.com/jt6211/mahout/tree/accumulo

This should be the basic steps to getting this working.

git clone [EMAIL PROTECTED]:jt6211/mahout.git
cd mahout
git checkout origin/accumulo -b accumulo # checkout my branch with the
AccumuloDataModel
mvn compile package -DskipTests # tests seem to take forever, feel free not
to skip them

Once done you will want to
add integration/target/mahout-integration-0.7-SNAPSHOT.jar to your
classpath.

Feedback and pull requests would be welcomed.

--Jason

On Tue, Mar 13, 2012 at 12:04 PM, Cardon, Tejay E
<[EMAIL PROTECTED]>wrote:

>  All,****
>
> I’m looking to use Accumulo as a data source for Mahout.  It doesn’t
> appear to be built in, nor does Accumulo appear to include the code, but
> I’m hoping someone can point me at a blog post or something else that could
> help.  I appreciate whatever help I can get.****
>
> ** **
>
> Thanks,
> Tejay****
>
> ** **
>
> [image: cid:[EMAIL PROTECTED]E3D0]****
>
> ** **
>
> Follow me on Eureka <https://eureka.isgs.lmco.com/#people/cardonte> and
> Brainstorm <http://brainstorm.isgs.lmco.com/Person.aspx?id=1200>****
>
> ** **
>
> ** **
>
> ** **
>