Tom Seddon 2013-08-25, 20:25
A Hadoop cluster would be a good start. We're in the process right now of
putting together distributable files which will help get you to up to speed
quickly. Contribution isn't just code, there are many types and I'm sure
you can help in any number of ways. Just documenting your early
experiences and advice would be a great way to start helping out.
On Sun, Aug 25, 2013 at 1:25 PM, Tom Seddon <[EMAIL PROTECTED]> wrote:
> I'm looking to do a dissertation on Drill, as part of masters degree in
> Data Science. I'm hoping to set up a cluster to run it and then analyse
> its efficiency with different datasets, as well as make recommendations for
> its usage. I know Drill is in a fairly early stage of development but I
> have around 18 months until the project is due, so I'm hoping the timing
> will work as Drill is developed further.
> I'd be grateful for any advice on how I could get started on this. Would a
> Hadoop cluster be a good back-end to base my project on or would something
> more suited to nested data like MongoDB be more appropriate? Also, I
> haven't found much documentation on configuring Drill in a distributed
> environment, so any help on this would be appreciated.
> I'd also be willing to contribute but not sure if I have enough Java
> experience. My background is mainly in BI and database technologies.
Tom Seddon 2013-08-29, 09:25
Jacques Nadeau 2013-09-02, 00:17
Ted Dunning 2013-08-26, 06:18