You might also want to look at Gobblin which uses Helix in a very similar way and is actually used to read data from HDFS, do transformations and load into remote store.

Shirshanka
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB