Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Plain View
MapReduce >> mail # user >> Accessing only particular folder using hadoop streaming


+
jamal sasha 2013-10-02, 19:58
Copy link to this message
-
Re: Accessing only particular folder using hadoop streaming
You need to use globs when passing your input path, like below perhaps:

data/shard*/d1*

On Thu, Oct 3, 2013 at 1:28 AM, jamal sasha <[EMAIL PROTECTED]> wrote:
> Hi,
>     I have data in this one folder like following:
>
> data-------shard1---d1_1
>             |          |_d2_1
>             Lshard2---d1_1
>             |          |_d2_2
>             Lshard3---d1_1
>             |          |_d2_3
>             Lshard4---d1_1
>                        |_d2_4
>
>
> Now, I want to search something in d1 (and excluding all the d2's) in it.
> So how do i do that in python?
> Thanks
>

--
Harsh J
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB