|
|
Lara Dong 2012-06-27, 01:18
As now we use the Sequence File to solve the small file problems, but the Sequence File has no index. What I mean is if I transfer my small file into a Sequence File, how do I know where is the position to get my file in the Sequence File? I know the Map File can avoid this problem, but I prefer to use Sequence File. Just now I use Lucene to build an index to record the position. Is there a better way?
+
Lara Dong 2012-06-27, 01:18
-
Re: Sequence File Index
Harsh J 2012-06-27, 06:18
MapFiles were designed for this. It does something similar to what you have done, keeps an external index of keys. Why do you not wish to use MapFiles though?
On Wed, Jun 27, 2012 at 6:48 AM, Lara Dong <[EMAIL PROTECTED]> wrote: > As now we use the Sequence File to solve the small file problems, but the > Sequence File has no index. What I mean is if I transfer my small file into > a Sequence File, how do I know where is the position to get my file in the > Sequence File? I know the Map File can avoid this problem, but I prefer to > use Sequence File. Just now I use Lucene to build an index to record the > position. Is there a better way? >
-- Harsh J
+
Harsh J 2012-06-27, 06:18
-
Re: Sequence File Index
Lara Dong 2012-06-27, 07:52
Yes, it is a solution. I just wonder if we use SequenceFile, how to retrieve the original small files? except traversaling the SequenceFile .
2012/6/27 Harsh J <[EMAIL PROTECTED]>
> MapFiles were designed for this. It does something similar to what you > have done, keeps an external index of keys. Why do you not wish to use > MapFiles though? > > On Wed, Jun 27, 2012 at 6:48 AM, Lara Dong <[EMAIL PROTECTED]> wrote: > > As now we use the Sequence File to solve the small file problems, but the > > Sequence File has no index. What I mean is if I transfer my small file > into > > a Sequence File, how do I know where is the position to get my file in > the > > Sequence File? I know the Map File can avoid this problem, but I prefer > to > > use Sequence File. Just now I use Lucene to build an index to record the > > position. Is there a better way? > > > > > > -- > Harsh J >
+
Lara Dong 2012-06-27, 07:52
|
|
All projects made searchable here are trademarks of the Apache Software Foundation.
Service operated by
Sematext