|
|
-
RandomAccessFile with HDFS
Stas Oskin 2009-05-24, 12:33
Hi.
Any idea if RandomAccessFile is going to be supported in HDFS?
Regards.
-
Re: RandomAccessFile with HDFS
Tom White 2009-05-25, 10:32
RandomAccessFile isn't supported directly, but you can seek when reading from files in HDFS (see FSDataInputStream's seek() method). Writing at an arbitrary offset in an HDFS file is not supported however.
Cheers, Tom
On Sun, May 24, 2009 at 1:33 PM, Stas Oskin <[EMAIL PROTECTED]> wrote: > Hi. > > Any idea if RandomAccessFile is going to be supported in HDFS? > > Regards. >
-
Re: RandomAccessFile with HDFS
Stas Oskin 2009-05-25, 10:40
Hi.
I understand that the "append files" functionality is going to provide Random Access support?
Any idea which version this expected to be?
Regards.
2009/5/25 Tom White <[EMAIL PROTECTED]>
> RandomAccessFile isn't supported directly, but you can seek when > reading from files in HDFS (see FSDataInputStream's seek() method). > Writing at an arbitrary offset in an HDFS file is not supported > however. > > Cheers, > Tom > > On Sun, May 24, 2009 at 1:33 PM, Stas Oskin <[EMAIL PROTECTED]> wrote: > > Hi. > > > > Any idea if RandomAccessFile is going to be supported in HDFS? > > > > Regards. > > >
-
Re: RandomAccessFile with HDFS
Stas Oskin 2009-09-13, 10:08
Hi.
Any idea when the "append" functionality is expected?
Regards.
-
Re: RandomAccessFile with HDFS
Owen O'Malley 2009-09-20, 19:46
On Sep 13, 2009, at 3:08 AM, Stas Oskin wrote:
> Hi. > > Any idea when the "append" functionality is expected?
A working append is a blocker on HDFS 0.21.0.
The code for append is expected to be complete in a few weeks. Meanwhile, the rest of Common, HDFS, and MapReduce have feature-frozen and need to be stabilized and all of the critical bugs fixed. I'd expect the first releases of 0.21.0 in early November.
-- Owen
-
Re: RandomAccessFile with HDFS
Stas Oskin 2009-09-20, 22:58
Hi.
Just to understand the road-map, 0.21 will be the first stable "append" implementation?
Regards.
2009/9/20 Owen O'Malley <[EMAIL PROTECTED]>
> > On Sep 13, 2009, at 3:08 AM, Stas Oskin wrote: > > Hi. >> >> Any idea when the "append" functionality is expected? >> > > A working append is a blocker on HDFS 0.21.0. > > The code for append is expected to be complete in a few weeks. Meanwhile, > the rest of Common, HDFS, and MapReduce have feature-frozen and need to be > stabilized and all of the critical bugs fixed. I'd expect the first releases > of 0.21.0 in early November. > > -- Owen >
-
Re: RandomAccessFile with HDFS
Aaron Kimball 2009-09-22, 22:50
Or maybe more pessimistically, the second "stable" append implementation.
It's not like HADOOP-1700 wasn't intended to work. It was just found not to after the fact. Hopefully this reimplementation will succeed. If you're running a cluster that contains mission-critical data that cannot tolerate corruption or loss, you shouldn't jump on the new-feature bandwagon until it's had time to prove itself in the wild.
But yes, we hope that appends will really-truly work in 0.21. Experimental/R&D projects should be able to plan on having a working append function in 0.21.
- Aaron
On Sun, Sep 20, 2009 at 3:58 PM, Stas Oskin <[EMAIL PROTECTED]> wrote:
> Hi. > > Just to understand the road-map, 0.21 will be the first stable "append" > implementation? > > Regards. > > 2009/9/20 Owen O'Malley <[EMAIL PROTECTED]> > > > > > On Sep 13, 2009, at 3:08 AM, Stas Oskin wrote: > > > > Hi. > >> > >> Any idea when the "append" functionality is expected? > >> > > > > A working append is a blocker on HDFS 0.21.0. > > > > The code for append is expected to be complete in a few weeks. Meanwhile, > > the rest of Common, HDFS, and MapReduce have feature-frozen and need to > be > > stabilized and all of the critical bugs fixed. I'd expect the first > releases > > of 0.21.0 in early November. > > > > -- Owen > > >
-
Re: RandomAccessFile with HDFS
Stas Oskin 2009-09-23, 07:21
Thanks, I exactly wanted to ask this for our R&D roadmap.
2009/9/23 Aaron Kimball <[EMAIL PROTECTED]>
> Or maybe more pessimistically, the second "stable" append implementation. > > It's not like HADOOP-1700 wasn't intended to work. It was just found not to > after the fact. Hopefully this reimplementation will succeed. If you're > running a cluster that contains mission-critical data that cannot tolerate > corruption or loss, you shouldn't jump on the new-feature bandwagon until > it's had time to prove itself in the wild. > > But yes, we hope that appends will really-truly work in 0.21. > Experimental/R&D projects should be able to plan on having a working append > function in 0.21. > > - Aaron > > On Sun, Sep 20, 2009 at 3:58 PM, Stas Oskin <[EMAIL PROTECTED]> wrote: > > > Hi. > > > > Just to understand the road-map, 0.21 will be the first stable "append" > > implementation? > > > > Regards. > > > > 2009/9/20 Owen O'Malley <[EMAIL PROTECTED]> > > > > > > > > On Sep 13, 2009, at 3:08 AM, Stas Oskin wrote: > > > > > > Hi. > > >> > > >> Any idea when the "append" functionality is expected? > > >> > > > > > > A working append is a blocker on HDFS 0.21.0. > > > > > > The code for append is expected to be complete in a few weeks. > Meanwhile, > > > the rest of Common, HDFS, and MapReduce have feature-frozen and need to > > be > > > stabilized and all of the critical bugs fixed. I'd expect the first > > releases > > > of 0.21.0 in early November. > > > > > > -- Owen > > > > > >
-
Re: RandomAccessFile with HDFS
Stas Oskin 2009-09-23, 07:28
By the way, something that I forgot to ask before:
Will the append implementation be as fast, as the write/copy functions?
Regards.
2009/9/23 Stas Oskin <[EMAIL PROTECTED]>
> Thanks, I exactly wanted to ask this for our R&D roadmap. > > 2009/9/23 Aaron Kimball <[EMAIL PROTECTED]> > > Or maybe more pessimistically, the second "stable" append implementation. >> >> It's not like HADOOP-1700 wasn't intended to work. It was just found not >> to >> after the fact. Hopefully this reimplementation will succeed. If you're >> running a cluster that contains mission-critical data that cannot tolerate >> corruption or loss, you shouldn't jump on the new-feature bandwagon until >> it's had time to prove itself in the wild. >> >> But yes, we hope that appends will really-truly work in 0.21. >> Experimental/R&D projects should be able to plan on having a working >> append >> function in 0.21. >> >> - Aaron >> >> On Sun, Sep 20, 2009 at 3:58 PM, Stas Oskin <[EMAIL PROTECTED]> wrote: >> >> > Hi. >> > >> > Just to understand the road-map, 0.21 will be the first stable "append" >> > implementation? >> > >> > Regards. >> > >> > 2009/9/20 Owen O'Malley <[EMAIL PROTECTED]> >> > >> > > >> > > On Sep 13, 2009, at 3:08 AM, Stas Oskin wrote: >> > > >> > > Hi. >> > >> >> > >> Any idea when the "append" functionality is expected? >> > >> >> > > >> > > A working append is a blocker on HDFS 0.21.0. >> > > >> > > The code for append is expected to be complete in a few weeks. >> Meanwhile, >> > > the rest of Common, HDFS, and MapReduce have feature-frozen and need >> to >> > be >> > > stabilized and all of the critical bugs fixed. I'd expect the first >> > releases >> > > of 0.21.0 in early November. >> > > >> > > -- Owen >> > > >> > >> >
-
Re: RandomAccessFile with HDFS
Edward Capriolo 2009-09-23, 14:37
On Wed, Sep 23, 2009 at 3:28 AM, Stas Oskin <[EMAIL PROTECTED]> wrote: > By the way, something that I forgot to ask before: > > Will the append implementation be as fast, as the write/copy functions? > > Regards. > > 2009/9/23 Stas Oskin <[EMAIL PROTECTED]> > >> Thanks, I exactly wanted to ask this for our R&D roadmap. >> >> 2009/9/23 Aaron Kimball <[EMAIL PROTECTED]> >> >> Or maybe more pessimistically, the second "stable" append implementation. >>> >>> It's not like HADOOP-1700 wasn't intended to work. It was just found not >>> to >>> after the fact. Hopefully this reimplementation will succeed. If you're >>> running a cluster that contains mission-critical data that cannot tolerate >>> corruption or loss, you shouldn't jump on the new-feature bandwagon until >>> it's had time to prove itself in the wild. >>> >>> But yes, we hope that appends will really-truly work in 0.21. >>> Experimental/R&D projects should be able to plan on having a working >>> append >>> function in 0.21. >>> >>> - Aaron >>> >>> On Sun, Sep 20, 2009 at 3:58 PM, Stas Oskin <[EMAIL PROTECTED]> wrote: >>> >>> > Hi. >>> > >>> > Just to understand the road-map, 0.21 will be the first stable "append" >>> > implementation? >>> > >>> > Regards. >>> > >>> > 2009/9/20 Owen O'Malley <[EMAIL PROTECTED]> >>> > >>> > > >>> > > On Sep 13, 2009, at 3:08 AM, Stas Oskin wrote: >>> > > >>> > > Hi. >>> > >> >>> > >> Any idea when the "append" functionality is expected? >>> > >> >>> > > >>> > > A working append is a blocker on HDFS 0.21.0. >>> > > >>> > > The code for append is expected to be complete in a few weeks. >>> Meanwhile, >>> > > the rest of Common, HDFS, and MapReduce have feature-frozen and need >>> to >>> > be >>> > > stabilized and all of the critical bugs fixed. I'd expect the first >>> > releases >>> > > of 0.21.0 in early November. >>> > > >>> > > -- Owen >>> > > >>> > >>> >> > Also at "hadoop world nyc" http://cloudera.com/hadoop-world-nyc their is going to be a presentation: Low Latency, Random Reads from HDFS, Jay Booth, Elastic Platforms That might be of interest to you.
-
Re: RandomAccessFile with HDFS
Stas Oskin 2009-09-23, 14:52
Hi. Hopefully they will put the presentation online. Regards. 2009/9/23 Edward Capriolo <[EMAIL PROTECTED]> > On Wed, Sep 23, 2009 at 3:28 AM, Stas Oskin <[EMAIL PROTECTED]> wrote: > > By the way, something that I forgot to ask before: > > > > Will the append implementation be as fast, as the write/copy functions? > > > > Regards. > > > > 2009/9/23 Stas Oskin <[EMAIL PROTECTED]> > > > >> Thanks, I exactly wanted to ask this for our R&D roadmap. > >> > >> 2009/9/23 Aaron Kimball <[EMAIL PROTECTED]> > >> > >> Or maybe more pessimistically, the second "stable" append > implementation. > >>> > >>> It's not like HADOOP-1700 wasn't intended to work. It was just found > not > >>> to > >>> after the fact. Hopefully this reimplementation will succeed. If you're > >>> running a cluster that contains mission-critical data that cannot > tolerate > >>> corruption or loss, you shouldn't jump on the new-feature bandwagon > until > >>> it's had time to prove itself in the wild. > >>> > >>> But yes, we hope that appends will really-truly work in 0.21. > >>> Experimental/R&D projects should be able to plan on having a working > >>> append > >>> function in 0.21. > >>> > >>> - Aaron > >>> > >>> On Sun, Sep 20, 2009 at 3:58 PM, Stas Oskin <[EMAIL PROTECTED]> > wrote: > >>> > >>> > Hi. > >>> > > >>> > Just to understand the road-map, 0.21 will be the first stable > "append" > >>> > implementation? > >>> > > >>> > Regards. > >>> > > >>> > 2009/9/20 Owen O'Malley <[EMAIL PROTECTED]> > >>> > > >>> > > > >>> > > On Sep 13, 2009, at 3:08 AM, Stas Oskin wrote: > >>> > > > >>> > > Hi. > >>> > >> > >>> > >> Any idea when the "append" functionality is expected? > >>> > >> > >>> > > > >>> > > A working append is a blocker on HDFS 0.21.0. > >>> > > > >>> > > The code for append is expected to be complete in a few weeks. > >>> Meanwhile, > >>> > > the rest of Common, HDFS, and MapReduce have feature-frozen and > need > >>> to > >>> > be > >>> > > stabilized and all of the critical bugs fixed. I'd expect the first > >>> > releases > >>> > > of 0.21.0 in early November. > >>> > > > >>> > > -- Owen > >>> > > > >>> > > >>> > >> > > > > Also at "hadoop world nyc" http://cloudera.com/hadoop-world-nyc their > is going to be a presentation: > > Low Latency, Random Reads from HDFS, Jay Booth, Elastic Platforms > > That might be of interest to you. >
-
Re: RandomAccessFile with HDFS
Konstantin Shvachko 2009-09-23, 17:29
Stas, you can track the status of append here: http://issues.apache.org/jira/browse/HDFS-265The latest status is: http://issues.apache.org/jira/browse/HDFS-265?focusedCommentId=12757764&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#action_12757764Thanks for your interest, --Konstantin Stas Oskin wrote: > Hi. > > Hopefully they will put the presentation online. > > Regards. > > 2009/9/23 Edward Capriolo <[EMAIL PROTECTED]> > >> On Wed, Sep 23, 2009 at 3:28 AM, Stas Oskin <[EMAIL PROTECTED]> wrote: >>> By the way, something that I forgot to ask before: >>> >>> Will the append implementation be as fast, as the write/copy functions? >>> >>> Regards. >>> >>> 2009/9/23 Stas Oskin <[EMAIL PROTECTED]> >>> >>>> Thanks, I exactly wanted to ask this for our R&D roadmap. >>>> >>>> 2009/9/23 Aaron Kimball <[EMAIL PROTECTED]> >>>> >>>> Or maybe more pessimistically, the second "stable" append >> implementation. >>>>> It's not like HADOOP-1700 wasn't intended to work. It was just found >> not >>>>> to >>>>> after the fact. Hopefully this reimplementation will succeed. If you're >>>>> running a cluster that contains mission-critical data that cannot >> tolerate >>>>> corruption or loss, you shouldn't jump on the new-feature bandwagon >> until >>>>> it's had time to prove itself in the wild. >>>>> >>>>> But yes, we hope that appends will really-truly work in 0.21. >>>>> Experimental/R&D projects should be able to plan on having a working >>>>> append >>>>> function in 0.21. >>>>> >>>>> - Aaron >>>>> >>>>> On Sun, Sep 20, 2009 at 3:58 PM, Stas Oskin <[EMAIL PROTECTED]> >> wrote: >>>>>> Hi. >>>>>> >>>>>> Just to understand the road-map, 0.21 will be the first stable >> "append" >>>>>> implementation? >>>>>> >>>>>> Regards. >>>>>> >>>>>> 2009/9/20 Owen O'Malley <[EMAIL PROTECTED]> >>>>>> >>>>>>> On Sep 13, 2009, at 3:08 AM, Stas Oskin wrote: >>>>>>> >>>>>>> Hi. >>>>>>>> Any idea when the "append" functionality is expected? >>>>>>>> >>>>>>> A working append is a blocker on HDFS 0.21.0. >>>>>>> >>>>>>> The code for append is expected to be complete in a few weeks. >>>>> Meanwhile, >>>>>>> the rest of Common, HDFS, and MapReduce have feature-frozen and >> need >>>>> to >>>>>> be >>>>>>> stabilized and all of the critical bugs fixed. I'd expect the first >>>>>> releases >>>>>>> of 0.21.0 in early November. >>>>>>> >>>>>>> -- Owen >>>>>>> >> Also at "hadoop world nyc" http://cloudera.com/hadoop-world-nyc their >> is going to be a presentation: >> >> Low Latency, Random Reads from HDFS, Jay Booth, Elastic Platforms >> >> That might be of interest to you. >> >
|
|