Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop >> mail # user >> Moving files from JBoss server to HDFS


Copy link to this message
-
Re: Moving files from JBoss server to HDFS
Hi  financeturd <[EMAIL PROTECTED]>,
    My Point of view  second step like bellow is the good approach

{Separate server} <-- {JBoss server}
and then
{Separate server} --> HDFS

thanks
  samir

On Sat, May 12, 2012 at 6:00 AM, financeturd financeturd <
[EMAIL PROTECTED]> wrote:

> Hello,
>
> We have a large number of
> custom-generated files (not just web logs) that we need to move from our
> JBoss servers to HDFS.  Our first implementation ran a cron job every 5
> minutes to move our files from the "output" directory to HDFS.
>
> Is this recommended?  We are being told by our IT team that our JBoss
> servers should not have access to HDFS for security reasons.  The files
> must be "sucked" to HDFS by other servers that do not accept traffic
> from the outside.  In essence, they are asking for a layer of
> indirection.  Instead of:
> {JBoss server} --> {HDFS}
> it's being requested that it look like:
> {Separate server} <-- {JBoss server}
> and then
> {Separate server} --> HDFS
>
>
> While I understand in principle what is being said, the security of having
> processes on JBoss servers writing files to HDFS doesn't seem any worse
> than having Tomcat servers access a central database, which they do.
>
> Can anyone comment on what a recommended approach would be?  Should our
> JBoss servers push their data to HDFS or should the data be pulled by
> another server and then placed into HDFS?
>
> Thank you!
> FT
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB