The client is failing to ingest because the tablet servers are crashing.
However, the only log message seems to be the "Commits held" exception
repeated over and over again.
As for the .out/.err files, I can't seem to find them. Just the .log files.
We have an init.d script that starts accumulo and redirects the stdout/err
to /dev/null. If you think these might shed more light onto the problems, I
can redirect them properly and wait for the issues to arise again.
On Thu, Jan 2, 2014 at 1:27 PM, Eric Newton <[EMAIL PROTECTED]> wrote:
> Hadoop and accumulo version?
> Your client is failing to ingest because commits are being held for a long
> time? Or, are your tablet servers crashing?
> Are you seeing any warnings/errors in the tablet server log (and .out
> .err) files?
> On Thu, Jan 2, 2014 at 8:08 AM, Roshan Punnoose <[EMAIL PROTECTED]> wrote:
>> We have a consistent(10-20 per second), large document(~100MB), ingest
>> that has been running for a few weeks. However, every few days the tablet
>> servers seem to be falling over with this error:
>> ERROR: Internal error processing closeUpdate
>> org.apache.accumulo.server.tabletserver.HoldTimeoutException: Commits are
>> It may be the large documents ingesting are causing too many splits to
>> occur and freezing the ingest? Maybe increasing the split threshold for the
>> tablet would fix it. Any ideas?