r - RHadoop Stream Job Fail with Apache Oozie -


i'm looking pick community's brain leads in figuring out going on issue i'm having.

i'm writing mr job rhadoop (rmr2, v3.0.0) , things great -- io hdfs, mapping, reducing. no problems. life great.

i'm trying schedule job apache oozie, , running issues:

error in mr(map = map, reduce = reduce, combine = combine, vectorized.reduce, : hadoop streaming failed error code 1

i've read rmr2 debugging guide, nothing getting stderr because job fails before gets scheduled.

in head, points difference in environments. however, oozie running job same user i'm able run via cli, , of r environment variables (fetched sys.getenv()) same, excepting there's additional class path stuff set oozie.

i can post more of os or hadoop versions , config details, sleuthing version-specific bugs seems bit of red herring runs fine @ command line.

anybody have thoughts might helpful next steps in hunting beast down?

update:

i overwrote system function in base package log user, host name of node, , command being executed before internal call system. before system call executed, following in stderr: user@host.name /usr/bin/hadoop jar /usr/lib/hadoop-mapreduce/hadoop-streaming-2.2.0.2.0.6.0-102.jar ...

when ran oozie, command printed in stderr fails exit status of 1. when run command on user@host.name, runs successfully. exact same command same user on same node fails oozie, runs cli.


Comments

Popular posts from this blog

Android layout hidden on keyboard show -

google app engine - 403 Forbidden POST - Flask WTForms -

c - Why would PK11_GenerateRandom() return an error -8023? -