r - RHadoop Stream Job Fail with Apache Oozie -
i'm looking pick community's brain leads in figuring out going on issue i'm having.
i'm writing mr job rhadoop (rmr2, v3.0.0) , things great -- io hdfs, mapping, reducing. no problems. life great.
i'm trying schedule job apache oozie, , running issues:
error in mr(map = map, reduce = reduce, combine = combine, vectorized.reduce, : hadoop streaming failed error code 1
i've read rmr2 debugging guide, nothing getting stderr because job fails before gets scheduled.
in head, points difference in environments. however, oozie running job same user i'm able run via cli, , of r environment variables (fetched sys.getenv()
) same, excepting there's additional class path stuff set oozie.
i can post more of os or hadoop versions , config details, sleuthing version-specific bugs seems bit of red herring runs fine @ command line.
anybody have thoughts might helpful next steps in hunting beast down?
update:
i overwrote system
function in base package log user, host name of node, , command being executed before internal call system. before system call executed, following in stderr: user@host.name /usr/bin/hadoop jar /usr/lib/hadoop-mapreduce/hadoop-streaming-2.2.0.2.0.6.0-102.jar ...
when ran oozie, command printed in stderr fails exit status of 1. when run command on user@host.name
, runs successfully. exact same command same user on same node fails oozie, runs cli.
Comments
Post a Comment