issues on python packages on hadoop distributed system nodes -
i utilize python hadoop streaming. utilize aws hadoop streaming distributed systems has master node, , 4 slave nodes. if need install bundle on python, need install bundle on every node of scheme create work. in cases, after install python bundle on every node of system, python hadoop streaming job still can not work. wondering why. give thanks you!
more details:
i utilize python hadoop streaming mapreduce work. , hadoop scheme utilize aws hadoop distributed scheme has 1 master node, , 4 slave nodes. sometimes, need install packages on python. distributed system, theoretically, if need install bundle on python, need install bundle on python on every node of distributed scheme create streaming job work. thing after install needed python packages on every node of hadoop distributed system, hadoop streaming job still failed. after delete "import some-package" line in hadoop streaming python script, , alter code accordingly, hadoop streaming python job work. problem on newly added python packages. confused, why can not create python streaming job work though install python packages on python of every node of hadoop distributed scheme .
python hadoop streaming packages
No comments:
Post a Comment