I want to read the ORC file in mapreduce on Python. I try to run it:
hadoop jar /usr /lib/hadoop/lib/hadoop-streaming-2.6.0.2.2.6.0-2800.jar
-file /hdfs/price/mymapper.py
-mapper’/usr/local
I want to read the ORC file in mapreduce on Python. I try to run it:
hadoop jar /usr /lib/hadoop/lib/hadoop-streaming-2.6.0.2.2.6.0-2800.jar
-file /hdfs/price/mymapper.py
-mapper’/usr/local