Took the Big Data University course for MapReduce. Following issues in the lab exercise
1. when providing the input file location, used /sampleData/ XXX.dat instead of sampleData/XXX.dat
This resulted in
Exception in thread "main" org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input path does not exist: hdfs://yyy:8020/sampleData/XXX.dat
hadoop jar test.jar com.pk.hadoop.MapReduce.Samp sampleData/XXX.dat sampleData/XXX.dat.out
sampleData folder is located on the hadoop fs and not local.
test.jar located in the local fs
using MapReduce Model V1 for the programming.
1. when providing the input file location, used /sampleData/ XXX.dat instead of sampleData/XXX.dat
This resulted in
Exception in thread "main" org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input path does not exist: hdfs://yyy:8020/sampleData/XXX.dat
2. Error: java.io.IOException: Type mismatch in key from map: expected org.apache.hadoop.io.Text, received org.apache.hadoop.io.LongWritable
Every time the code executed, it was throwing the above error.
Problem was that i forgot to declare the mapper and reducer classes for the job. And therefore the job was trying to use its default mappings and failing