Took the Big Data University course for MapReduce. Following issues in the lab exercise sampleData folder is located on the hadoop fs and not local. test.jar located in the local fs using MapReduce Model V1 for the programming. 1. when providing the input file location, used /sampleData/ XXX.dat instead of sampleData/XXX.dat This resulted in Exception in thread "main" org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input path does not exist: hdfs://yyy:8020/sampleData/XXX.dat hadoop jar test.jar com.pk.hadoop.MapReduce.Samp sampleData/XXX.dat sampleData/XXX.dat.out 2. Error: java.io.IOException: Type mismatch in key from map: expected org.apache.hadoop.io.Text, received org.apache.hadoop.io.LongWritable Every time the code executed, it was throwing the above error. Problem was that i forgot to declare the mapper and reducer classes for the job. And therefore the job was trying to use its default mappings and failing
Place to document my learning resources, interesting findings, code etc.. Anything that i find interesting and related to programming