Mistakes in my first MapRed prog

Took the Big Data University course for MapReduce. Following issues in the lab exercise

sampleData folder is located on the hadoop fs and not local.

test.jar located in the local fs

using MapReduce Model V1 for the programming.

1. when providing the input file location, used /sampleData/ XXX.dat instead of sampleData/XXX.dat

This resulted in
Exception in thread "main" org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input path does not exist: hdfs://yyy:8020/sampleData/XXX.dat

hadoop jar test.jar com.pk.hadoop.MapReduce.Samp sampleData/XXX.dat sampleData/XXX.dat.out

2. Error: java.io.IOException: Type mismatch in key from map: expected org.apache.hadoop.io.Text, received org.apache.hadoop.io.LongWritable

Every time the code executed, it was throwing the above error.

Problem was that i forgot to declare the mapper and reducer classes for the job. And therefore the job was trying to use its default mappings and failing

Learning Portal

Search This Blog

Mistakes in my first MapRed prog

Popular posts from this blog

AWS Developer Associate certification

why not to sysout from inside a loop!!

Java 8 Tutorial for Lambda