This repo contains the steps and files I used to build two EMR clusters with Spark (used for just EMR) and Hive (used for EMR Serverless) apps. See step 9 for issues I encountered to look out for if ...
hadoop jar target/scala-2.11/emr-scalding-tutorial-assembly-0.1.jar com.softwaremill.AgeCounterJob — local — input “data/hello.txt” — output data-output.txt ...