Implement some MapReduce algorithms, including Pair, Stripe, and Hybrid for Word Co-Occurence and Relative Frequency problems.
Run MapReduce jobs using Spark.
Reference:
Implement some MapReduce algorithms, including Pair, Stripe, and Hybrid for Word Co-Occurence and Relative Frequency problems.
- In Mapper WordCount
- Average
- In Mapper Average
- Pair Approach
- Stripe Appoach
- Hybrid Approach
- Run in eclipse or,
- Run the bash script file
Using Spark, compute mean and standard deviation of the amount of gas consumption in UK
- Run .scala file or,
- Databricks