Files
school-of-sre/courses/level101/big_data/tasks.md
Jana R 4239ecf473 docs (level 101): fix typos, punctuation, formatting (#160)
* docs: formatted for readability

* docs: rephrased and added punctuation

* docs: fix typos, punctuation, formatting

* docs: fix typo and format

* docs: fix caps and formatting

* docs: fix punctuation and formatting

* docs: capitalized SQL commands, fixed puntuation, formatting

* docs: fix punctuation

* docs: fix punctuation and formatting

* docs: fix caps,punctuation and formatting

* docs: fix links, punctuation, formatting

* docs: fix code block formatting

* docs: fix punctuation, indentation and formatting
2024-07-28 17:38:19 +05:30

844 B

Tasks and conclusion

Post-training tasks:

  1. Try setting up your own three-node Hadoop cluster.
    1. A VM-based solution can be found here
  2. Write a simple Spark/MR job of your choice and understand how to generate analytics from data.
    1. Sample dataset can be found here

References:

  1. Hadoop documentation
  2. HDFS Architecture
  3. YARN Architecture
  4. Google GFS paper