Skip to content

Onboarding resources for new team members in the 883 Group, covering data operations, customer engagement, Hadoop, Spark, Airflow, Kubernetes, and more. Get started on your journey to success with us!

Notifications You must be signed in to change notification settings

IlaiGigi/Onboarding-Newbies

 
 

Repository files navigation

883 Group Onboarding - Newbies 🍼

Welcome to the 883 Group Onboarding for Newbies! This repository is your gateway to a structured 10-chapter program, designed to seamlessly integrate you into our dynamic Data Ops team. The onboarding program is designed like "100 DAYS OF CODE", each chapter divided into days, and each day is dedicated to a specific topic or skill set that you'll be exploring. Each day will have a set of tasks, goals, and resources to guide you through the learning process. You'll also have the opportunity to engage in discussions, Q&A sessions, and hands-on exercises to reinforce your understanding and apply your knowledge in real-world scenarios.

PAY ATTENTION: ⚠️

  • Day is not a day, it's just a session.

Table of Contents

Chapter 00: Foundations of Data Operations and Customer Engagement in the 883 Group

  • Day 00: Welcome & Introduction - Understand the onboarding process and the 883 Group's vision and mission 🎯
  • Day 01: Introduction to Big Data - Core Concepts
  • Day 02: Customer Interaction and Insight into Data Operations & Data Lifecycle in the 883 Group :bow_and_arrow:

Chapter 01: Introduction to Hadoop Ecosystem

  • Day 03: Introduction to Data concept & Hadoop Ecosystem
  • Day 04: Hadoop Distributed File System (HDFS)
  • Day 05: MapReduce Programming Model, YARN & Oozie
  • Day 06: Introduction to Partitioning & Hive
  • Day 07: Introduction to Impala
  • Day 08: Introduction to HBase
  • [Final Exercise 01 - SKILA]: Introduction to Data Ops Team and Hadoop Ecosystem concepts Q&A ❓
  • Practical Exercise 01: Meet the Hadoop Ecosystem in Action :computer:
  • Showcase 01: Develop a high-level understanding of the Hadoop ecosystem and its role in big data processing. 🔎

Chapter 02: Basic Docker & Testing & CI-CD

Chapter 03: DevOps in Openshift - #INTERNAL

  • Day 12: Openshift Fundamentals for Data Ops using "zero to hero" Openshift labs
  • Day 13: Helm for Kubernetes Package Management using "Hero to Master" Openshift labs
  • Day 14: Openshift "Best Practises" Openshift labs

Chapter 04: Proficiency in Spark

chapter 05: Mastering Apache Airflow for Workflow Orchestration

  • Day 14: Introduction to Apache Airflow concepts, terminology and architecture
  • Day 15: exploring the Airflow web UI for workflow management

Chapter 06: Trino core concepts

  • Day 16: Introduction to Trino concepts, terminology and architecture

Chapter 07: Culture - Jira & Confluence - Agile and more

  • Day 17: Introduction to 883 Culture concepts, terminology and architecture.

Chapter 08: Real-world Project Simulation - #INTERNAL

  • BIG Final Exercise: Senior Data Infra Engineer Onboarding Final Exercise 🚀
  • Showcase: Prepare a presentation to the group to showcase the story, architecture, insights gained, and troubleshooting strategies of your final exercise solution in just two sentences.

Chapter 09: Documentation Knowledge Sharing and Open Source Contribution

  • Day 18: Create documentation for a sample project from the 883 Group and present it to the team.(ask your mentor for the link to the project)
  • Day 19: Open Source Contribution

Chapter 10: Onboarding Process Analysis & Improvement

  • Day 20: Review the onboarding process and suggest improvements.

Contribution 🙌

If you have any suggestions or ideas to enhance the onboarding for future members, feel free to contribute. Fork, create a feature branch, commit changes, and create a pull request.

About

Onboarding resources for new team members in the 883 Group, covering data operations, customer engagement, Hadoop, Spark, Airflow, Kubernetes, and more. Get started on your journey to success with us!

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%