Course Overview
Dataplex is an intelligent data fabric that enables organizations to centrally discover, manage, monitor, and govern their data across data lakes, data warehouses, and data marts. You can use Dataplex to build a data mesh architecture to decentralize data ownership among domain data owners.
In this course, you will learn how to discover, manage, monitor, and govern your data across data lakes, data warehouses, and data marts through guided lectures and independent exercises using sample data.
This course does not cover the interaction of Dataplex with Dataproc Metastore nor does it do a deep dive into BigLake concepts.
Moyens d'évaluation :
- Quiz pré-formation de vérification des connaissances (si applicable)
- Évaluations formatives pendant la formation, à travers les travaux pratiques réalisés sur les labs à l’issue de chaque module, QCM, mises en situation…
- Complétion par chaque participant d’un questionnaire et/ou questionnaire de positionnement en amont et à l’issue de la formation pour validation de l’acquisition des compétences
Prerequisites
Completion of the Modernizing Data Lakes and Data Warehouses with Google Cloud (MDLDW) and Building Batch Data Pipelines on Google Cloud (BBDP) courses in the "Data Engineer" learning path or equivalent experience using Google Cloud.
Course Objectives
- Identify the importance of a modern data platform
- Configure and set up Dataplex
- Secure data lakes, zones, and assets
- Implement tagging for resources and use tags to search for assets
- Process data using Dataplex tasks
- Design, execute and report on data quality processes
Moyens Pédagogiques :