Matteo Perego
成为会员时间:2023
黄金联赛
28380 积分
成为会员时间:2023
In the last installment of the Dataflow course series, we will introduce the components of the Dataflow operational model. We will examine tools and techniques for troubleshooting and optimizing pipeline performance. We will then review testing, deployment, and reliability best practices for Dataflow pipelines. We will conclude with a review of Templates, which makes it easy to scale Dataflow pipelines to organizations with hundreds of users. These lessons will help ensure that your data platform is stable and resilient to unanticipated circumstances.
While the traditional approaches of using data lakes and data warehouses can be effective, they have shortcomings, particularly in large enterprise environments. This course introduces the concept of a data lakehouse and the Google Cloud products used to create one. A lakehouse architecture uses open-standard data sources and combines the best features of data lakes and data warehouses, which addresses many of their shortcomings.
This course is part 1 of a 3-course series on Serverless Data Processing with Dataflow. In this first course, we start with a refresher of what Apache Beam is and its relationship with Dataflow. Next, we talk about the Apache Beam vision and the benefits of the Beam Portability framework. The Beam Portability framework achieves the vision that a developer can use their favorite programming language with their preferred execution backend. We then show you how Dataflow allows you to separate compute and storage while saving money, and how identity, access, and management tools interact with your Dataflow pipelines. Lastly, we look at how to implement the right security model for your use case on Dataflow.
This course helps learners create a study plan for the PDE (Professional Data Engineer) certification exam. Learners explore the breadth and scope of the domains covered in the exam. Learners assess their exam readiness and create their individual study plan.
Complete the intermediate Manage Data Models in Looker skill badge course to demonstrate skills in the following: maintaining LookML project health; utilizing SQL runner for data validation; employing LookML best practices; optimizing queries and reports for performance; and implementing persistent derived tables and caching policies.
Complete the introductory Build LookML Objects in Looker skill badge course to demonstrate skills in the following: building new dimensions and measures, views, and derived tables; setting measure filters and types based on requirements; updating dimensions and measures; building and refining Explores; joining views to existing Explores; and deciding which LookML objects to create based on business requirements.
Data Catalog is deprecated and will be discontinued on January 30, 2026. You can still complete this course if you want to. For steps to transition your Data Catalog users, workloads, and content to Dataplex Catalog, see Transition from Data Catalog to Dataplex Catalog (https://cloud.google.com/dataplex/docs/transition-to-dataplex-catalog). Data Catalog is a fully managed and scalable metadata management service that empowers organizations to quickly discover, understand, and manage all of their data. In this quest you will start small by learning how to search and tag data assets and metadata with Data Catalog. After learning how to build your own tag templates that map to BigQuery table data, you will learn how to build MySQL, PostgreSQL, and SQLServer to Data Catalog Connectors.
In this course, you will get hands-on experience applying advanced LookML concepts in Looker. You will learn how to use Liquid to customize and create dynamic dimensions and measures, create dynamic SQL derived tables and customized native derived tables, and use extends to modularize your LookML code.
完成「運用 BigQuery ML 建立機器學習模型」技能徽章中階課程,即可證明您具備下列技能: 可使用 BigQuery ML 建立及評估機器學習模型,並根據資料進行預測。 「技能徽章」是 Google Cloud 核發的獨家數位徽章, 用於肯定您在 Google Cloud 產品和服務方面的精熟技能, 代表您已通過測驗,能在互動式實作環境中應用相關知識。完成 本課程及結業評量挑戰實驗室,即可取得技能徽章 並與他人分享。
完成 從 BigQuery 資料取得深入分析結果 技能徽章入門課程,即可證明您具備下列技能: 撰寫 SQL 查詢、查詢公開資料表、將樣本資料載入 BigQuery、使用 BigQuery 的查詢驗證工具 排解常見語法錯誤,以及在 Looker Studio 中 透過連結 BigQuery 資料建立報表。
完成「為 Looker 資訊主頁和報表準備資料」技能徽章入門課程, 即可證明您具備下列技能:可篩選、排序和 pivot 資料、合併不同的 Looker 探索結果, 還能使用函式和運算子建構 Looker 資訊主頁和報表,取得資料分析結果和圖表。
完成 在 Google Cloud 為機器學習 API 準備資料 技能徽章入門課程,即可證明您具備下列技能: 使用 Dataprep by Trifacta 清理資料、在 Dataflow 執行資料管道、在 Dataproc 建立叢集和執行 Apache Spark 工作,以及呼叫機器學習 API,包含 Cloud Natural Language API、Google Cloud Speech-to-Text API 和 Video Intelligence API。
This course empowers you to develop scalable, performant LookML (Looker Modeling Language) models that provide your business users with the standardized, ready-to-use data that they need to answer their questions. Upon completing this course, you will be able to start building and maintaining LookML models to curate and manage data in your organization’s Looker instance.
In this course, you learn how to do the kind of data exploration and analysis in Looker that would formerly be done primarily by SQL developers or analysts. Upon completion of this course, you will be able to leverage Looker's modern analytics platform to find and explore relevant content in your organization’s Looker instance, ask questions of your data, create new metrics as needed, and build and share visualizations and dashboards to facilitate data-driven decision making.
這堂初級課程將介紹 Google Cloud 的資料分析工作流程,以及用於探索、分析資料並以圖表呈現的工具。您也能學會如何與相關人員分享自己的發現結果。本課程包含個案研究、實作實驗室、講座、測驗和示範,實際展示如何將原始資料集轉化為清晰的資料,進而呈現出能發揮成效的圖表和資訊主頁。無論您是資料領域從業人員、想瞭解如何透過 Google Cloud 取得成功,或有意在職涯中更上一層樓,本課程都能協助您踏出第一步。絕大多數在工作上執行或運用資料分析的學員,都能從本課程受益。