After successful completion of the Data Engineer Certificate, you will know the basics for Keboola Connection and be able to do common tasks on the platform (download data from external sources, process it in Keboola and send the results outside). Familiarize yourself with components (components are Keboola applications for data processing), advanced usage of transformations (transformations contain structured SQL statements), and debug issues with your data pipelines in Keboola. You will also learn some good practices on how to work with your data in Keboola safely and effectively.

With successful completion of the certificate, you will be ready to build the whole data flow in a project with little to no assistance. 

The certificate requires completion of courses: Introduction, Best Practices, Debug Techniques, Common Components and Processors, and Business Data Model. 

If you have already completed any of the assignments, you will not be required to resubmit in order to qualify for completion of this course. Either if you went through a course outside of the certificate just skip those lessons.

Course curriculum

  • 1

    Certificate Course Introduction

    • Certificate Course Introduction

  • 2

    Introduction

    • Architecture Overview

    • Extractors

    • Storage

    • Transformations and Workspaces

    • Old Transformations

    • Jobs

    • Writers

    • Orchestrations

    • Additional Transformations

    • Scaffolds

    • Support

    • Trash

    • Resources

  • 3

    Best Practices

    • Introduction and Architecture Explanation

    • Storage Part 1

    • Storage Part 2

    • Development Branches

    • Snowflake Transformation Tips

    • ** Note

    • Dependencies and Phases

    • Data Science Transformations

    • Variables

    • Shared Code

    • Workspaces

    • Input & Output Mapping Incremental

    • Orchestrations

    • Assignment Overview

    • Assignment

    • Resources

  • 4

    Debug Techniques

    • Agenda and Recap of Important Concepts

    • Common Error Root Causes

    • Storage and Job Event Details

    • Practical Example - Invalid SQL Queries

    • Practical Example - Python and Snowflake Output Errors

    • Practical Example - Missing Rows in Output

    • **Note

    • Practical Example - Unable to Merge Primary Key

    • Practical Example - Merged Load Types

    • Practical Example - Invalid Cypher

    • Deleted Tables

    • Assignment

    • Resources

  • 5

    Common Components and Processors

    • Intro and Components Overview

    • Public vs Private Components

    • Component Developers

    • Component Configurations

    • Processors

    • Processor Example

    • S3 Processors Deep Dive

    • Common Components 1 - FTP, Email, HTTP

    • Common Components 2 - KBC, Geocode, Apify, Selenium

    • Common Components 3 - Generic Extractor

    • Common Components 4 - Text Analytics

    • Common Components 5 - Mailgun

    • Common Components 6 - Common Processors

    • Assignment Overview

    • Assignment

    • S3 Intro

    • How to Perform a Debug Job via API

    • Resources

  • 6

    Business Data Model

    • Methodology Overview

    • BDM Guide

    • BDM Exercise Intro

    • Live BDM Session Exercise

    • Assignment Overview

    • Assignment

    • Submission Requirements

  • 7

    Certificate Submission

    • Certificate Submission

  • 8

    Before you go ...

    • Certificate Feedback