The Data Scientist Certificate is for those users that are looking to implement Keboola into their Data Science practice/workflow.  

This certificate will be good for Data Scientists in order to learn the basics of Keboola Connection, how to prepare your data and/or create your feature store, and go on to also learn how to develop Data Science models and work with MLFlow to deploy your models. 

Existing advanced knowledge and experience preparing data, and data science is useful for this course.  

The certificate requires completion of courses: Introduction, Best Practices, Common Components and Processors, and Data Science.

If you have already completed any of the assignments, you will not be required to resubmit in order to qualify for completion of this course. Either if you went through a course outside of the certificate just skip those lessons.

Course curriculum

  • 1

    Certificate Course Introduction

    • Certificate Course Introduction

  • 2

    Introduction

    • Introduction and Architecture Overview

    • Data Sources

    • Quiz 1

    • Storage

    • Transformations and Workspaces

    • Jobs

    • Quiz 2

    • Data Destinations

    • Flows

    • Quiz 3

    • Data Flow Templates

    • Keboola Support

    • Trash

    • Free Tier vs. Enterprise Plan

    • dbt Transformations

    • Quiz 4

    • Course Feedback

    • Resources

  • 3

    Best Practices

    • Introduction and Architecture Explanation

    • Storage Introduction

    • Quiz 1

    • Storage Jobs

    • Snapshots

    • Aliases

    • Trash

    • Quiz 2

    • Data Types

    • Table Operations

    • Quiz 3

    • File Storage

    • Input and Output Mapping

    • Variables

    • Shared Code

    • SQL Tips

    • Quiz 4

    • Workspaces

    • Flows

    • Quiz 5

    • Development Branches

    • Quiz 6

    • Keboola Dev Tools

    • Assignment

    • Assignment Overview

    • Course Feedback

    • Resources

  • 4

    Common Components and Processors

    • Intro and Components Overview

    • Intro - Public vs Private Components

    • Intro - Component Developers

    • Intro - Component Configurations

    • Processors

    • Processors - Processor Example

    • Processors - S3 Processors Deep Dive

    • Common Components - FTP, Email, HTTP

    • Common Components - KBC, Geocoder, Apify, Selenium

    • Common Components - Generic Extractor

    • Common Components - Text Analytics

    • Common Components - Mailgun

    • Common Components - Common Processors

    • Assignment - Assignment Overview

    • Assignment

    • Assignment - S3 Intro

    • Assignment - How to Perform a Debug Job via API

    • Resources

    • Course Feedback

  • 5

    Data Science

    • Introduction - Shared Project

    • Introduction

    • Workspaces

    • Workspaces Demo - Introduction

    • Workspaces Demo - Creating a Workspace

    • Workspaces Demo - Loading Data and Connecting to a Workspace

    • Workspaces Demo - Additional Features

    • Experiments and Development - Workflow

    • Workspaces Demo - JupyterLab Tour

    • Experiments and Development - MLFlow

    • MLFlow - Running Experiments

    • MLFlow - Register a Model

    • MLFlow - Deploy and Use a Model

    • Assignment - Assignment Overview

    • Assignment

    • Presentation

    • Course Feedback

  • 6

    Certificate Submission

    • Certificate Submission

  • 7

    Before you go ...

    • Certificate Feedback