Duration

1-day

Target audience

Data analysts, Analytics Engineer, Data Engineer

Technology

data warehouse, data analytics, dbt, ETL, ELT, data transformation, sql

Modern Data Pipelines with DBT

In this one day workshop, you will learn how to create modern data transformation pipelines managed by DBT. Discover how you can improve your pipelines’ quality and workflow of your data team by introducing a tool aimed to standardize the way you incorporate good practices within the data team.

Requirements

SQL fluency - ability to write data transforming queries.
Basic understanding of ETL processes.
Basic experience with command-line
Laptop with stable internet connection (participants will connect to Jupyter Notebooks pre-created on Google Cloud Platform)

Training outcome

During the workshops participants will follow a shared step-by-step guideline with an overview from the perspective of augmenting the Data Team workflow with the dbt tool. We will work through typical data transformation problems you can encounter on a journey to deliver fresh & reliable data and how DBT can help to solve them. Jupyter Notebook environments will be provided for each participant.

Course agenda^*

Part 1

Introduction to DBT

Framework overview
Typical use cases
Impact on data transformation development

Part 2

Core concepts of DBT

Data models
Seeds, sources
Tests
Documentation, maintenance and data lineage
Hands-on exercises

Part 3

Advanced DBT features

Macros & hooks
Snapshots
Extensions
Other tools to integrate with (overview only)
Hands-on exercises

Part 4

Scheduling, deployment, workflow

GID DataOps Data Platform
Airflow
DBT cloud
Hands-on exercises

^* GetInData reserves the right to make any changes and adjustments to the presented agenda.

Instructors

Our workshops and training programmes are organised by experienced instructors with many years real-life Big Data experience. Get to know our team!

More information

All participants will get training materials in the form of PDF files containing slides with theory and exercises manual with the detailed description of all exercises.

Contact person

Klaudia Wachnio

klaudia@getindata.com

off

Piotr Krewski

piotr@getindata.com

+48 888 185 137

Testimonials

Completed in half the estimated time and with a fivefold improvement on data collection goals, the robust product has exponentially increased processing capabilities. GetInData’s in-depth engagement, reliability, and broad industry knowledge enabled seamless project execution and implementation.

Wojciech Ptak

CTO

GetInData had been supporting us in building production Big Data infrastructure and implementing real-time applications that process large streams of data. In light of our successful cooperation with GetInData, their unique experience and the quality of work delivered, we recommend the company as a Big Data vendor.

Miłosz Balus

CTO

GetInData delivered a robust mechanism that met our requirements. Their involvement allowed us to add a feature to our product, despite not having the required developer capacity in-house.

Stephan Ewen

CTO

Their consistent communication and responsiveness enabled GetInData to drive the project forward. They possess comprehensive knowledge of the relevant technologies and have an intuitive understanding of business needs and requirements. Customers can expect a partner that is open to feedback.

Wilson Yu Cao

Development Team Manager

We sincerely recommend GetInData as a Big Data training provider! The trainer is a very experienced practitioner and he gave us a lot of tips regarding production deployments, possible issues as well as good practices that are invaluable for a Hadoop administrator.

Mariusz Popko

Platform Manager

The engineers and administrators at GetInData are world-class experts. They have proven experience in many open-source technologies such as Hadoop, Spark, Kafka and Flink for implementing batch and real-time pipelines.

Kostas Tzoumas

CEO

Other Big Data Training

Machine Learning Operations Training (MLOps)
This four-day course will teach you how to operationalize Machine Learning models using popular open-source tools, like Kedro and Kubeflow, and deploy it using cloud computing.
See more
See more
Hadoop Administrator Training
This four-day course provides the practical and theoretical knowledge necessary to operate a Hadoop cluster. We put great emphasis on practical hands-on exercises that aim to prepare participants to work as effective Hadoop administrators.
See more
See more
Advanced Spark Training
This 2-day training is dedicated to Big Data engineers and data scientists who are already familiar with the basic concepts of Apache Spark and have hands-on experience implementing and running Spark applications.
See more
See more
Data Analyst Training
This four-day course teaches Data Analysts how to analyse massive amounts of data available in a Hadoop YARN cluster.
See more
See more
Real-Time Stream Processing
This two-day course teaches data engineers how to process unbounded streams of data in real-time using popular open-source frameworks.
See more
See more
Analytics engineering with Snowflake and dbt
This 2-day training is dedicated to data analysts, analytics engineers & data engineers, who are interested in learning how to build and deploy Snowflake data transformation workflows faster than ever before.
See more
See more
Real-time analytics with Snowflake and dbt
This 2-day training is dedicated to data analysts, analytics engineers & data engineers, who are interested in learning how to build and deploy real-time Snowlake data pipelines.
See more
See more
Mastering ML/MLOps and AI-powered Data Applications in the Snowflake Data Cloud
This 2-day training is dedicated to data engineers, data scientists, or a tech enthusiasts. This workshop will provide hands-on experience and real-world insights into architecting data applications on the Snowflake Data Cloud.
See more
See more

Contact us

Interested in our solutions?
Contact us!

Together, we will select the best Big Data solutions for your organization and build a project that will have a real impact on your organization.

What did you find most impressive about GetInData?

They did a very good job in finding people that fitted in Acast both technically as well as culturally.

Type the form or send a e-mail: hello@getindata.com

Modern Data Pipelines with DBT

Requirements

Training outcome

Course agenda*

Part 1

Introduction to DBT

Part 2

Core concepts of DBT

Part 3

Advanced DBT features

Part 4

Scheduling, deployment, workflow

Instructors

More information

Contact person

Testimonials

Other Big Data Training

Machine Learning Operations Training (MLOps)

Hadoop Administrator Training

Advanced Spark Training

Data Analyst Training

Real-Time Stream Processing

Analytics engineering with Snowflake and dbt

Real-time analytics with Snowflake and dbt

Mastering ML/MLOps and AI-powered Data Applications in the Snowflake Data Cloud

Contact us

Interested in our solutions?Contact us!

Course agenda^*

Interested in our solutions?
Contact us!