IBM
IBM Data Engineering Professional Certificate
IBM

IBM Data Engineering Professional Certificate

Prepare for a career as a Data Engineer. Gain the in-demand skills and hands-on experience to get job-ready in less than 5 months. No prior experience required.

Taught in English

Some content may not be translated

Muhammad Yahya
Abhishek Gagneja
Romeo Kienzler

Instructors: Muhammad Yahya

46,341 already enrolled

Professional Certificate - 12 course series

Earn a career credential that demonstrates your expertise

4.6

(4,031 reviews)

Beginner level

Recommended experience

5 months at 10 hours a week
Flexible schedule
Learn at your own pace
Earn degree credit

What you'll learn

  • Master the most up-to-date practical skills and knowledge data engineers use in their daily roles

  • Learn to create, design, & manage relational databases & apply database administration (DBA) concepts to RDBMSs such as MySQL, PostgreSQL, & IBM Db2 

  • Develop working knowledge of NoSQL & Big Data using MongoDB, Cassandra, Cloudant, Hadoop, Apache Spark, Spark SQL, Spark ML, and Spark Streaming 

  • Implement ETL & Data Pipelines with Bash, Airflow & Kafka; architect, populate, deploy Data Warehouses; create BI reports & interactive dashboards

Details to know

Shareable certificate

Add to your LinkedIn profile

Professional Certificate - 12 course series

Earn a career credential that demonstrates your expertise

4.6

(4,031 reviews)

Beginner level

Recommended experience

5 months at 10 hours a week
Flexible schedule
Learn at your own pace
Earn degree credit

See how employees at top companies are mastering in-demand skills

Placeholder

Prepare for a career in Data Engineering

  • Receive professional-level training from IBM
  • Demonstrate your proficiency in portfolio-ready projects
  • Earn an employer-recognized certificate from IBM
  • Qualify for in-demand job titles: Database Engineer, Data Engineer, Junior Data Engineer
Placeholder
$132,000+
median U.S. salary for Data Engineering
¹
59,000+
U.S. job openings in Data Engineering
¹

Get exclusive access to career resources upon completion

  • Soft skills training

    Get free access to IBM’s People and Soft Skills Specialization

  • Resume review

    Improve your resume and LinkedIn with personalized feedback

  • Interview prep

    Practice your skills with interactive tools and mock interviews

  • Career support

    Plan your career move with Coursera’s job search guide

¹Lightcast™ Job Postings Report, United States, 7/1/22-6/30/23. ²Based on program graduate survey responses, United States 2021.

Placeholder

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV

Share it on social media and in your performance review

Placeholder

Professional Certificate - 12 course series

Introduction to Data Engineering

Course 112 hours4.7 (2,326 ratings)

What you'll learn

  • List basic skills required for an entry-level data engineering role.

  • Discuss various stages and concepts in the data engineering lifecycle.

  • Describe data engineering technologies such as Relational Databases, NoSQL Data Stores, and Big Data Engines.

  • Summarize concepts in data security, governance, and compliance.

Skills you'll gain

Category: Shell Script
Category: Bash (Unix Shell)
Category: Extract Transform and Load (ETL)
Category: Linux
Category: Linux Commands

Python for Data Science, AI & Development

Course 226 hours4.6 (35,092 ratings)

What you'll learn

  • Learn Python - the most popular programming language and for Data Science and Software Development.

  • Apply Python programming logic Variables, Data Structures, Branching, Loops, Functions, Objects & Classes.

  • Demonstrate proficiency in using Python libraries such as Pandas & Numpy, and developing code using Jupyter Notebooks.

  • Access and web scrape data using APIs and Python libraries like Beautiful Soup.

Skills you'll gain

Category: Cloud Database
Category: Mongodb
Category: Cassandra
Category: NoSQL
Category: Cloudant

Python Project for Data Engineering

Course 39 hours4.6 (566 ratings)

What you'll learn

  • Demonstrate your skills in Python for working with and manipulating data

  • Implement webscraping and use APIs to extract data with Python

  • Play the role of a Data Engineer working on a real project to extract, transform, and load data

  • Use Jupyter notebooks and IDEs to complete your project

Skills you'll gain

Category: Data Science
Category: Data Analysis
Category: Python Programming
Category: Numpy
Category: Pandas

Introduction to Relational Databases (RDBMS)

Course 418 hours4.6 (505 ratings)

What you'll learn

  • Describe data, databases, relational databases, and cloud databases.

  • Describe information and data models, relational databases, and relational model concepts (including schemas and tables). 

  • Explain an Entity Relationship Diagram and design a relational database for a specific use case.

  • Develop a working knowledge of popular DBMSes including MySQL, PostgreSQL, and IBM DB2

Skills you'll gain

Category: Python Programming
Category: Relational Databases
Category: SQL
Category: NoSQL
Category: Data Pipelines

Databases and SQL for Data Science with Python

Course 520 hours4.6 (19,177 ratings)

What you'll learn

  • Analyze data within a database using SQL and Python.

  • Create a relational database and work with multiple tables using DDL commands.

  • Construct basic to intermediate level SQL queries using DML commands.

  • Compose more powerful queries with advanced SQL techniques like views, transactions, stored procedures, and joins.

Skills you'll gain

Category: Python Programming
Category: Information Engineering
Category: Extract Transform and Load (ETL)
Category: Data Engineer
Category: Web Scraping

Hands-on Introduction to Linux Commands and Shell Scripting

Course 614 hours4.7 (1,075 ratings)

What you'll learn

  • Describe the Linux architecture and common Linux distributions and update and install software on a Linux system.

  • Perform common informational, file, content, navigational, compression, and networking commands in Bash shell.

  • Develop shell scripts using Linux commands, environment variables, pipes, and filters.

  • Schedule cron jobs in Linux with crontab and explain the cron syntax. 

Skills you'll gain

Category: Extract Transform and Load (ETL)
Category: Data Engineer
Category: Apache Kafka
Category: Apache Airflow
Category: Data Pipelines

Relational Database Administration (DBA)

Course 720 hours4.5 (162 ratings)

What you'll learn

  • Create, query, and configure databases and access and build system objects such as tables.

  • Perform basic database management including backing up and restoring databases as well as managing user roles and permissions. 

  • Monitor and optimize important aspects of database performance. 

  • Troubleshoot database issues such as connectivity, login, and configuration and automate functions such as reports, notifications, and alerts. 

Skills you'll gain

Category: Big Data
Category: SparkSQL
Category: SparkML
Category: Apache Hadoop
Category: Apache Spark

ETL and Data Pipelines with Shell, Airflow and Kafka

Course 816 hours4.5 (282 ratings)

What you'll learn

  • Describe and contrast Extract, Transform, Load (ETL) processes and Extract, Load, Transform (ELT) processes.

  • Explain batch vs concurrent modes of execution.

  • Implement an ETL pipeline through shell scripting.

  • Describe data pipeline components, processes, tools, and technologies.

Skills you'll gain

Category: Python Programming
Category: Cloud Databases
Category: Relational Database Management System (RDBMS)
Category: SQL
Category: Jupyter notebooks

Getting Started with Data Warehousing and BI Analytics

Course 916 hours4.4 (145 ratings)

What you'll learn

  • Explore the architecture, features, and benefits of data warehouses, data marts, and data lakes and identify popular data warehouse system vendors.

  • Design and populate a data warehouse, and model and query data using CUBE, ROLLUP, and materialized views.

  • Identify popular data analytics and business intelligence tools and vendors and create data visualizations using IBM Cognos Analytics.

  • Design and load data into a data warehouse, write aggregation queries, create materialized query tables, and create an analytics dashboard.

Skills you'll gain

Category: Database Security
Category: Database (DBMS)
Category: Database Servers
Category: database administration
Category: Relational Database

Introduction to NoSQL Databases

Course 1017 hours4.6 (248 ratings)

What you'll learn

  • Differentiate among the four main categories of NoSQL repositories.

  • Describe the characteristics, features, benefits, limitations, and applications of the more popular Big Data processing tools.

  • Perform common tasks using MongoDB tasks including create, read, update, and delete (CRUD) operations.

  • Execute keyspace, table, and CRUD operations in Cassandra.

Skills you'll gain

Category: Data Science
Category: Database (DBMS)
Category: Information Engineering
Category: SQL
Category: NoSQL

Introduction to Big Data with Spark and Hadoop

Course 1118 hours4.4 (326 ratings)

What you'll learn

  • Explain the impact of big data, including use cases, tools, and processing methods.

  • Describe Apache Hadoop architecture, ecosystem, practices, and user-related applications, including Hive, HDFS, HBase, Spark, and MapReduce.

  • Apply Spark programming basics, including parallel programming basics for DataFrames, data sets, and Spark SQL.

  • Use Spark’s RDDs and data sets, optimize Spark SQL using Catalyst and Tungsten, and use Spark’s development and runtime environment options.

Skills you'll gain

Category: Database (DB) Design
Category: Postgresql
Category: Relational Database Management System (RDBMS)
Category: Database Architecture
Category: MySQL

Data Engineering Capstone Project

Course 1213 hours4.7 (90 ratings)

What you'll learn

  • Demonstrate proficiency in skills required for an entry-level data engineering role.

  • Design and implement various concepts and components in the data engineering lifecycle such as data repositories.

  • Showcase working knowledge with relational databases, NoSQL data stores, big data engines, data warehouses, and data pipelines.

  • Apply skills in Linux shell scripting, SQL, and Python programming languages to Data Engineering problems.

Skills you'll gain

Category: Data Warehousing
Category: Business Intelligence (BI)
Category: cognos analytics
Category: Cube and Rollup
Category: Star and Snowflake Schema

Instructors

Muhammad Yahya
IBM
4 Courses45,896 learners
Abhishek Gagneja
IBM
4 Courses94,732 learners
Romeo Kienzler
IBM
10 Courses629,776 learners

Offered by

IBM

Why people choose Coursera for their career

New to Data Management? Start here.

Placeholder

Open new doors with Coursera Plus

Unlimited access to 7,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Frequently asked questions