PB✓
PBridge
← All certifications

PBC-DE · Professional

Data Engineer — Professional

Architect production data pipelines. Spark, Airflow, warehousing, streaming.

Duration

60h self-paced

Exam

65 Qs · 130min

Pass mark

75%

What you'll learn

Production-grade ETL/ELT, distributed processing with Spark, orchestration with Airflow, data warehousing (Snowflake / BigQuery / Redshift), streaming with Kafka, and data quality. Peer-reviewed capstone: design and partially build a real pipeline.

Course modules

  1. 01

    Data engineering foundations — what DEs actually do and the modern stack

    Data engineering is the discipline of moving and shaping data reliably at scale. This module sets the context: what DEs actually do day-to-day, how DE differs from data analysis and ML engineering, the modern data stack, and the career path for international data engineers.

    ~450min
  2. 02

    SQL deep-dive — window functions, CTEs, query optimisation

    SQL is the most valuable skill for any data engineer. This module goes beyond basics: window functions, CTEs, query optimisation, execution plans, and the analytical SQL patterns used daily in production warehouses.

    ~450min
  3. 03

    Data modeling — dimensional, normalised, data vault

    How you model data determines whether your warehouse is fast, useful, and maintainable — or a swamp. This module covers the three main approaches (Kimball dimensional, 3NF normalised, data vault), when each applies, and slowly-changing dimension patterns.

    ~450min
  4. 04

    ETL vs ELT, batch vs streaming, ingestion patterns

    Data engineering is fundamentally about moving and shaping data. This module covers the patterns: ETL vs ELT, batch vs streaming, change data capture, idempotency, and the tooling landscape (Fivetran, Airbyte, Kafka, custom Python).

    ~450min
  5. 05

    Apache Airflow — orchestration in production

    Airflow is the dominant workflow orchestrator in 2026. This module covers DAGs, operators, schedule semantics, dependency management, and the production patterns (retries, alerting, resource pools, deployment) that separate hobby Airflow from enterprise Airflow.

    ~450min
  6. 06

    dbt + cloud data warehouses — the modern transformation layer

    dbt revolutionised analytics engineering. This module covers dbt fundamentals (models, sources, tests, macros, packages), the major cloud warehouses (Snowflake, BigQuery, Redshift), and the patterns that scale dbt projects from 5 to 5000 models.

    ~450min
  7. 07

    Apache Spark — distributed processing for large-scale data

    When data exceeds warehouse capabilities or you need complex programmatic transforms, Spark handles it. This module covers Spark fundamentals, DataFrames, partitioning, performance tuning, and when Spark is appropriate (and when it is not).

    ~450min
  8. 08

    Data quality, observability, governance, and career paths

    Building pipelines is half the job. Keeping them healthy, trustworthy, and compliant is the other half. This final module covers data quality testing, observability, governance, and the career trajectories from junior to principal data engineer.

    ~450min

What you walk away with

Not just a certificate. A career-grade credential built to open doors.

01

A credential employers verify in one click

Unique cert ID, QR code, and tamper-evident signature. No more wondering if your certificate is taken seriously.

02

A portfolio piece, not just a paper

Your capstone project lives on your public PBV portfolio. Recruiters click and see actual work — that beats a cert badge every time.

03

Lifetime access, no expiry

Once you earn it, it's yours. No renewals, no subscriptions, no surprises.

04

A profile that stands out

Verified PBV credentials show up on your PBridge freelancer profile, putting you ahead of unverified candidates in client searches.

Why PBC works

  • USD pricing, transparent and global. Pay in USD, accepted globally.
  • Built for the global context. Case studies and capstone briefs use real-world business scenarios.
  • Reviewed capstone project. A human reviews your final project.
  • Verifiable digital certificate. QR code and cryptographic hash for one-scan verification.
  • PBridge job board access. Featured in front of employers hiring on PBridge.

Data Engineer certification FAQs

How much does the Data Engineer certification cost?+

The PBridge Certified Data Engineer certification costs $149 for the exam-only path and $249 for the full path which includes the capstone project review. Both prices are in US Dollars. Payment via secure international checkout (Visa, Mastercard, Amex).

Is the PBridge Certified Data Engineer certification recognized internationally?+

Yes. PBridge Certified certifications are issued with verifiable digital credentials including a unique cert ID, QR code, and hash signature. Hiring managers anywhere in the world can verify any cert at https://www.pbridgeco.com/verify/{cert-id}.

How long does it take to complete the Data Engineer certification?+

The Data Engineer certification takes approximately 60 hours of self-paced study. The exam itself is 130 minutes with 65 questions. Most candidates complete the full path including the capstone project in 4-8 weeks studying part-time alongside work.

What makes PBridge Certified the right fit for learners and employers globally?+

PBridge Certified is built for the global market — USD pricing, capstone scenarios drawn from real-world business cases, international hiring manager recognition, and direct integration with the PBridge job board so certified holders get featured in front of employers actively hiring.

Can I get a job after the Data Engineer certification?+

PBridge Certified cert holders get priority access to the PBridge job board where global and remote employers post data engineer roles. Pair the certification with the capstone project as a portfolio piece — most certified holders see callbacks within 30 days.