PBC-DE · Professional
Data Engineer — Professional
Architect production data pipelines. Spark, Airflow, warehousing, streaming.
Duration
60h self-paced
Exam
65 Qs · 130min
Pass mark
75%
What you'll learn
Production-grade ETL/ELT, distributed processing with Spark, orchestration with Airflow, data warehousing (Snowflake / BigQuery / Redshift), streaming with Kafka, and data quality. Peer-reviewed capstone: design and partially build a real pipeline.
Course modules
- 01~450min
Data engineering foundations — what DEs actually do and the modern stack
Data engineering is the discipline of moving and shaping data reliably at scale. This module sets the context: what DEs actually do day-to-day, how DE differs from data analysis and ML engineering, the modern data stack, and the career path for international data engineers.
- 02~450min
SQL deep-dive — window functions, CTEs, query optimisation
SQL is the most valuable skill for any data engineer. This module goes beyond basics: window functions, CTEs, query optimisation, execution plans, and the analytical SQL patterns used daily in production warehouses.
- 03~450min
Data modeling — dimensional, normalised, data vault
How you model data determines whether your warehouse is fast, useful, and maintainable — or a swamp. This module covers the three main approaches (Kimball dimensional, 3NF normalised, data vault), when each applies, and slowly-changing dimension patterns.
- 04~450min
ETL vs ELT, batch vs streaming, ingestion patterns
Data engineering is fundamentally about moving and shaping data. This module covers the patterns: ETL vs ELT, batch vs streaming, change data capture, idempotency, and the tooling landscape (Fivetran, Airbyte, Kafka, custom Python).
- 05~450min
Apache Airflow — orchestration in production
Airflow is the dominant workflow orchestrator in 2026. This module covers DAGs, operators, schedule semantics, dependency management, and the production patterns (retries, alerting, resource pools, deployment) that separate hobby Airflow from enterprise Airflow.
- 06~450min
dbt + cloud data warehouses — the modern transformation layer
dbt revolutionised analytics engineering. This module covers dbt fundamentals (models, sources, tests, macros, packages), the major cloud warehouses (Snowflake, BigQuery, Redshift), and the patterns that scale dbt projects from 5 to 5000 models.
- 07~450min
Apache Spark — distributed processing for large-scale data
When data exceeds warehouse capabilities or you need complex programmatic transforms, Spark handles it. This module covers Spark fundamentals, DataFrames, partitioning, performance tuning, and when Spark is appropriate (and when it is not).
- 08~450min
Data quality, observability, governance, and career paths
Building pipelines is half the job. Keeping them healthy, trustworthy, and compliant is the other half. This final module covers data quality testing, observability, governance, and the career trajectories from junior to principal data engineer.
What you walk away with
Not just a certificate. A career-grade credential built to open doors.
A credential employers verify in one click
Unique cert ID, QR code, and tamper-evident signature. No more wondering if your certificate is taken seriously.
A portfolio piece, not just a paper
Your capstone project lives on your public PBV portfolio. Recruiters click and see actual work — that beats a cert badge every time.
Lifetime access, no expiry
Once you earn it, it's yours. No renewals, no subscriptions, no surprises.
A profile that stands out
Verified PBV credentials show up on your PBridge freelancer profile, putting you ahead of unverified candidates in client searches.
Why PBC works
- ✓USD pricing, transparent and global. Pay in USD, accepted globally.
- ✓Built for the global context. Case studies and capstone briefs use real-world business scenarios.
- ✓Reviewed capstone project. A human reviews your final project.
- ✓Verifiable digital certificate. QR code and cryptographic hash for one-scan verification.
- ✓PBridge job board access. Featured in front of employers hiring on PBridge.
Data Engineer certification FAQs
How much does the Data Engineer certification cost?+
The PBridge Certified Data Engineer certification costs $149 for the exam-only path and $249 for the full path which includes the capstone project review. Both prices are in US Dollars. Payment via secure international checkout (Visa, Mastercard, Amex).
Is the PBridge Certified Data Engineer certification recognized internationally?+
Yes. PBridge Certified certifications are issued with verifiable digital credentials including a unique cert ID, QR code, and hash signature. Hiring managers anywhere in the world can verify any cert at https://www.pbridgeco.com/verify/{cert-id}.
How long does it take to complete the Data Engineer certification?+
The Data Engineer certification takes approximately 60 hours of self-paced study. The exam itself is 130 minutes with 65 questions. Most candidates complete the full path including the capstone project in 4-8 weeks studying part-time alongside work.
What makes PBridge Certified the right fit for learners and employers globally?+
PBridge Certified is built for the global market — USD pricing, capstone scenarios drawn from real-world business cases, international hiring manager recognition, and direct integration with the PBridge job board so certified holders get featured in front of employers actively hiring.
Can I get a job after the Data Engineer certification?+
PBridge Certified cert holders get priority access to the PBridge job board where global and remote employers post data engineer roles. Pair the certification with the capstone project as a portfolio piece — most certified holders see callbacks within 30 days.