Overview¶
If you have a specific data-engineering problem and want Lakehouse Plumber (LHP) to solve it, start here. Pick a category below and jump to the task.
Decide¶
Decisions — Choose between Preset, Template, and Blueprint; pick a load source, write target, and write mode.
Ingest data¶
Ingest with Auto Loader — Stream files from cloud object storage with Auto Loader (CloudFiles).
Pipeline Patterns — Apply multi-source fan-in, path filtering, and other recipes from the patterns cookbook.
Operate and monitor¶
Enable Monitoring — Set up centralized event-log monitoring across every pipeline in your project.
Quarantine Records — Route failed rows to a Dead Letter Queue (DLQ) and recycle corrected rows back into the pipeline.
Reusability and patterns¶
Multi-Flowgroup YAML Files — Combine multiple FlowGroups in a single YAML file with shared settings and inheritance.
Dynamic Templates Guide — Use Jinja2 conditionals, loops, and filters in Templates for advanced parameter patterns.
Deploy¶
Configure Bundles — Enable Databricks Asset Bundle (DAB) integration for environment-specific deployments.
How to Set Up CI/CD for an LHP Project — Apply CI/CD patterns and DataOps practices for GitHub Actions, Azure DevOps, and Bitbucket Pipelines.
See also¶
Architecture — Explanation of the LHP execution model and why generation works the way it does.
Actions Reference — Reference catalog of every Load, Transform, Write, and Test action.