Documents and Articles

Importance of Cross-Platform Data Consistency

Arguably the biggest part of data warehousing is tying data from multiple sources together to create a single version of the truth. Sometimes, lookup tables or data conversions are needed to join data. In these instances we often rely on cross-platform data consistency.

Diagnosing Airflow’s Auto-Scaling Flaw in AWS MWAA

Apache Airflow is a fantastic platform for scheduling workflows for the ETL of a Data Warehouse. MWAA is the AWS managed implementation of this, allowing for easier management and scalability of workers among other improvements. The problem with MWAA however is that the automatic scaling of workers is currently broken, with a major architectural miss.

Agile Development

Agile development is an iterative Software Development Lifecycle (SDLC) model with a strong base in communication, which differs greatly from the high-effort deliverable-based waterfall model. It is designed for environments where the bulk of the requirements may not be known at the start of the project (or change regularly), or for small projects with a …

Agile Development Read More »

Requirements Analysis

Requirements Analysis, the first stage in the software lifecycle, is the process of defining the capabilities of a system that must be met upon project completion. The length and content of a requirements document depends entirely on the scope and depth of the required system, so anything from a few pages to a few books. …

Requirements Analysis Read More »