Blog

Workflow Orchestration vs. Data Orchestration — Are Those Different?
Is Python Really a Bottleneck?
How to Make Your Data Pipelines More Dynamic Using Parameters in Prefect
How to Use Prefect and Monte Carlo to Achieve More Reliable Data Pipelines
Say Hi to Prefect Discourse — a New Forum for Data Engineers
Monitoring vs. Observability: Can You Tell The Difference?
Serverless Data Pipelines Made Easy with Prefect and AWS ECS Fargate
How I Manage Credentials in Python Using AWS Secrets Manager
3 Tricks to Simplify Your Python Data Projects
TaskFlow API in Apache Airflow 2.0 — Should You Use It?
Are NoSQL Databases Relevant for Data Engineering?
7 Reasons Why You Should Consider a Data Lake (and Event-Driven ETL)
How I Built a Data Discovery API for AWS Data Lake
Can Data Lakes Accelerate Building ML Data Pipelines?
Here is What Happens If You Decouple Your BI Stack
How I Built CI/CD For Data Pipelines in Apache Airflow on AWS
Should You Hire a Software Engineer or a Data Engineer?
How a Shared Slack Channel Can Improve Your Data Quality
“Don’t Repeat Yourself” is beneficial — Not Only in Software Engineering
10 Reasons Why Estimating Time For Data Projects is Hard
AWS Kinesis vs. SNS vs. SQS — A Comparison With Python Examples
Is Real-Time Processing Worth It For Your Analytical Use Cases?
Put a Stop to Data Swamps With Event-Driven Data Testing
Demystifying boto3: How to Use Any AWS Service with Python
10 Common Mistakes When Building Analytical Data Models
8 Must-Know Tricks to Use AWS S3 More Effectively in Python
All the Code You Will Ever Write Is Business Logic