CatInCloud Labs is a solo architecture practice run by Dave Anaya. I design data platforms that are private by default, easy to audit, and boring to run.
The Philosophy
I believe that a data platform's primary job is to be predictable. When a pipeline fails, it should fail loudly and cleanly. When it succeeds, the data should be fully auditable.
I prioritize idempotency over speed and private networking over public endpoints. I don't chase the latest modern data stack hype cycles. I stick to patterns that have survived production at scale: hardened VPCs, version-controlled business logic, and rigorous data testing.
The Reference Stack
I work exclusively within a specific, high-leverage stack where I can guarantee results. I don't dabble; I architect deep solutions using these core components:
Orchestration: AWS MWAA (Airflow) in a private VPC.
Compute & Storage: Snowflake (PrivateLink) and S3 (KMS Encrypted).
Transformation: dbt Core/Cloud with rigorous testing and documentation.
Version Control: Git-backed workflows for all business logic and pipelines.
Security & Operations
A pipeline isn't finished until it can be handed over to an operations team. That means the "Day 2" problems are solved on Day 1.
My deliverables always include properly scoped IAM roles, least-privilege networking, and clear observability hooks (CloudWatch). I build systems that are designed to be lived with, not just demoed.
Need a senior perspective?
If your current data environment feels fragile, opaque, or overly complex, let's talk. I can help you stabilize your existing stack or architect a new one from the ground up.