Tristan Snyder

Software Engineer, Data

Los Angeles, California

I build data tools for humans, and infrastructure for the cloud-scale enterprise.


Block, Inc (fka Square)

Tech Lead, Data Governance

2023 - present

I am currently building a comprehensive data discovery and lineage tool to catalog all of Block's data assets across various Kafka brokers, Snowflake, Databricks, Business Intelligence tools, and other platforms.

By exporting assets and edges to AWS S3 from their source platforms, ingesting to Elasticsearch, and exposing search via a simple REST API, I have made cross-platform lineage traversal and search possible at lighting-fast speeds. Cross-platform lineage enables propagation of semantic types and other governance metadata tags from Kafka Topic to Data Warehouse to Dashboard.

This app is built with Nuxt, a Vue meta-framework that makes Typescript on the the frontend and backend easy. I have built a number of apps in our data ecosystem using a similar tech stack, which allows for rapid prototyping of MVPs, and then painless deployment and scaling with Terraform and Kubernetes.

Senior Software Engineer

2019 - 2023

I built Squarewave, a dead-simple, user-friendly, SQL-only ETL tool to allow any data citizen to deploy SQL transformations from a web browser. Built with Python, Flask, and Vue on the frontend, Squarewave parses SQL, dynamically generates DAGs from parsed sources and targets, and syncs them to Airflow. Squarewave has hundreds of users and over 5,000 daily jobs, democratizing data at enterprise scale.

While Squarewave has democratized data transformations it also improves data quality with guard rails like automatic dependency status detection and Peer Review, encourages the adoption of software engineering principles like Unit Testing, and makes configuring alerting and anomaly detection as easy as flipping a switch.

I recently migrated Squarewave, our Airflow clusters, and other internal apps from the on-prem datacenter to Block's Kubernetes infrastructure in AWS. Airflow notably is now running on 15 autoscaling clusters in production.

Business Intelligence

2016 - 2019

I scaled Tableau to zero and Looker to over 3,000 weekly users and more than 300 LookML devs. At the time this was the largest and most complex Looker instance, and most likely still is.

To facilitate migrating from on-prem Vertica to Snowflake I built a tool to automatically convert ~90% of SQL from one platform to the other. Using the same tool and Liquid for dynamic LookML, we were able to have a nearly seamless transition that was transparent to most end-users. I presented this work on stage at Looker's Join conference in 2018.

Instacart

Senior Data Analyst

2015 - 2016

As the first data hire at this start up in a rapid growth stage, I wore many hats. I scaled up Tableau Server across operations teams, and implemented the first ETL orchestration system to maintain high quality reporting during and after the monolith decomposition.


Temple University

BBA Intl Business and Finance

2006 - 2009



Languages / Frameworks

  • python
  • flask
  • typescript
  • vue
  • nuxt


Technologies

  • aws
  • kubernetes
  • docker
  • terraform
  • snowflake
  • databricks
  • elasticsearch
  • rds/aurora
  • airflow
  • prefect
  • looker
  • tableau