
Senior data engineer specializing in financial data infrastructure, large-scale pipeline architecture, and cloud data systems on AWS/GCP/Snowflake.
I build and fix data pipelines that handle real stakes ā regulatory reporting, trading infrastructure, high-volume financial data ā and I have a track record of measurable improvements: full data loads from 3 weeks to 6 hours, ingest times from 45 minutes to 3, query runtimes improved 500%.
Recent work:
š¦ Lead data engineer at MSRB (US federal regulator, $4T municipal bond market): 30 ETL pipelines supporting $9B+ in daily trades, pricing data availability increased 800% across 2.7M securities
š¦ Data infrastructure at Trafigura: serverless pipelines processing ~500GB of multi-domain financial, industrial, and geopolitical data
š Greenfield SEC filing pipeline: 20 years of filings across 4,000+ companies, structured for LLM querying
āļø DataPraxis: container-based ingestion platform (GCE/Docker/Kubernetes) reducing ingest time from 45 minutes to 3
š Catalist: Productionizing machine learning models at 1,000,000x scale and implemented Cloudera colocation -> BigQuery transtion
MS Applied Mathematics & Statistics, Johns Hopkins University.


Lead Data Engineer at DataPraxis
Lead Data Engineer at DataPraxis
Data Engineer at the Municipal Securities Rulemaking Board (MSRB) from 2022 to 2024.
Data Engineer at the Municipal Securities Rulemaking Board (MSRB) from 2022 to 2024.
Data Engineer II (Contractor) at BlueLabs Analytics in 2022.
Data Engineer II (Contractor) at BlueLabs Analytics in 2022.