Lyle Poisson

Lyle Poisson

Mentor
Rising Codementor
US$15.00
For every 15 mins
4
Sessions/Jobs
ABOUT ME
Senior data engineer with 7+ years of experience
Senior data engineer with 7+ years of experience

Senior data engineer specializing in financial data infrastructure, large-scale pipeline architecture, and cloud data systems on AWS/GCP/Snowflake.

I build and fix data pipelines that handle real stakes — regulatory reporting, trading infrastructure, high-volume financial data — and I have a track record of measurable improvements: full data loads from 3 weeks to 6 hours, ingest times from 45 minutes to 3, query runtimes improved 500%.

Recent work:

šŸ¦ Lead data engineer at MSRB (US federal regulator, $4T municipal bond market): 30 ETL pipelines supporting $9B+ in daily trades, pricing data availability increased 800% across 2.7M securities
šŸ“¦ Data infrastructure at Trafigura: serverless pipelines processing ~500GB of multi-domain financial, industrial, and geopolitical data
šŸ“‘ Greenfield SEC filing pipeline: 20 years of filings across 4,000+ companies, structured for LLM querying
āš™ļø DataPraxis: container-based ingestion platform (GCE/Docker/Kubernetes) reducing ingest time from 45 minutes to 3

šŸŒ Catalist: Productionizing machine learning models at 1,000,000x scale and implemented Cloudera colocation -> BigQuery transtion

MS Applied Mathematics & Statistics, Johns Hopkins University.

Paris (+02:00)
Joined May 2025
EXPERTISE
8 years experience
8 years experience
5 years experience
3 years experience
2 years experience
6 years experience
1 year experience

REVIEWS FROM CLIENTS

Lyle's profile has been carefully vetted and approved as a Codementor. Connect with Lyle now, and leave a review for them once you're done!
SOCIAL PRESENCE
GitHub
algolia-interview
Python
0
0
package_statistics
Python
0
0
EMPLOYMENTS
Lead Data Engineer
Mandate Research
2024-03-01-Present

Lead Data Engineer at DataPraxis

  • Developed a scalable, stable, and automated solution to a pre-existing manual process for ...

Lead Data Engineer at DataPraxis

  • Developed a scalable, stable, and automated solution to a pre-existing manual process for a nascent, ad hoc survey data analytics service, saving over 6 hours per survey ingestion.
  • Built a self-hosted container-based ingestion and analytics platform from scratch, using Google Compute Engine, BigQuery, dbt, Docker/Kubernetes, and Windmill workflow engine, reducing ingest time from 45 to 3 minutes.
  • Developed and enforced team-wide engineering best practices and developed automated tests and toolsets to allow the entire Analytics team to write consistent, bug-free code.
Python
Automation
Google BigQuery
View more
Python
Automation
Google BigQuery
Docker
Google Compute Engine
Kubernetes
CI/CD
Automated Tests
Windmill
View more
Data Engineer
Municipal Securities Rulemaking Board (MSRB)
2022-09-01-2024-02-01

Data Engineer at the Municipal Securities Rulemaking Board (MSRB) from 2022 to 2024.

  • Oversaw data engineering for an organi...

Data Engineer at the Municipal Securities Rulemaking Board (MSRB) from 2022 to 2024.

  • Oversaw data engineering for an organization with an operating budget of over $47MM annually, creating 30 pipelines to improve transparency in the $4 trillion municipal bonds market.
  • Spearheaded a $50K pipeline upgrade project as the sole engineer, increasing pricing yield curve data availability on 2.7MM securities by 800%.
  • Served as lead subject matter expert in pricing and securities data on a team of 15, driving a 70% improvement in team performance.
  • Reduced full data load time from 3 weeks to 6 hours, cutting operating expenses by 95%.
  • Automated table DDL comparison, reducing operating time significantly while mentoring 3 data analysts in optimizing SQL queries, improving runtime by 500%.
Python
Node.js
PostgreSQL
View more
Python
Node.js
PostgreSQL
Amazon S3
Lambda
RDF
SPARQL
DynamoDB
Amazon Redshift
SQL Tuning
Data Engineering
Airflow
CI/CD
Data parsing
AWS
Spark optimization
View more
Data Engineer II (Contractor)
BlueLabs Analytics
2022-01-01-2022-08-01

Data Engineer II (Contractor) at BlueLabs Analytics in 2022.

  • Managed data engineering for a company with >$15MM in annua...

Data Engineer II (Contractor) at BlueLabs Analytics in 2022.

  • Managed data engineering for a company with >$15MM in annual revenue, directing a major version upgrade of data workflow architecture.
  • Overhauled architecture for >1 TB in cloud file storage, reducing costs significantly.
  • Redesigned a large government agency's HR platform by migrating to a SaaS solution, decreasing onboarding and offboarding costs.
Python
Docker
Amazon Redshift
View more
Python
Docker
Amazon Redshift
Kubernetes
Airflow
Data architecture
Cloud migration
Spark optimization
Cloud Based file sharing
View more
PROJECTS
Shopify Data PipelineView Project
Algolia
2024
This project implements a data pipeline designed for the Algolia Integration team to process Shopify configuration data. It extracts CSV ...
This project implements a data pipeline designed for the Algolia Integration team to process Shopify configuration data. It extracts CSV files from an S3 bucket, transforms the data by filtering and enhancing it, and loads the transformed data into a PostgreSQL database. The pipeline is designed with scalability in mind to handle higher volumes of data.
Python
SQL
PostgreSQL
View more
Python
SQL
PostgreSQL
Amazon S3
Docker
Airflow
AWS
View more
Debian Package File Statistics ToolView Project
Canonical
2024
This Python tool ranks Debian packages based on the number of associated files for a given architecture. It accesses a specified architec...
This Python tool ranks Debian packages based on the number of associated files for a given architecture. It accesses a specified architecture's Contents file from the Debian repository, parses the file, and outputs the statistics of the packages that have the most files associated with them.
Python
Web Scraping
Data analytics
View more
Python
Web Scraping
Data analytics
Data Engineering
View more