David Bros

David Bros

Mentor
Rising Codementor
US$25.00
For every 15 mins
ABOUT ME
Mentor - I explain technical details in a way that you will understand
Mentor - I explain technical details in a way that you will understand

From very early in my career I have been teaching multiple people (Senior, Medior and Junior devs) how to do things. This has been possible thanks to my ability to teach complex subjects and concepts in a very foot-on-the-ground way, in a way that combines programming logic and real-life logic.

Learning programming logic, data modeling and general IT logic is something that takes time, practice and the energy to want to generalize and simplify every problem you come across.

With me, you will be able to understand the details of deployments, CI/CD, Docker and any programming language and be able to remember it, thanks to my teaching methods.

Skills and tools as of 10/08/2022 (4 years working experience):

Data Storage/Data Lake: Azure Data Lake, Kafka 0.10+, ElasticSearch, OpenDistro.
Data Processing / Visualization: Logstash, PySpark, Spark, Kibana.
Databases: PostgreSQL, MariaDB, SQLite, MongoDB, DynamoDB, Cassandra.
Primary Languages: Python (3.+), SQL/T-SQL, C++ (17), Ruby (2.6.2+), JS (ES6+), Ansible Playbooks.
Secondary Languages: C, Java 11 (low experience).
Distributions: CentOS (7/8), Red Hat, Debian, Ubuntu.
Architecture: Nginx, ProxySQL, BDR, Galera, AWS, Azure, Docker, Zookeeper.
Cloud: Azure, AWS.
Other tools: Git, Jenkins, Postman, Ansible Playbooks, SSMS, DBeaver, Kubernetes (low experience), WSL2.

Catalan, Spanish, English
Amsterdam (+02:00)
Joined December 2019
EXPERTISE
2 years experience
6 years experience
5 years experience
3 years experience

REVIEWS FROM CLIENTS

David's profile has been carefully vetted and approved as a Codementor. Connect with David now, and leave a review for them once you're done!
SOCIAL PRESENCE
GitHub
pypdfy
Pypdfy is a python package in coninuous development that provides a set of tools to extract metadata from PDF files
Python
1
0
jobsReactAPI
First api I ever wrote JS, Express 13/01/2019
JavaScript
0
0
Stack Overflow
36 Reputation
0
0
4
EMPLOYMENTS
Data Engineer Consultant
Essent
2022-07-01-Present
New at Essent.
New at Essent.
Python
SQL
PostgreSQL
View more
Python
SQL
PostgreSQL
Spark streaming
Go (Golang)
AWS (Amazon Web Services)
View more
Data Engineer
iVent Mobile
2021-01-01-2022-07-01
- Deployed a GEO Redundant Kafka Cluster (8 nodes) in 4 POPs around the world, set up MM2 redundancy for real time data streaming. This c...
- Deployed a GEO Redundant Kafka Cluster (8 nodes) in 4 POPs around the world, set up MM2 redundancy for real time data streaming. This cluster supports data from over 2500 customer planes, around 6.5 million documents every 5 minutes. (Kafka, Centos7) - Set up and maintained a 9 node Galera cluster and proxySQL for Observium (NMS System). This system is now in charge of monitoring over 2500 VMs in 4 different data centers in the world and provides critical status updates to hundreds of support engineers. (PostgreSQL, Observium, ProxySQL, MariaDB, Galera) - Worked on data parsing, processing, analyzing, and visualizing. Created and combined several internal and external data sources to enrich our client's datasets. (Ruby, Rails, ElasticSearch, Kibana, PostgreSQL, Kafka). - Designed and implemented a Data Model capable of supporting multiple Mobile Technologies and their metadata, regardless of source and data structure. This cut implementation times by over 4 hours per each Technology and standarized all sources. (PostgreSQL, Rails) - Designed a Spark cluster (Spark, PySpark, ZooKeeper) (6 nodes) which ingests data from data collectors located in the 3 world POPs. Provided near real time availability, 100% data consistency, greater ingest, processing and writing speed using map reduce and data micro batching streaming. (Spark, PySpark, ZooKeeper, Python, Ansible, Kafka). - Designed and developed a Geo Redundancy and Fail Over system for Delayed Jobs, this made all our data processors redundant and fail safe. It also allowed us to have 4 hosts with Delayed Jobs workers in 4 locations around the world. (Ruby, PostgreSQL, Rails) - Maintained and set up redundant, internet facing Kong Proxy, 4 VMs with custom Lua plugins and 2 custom PostgreSQL clusters (2 nodes) with versions 9 and 10 in a master-slave infrastructure. (Kong, Ansible, Lua, Keepalived, Centos 7, Yum).
Ruby
Python
SQL
View more
Ruby
Python
SQL
Ruby on Rails
Git
Nginx
Elasticsearch
GitLab
Docker
Apache Kafka
View more
Data Engineer
Annual Insight
2019-04-01-2020-12-01
- Developed 8+ data crawlers that track several public data sources, these fetched financial statements for hundreds of companies, making...
- Developed 8+ data crawlers that track several public data sources, these fetched financial statements for hundreds of companies, making the Analysts have the latest reports on any company they were analyzed at the time. (MSQL, SSMS, Python, Docker, Selenium) - Developed a new Data Model for the Annual Insight's data platform, and took care of developing the necessary ETLs for that to happen. The model allowed for increased speeds when enriching data. (MSQL, SSMS, Python) - Created a PDF Parser with Python which read through crawled financial statements and saved the metrics read. These were then corrected or approved by analysts. The PDF Parser application saved about 4 hours of manual input for every analyst and every company they analyzed. (Python, OCR) - Designed, developed, and implemented an SDK for the company to use, the tools it included were: A generic Logging system that fed the monitoring dashboards with continuous statuses from every application, and, a Python Implementation of Azure AD for any of our applications to log in to Annual Insight's Key Vault and Secret Vault automatically. (Python, Azure SDK) - Designed and implemented testing and CI/CD with Azure DevOps. (Azure, Docker, Selenium) - Taught several Pythonic standards to the team in order to bring flexibility and reduce duplication and code debt.
SQL
Azure
NumPy
View more
SQL
Azure
NumPy
Pandas
Data Analysis
Algorithm
Python 3
Data Science
Data analytics
Data Engineering
View more
PROJECTS
Pypdfy
2019
Pypdfy is a package that provides a set of tools to analyse PDF structures.
Pypdfy is a package that provides a set of tools to analyse PDF structures.
Python
Regex
Data Analysis
Python
Regex
Data Analysis
Confidential Project
2019
Python
SQL
XML
View more
Python
SQL
XML
Automation
Algorithm
React
Autocad
Data parsing
View more