Josiah Berkebile

Rising Codementor

US$10.00

For every 15 mins

Sessions/Jobs

First 15 mins free for your first session

ABOUT ME

Big Data Cloud Engineer

I have specialized in Big Data technologies like Apache Spark, Flume, Flink, HBase, HDFS, Hive LLAP, Impala, Athena, EMR, DynamoDB, Airflow, etc. This career has lead me into developing applications that implement Machine Learning models, predictive algorithms, NLP algorithms, and ingest large datasets. I'm very well versed in concurrent and parallel programming and am really good with both Object Oriented as well as Functional programming approaches.

I really love teaching people and sharing my knowledge to the degree that I'm transitioning into making teaching and mentorship my full-time and primary job. I promise that in the time that I spend mentoring you, I will pour into you as much of my knowledge as I can to give you the best chance possible in the industry.

English

Central Time (US & Canada) (-05:00)

Joined October 2016

EXPERTISE

Ruby

4 years experience

I worked in a fast-paced, goal-oriented Ruby on Rails Agile web app development team at Healthcare IT company for 6-9 months, and then si...

I worked in a fast-paced, goal-oriented Ruby on Rails Agile web app development team at Healthcare IT company for 6-9 months, and then single-handedly maintained a large Ruby on Rails app for a large financial institution for about 2 years. I've also done a few hackathons that involved building a web app for a non-profit in less than 24 hours. For 2 of the 3 hackathons, I organized and lead the team. Here's the end result of some of those challenges: http://principalsconnect.com/ http://adoption.kvc.org/

Software architectsRvm Ruby on Rails

Software architectsRvm Ruby on RailsProgrammingComputer Science Software Development Haml HTML/CSS

Java

8 years experience

I have 8 years of extensive experience writing Java applications that run in a Hadoop environment. I've developed data pipelines in Apac...

I have 8 years of extensive experience writing Java applications that run in a Hadoop environment. I've developed data pipelines in Apache Crunch, created custom Flume Clients, and also Flume Sources that plugin to the running Flume Agents per the Flume API, and written MapReduce algorithms that process data in the petabytes scale. Some of these Java applications ran as services and used Jersey to expose a web API and Solr to index data sitting in NoSQL storage. Spring is a bit of a heavy-weight. As a result, the various Hadoop engineering teams I've been on have chosen Dagger, rather than Spring, as the dependency injection framework for our applications since it's faster and more lightweight.

Apache Solr Dagger Jersey

Apache Solr Dagger JerseyApache crunchApache flume MapReduce Apache Hadoop Apache Maven

Scala

6 years experience

I have six (6) years of experience using Scala in Hadoop and data engineering environments. I've written a LOT of Scala. Some of the Fl...

I have six (6) years of experience using Scala in Hadoop and data engineering environments. I've written a LOT of Scala. Some of the Flume plugins I've written have been written in Scala. I've written a Flume Client that used Scalatra to listen for events using Webhooks and then distributed them to an array of Flume Clients using the Akka concurrency framework. I've extended and refactored a Scala Akka web scraper. I've leveraged ScalaTest and ScalaCheck in practically all of my Scala projects. In my projects as a Scala consultant a year ago I've leveraged the Typelevel stack to develop streaming applications using a pure functional approach. I've also written a few Spark applications in Scala. Some of my smaller projects have involved implementing a PageRank and Collaborative Filtering algorithms, and the most ambitious project I've done on Spark was an NLP classifier. I have a decent amount of experience architecting, profiling, and tuning Spark applications. All of the applications I've written in Scala have been highly concurrent, some of them have leveraged Software Transactional Memory libraries like ScalaSTM to simplify multi-threaded interactions with shared memory.

Cats EffectScalastmScalacheck

Cats EffectScalastmScalacheck Scalatest Apache flume Scalatra AkkaStanfordnlpApache Spark Apache Hadoop

Python

8 years experience

Python is a versatile language, and my experience in it is also very versatile. I've used Python to do data exploration using NumPy and ...

Python is a versatile language, and my experience in it is also very versatile. I've used Python to do data exploration using NumPy and Pandas, I've taken Python code from data scientists and translated it into PySpark or into Scala Spark applications, I've written some systems automation scripts in Python, and written scripts for moving or ingesting data between databases and systems on the petabytes scale. I have about equal experience using legacy Python 2 as well as Python 3 in production environments.

Systems Programming Concurrent Programming Apache Spark

Systems Programming Concurrent Programming Apache Spark Pandas NumPy

Haskell

2 years experience

I used Haskell as my vehicle for learning Functional Programming. Since Haskell is a pure functional language, it would not let me fall ...

I used Haskell as my vehicle for learning Functional Programming. Since Haskell is a pure functional language, it would not let me fall back on my old procedural object-oriented habits. I succeeded in learning enough Haskell on my own to become productive in the language. This learning enabled me to pick-up Scala more easily than most other members of the engineering teams I've been a part of. If you are learning Functional Programming or just getting started with Haskell, I can certainly get you to a level where you will be productive in the language.

Category Theory Abstract algebra Monads

Category Theory Abstract algebra Monads MonoidsFunctorsApplicativesAlternatives

Apache Kafka

2 years experience

Apache Spark

6 years experience

REVIEWS FROM CLIENTS

Josiah's profile has been carefully vetted and approved as a Codementor. Connect with Josiah now, and leave a review for them once you're done!

SOCIAL PRESENCE

Community Posts

The Noob's Prelude to Hadoop: Part 1 - What is "Hadoop"???

The Road Ahead: What This Series Will Cover Any blog on Hadoop needs at least a brief introduction to the… by josiah-b

EMPLOYMENTS

Sr Big Data Engineer

FINRA

2023-07-01-Present

Working in FINRA's big data processing environment. I cannot discuss which languages or technologies I use, here.

Big Data

Senior Scala Engineer

47 Degrees

2021-07-01-2023-07-01

Mentored Jr. Developers Developed and maintained a streaming categorization application for Jack Henry using a pure functional approach ...

Mentored Jr. Developers Developed and maintained a streaming categorization application for Jack Henry using a pure functional approach via the Typelevel stack in Scala.

Scala

Docker

Kubernetes

Scala

Docker

Kubernetes

AWS Kinesis

Helm

Doobie

Cats Effect

Information Architect II

Blue Cross Blue Shield of Kansas City

2020-05-01-2021-07-01

Lead the company shift to modern cloud-based data management from classical on-premises solutions. This involved the introduction of Kube...

Lead the company shift to modern cloud-based data management from classical on-premises solutions. This involved the introduction of Kubernetes, Helm, and Terraform to automate and codify deployment environments, the containerization of Apache Airflow for deployment to AKS, and coordinating the conversations and interactions necessary between the Ops, Security, and Engineering/Development teams to facilitate the shift.

Docker

Kubernetes

Terraform

Docker

Kubernetes

Terraform

Agile

Snowflake

Apache Airflow

Azure Functions

Helm

Prisma

Azure Data Factory

PROJECTS

Overnight Website ChallengeView Project

KVC Health Systems, The Nerdery

2014

Built a new website for KVC Health Systems in 24 hours.

HTML/CSS

Ruby on Rails

PostgreSQL

HTML/CSS

Ruby on Rails

PostgreSQL

Heroku

JavaScript

Overnight Website ChallengeView Project

PrincipalsConnect

2017

Built a website in 24 hours for PrincipalsConnect

HTML/CSS

Ruby on Rails

PostgreSQL

HTML/CSS

Ruby on Rails

PostgreSQL

Heroku

Continuous Integration

Docker

React

JavaScript

Continuous Deployment

Redux