- Expert Mentors
- Programming TutorsProgramming Tutors
- Community
- CodementorX
- How it Works
- Become a Codementor
SIGN UP
LOG IN
Jose A Dianes
Data Engineer & Computer Science PhD
FOLLOW
MESSAGE JOSE
ABOUT ME
First
15 mins
free
15 mins
free
5.0
155
Followers
2
Sessions

For every 15 minutes
With more than a decade of experience in software engineering, I have been involved in different aspects of distributed and enterprise systems applied to domains such as Bioinformatics, Ambient Sensing, and Real-time Simulators.
I have a special interest in architectural principles, scalability, the use of data analytics and machine learning, and how they can give organisations the edge.
EXPERTISE
Apache spark
- 1 year experience
I came to Spark from a software engineering background. We had a Hadoop pipeline for clustering proteomics spectral data. Lately I've been working on on-line spectral search and on several Spark tutorials.

Python
- 3 years experience
If I would have to choice a single language to work with, that would be Python. It might not be the most efficient one, but as a language it is very clean and expressive. However its power comes from being a language that you can use all sort of things, including batch processing, system scripting, web development, and data analysis.
R
- 3 years experience
I came to R by chance, while working in a bioinformatics institution. We needed to provide tools for researchers to deal with biological data. I have to confess that at first I didn't like it. Being a software engineer I prefer languages that allow you to build and scale complex software systems. R is not meant to be that way. The language is build for statistics and data analysis work, and has probably the richest ecosystem for that purpose. In the end I loved to work in RStudio and using DataFrames, ggplot2, and build all sort of machine learning models with just a few lines of code. Very powerful tool that you can now even combine with Apache Spark!
Machine learning
- 3 years experience
I came to machine learning and data analysis from a Software Engineering background. I have a PhD in real-time distributed systems and my interest is to build data products that give companies the edge. I am particularly interested in building data products that can scale to large amounts of data and work in real-time.
POSTS BY JOSE
Building a Movie Recommendation Service with Apache Spark & Flask - Part 2
This Apache Spark tutorial goes into detail on how to use Spark machine learning models, or even another kind of data analytics objects, within a web service. By using the Python language, we make this task very easy, thanks to Spark own Python capabilities and to Python-based frameworks such as Flask.
Data Science with Python & R: Dimensionality Reduction and Clustering
An important step in data analysis is data exploration and representation. In this tutorial we will see how by combining a technique called Principal Component Analysis (PCA) together with Cluster, we can represent in a two-dimensional space data defined in a higher dimensional one while, at the same time, be able to group this data in similar groups or clusters and find hidden relationships in our data.
REVIEWS
Average Rating
5.0
(3 ratings)
Awesome mentor!
3
Pretty good
0
Could've been better
0
Needs improvement
0
Unsatisfactory
0
He helps in a short time.
He prepares the session in advance,
All my doubts were solved.
Maria
Aug 17, 2015
Very helpful!!
He answer and solve my doubts very quick
Maria
Aug 17, 2015
Codementor
On-demand Marketplace for Software Developers
© Copyright 2017 Codementor