Data Science Tutorials and Insights

Learn about the latest trends in Data Science. Read tutorials, posts, and insights from top Data Science experts and developers for free.

GET STARTED

Data Science tutorials, posts, and more

Trending Developer Skills, Based on my Analysis of “Ask HN: Who’s Hiring?”

The most popular programming languages, databases and software development tools from "Ask HN: Who is hiring?"
Trending Developer Skills, Based on my Analysis of “Ask HN: Who’s Hiring?”

Machine Learning in Plain English: Building a Decision Tree Model to Classify Names by Gender: Part One

Learn how to build a decision tree model to classify names. by gender, using machine learning, explained in plain English.
Machine Learning in Plain English: Building a Decision Tree Model to Classify Names by Gender: Part One

How I learned R

How I learned R. A little about me and the techniques I used to learn this robust data science language.
How I learned R

UC Berkeley Machine Learning Crash Course: Part 1

Learn all the basics of machine learning — regression, cost functions, and gradient descent. This is the first article in Machine Learning at Berkeley's Crash Course series.
UC Berkeley Machine Learning Crash Course: Part 1

Cheat Sheet: Python For Data Science

The cheat sheet is a handy addition to your learning, as it covers the basics, brought together in seven topics, that any beginner needs to know to get started doing data science with Python.
Cheat Sheet: Python For Data Science

How to Set Up Hadoop in Pseudo Distributed Mode on Single Cluster

This step-by-step tutorial will teach you how to set up Apache Hadoop in Pseudo-Distributed Mode on Single cluster.
How to Set Up Hadoop in Pseudo Distributed Mode on Single Cluster

Getting Started with Cassandra and Spark

This tutorial is going to go through the steps required to install Cassandra and Spark on a Debian system and how to get them to play nice via Scala.
Getting Started with Cassandra and Spark

Exploring geographical data using SparkR and ggplot2

The present analysis will use the power of SparkR to analyse large datasets in order to explore the 2013 American Community Survey dataset, more concretely its geographical features.
Exploring geographical data using SparkR and ggplot2

Extending Apache Pig with Python UDFs

Apache Pig is a popular system for executing complex Hadoop map-reduce based data-flows. Pig is especially great because it is extensible. By the end of this tutorial, you will be able to write PigLatin scripts that execute Python code as a part of a larger map-reduce workflow.
Extending Apache Pig with Python UDFs

Building Data Products with Python: Using Machine Learning to Provide Recommendations

This is the third part of our tutorial on how to build a web-based wine review and recommendation system using Python technologies such as Django, Pandas, SciPy, and Scikit-learn. In this part, you will learn how to use machine-learning to recommend users wines based on their preferences.
Building Data Products with Python: Using Machine Learning to Provide Recommendations

Spark & R: data frame operations with SparkR

In this third tutorial (see the previous one) we will introduce more advanced concepts about SparkSQL with R that you can find in the SparkR documentation, applied to the 2013 American Community Survey housing data. These concepts are related with data frame manipulation, including data slicing, summary statistics, and aggregations.
Spark & R: data frame operations with SparkR

Building Data Products with Python: A Wine Review Website using Django and Bootstrap

With this tutorial, we start a series of tutorials about how to build data products with Python. As a leitmotif we want to build a web-based wine reviews and recommendations website using Python technologies such as Django and Pandas. We have chosen to build a wine reviews and recommendations website, but the concepts and the technology stack...
Building Data Products with Python: A Wine Review Website using Django and Bootstrap

Spark & R: Loading Data into SparkSQL Data Frames

In this second Spark & R tutorial, we will read data into a SparkSQL data frame as well as have a quick look at the schema.
Spark & R: Loading Data into SparkSQL Data Frames

Spark & R: Downloading data and Starting with SparkR using Jupyter notebooks

In this tutorial we will use the 2013 American Community Survey dataset and start up a SparkR cluster using IPython/Jupyter notebooks.
Spark & R: Downloading data and Starting with SparkR using Jupyter notebooks

Spark & Python: SQL & DataFrames

This tutorial will introduce you to Spark capabilities. By using SQL language and data frames, you can perform exploratory data analysis easily.

Building a Movie Recommendation Service with Apache Spark & Flask - Part 2

This Apache Spark tutorial goes into detail on how to use Spark machine learning models, or even another kind of data analytics objects, within a web service. By using the Python language, we make this task very easy, thanks to Spark own Python capabilities and to Python-based frameworks such as Flask.
Building a Movie Recommendation Service with Apache Spark & Flask - Part 2

Data Science with Python & R: Exploratory Data Analysis

In this article, we will take a exploratory look at the crucial steps in Python's and R's data analytics process.
Data Science with Python & R: Exploratory Data Analysis

Spark & Python: MLlib Logistic Regression

In this tutorial, you will learn how to use Spark's machine learning library MLlib to build a Logistic Regression classifier for network attack detection.

Data Science with Python & R: Data Frames I

These series of tutorials on Data Science will try to compare how different concepts in the discipline can be implemented into the two dominant ecosystems nowadays: R and Python.
Data Science with Python & R: Data Frames I

Spark & Python: Working with RDDs (I)

This tutorial introduces two different ways of getting data into the basic Spark data structure, RDD.

Get curated posts in your inbox

Read more posts to become a better developer

YOU MAY ALSO BE INTERESTED IN

Share ideas
with an editor
built for developers

LEARN MORE