× {{alert.msg}} Never ask again
Receive New Tutorials GET IT FREE

Data Science Data Science Tutorials

Learn more about crucial data analysis concepts to get started on data science. Check out our Beginner and Intermediate Tutorials and start analyzing data!


Cheat Sheet: Python For Data Science Codementor Team Codementor Team  ●  Beginner · Intermediate · Python · Data Science · Cheatsheet  ●  Jan 11, 2017
Cheat Sheet: Python For Data Science

The cheat sheet is a handy addition to your learning, as it covers the basics, brought together in seven topics, that any beginner needs to know to get started doing data science with Python.

Continue Reading
How to Set Up Hadoop in Pseudo Distributed Mode on Single Cluster Lakshay Nagpal Lakshay Nagpal  ●  Java · Hadoop · Bigdata · Data Science · Single cluster · Pseudo distributed  ●  Apr 22, 2016
How to Set Up Hadoop in Pseudo Distributed Mode on Single Cluster

This step-by-step tutorial will teach you how to set up Apache Hadoop in Pseudo-Distributed Mode on Single cluster.

Continue Reading
Intro to Machine Learning & NLP with Python and Weka Benjamin Cohen Benjamin Cohen  ●  Python · Data Science  ●  Apr 08, 2016
Intro to Machine Learning & NLP with Python and Weka

In this tutorial, you’ll be briefly introduced to machine learning with Python and Weka, a data processing and machine learning tool. The activity is to build a simple spam filter for emails and learn machine learning concepts.

Continue Reading
Getting Started with Cassandra and Spark Sheena Sheena  ●  Data Science · Spark · Cassandra · Linux  ●  Nov 03, 2015
Getting Started with Cassandra and Spark

This tutorial is going to go through the steps required to install Cassandra and Spark on a Debian system and how to get them to play nice via Scala.

Continue Reading
Linear Models with SparkR 1.5: Uses and Present Limitations Jose A Dianes Jose A Dianes  ●  Spark · R · Machine learning · Data Science  ●  Oct 02, 2015
Linear Models with SparkR 1.5: Uses and Present Limitations

In this analysis we will use SparkR machine learning capabilities in order to try to predict property value in relation to other variables present in the 2013 American Community Survey dataset.

Continue Reading
Exploring geographical data using SparkR and ggplot2 Jose A Dianes Jose A Dianes  ●  Spark · R · Ggplot2 · Data Science  ●  Sep 24, 2015
Exploring geographical data using SparkR and ggplot2

The present analysis will exploit the power of SparkR to analyse large datasets in order to explore the 2013 American Community Survey dataset, more concretely its geographical features.

Continue Reading
Spark & R: data frame operations with SparkR Jose A Dianes Jose A Dianes  ●  Spark · SQL · R · Data Science  ●  Sep 21, 2015
Spark & R: data frame operations with SparkR

In this third tutorial (see the previous one) we will introduce more advanced concepts about SparkSQL with R that you can find in the SparkR documentation, applied to the 2013 American Community Survey housing data. These concepts are related with data frame manipulation, including data slicing, summary statistics, and aggregations.

Continue Reading
Spark & R: loading data into SparkSQL data frames Jose A Dianes Jose A Dianes  ●  Spark · SQL · R · Data Science  ●  Sep 18, 2015
Spark & R: loading data into SparkSQL data frames

In this second Spark & R tutorial, we will read data into a SparkSQL data frame as well as have a quick look at the schema.

Continue Reading
Spark & R: Downloading data and Starting with SparkR using Jupyter notebooks Jose A Dianes Jose A Dianes  ●  Spark · R · Data Science  ●  Sep 17, 2015
Spark & R: Downloading data and Starting with SparkR using Jupyter notebooks

In this tutorial we will use the 2013 American Community Survey dataset and start up a SparkR cluster using IPython/Jupyter notebooks.

Continue Reading
Building Data Products with Python: Using Machine Learning to Provide Recommendations Jose A Dianes Jose A Dianes  ●  Python · Django · Machine learning · Data Science  ●  Sep 14, 2015
Building Data Products with Python: Using Machine Learning to Provide Recommendations

This is the third part of our tutorial on how to build a web-based wine review and recommendation system using Python technologies such as Django, Pandas, SciPy, and Scikit-learn. In this part, you will learn how to use machine-learning to recommend users wines based on their preferences.

Continue Reading