Data Science Tutorials and Insights

Learn about the latest trends in Data Science. Read tutorials, posts, and insights from top Data Science experts and developers for free.

GET STARTED

Data Science tutorials, posts, and more

Building Data Products with Python: Adding User Management to a Django website

This is the second tutorial on our series on how to build data products with Python. In this second tutorial, we will add user management. This is an important part. Once we are able to identify individual users, we will be ready to generate user recommendations through machine learning.
Building Data Products with Python: Adding User Management to a Django website

Exploring geographical data using SparkR and ggplot2

The present analysis will use the power of SparkR to analyse large datasets in order to explore the 2013 American Community Survey dataset, more concretely its geographical features.
Exploring geographical data using SparkR and ggplot2

Extending Apache Pig with Python UDFs

Apache Pig is a popular system for executing complex Hadoop map-reduce based data-flows. Pig is especially great because it is extensible. By the end of this tutorial, you will be able to write PigLatin scripts that execute Python code as a part of a larger map-reduce workflow.
Extending Apache Pig with Python UDFs

Linear Models with SparkR 1.5: Uses and Present Limitations

In this analysis we will use SparkR machine learning capabilities in order to try to predict property value in relation to other variables present in the 2013 American Community Survey dataset.
Linear Models with SparkR 1.5: Uses and Present Limitations

Spark & R: data frame operations with SparkR

In this third tutorial (see the previous one) we will introduce more advanced concepts about SparkSQL with R that you can find in the SparkR documentation, applied to the 2013 American Community Survey housing data. These concepts are related with data frame manipulation, including data slicing, summary statistics, and aggregations.
Spark & R: data frame operations with SparkR

Getting Started with Cassandra and Spark

This tutorial is going to go through the steps required to install Cassandra and Spark on a Debian system and how to get them to play nice via Scala.
Getting Started with Cassandra and Spark

How to Set Up Hadoop in Pseudo Distributed Mode on Single Cluster

This step-by-step tutorial will teach you how to set up Apache Hadoop in Pseudo-Distributed Mode on Single cluster.
How to Set Up Hadoop in Pseudo Distributed Mode on Single Cluster

Cheat Sheet: Python For Data Science

The cheat sheet is a handy addition to your learning, as it covers the basics, brought together in seven topics, that any beginner needs to know to get started doing data science with Python.
Cheat Sheet: Python For Data Science

Basic Pandas

Demo how to use pandas to do basic data analysis
Basic Pandas

New Jupyter Client Released

A new implementation of the Jupyter notebook with realtime synchronization, written using React.js.
New Jupyter Client Released

Ingredients in the making of a Data Scientist

What exactly is data science? How does one prepare for a career in data science? Well, Data Scientist is the hottest profession in the 21st century.

Is Google Tensorflow Object Detection API the easiest way to implement image recognition?

Google Tensorflow Object Detection API
Is Google Tensorflow Object Detection API the easiest way to implement image recognition?

Get curated posts in your inbox

Learn programming by reading more posts like this

YOU MAY ALSO BE INTERESTED IN

Share ideas
with an editor
built for developers

LEARN MORE