data science Archives

10 things I didn’t know about Data Science a year ago

Data Science & SQL, PythonBy Jörn November 12, 2018

In my article My personal road map for learning data science in 2018 I wrote about how I try to tackle the data science knowledge sphere. Due to the fact that 2018 is slowly coming to an end I think it is time for a little wrap up. What are the things I learned about…

Classification: Precision and Recall

Data Science & SQLBy Jörn June 28, 2018

In the realms of Data Science you’ll encounter sooner or the later the terms “Precision” and “Recall”. But what do they mean? Clarification Living together with little kids You very often run into classification issues: My daughter really likes dogs, so seeing a dog is something positive. When she sees a normal dog e.g. a…

Lesson 2: Naive Bayes

Data Science & SQL, PythonBy Jörn June 19, 2018

Lesson 2 of the Udacity Course UD120 – Intro to Machine Learning deals with Naive Bayes classification.

Linear Algebra with numpy

Data Science & SQL, PythonBy Jörn May 4, 2018

Numpy is a package for scientific computing in Python. It is blazing fast due to its implementation in C. It is often used together with pandas, matplotlib and Jupyter notebooks. Often these packages are referred to as the datascience stack. Installation You can install numpy via pip pip install numpy Basic Usage In the datascience…

Introduction to Jupyter Notebook

Data Science & SQL, Python, ToolsBy Jörn April 25, 2018

JuPyteR Do You know the feeling of being already late to a party when encountering something new? But when you actually start telling others about it, you realize that it is not too common knowledge at all, e.g. Jupyter Notebooks. What is a Jupyter notebook? In my own words: a browser-based document-oriented command line style…

Data Science Datasets: Iris flower data set

Data Science & SQLBy Jörn April 25, 2018

Motivation When you are going to learn some data science the aquisition of data is often the first step. To get you started scikit-learn comes with a bunch of so called “toy datasets”. One of them is the Iris dataset. Prerequisites & Imports Besides scikit-learn we will use pandas for data handling and matplotlib with…

Data Science Overview

Data Science & SQLBy Jörn March 7, 2018

Questions Data Science tries to answer one of the following questions: Classification -> “Is it A or B?” Clustering -> “Are there groups which belong together?” Regression -> “How will it develop in the future?” Association -> “What is happening very often together?” There are two ways to tackle these problem domains with machine learning:…

My personal roadmap for learning data science in 2018

Data Science & SQL, Self-Improvement & Personal FinanceBy Jörn December 13, 2017

I got confused by all the buzzwords: data science, machine learning, deep learning, neural nets, artificial intelligence, big data, and so on and so on. As an engineer I like to put some structure to the chaos. Inspired by Roadmap: How to Learn Machine Learning in 6 Months and Tetiana Ivanova – How to become…

Bayes’ Theorem

Data Science & SQLBy Jörn December 3, 2017

Imagine that you come home from a party and you are stopped by the police. They ask you to take a drug test and you accept. The test result is positive. You are guilty. But wait a minute! Is it really that simple?

Tag Archives: data science