Differences in end of line characters

When You are working with different operating systems you encounter different line endings. Editing a file on a Linux system and opening it on a windows machine can give a weird result.

Here is a short overview which system uses which command characters:

OS Command character
Windows CR + LF
Linux LF
Mac OS <= 9 CR
Mac OS X LF

Classification: Precision and Recall

In the realms of Data Science you’ll encounter sooner or the later the terms “Precision” and “Recall”. But what do they mean?

Clarification

Living together with little kids You very often run into classification issues:

My daughter really likes dogs, so seeing a dog is something positive. When she sees a normal dog e.g. a Labrador and proclaims: “Look, there is a dog!”

That’s a True Positive (TP) Continue reading “Classification: Precision and Recall”

Lesson 4: Decision Trees

from sklearn.tree import DecisionTreeClassifier
clf = DecisionTreeClassifier(min_samples_split=40)
clf.fit(features_train, labels_train)

UD120 – Intro to Machine Learning

One part of my bucket list for 2018 was finishing the Udacity Course UD120: Intro to Machine Learning.

the host of this course are Sebastian Thrun, ex-google-X and founder of Udacity and Katie Malone, creator of the Linear digressions podcast.

The course consists of 17 lessons. Every lesson has a couple of hours of video and lots and lots of quizzes in it.

  • [x] Lesson 1: Only introduction 🙂
  • [x] Lesson 2: Naive Bayes
  • [x] Lesson 3: Support Vector Machines
  • [x] Lesson 4: Decision Trees
  • [x] Lesson 5: Choose your own algorithm
  • [ ] Lesson 6: Datasets and questions
  • [ ] Lesson 7: Regression
  • Lesson 8: Outliers
  • Lesson 9: Clustering
  • Lesson 10: Feature Scaling
  • Lesson 11: Text Learning
  • Lesson 12: Feature Selection
  • Lesson 13: PCA
  • Lesson 14: Validation
  • Lesson 15: Evaluation Metrics
  • Lesson 16: Tying it all together
  • Lesson 17: Final project