Machine Learning Study Goals โ€“ November 2022: Back to projects and exploration

In this monthly post, I tell you what I plan to study or improve on in the area of machine learning (including an update at the end of the month). Last month was a bit … non-optimal. So this month I’m learning from my experience and going back to practical goals despite loving theory. Let’s……

Time Series Analysis and Forecasting – An Introduction

You need to analyze a time series but have no idea where to start? Then youโ€™re in the right place. In this post, I give you a rough overview and pointers on what to learn next for your specific problem. Here you can see an example time series of monthly airline passengers. We will discuss……

3 Lessons from the paper “Attention Is All You Need” as a Beginner

With the recent surge of news about image, or even art, generating AI, I’m sure I’m not the only one who was interested in how this works under the hood. Most of these results are achieved by so-called “transformer” models and even though it seems like they are coming out just now with the huge……

Kaggle Playground Series, August 2022: What I learned

I participated in this month’s beginner’s challenge on a simulated dataset that Kaggle releases every month. In the August 2022 challenge we are given simulated data from a fictional product test series and given the measured data, the task is to predict whether the product will fail or not in each case. In this post……

How is Data Science different from Machine Learning?

I have been working in a “Data Science” consulting team for 8 months now. Before that I got a Master’s degree in Computer Science with a machine learning specialization. So I could argue that I have seen both sides of the coin here and I have noticed some differences. Disclaimer: You will have trouble finding……

How to build a Decision Tree for Classification with Python

As promised in my July 2022 Machine Learning Study Plans, here is content on decision trees. Specifically, let’s talk about how you can build a trained decision tree for a classification problem with the Python library Scikit-Learn. I will also address what steps you need to take before using the example dataset in terms of……

One Hot Encoding – How to deal with categorical data in Machine Learning

Many models in machine learning don’t work with categorical data. So what do we do in that case? Of course you can always just remove them, but you would lose a lot of valuable information. So in this post, I share how you can use one hot encoding to make that information usable. I stumbled……

Consent Management Platform by Real Cookie Banner