I’m challenging myself to do a full data science project on a Kaggle data set in 14 days – including data understanding, preprocessing, modeling and evaluation. I’m sharing this to make sure I actually do it and to show a realistic portrayal of what is possible in 14 days if you’re not a Kaggle grandmaster.……
Category: Machine Learning
Machine Learning Study Goals β January 2023: SHAP paper & getting started with Scala
In this monthly post, I tell you what I plan to study or improve on in the area of machine learning (including an update at the end of the month). Oh, and if you want to receive (bi)weekly updates on how Iβm doing with these goals, consider subscribing to my newsletter π Goal 1: Read……
Machine Learning Study Goals β November 2022: Back to projects and exploration
In this monthly post, I tell you what I plan to study or improve on in the area of machine learning (including an update at the end of the month). Last month was a bit … non-optimal. So this month I’m learning from my experience and going back to practical goals despite loving theory. Let’s……
Time Series Analysis and Forecasting – An Introduction
You need to analyze a time series but have no idea where to start? Then youβre in the right place. In this post, I give you a rough overview and pointers on what to learn next for your specific problem. Here you can see an example time series of monthly airline passengers. We will discuss……
3 Lessons from the paper “Attention Is All You Need” as a Beginner
With the recent surge of news about image, or even art, generating AI, I’m sure I’m not the only one who was interested in how this works under the hood. Most of these results are achieved by so called “transformer” models and even though it seems like they are coming out just now with the……
Machine Learning Study Goals – October 2022: Time Series
Welcome to another installment of: I share my goals for improving at Machine Learning and we can all guess how many of those I will reach π I will post an update at the end of the month as always. My theme for this month is time series. Last month I had some time series……
Machine Learning Study Goals – September 2022
By now, this is turning into a monthly tradition on this blog: Iβm sharing my goals for improving at Machine Learning with you and at the end of the month I update you on how I did to keep myself accountable. This one looks quite ambitious, but I have some free brain capacity due to……
Kaggle Playground Series, August 2022: What I learned
I participated in this month’s beginner’s challenge on a simulated dataset that Kaggle releases every month. In the August 2022 challenge we are given simulated data from a fictional product test series and given the measured data, the task is to predict whether the product will fail or not in each case. In this post……
How to impute missing values in Python DataFrames
It would be great if our data was always ready to go and be fed into a machine learning model. That is however not the case in most projects. Among the many possible issues, a likely one is that your data has missing values. Possible reasons for missing data are that perhaps the data was……
How is Data Science different from Machine Learning?
I have been working in a “Data Science” consulting team for 8 months now. Before that I got a Master’s degree in Computer Science with a machine learning specialization. So I could argue that I have seen both sides of the coin here and I have noticed some differences. Disclaimer: You will have trouble finding……