With the recent surge of news about image, or even art, generating AI, I’m sure I’m not the only one who was interested in how this works under the hood. Most of these results are achieved by so-called “transformer” models and even though it seems like they are coming out just now with the huge……
Category: Machine Learning
Machine Learning Study Goals – October 2022: Time Series
Welcome to another installment of: I share my goals for improving at Machine Learning and we can all guess how many of those I will reach 😉 I will post an update at the end of the month as always. My theme for this month is time series. Last month I had some time series……
Machine Learning Study Goals – September 2022
By now, this is turning into a monthly tradition on this blog: I’m sharing my goals for improving at Machine Learning with you and at the end of the month I update you on how I did to keep myself accountable. This one looks quite ambitious, but I have some free brain capacity due to……
Kaggle Playground Series, August 2022: What I learned
I participated in this month’s beginner’s challenge on a simulated dataset that Kaggle releases every month. In the August 2022 challenge we are given simulated data from a fictional product test series and given the measured data, the task is to predict whether the product will fail or not in each case. In this post……
How to impute missing values in Python DataFrames
It would be great if our data was always ready to go and be fed into a machine learning model. That is however not the case in most projects. Among the many possible issues, a likely one is that your data has missing values. Possible reasons for missing data are that perhaps the data was……
How is Data Science different from Machine Learning?
I have been working in a “Data Science” consulting team for 8 months now. Before that I got a Master’s degree in Computer Science with a machine learning specialization. So I could argue that I have seen both sides of the coin here and I have noticed some differences. Disclaimer: You will have trouble finding……
First project as a Data Science Consultant: tasks, tools, meetings
Consulting can be a frustrating business to enter into as a newcomer. Because every time you ask a consultant a question about their typical work, they tend to answer with “It depends.” Because a lot of our day-to-day depends on the specific data science consulting project but also on the client. So after recently completing……
Machine Learning Study Goals – August 2022
What do I want to learn this month? I liked doing this last month, so let’s continue this tradition of sharing my study goals for now. Quick context, I already have university degrees and I started a job in January in Data Science that I enjoy, so my goals are largely internally motivated. The number……
How to build a Decision Tree for Classification with Python
As promised in my July 2022 Machine Learning Study Plans, here is content on decision trees. Specifically, let’s talk about how you can build a trained decision tree for a classification problem with the Python library Scikit-Learn. I will also address what steps you need to take before using the example dataset in terms of……
One Hot Encoding – How to deal with categorical data in Machine Learning
Many models in machine learning don’t work with categorical data. So what do we do in that case? Of course you can always just remove them, but you would lose a lot of valuable information. So in this post, I share how you can use one hot encoding to make that information usable. I stumbled……