The problem: Do you have many topics you would like to study or areas of coding you want to improve, but you can’t decide which is the best to focus on so you end up studying a little bit of everything and never making noticeable progress? Or you end up so overwhelmed that you just……
Category: Data Science
Completing a Data Science Project in 14 days – Fraud Detection Challenge Announcement
I’m challenging myself to do a full data science project on a Kaggle data set in 14 days – including data understanding, preprocessing, modeling and evaluation. I’m sharing this to make sure I actually do it and to show a realistic portrayal of what is possible in 14 days if you’re not a Kaggle grandmaster.……
Machine Learning Study Goals – November 2022: Back to projects and exploration
In this monthly post, I tell you what I plan to study or improve on in the area of machine learning (including an update at the end of the month). Last month was a bit … non-optimal. So this month I’m learning from my experience and going back to practical goals despite loving theory. Let’s……
Time Series Analysis and Forecasting – An Introduction
You need to analyze a time series but have no idea where to start? Then you’re in the right place. In this post, I give you a rough overview and pointers on what to learn next for your specific problem. Here you can see an example time series of monthly airline passengers. We will discuss……
How to impute missing values in Python DataFrames
It would be great if our data was always ready to go and be fed into a machine learning model. That is however not the case in most projects. Among the many possible issues, a likely one is that your data has missing values. Possible reasons for missing data are that perhaps the data was……
How is Data Science different from Machine Learning?
I have been working in a “Data Science” consulting team for 8 months now. Before that I got a Master’s degree in Computer Science with a machine learning specialization. So I could argue that I have seen both sides of the coin here and I have noticed some differences. Disclaimer: You will have trouble finding……
First project as a Data Science Consultant: tasks, tools, meetings
Consulting can be a frustrating business to enter into as a newcomer. Because every time you ask a consultant a question about their typical work, they tend to answer with “It depends.” Because a lot of our day-to-day depends on the specific data science consulting project but also on the client. So after recently completing……
Machine Learning Study Goals – August 2022
What do I want to learn this month? I liked doing this last month, so let’s continue this tradition of sharing my study goals for now. Quick context, I already have university degrees and I started a job in January in Data Science that I enjoy, so my goals are largely internally motivated. The number……
One Hot Encoding – How to deal with categorical data in Machine Learning
Many models in machine learning don’t work with categorical data. So what do we do in that case? Of course you can always just remove them, but you would lose a lot of valuable information. So in this post, I share how you can use one hot encoding to make that information usable. I stumbled……
Are Decision Trees the Siblings of Neural Networks? – Interesting research
I recently stumbled upon a research paper about decision trees that made me feel interested in decision trees for the first time in my life. Let me tell you why. My fascination with Neural Networks During my master’s degree (2020-2021) in Computer Science I developed a fascination for neural networks. But not because of their……