Data Leakage

Data leakage occurs when a predictive model uses data in the training phase that are unavailable when the model is in production. Consider the example below (source: Mostafa Saad Ibrahim): The main concern about the data is related to splitting it, where images of the same animal might have occurred in both train and test …

Data Leakage Read More »

t-test

Suppose we have collected students’ grades at a certain test, and we have found that the mean of grades for male students is 70.2 and the mean for female students is 72.59.We can see that the two means are different but, is this difference statistically significant? T-Test can determine if there is a significant difference …

t-test Read More »

What is Data Science?

Data Science is one of the best-paying jobs around the world, but what does it really mean? Data science involves studying the data to conclude useful insights by using different methods such as visualizing the data, building machine learning models, and applying statistical tests. or in simple words according to Cassie Kozyrkov: Data science is …

What is Data Science? Read More »