All the things I am writing good or bad
Data Analysis Tools - Week 1
Data for this study comes from the Gapminder World Dataset collected by the Gapminder Foundation. The Gapminder World Dataset contains data collected from more than 200 countries/areas for more 500 variables.
Below is the description of the variables
Continents
Female Employment Rate (variable code: femaleemployrate, Unit: Percentage) - Employed females (age > 15) as a percentage of the total female population. Female Employment Rate is the response variable
Start with import
data = pd.read_csv('gapminder.csv',low_memory=False)
I will be using url to get the data online
join the two dataframe
We create a dataframe sub
out from the merge dataframe df_outer
Since our p-value is 0.0455
which is smaller than 0.05
, the data provides significant evidence against the null hypothesis. But, we cannot reject the null hypothesis and accept the alternate hypothesis, right away. To avoid Type I error we need to perform the POST HOC test
From the result above we can only say that there is a significant difference between Africa and Asia’s female employment rate.
For other pair of continents we fail to reject the NULL Hypothesis