Skip to content

Use Python, Pandas, Spark etc to demontrate that correlation can be used as a basis for decision making

License

Notifications You must be signed in to change notification settings

devonfw-forge/python-data-driven-decisions

 
 

Repository files navigation

Data Driven Decisions

Use Python, Pandas, Spark etc to demontrate that correlation can be used as a basis for decision making.

This project consists of finding the correlation between the GDP (Gross Domestic Product) and social and economical indicators, such as population growth, fertility rates, investment in specific sectors or prices.


Explanation of the followed process

The Hypothesis: It is assumed that there exists a correlation between economic growth and indicators as infant mortality, access to education... We want to demonstarte the validity of this assumption based on available datasets.

In order to check the veracity of this hypothesis the following steps are going to be followed:

Execute the project

Execute the notebooks in the following order:

  1. Data_load
  2. Data_normalization and outliers
  3. Data_filling
  4. Data clustering by countries
  5. Data clustering by indicators
  6. Data predictions
  7. Data sequencies

This will create a series of output DataFrames as .csv files.

First step : Choose the indicators

In order to study the correlation between the economic indicators and some socio-demographic indicators, we have to choose the different indicators :

  1. Gdp from 1850 to 2020 in pounds

  2. Infant mortality of children under 5 years old

  3. Percentage of population age 15+ with tertiary schooling.

  4. Fertility rate

  5. gender inequality

  6. Life expectancy

I choose to measure the economic growth to compare the indicators with the GDP of the country.

Select source of information

I choose to extract datasets about these indicators from the website Our world in data

Showing the charts

image

About

Use Python, Pandas, Spark etc to demontrate that correlation can be used as a basis for decision making

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •