Skip to content

Commit 8392dac

Browse files
authored
Update Tasks register report
1 parent 55cfcc8 commit 8392dac

File tree

1 file changed

+5
-2
lines changed

1 file changed

+5
-2
lines changed

Tasks register report

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,10 @@
99
As we were not experts through Python, we started learning the fundamentals and basic functions throughout some videos: one recorded internally by Capgemini and the other ones mainly from Pluralsight courses, such as Python for Data Analysts, Pandas Fundamentals and Finding Relationships in Data.
1010
At the same time, we began programming the initial functions to open the files and have an overview of the data. Through this inspection of data, we observed that:
1111
- Data was not normalized.
12-
- Depending on the indicator, its corresponding file was order in a different way.
13-
Having in mind that these two observations implied a "problem" for us, we realized that the files (CSV) available in the webpage Our World in Data were being extracted from other sources, mainly FAO. So, we decided to extract directly the data
12+
- Depending on the indicator, its corresponding file was ordered in a different way.
13+
Having in mind that these two observations implied a "problem" for us, we realized that the files (CSVs) available in the webpage Our World in Data were being extracted from other sources, mainly FAO. So, we decided to extract directly the data
1414
from there getting the benefits of having data normalized and unified.
15+
We had 68 files that could be related to GDP growth. In each one, the information cointained categories of country, year, units, value and others (consider not relevant for our study).
16+
Therefore, what we made was putting all the files in the same table, joining data basing on the conditions of same country and year. So, what we got was a table containing the value of each indicator in each year and region.
17+
It is important to note that indicators´files were not all same size, some registering more ancient historical values or more in depth data by region than others.
1518

0 commit comments

Comments
 (0)