Skip to content

This project applies data modeling concepts such as data normalization (BCNF), and schema design (snowflake) to structure and optimize datasets, allowing analytical queries to enable efficient insights into food consumption patterns and correlation with hunger and malnutrition across countries and regions.

Notifications You must be signed in to change notification settings

mtwchuang/CSC2008-Global-Hunger-Insights

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CSC2008-Global-Hunger-Insights

Overview

The Global Hunger & Dietary Insights project aims to analyze global malnutrition trends, food consumption patterns, and nutritional availability using structured datasets.

Datasets

This project processes and models data related to:

  • Dietary Composition – analyzing per capita food consumption across countries and regions.
  • Nutritional Composition – mapping food types to macronutrient content (protein, carbohydrates, fats).
  • Global Hunger Index (GHI) – measuring hunger severity per country.
  • Children’s Malnutrition Rates – tracking stunting, wasting, and underweight prevalence.

Data Modelling Approach

Figure 1: Entity Relational Diagram

For efficiency, scalability, and accuracy, the project followed data modelling process involving normalisation, data cleaning, and proper schema design.

Normalization

The data was normalized down to the Boyce-Codd Normal Form (BCNF) standard, achieved by addressing repeating groups (1NF), partial dependencies (2NF), and transitive dependencies (3NF).

  • First Normal Form (1NF): The raw dataset contained repeating groups where multiple food types were stored as separate columns rather than rows. These columns were unpivoted into rows.

  • Second Normal Form (2NF): Partial dependencies were removed by splitting dimension attributes into separate tables, leaving quantitative metrics and references in the fact tables.

Third Normal Form (3NF): Transitive dependencies were eliminated by creating an additional nutrition_dim table to separate food nutrient values from food_dim.

Data Cleaning

Schema Design

About

This project applies data modeling concepts such as data normalization (BCNF), and schema design (snowflake) to structure and optimize datasets, allowing analytical queries to enable efficient insights into food consumption patterns and correlation with hunger and malnutrition across countries and regions.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages