Skip to content

Project developed for the course Data Warehouse at UFSC. Using data from the Entrance Exam of the UFSC from 2008 to 2012 a Data Mart and Dashboard were implemented for visualization and analysis support.

License

Notifications You must be signed in to change notification settings

nicolasantero/DataWarehouse-ETL-Analytics_EntranceExamUFSC

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DataWarehouse_DataViz-Entrance Exam UFSC

Data Visualization

SOFTWARES

  • UniController
  • HeidiSQL
  • Pentaho Data Integration
  • SQL Power Architect
  • Power BI

PROJECT

  1. Analyze the socio-academic model of the Coperve entrance exams from 2008 to 2012 in order to implement a Data Mart to support the following analysis;

    • To support the decision making of the Secretary of State of SC and to evaluate the performance in the subjects of the candidates from the state public schools, comparing them to the private and federal schools.
  2. Create a star schema (dimensional model) to support the analyses analyses;

  3. Define the physical design for the Data Mart, following standardization, establish a transition area for the ETL process;

  4. Creating the Data Mart Front End in an OLAP Tool.

FOLDERS

  • Back-end (information needed to access the database, folder with the scripts/kettle files used in ETL, dimensional modeling);
  • Front-end (information needed to access the data visualization);

INSTRUCTIONS

  • Create a local MySQL connection , in this project UniController was used to create the connection;

  • Using a MySQL database manager (HeidiSQL) and connect to the MySQL connection;

  • Open the vestibular data dump in the database manager;

  • Use the file Script_Create_DW to create the DataWarehouse Fact and Dimensions table, but still without any data;

  • With the modeling created in HeidiSQL, Spoon (Pentaho Data Integration) was used to perform the ETL steps, connecting to the same SQL connection of the DataWarehouse and the vestibular data source;

  • In Spoon, the sql connection is used to pull the acceptance exam data, perform the transformations and then load in the respective dimensions and facts from the DataWarehouse database;

  • The file dw_ufsc_with_data.sql contains the DataWarehouse populated with the data after the Load Step;

MODELING

  • Relational Modeling of the source database Relational Model
  • Dimensional Modeling of the DataWarehouse Modelo Dimensional

About

Project developed for the course Data Warehouse at UFSC. Using data from the Entrance Exam of the UFSC from 2008 to 2012 a Data Mart and Dashboard were implemented for visualization and analysis support.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published