The repository contains Structured Query Language (SQL) Scripts. The Multiple SQL scripts for various projects which includes data cleaning, data pre-processing, data processing, data transformation and insights gaining through Query Language.
The data is collected by Loan Lending Institution for many individuals from multiple banks whether to sanction the loan approval or not based on their past behaviour in repayment of loans or account mainteinance in their respective banks. The data provided by our stake holder is as follows.
- Account - The dataset has account id, district id, frequency & date. (No. of Records - 4,500)
- Card - The dataset has card id, disposition id, type & issued. (No. of Records - 892)
- Client - The dataset has client id , birth number alomg with district id. (No. of Records - 5,369)
- Disp - The dataset has disposition id, client id, account id, type. (No. of Records - 5,369)
- District - The dataset has different factors such as A1 - A16 which includes much datas than the other datasets but not explained in a well manner. (No. of Records - 77)
- Loan - The dataset has loan id, account id, date, amount, duartion, payment & status. (No. of Records - 682)
- Order - The dataset has order id, account id, bank to, account to, amount, k_symbol. (No. of Records - 6,471)
- Transaction Data - The dataset has transaction id, account id, date, type, operation, amount, balance, k_symbol, bank & account. (No. of Records - 10,56,320)