You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+15-34Lines changed: 15 additions & 34 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,47 +6,28 @@ This is a personal repository for Udacity Programming For Data Science projects.
6
6
7
7
### Overview
8
8
9
-
In this project, the student had to use SQL to explore a database related to movie rentals. We had to write SQL code to run SQL queries and answer interesting questions about the provided database.
9
+
This project consisted in the exploration of a provided database movie rentals. The student had to run SQL queries and build visualizations to showcase the output of the student's queries. For the presentation component, the student had to create four slides, and each should have (a) a question of interest, (b) a supporting SQL query needed to answer the question, (c) a supporting visualization created using the final data of the SQL queries that answer the questions of interest, and (d) a small summary on each slide.
10
10
11
-
As part of the Project Submission, the student had to run SQL queries and build visualizations to showcase the output of the student's queries.
12
-
13
-
The Project Submission is a presentation, which will be reviewed, and for which you will need to meet the criteria as specified in the Rubric to pass. For the presentation component, the student had to create four slides. Each slide should have (a) a question of interest, (b) a supporting SQL query needed to answer the question, (c) a supporting visualization created using the final data of the SQL queries that answer the questions of interest, and (d) a small summary on each slide. [Project Submission](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree#project-submission).
14
-
15
-
### Question Sets
16
-
17
-
A set of questions have been provided by Udacity so that we were free to consider and include them in our Project Submission. These were solely provided for our convenience, and we were able to choose between any of the questions in these sets or none at all in your project submission.
18
-
19
-
Despite having not used any of the questions provided by Udacity, I still chose to query each question as shown below.
20
-
21
-
#### Question Set 1
22
-
23
-
Question 1: We want to understand more about the movies that families are watching. The following categories are considered family movies: Animation, Children, Classics, Comedy, Family and Music. Create a query that lists each movie, the film category it is classified in, and the number of times it has been rented out. [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/set-1-question-1.sql).
24
-
25
-
Question 2: Now we need to know how the length of rental duration of these family-friendly movies compares to the duration that all movies are rented for. Can you provide a table with the movie titles and divide them into 4 levels (first_quarter, second_quarter, third_quarter, and final_quarter) based on the quartiles (25%, 50%, 75%) of the rental duration for movies across all categories? Make sure to also indicate the category that these family-friendly movies fall into. [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/set-1-question-2.sql).
26
-
27
-
Question 3: Finally, provide a table with the family-friendly film category, each of the quartiles, and the corresponding count of movies within each combination of film category for each corresponding rental duration category. The resulting table should have three columns: Category, Rental length category and Count. [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/set-1-question-3.sql).
28
-
29
-
#### Question Set 2
30
-
31
-
Question 1: We want to find out how the two stores compare in their count of rental orders during every month for all the years we have data for. Write a query that returns the store ID for the store, the year and month and the number of rental orders each store has fulfilled for that month. Your table should include a column for each of the following: year, month, store ID and count of rental orders fulfilled during that month. [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/set-2-question-1.sql).
32
-
33
-
Question 2: We would like to know who were our top 10 paying customers, how many payments they made on a monthly basis during 2007, and what was the amount of the monthly payments. Can you write a query to capture the customer name, month and year of payment, and total payment amount for each month by these top 10 paying customers? [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/set-2-question-2.sql).
11
+
### Project Submission
34
12
35
-
Question 3: Finally, for each of these top 10 paying customers, I would like to find out the difference across their monthly payments during 2007. Please go ahead and write a query to compare the payment amounts in each successive month. Repeat this for each of these 10 paying customers. Also, it will be tremendously helpful if you can identify the customer name who paid the most difference in terms of payments. [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/set-2-question-3.sql).
13
+
You can download the Project Submission [here](https://drive.google.com/open?id=1sfcOviwrgdNjnhghi6N4IOobMzDNGz4w).
36
14
37
-
### Chosen Questions
15
+
Question 1: How do the rentals for the 10 top renting countries compare to the other countries for each category?
Question 1: How do the rentals for the 10 top renting countries compare to the other countries for each category? [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/1st-presentation-query.sql).
18
+
Question 2: How are the movies distributed by the amount of times they were rented inside of a category?
Question 2: How are the movies distributed by the amount of times they were rented inside of a category? [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/2nd-presentation-query.sql).
21
+
Question 3: How much did the top 10 paying customers spend on DVD rental over 2007?
Question 3: How much did the top 10 paying customers spend on DVD rental over 2007? [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/3rd-presentation-query.sql).
24
+
Question 4: How many customers have paid less in rentals compared to the previous month?
Question 4: How many customers have paid less in rentals compared to the previous month? [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/4th-presentation-query.sql).
27
+
### Question Sets
46
28
47
-
### Project Submission
29
+
A set of questions have also been provided by Udacity so that we were free to consider and include them in our Project Submission. These were solely provided for our convenience, and we were able to choose between any of the questions in these sets or none at all in your project submission.Despite having not used any of the questions provided by Udacity, I still chose to query each [question](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/tree/master/1st-project/set-questions).
48
30
49
-
You can find the project submission here.
50
31
51
32
## 2nd-project
52
33
@@ -58,7 +39,7 @@ In this project, the student had to make use of Python to explore data related t
58
39
59
40
The developed CLI program allows the user to explore an US bikeshare system database and retrieve statistics information from the database. The user is able filter the information by city, month and weekday, in order to visualize statistics information related to a specific subset of data. The user is also able to chose to view raw data and to sort this data by columns, in ascending or descending order.
0 commit comments