Skip to content

Commit f11974b

Browse files
authored
Update README.md
1 parent a758c07 commit f11974b

File tree

1 file changed

+15
-34
lines changed

1 file changed

+15
-34
lines changed

README.md

Lines changed: 15 additions & 34 deletions
Original file line numberDiff line numberDiff line change
@@ -6,47 +6,28 @@ This is a personal repository for Udacity Programming For Data Science projects.
66

77
### Overview
88

9-
In this project, the student had to use SQL to explore a database related to movie rentals. We had to write SQL code to run SQL queries and answer interesting questions about the provided database.
9+
This project consisted in the exploration of a provided database movie rentals. The student had to run SQL queries and build visualizations to showcase the output of the student's queries. For the presentation component, the student had to create four slides, and each should have (a) a question of interest, (b) a supporting SQL query needed to answer the question, (c) a supporting visualization created using the final data of the SQL queries that answer the questions of interest, and (d) a small summary on each slide.
1010

11-
As part of the Project Submission, the student had to run SQL queries and build visualizations to showcase the output of the student's queries.
12-
13-
The Project Submission is a presentation, which will be reviewed, and for which you will need to meet the criteria as specified in the Rubric to pass. For the presentation component, the student had to create four slides. Each slide should have (a) a question of interest, (b) a supporting SQL query needed to answer the question, (c) a supporting visualization created using the final data of the SQL queries that answer the questions of interest, and (d) a small summary on each slide. [Project Submission](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree#project-submission).
14-
15-
### Question Sets
16-
17-
A set of questions have been provided by Udacity so that we were free to consider and include them in our Project Submission. These were solely provided for our convenience, and we were able to choose between any of the questions in these sets or none at all in your project submission.
18-
19-
Despite having not used any of the questions provided by Udacity, I still chose to query each question as shown below.
20-
21-
#### Question Set 1
22-
23-
Question 1: We want to understand more about the movies that families are watching. The following categories are considered family movies: Animation, Children, Classics, Comedy, Family and Music. Create a query that lists each movie, the film category it is classified in, and the number of times it has been rented out. [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/set-1-question-1.sql).
24-
25-
Question 2: Now we need to know how the length of rental duration of these family-friendly movies compares to the duration that all movies are rented for. Can you provide a table with the movie titles and divide them into 4 levels (first_quarter, second_quarter, third_quarter, and final_quarter) based on the quartiles (25%, 50%, 75%) of the rental duration for movies across all categories? Make sure to also indicate the category that these family-friendly movies fall into. [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/set-1-question-2.sql).
26-
27-
Question 3: Finally, provide a table with the family-friendly film category, each of the quartiles, and the corresponding count of movies within each combination of film category for each corresponding rental duration category. The resulting table should have three columns: Category, Rental length category and Count. [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/set-1-question-3.sql).
28-
29-
#### Question Set 2
30-
31-
Question 1: We want to find out how the two stores compare in their count of rental orders during every month for all the years we have data for. Write a query that returns the store ID for the store, the year and month and the number of rental orders each store has fulfilled for that month. Your table should include a column for each of the following: year, month, store ID and count of rental orders fulfilled during that month. [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/set-2-question-1.sql).
32-
33-
Question 2: We would like to know who were our top 10 paying customers, how many payments they made on a monthly basis during 2007, and what was the amount of the monthly payments. Can you write a query to capture the customer name, month and year of payment, and total payment amount for each month by these top 10 paying customers? [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/set-2-question-2.sql).
11+
### Project Submission
3412

35-
Question 3: Finally, for each of these top 10 paying customers, I would like to find out the difference across their monthly payments during 2007. Please go ahead and write a query to compare the payment amounts in each successive month. Repeat this for each of these 10 paying customers. Also, it will be tremendously helpful if you can identify the customer name who paid the most difference in terms of payments. [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/set-2-question-3.sql).
13+
You can download the Project Submission [here](https://drive.google.com/open?id=1sfcOviwrgdNjnhghi6N4IOobMzDNGz4w).
3614

37-
### Chosen Questions
15+
Question 1: How do the rentals for the 10 top renting countries compare to the other countries for each category?
16+
* [Query](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/1st-presentation-query.sql).
3817

39-
Question 1: How do the rentals for the 10 top renting countries compare to the other countries for each category? [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/1st-presentation-query.sql).
18+
Question 2: How are the movies distributed by the amount of times they were rented inside of a category?
19+
* [Query](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/2nd-presentation-query.sql).
4020

41-
Question 2: How are the movies distributed by the amount of times they were rented inside of a category? [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/2nd-presentation-query.sql).
21+
Question 3: How much did the top 10 paying customers spend on DVD rental over 2007?
22+
* [Query](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/3rd-presentation-query.sql).
4223

43-
Question 3: How much did the top 10 paying customers spend on DVD rental over 2007? [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/3rd-presentation-query.sql).
24+
Question 4: How many customers have paid less in rentals compared to the previous month?
25+
* [Query](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/4th-presentation-query.sql).
4426

45-
Question 4: How many customers have paid less in rentals compared to the previous month? [Solution](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/1st-project/4th-presentation-query.sql).
27+
### Question Sets
4628

47-
### Project Submission
29+
A set of questions have also been provided by Udacity so that we were free to consider and include them in our Project Submission. These were solely provided for our convenience, and we were able to choose between any of the questions in these sets or none at all in your project submission.Despite having not used any of the questions provided by Udacity, I still chose to query each [question](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/tree/master/1st-project/set-questions).
4830

49-
You can find the project submission here.
5031

5132
## 2nd-project
5233

@@ -58,7 +39,7 @@ In this project, the student had to make use of Python to explore data related t
5839

5940
The developed CLI program allows the user to explore an US bikeshare system database and retrieve statistics information from the database. The user is able filter the information by city, month and weekday, in order to visualize statistics information related to a specific subset of data. The user is also able to chose to view raw data and to sort this data by columns, in ascending or descending order.
6041

61-
Project Submission.
42+
[Project Submission](https://github.com/decarvalhohenrique/programming-for-data-science-nanodegree/blob/master/2nd-project/bikeshare.py).
6243

6344
#### Files Used
6445

@@ -68,7 +49,7 @@ The required filed for running this program are:
6849
* new_york_city.csv
6950
* chicago.csv
7051

71-
You can download them here.
52+
You can download them [here](https://drive.google.com/open?id=1sfcOviwrgdNjnhghi6N4IOobMzDNGz4w).
7253

7354
#### Requirements
7455

0 commit comments

Comments
 (0)