GitHub - THEPEACEMAKER/Logs-Analysis-Project: The first project in the Full Stack Web Developer Nanodegree

Logs Analysis Project - Udacity Full Stack Web Developer Nanodegree

DESCRIPTION

This is the first project in the Full Stack Web Developer Nanodegree, and it goes like this:

I'm hired onto a team working on a newspaper site. The user-facing newspaper site frontend itself, and the database behind it, are already built and running. I'm asked to build an internal reporting tool that will use information from the database to discover what kind of articles the site's readers like.

my task is to create a reporting tool that runs from the command line, and prints out reports (in plain text) based on the data in the database. This reporting tool is a Python program using the psycopg2 module to connect to the database.

The database contains newspaper articles, as well as the web server log for the site. The log has a database row for each time a reader loaded a web page. Using that information, my code will answer the following three questions:

What are the most popular three articles of all time?
Who are the most popular article authors of all time?
On which days did more than 1% of requests lead to errors?

RUNNING THE PROGRAM

To get started, I recommend the user use a virtual machine to ensure they are using the same environment that this project is developed on, running on your computer. You can download Vagrant and VirtualBox to install and manage your virtual machine. Use vagrant up to bring the virtual machine online and vagrant ssh to login.
Download the data provided by Udacity here. Unzip the file in order to extract newsdata.sql. Put the file inside the Vagrant folder.
Load the database using psql -d news -f newsdata.sql. Running this command will connect to your installed database server and execute the SQL commands in the downloaded file, creating tables and populating them with data.
Connect to the database using psql -d news.
Create the Views given below. Then exit psql by writing \q.
Now execute the Python file - python logs-analysis-report.py.

CREATE THE FOLLOWING VIEWS:

Views for Question 1

CREATE VIEW popular_articles AS
select title, count(path) as num
from articles, log
where articles.slug = replace(log.path, '/article/', '')
group by title;

Views for Question 2

CREATE VIEW popular_authors AS
select authors.name as name, sum(popular_articles.num) as num
from articles, authors, popular_articles
where articles.author = authors.id and
articles.title = popular_articles.title
group by name;

Views for Question 3

CREATE VIEW day_requests AS
SELECT TO_CHAR(a.time, 'monthdd, yyyy') as day, count(a.status) as requests, count(b.status) as errors
from log as a left join log as b
on a.id = b.id and
b.status = '404 NOT FOUND'
group by day;

CREATE VIEW errors_percentage AS
select day, round(errors * 100.0 / requests, 3) as percentage
from day_requests;

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Logs-Analysis-Report.py		Logs-Analysis-Report.py
README.md		README.md
the-output		the-output

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Logs Analysis Project - Udacity Full Stack Web Developer Nanodegree

DESCRIPTION

RUNNING THE PROGRAM

CREATE THE FOLLOWING VIEWS:

Views for Question 1

Views for Question 2

Views for Question 3

About

Releases

Packages

Languages

THEPEACEMAKER/Logs-Analysis-Project

Folders and files

Latest commit

History

Repository files navigation

Logs Analysis Project - Udacity Full Stack Web Developer Nanodegree

DESCRIPTION

RUNNING THE PROGRAM

CREATE THE FOLLOWING VIEWS:

Views for Question 1

Views for Question 2

Views for Question 3

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages