Skip to content

Commit 1c2c80b

Browse files
author
hridayns
committed
update README.md + remove unwanted script
1 parent 2041032 commit 1c2c80b

File tree

2 files changed

+9
-13
lines changed

2 files changed

+9
-13
lines changed

README.md

Lines changed: 9 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1,9 @@
1-
Analysis of Apache server logs to find the most visited website using Pig, Hadoop and Python UDF script.
1+
# Project Details
2+
3+
This repository contains code to analyze Apache server logs to find the most visited website using Apache Hadoop's Pig script extended using Python's User-defined Functions (UDF). It was run on an Ubuntu instance deployed on the Oracle's VMware with the help of Vagrant.
4+
5+
# Contents
6+
7+
- *shareFiles/pig_script .py* contains code to compute the page hits and store them.
8+
- *shareFiles/scipt .py* contains the Python UDF to parse the sample Apache logs.
9+
- *shareFiles/sample_log* contains the sample logs on which the scripts are run.

shareFiles/pig_test_script.pig

Lines changed: 0 additions & 12 deletions
This file was deleted.

0 commit comments

Comments
 (0)