Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
artkostm authored Nov 9, 2018
1 parent 39f2795 commit 4501e95
Showing 1 changed file with 32 additions and 3 deletions.
35 changes: 32 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -45,9 +45,7 @@ Console output:
[root@sandbox-hdp ~]# hive -d HOTELS_TABLE_NAME=hotels \
-d HOTELS_FILE_LOCATION=/tmp/test/hotels \
-f create_hotels_table.hql
log4j:WARN No such property [maxFileSize] in org.apache.log4j.DailyRollingFileAppender.

Logging initialized using configuration in file:/etc/hive/2.6.5.0-292/0/hive-log4j.properties

OK
Time taken: 4.625 seconds
```
Expand Down Expand Up @@ -151,3 +149,34 @@ Time taken: 53.434 seconds, Fetched: 3 row(s)
```

![To calculate top 3 most popular hotels which were not booked](./img/top3_hotels_from_clicks.png "To calculate top 3 most popular hotels which were not booked")

### To run the tasks sequentially

```./run.sh hotels /tmp/test/hotels```, hotels - the name of the table, /tmp/test/hotels - hdfs location where train.csv.gz is stored.

Console output:

```shell
[root@sandbox-hdp ~]# ./run.sh hotels /tmp/test/hotels
Task #1 is running
Creating tables...

OK
Time taken: 3.614 seconds

c.country c.num
50 1616055
198 197228
70 102651
Task #2 is running
days
390
Task #3 is running
hotel_continent hotel_country hotel_market cnt
2 50 628 1640731
2 50 675 1490187
2 50 682 809776
Done
```


0 comments on commit 4501e95

Please sign in to comment.