Skip to content

Commit

Permalink
Project description added
Browse files Browse the repository at this point in the history
  • Loading branch information
hmhard committed Aug 11, 2024
1 parent 88156a8 commit 5c019f0
Showing 1 changed file with 29 additions and 2 deletions.
31 changes: 29 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,12 +1,39 @@
# [Tikvah](https://t.me/s/tikvahethiopia) Telegram channel analysis repo
! this repo analysis is done for **learning purpose**

## one of analysis dashboard

![Image](./image.png)

### libraries used
- bs4

### install by using
```bash
pip install -r requirements.txt
```

### Preprocessing Steps Completed:

- Fetched HTML data.
- Extracted data into JSON format.
- Filtered Amharic keywords, removing entries with:
- Emojis
- English characters
- Special characters
- Numbers
- Filtered out stop words.

### change top_n (default:500) in top-words.py and run

```bash
python top-words.py
```

## final data be like

```bash
let data = {
data = {
"ሰዎች": 14109,
"ከተማ": 10457,
"ክልል": 9968,
Expand Down Expand Up @@ -39,7 +66,7 @@ let data = {
...
```
## you can also do extra processing and analysis, create pull request.
## you can also do extra processing and analysis and create pull request.
---
## if you like this repo, please give it the star.

0 comments on commit 5c019f0

Please sign in to comment.