telegramSimilarChannels

Collecting similar channels data on Telegram

How to collect the data:

Create a new schema in your MySQL server
Run db_creates.sql
Populate the Collections table with a row in the format {collection_id, date, notes}. Example: 1,2024-07-31,first collection
Populate the ScrapingJobs table with rows in the format {job_id, entity_id, username, status}. Example: 1,1224624669,TelegramTips,pending
Run getSimliarChannels.py. It will get the list of similar channels for every Entity marked "pending" in the ScrapingJobs table. It will add them to the Entities table and add the pair of channels (channel, suggested channel) to the ChannelSuggestions table.
Repeat steps 4-5 for as many generations as you want to run.

Some tips for analyzing the data:

You can create a directed social network with each channel as a node and the suggestion relationship as the edges.
Since there may be up to 100 similar channels for each channel, the number of edges will be very large, so you may want to prune the data by generation, or by degree (number of edges that a node has).
See the db_queries.sql file for some suggestions of helpful queries

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
LICENSE		LICENSE
README.md		README.md
TelegramOSINTDefConPresentation.pdf		TelegramOSINTDefConPresentation.pdf
db_creates.sql		db_creates.sql
db_queries.sql		db_queries.sql
getSimilarChannels.py		getSimilarChannels.py

Provide feedback