Indian Scriptures

This repository contains various Indian scriptures 📜 in a structured .csv format. The files contain the verses in their original Sanskrit language and their verse number.

The data folder contains both raw and processed data. The raw data is the direct output of the scrapy spiders and the processed data contains data after additional processing.

The notebooks folder contain the notebooks used to create the processed dataset.

The scriptures folder is a scrapy project which contains the scrapy spiders to scrape data from the web.

The project aims to provide Indian scriptures in a format that is suitable for text mining and natural language processing. If you would like to propose any changes, kindly send a pull request.

All the files are scraped from https://www.upanishads.iitk.ac.in using scrapy 🕷️ framework.

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
.github/workflows		.github/workflows
data		data
notebooks		notebooks
scriptures		scriptures
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Indian Scriptures

About

Releases

Packages

Contributors 2

Languages

License

hrgupta/indian-scriptures

Folders and files

Latest commit

History

Repository files navigation

Indian Scriptures

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages