An intuitive web application that converts web pages into markdown format - perfect for AI analysis and content processing.
- 🔄 Convert any webpage to markdown format
- 🎨 Clean, modern interface with light/dark mode
- 💾 Save and manage previous conversions
- 🤖 AI-friendly output format
- 🚀 Fast and efficient processing
I would recommend using a virtual environment or dev container to run this project.
-
Clone the repository:
git clone https://github.com/ebrown-32/webcrawleb cd url-to-markdown
-
Install the required Python packages:
pip install -r requirements.txt
-
Create a directory for scraped content:
mkdir scraped_content
-
Start the server:
python app.py
-
Open your browser and navigate to:
http://localhost:8000
-
Enter a URL in the input field and click "Scrape" or press Enter.
-
The webpage will be converted to markdown format and displayed below.
-
Previous conversions are automatically saved and can be:
- Viewed by clicking the "View" button
- Deleted by clicking the "Delete" button
- Backend: FastAPI (Python)
- Frontend: Vanilla JavaScript
- Styling: Custom CSS with theme support
- Markdown Processing: marked.js
- Code Highlighting: highlight.js
- Web Scraping: crawl4ai
- Python 3.7+
- FastAPI
- uvicorn
- crawl4ai
- Other dependencies listed in
requirements.txt
url-to-markdown/
├── app.py # fastAPI
├── requirements.txt # dependencies
├── scraped_content/ # results
└── static/
├── index.html # frontend
├── styles.css # styling
└── script.js # logic
To run the application in development mode with auto-reload:
uvicorn app:app --reload
MIT License