pythonicshariful

Follow

Shariful Islam pythonicshariful

Follow

Web Scraping & ML Enthusiast

6 followers · 0 following

Achievements

Achievements

pythonicshariful/README.md

🕸️ What I Do

Web scraping at scale with Python (Selenium, Playwright, Scrapy, BeautifulSoup)
Automation pipelines for data collection, cleaning, and storage
API integrations (REST/GraphQL) and browser automation
Data wrangling with Pandas, exporting to CSV/JSON/DB
Learning ML & AI to build smarter data products

🔧 Tech Stack

✨ Highlights

Built bots that extract thousands of pages/day with rotating proxies & retries
Designed resilient anti-bot bypass flows (stealth drivers, human-like waits, captchas via services)
Delivered clean datasets ready for analysis & model training
Currently exploring feature engineering, vector databases, and LLM-powered scraping assistants

/

📊 GitHub Stats

🧪 ML & AI Learning Journey

🎯 Current focus: data labeling, feature engineering, small ML models for classification/regression
🧠 Next up: LLM-assisted scraping, RAG for document-heavy sites, agent workflows
📚 Notes & experiments live here → /labs

🗂️ Example Services I Offer

Full-site data extraction (anti-bot aware) → CSV/JSON/DB
PDF/image capture & text extraction (OCR)
API discovery & reverse engineering for private endpoints
Dashboard/API to deliver data (FastAPI + simple UI)
Ongoing monitoring for price changes, stock, new listings

💌 Need data? Open an issue or reach out!

💬 Connect

🐍 Fun

Made with ❤️, Python, and a lot of headless browsers.

Popular repositories Loading

insurance-charge-predictor insurance-charge-predictor Public

This project predicts medical insurance charges based on personal details such as age, gender, BMI, number of children, smoking habits, and region. It uses a Machine Learning model trained on the i…

Jupyter Notebook 3 1
phone-number-extractor phone-number-extractor Public

A Python script that extracts phone numbers from images using Tesseract OCR and Regex. Automatically organizes processed images into success and failed folders, and saves results to a CSV file.

Python 2
-Vehicle-Detection-And-Counting-using-OpenCV -Vehicle-Detection-And-Counting-using-OpenCV Public

This project implements a vehicle counting system using OpenCV and background subtraction techniques. It detects moving vehicles in a video or webcam feed, tracks their motion, and counts how many …

Python 2
pythonicshariful pythonicshariful Public

Python developer specializing in web scraping, automation, and machine learning. Experienced in building scalable scrapers, bypassing anti-bot systems, processing large datasets, and applying ML/NL…

1
info_extractor info_extractor Public

Forked from shaiful019/info_extractor

This project uses OpenAI's GPT-4 Vision model to extract customer information from images and compile it into an Excel spreadsheet. It's particularly useful for digitizing customer information from…

Python 1
Traffic_Detector_-_Instance_Segmentation Traffic_Detector_-_Instance_Segmentation Public

A small project demonstrating traffic object detection and instance segmentation using Ultralytics YOLOv8 models. This repository contains a template script for running inference, and two pretraine…

Python 1