Skip to content
View pythonicshariful's full-sized avatar

Block or report pythonicshariful

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pythonicshariful/README.md

Typing SVG

coding

followers stars profile views


πŸ•ΈοΈ What I Do

  • Web scraping at scale with Python (Selenium, Playwright, Scrapy, BeautifulSoup)
  • Automation pipelines for data collection, cleaning, and storage
  • API integrations (REST/GraphQL) and browser automation
  • Data wrangling with Pandas, exporting to CSV/JSON/DB
  • Learning ML & AI to build smarter data products

πŸ”§ Tech Stack


✨ Highlights

  • Built bots that extract thousands of pages/day with rotating proxies & retries
  • Designed resilient anti-bot bypass flows (stealth drivers, human-like waits, captchas via services)
  • Delivered clean datasets ready for analysis & model training
  • Currently exploring feature engineering, vector databases, and LLM-powered scraping assistants

/


πŸ“Š GitHub Stats


πŸ§ͺ ML & AI Learning Journey

  • 🎯 Current focus: data labeling, feature engineering, small ML models for classification/regression
  • 🧠 Next up: LLM-assisted scraping, RAG for document-heavy sites, agent workflows
  • πŸ“š Notes & experiments live here β†’ /labs

πŸ—‚οΈ Example Services I Offer

  • Full-site data extraction (anti-bot aware) β†’ CSV/JSON/DB
  • PDF/image capture & text extraction (OCR)
  • API discovery & reverse engineering for private endpoints
  • Dashboard/API to deliver data (FastAPI + simple UI)
  • Ongoing monitoring for price changes, stock, new listings

πŸ’Œ Need data? Open an issue or reach out!


πŸ’¬ Connect


🐍 Fun

snake animation


Made with ❀️, Python, and a lot of headless browsers.

Popular repositories Loading

  1. insurance-charge-predictor insurance-charge-predictor Public

    This project predicts medical insurance charges based on personal details such as age, gender, BMI, number of children, smoking habits, and region. It uses a Machine Learning model trained on the i…

    Jupyter Notebook 3 1

  2. phone-number-extractor phone-number-extractor Public

    A Python script that extracts phone numbers from images using Tesseract OCR and Regex. Automatically organizes processed images into success and failed folders, and saves results to a CSV file.

    Python 2

  3. -Vehicle-Detection-And-Counting-using-OpenCV -Vehicle-Detection-And-Counting-using-OpenCV Public

    This project implements a vehicle counting system using OpenCV and background subtraction techniques. It detects moving vehicles in a video or webcam feed, tracks their motion, and counts how many …

    Python 2

  4. info_extractor info_extractor Public

    Forked from shaiful019/info_extractor

    This project uses OpenAI's GPT-4 Vision model to extract customer information from images and compile it into an Excel spreadsheet. It's particularly useful for digitizing customer information from…

    Python 1

  5. pythonicshariful pythonicshariful Public

  6. purchase-atomation-target.com purchase-atomation-target.com Public

    This project automates actions using SKUs from sku.csv and settings from config.json. Follow these steps before running the program.

    Python