We love scraping, don't we? But sometimes, we face Cloudflare protection. This script is designed to bypass the Cloudflare protection on websites, allowing you to interact with them programmatically.
Capsolver.com is an AI-powered service that specializes in solving various types of captchas automatically. It supports captchas such as reCAPTCHA V2, reCAPTCHA V3, hCaptcha, FunCaptcha, DataDome, AWS Captcha, Geetest, and Cloudflare Captcha / Challenge 5s, Imperva / Incapsula, among others.
For developers, Capsolver offers API integration options detailed in their documentation, facilitating the integration of captcha solving into applications. They also provide browser extensions for both Chrome and Firefox, making it easy to use their service directly within a browser. Different pricing packages are available to accommodate varying needs, ensuring flexibility for users.
If you use Selenium, you may have noticed that it is not possible to bypass Cloudflare protection with it. Even you click the "I'm not a robot" button, you will still be stuck in the "Checking your browser before accessing" page. This is because Cloudflare protection is able to detect the automation tools and block them, which puts the webdriver infinitely in the "Checking your browser before accessing" page.
As you realize, the script uses the DrissionPage, which is a controller for the browser itself. This way, the browser is not detected as a webdriver and the Cloudflare protection is bypassed.
You can install the required packages by running the following command:
pip install -r requirements.txt
Create a new instance of the CloudflareBypass
class and call the bypass
method when you need to bypass the Cloudflare protection.
from CloudflareBypasser import CloudflareBypasser
driver = ChromiumPage()
driver.get('https://nopecha.com/demo/cloudflare')
cf_bypasser = CloudflareBypasser(driver)
cf_bypasser.bypass()
You can run the test script to see how it works:
python test.py
Recently, @frederik-uni has introduced a new feature called "Server Mode". This feature allows you to bypass the Cloudflare protection remotely, either you can get the cookies or the HTML content of the website.
You can install the required packages by running the following command:
pip install -r server_requirements.txt
Start the server by running the following command:
python server.py
Two endpoints are available:
/cookies?url=<URL>&retries=<>
: This endpoint returns the cookies of the website (including the Cloudflare cookies)./html?url=<URL>&retries=<>
: This endpoint returns the HTML content of the website.
Send a GET request to the desired endpoint with the URL of the website you want to bypass the Cloudflare protection.
sarp@IdeaPad:~/$ curl http://localhost:8000/cookies?url=https://nopecha.com/demo/cloudflare
{"cookies":{"_ga":"GA1.1.942857389.1723294929","_ga_70K7MBE4SF":"GS1.1.1723294928.1.1.1723295750.11.0.0","cf_clearance":"tAKO42SMzkTFNLh7VMgrcWK87swKOibXF3mcywc4U.Y-1723294918-1.0.1.1-cO8AihASCe6WogYzLl2Q_5tIltWrvHzESncITD2X5ZJr8sF6wIZoAJSYVP9OyJw.J2Y8ST15mh.qx.O8m6ujxA"}}
This script is not related to bring a solution to bypass if your IP is blocked by Cloudflare. If you are blocked by Cloudflare, you need a clean IP to access the website. This script is designed to bypass the Cloudflare protection, not to bypass the IP block.
To find out more about DrissionPage, you can get more information from the following links:
Be sure you use a translation tool if you don't speak Chinese.