PainPoint.er Scraper 🔍

Discover Software Pain Points & Product Opportunities

PainPoint.er Scraper is an open-source tool that helps entrepreneurs, product managers, and developers identify software pain points and product opportunities by analyzing user feedback from Reddit communities.

🚀 Key Features

Platform Analysis: Scrape and analyze user feedback from Reddit communities, and more platforms being added soon
Pain Point Identification: Automatically detect user complaints and frustrations with existing software
AI-Powered Insights: Leverage OpenAI or Azure/GitHub Models API for analysis
Modular Design: Easily extendable architecture for adding more data sources in the future
Simple Reporting: Generate analysis results with actionable insights

🔧 Installation

Prerequisites

Python 3.7 or higher
pip (Python package installer)

Setup

Clone the repository:

git clone https://github.com/sdotdev/painpointer-search.git
cd painpointer-search

Install dependencies:

pip install -r requirements.txt

🚀 Quick Start

Copy .env.example to .env and add your API keys:
- Reddit API credentials (required)
- OpenAI API key or Azure/GitHub API key (at least one required)
Run the application:

python main.py

This will:

Load your configuration from .env
Prompt you to choose an AI provider (OpenAI or Azure)
Process URLs from urls.csv
Scrape comments from Reddit subreddits
Analyze the comments for pain points and opportunities
Save results to the ./output directory

💡 Usage Examples

Basic Usage

python main.py

Customizing the Configuration

Edit the .env file to customize your configuration:

# Reddit API Credentials (Required)
REDDIT_CLIENT_ID="your_reddit_client_id"
REDDIT_CLIENT_SECRET="your_reddit_client_secret"
REDDIT_USER_AGENT="your_reddit_user_agent"

# AI API Keys (At least one required)
OPENAI_API_KEY="your_openai_api_key"
AZURE_GITHUB_API_KEY="your_azure_github_api_key"

Customizing URLs to Scrape

Edit the urls.csv file to specify which subreddits to analyze:

url,category,notes
https://reddit.com/r/productivity,productivity,Popular productivity subreddit
https://reddit.com/r/software,software,Discussions about various software

📄 Input Format

The input CSV file must contain a column named url with the Reddit URLs to analyze. Additional columns are preserved for reference.

Example urls.csv:

url,category,notes
https://reddit.com/r/productivity,productivity,Popular productivity subreddit
https://reddit.com/r/software,software,Discussions about various software
https://reddit.com/r/SaaS,saas,Software as a Service discussions

Note: Currently, only Reddit URLs are supported. Non-Reddit URLs in the CSV will be skipped.

📊 Output Format

PainPoint.er Scraper generates a single output file in the output directory:

[ai_provider]_analysis_[timestamp].txt

For example: openai_analysis_20230615_123045.txt

This file contains the analysis results from the AI provider, including identified:

Product opportunities
Feature requests
Pain points
New tool suggestions

⚙️ Advanced Configuration

Configuration Options

The application uses a configuration loaded from the .env file and the config.py module. You can modify these settings:

Setting	Description	Location
`reddit_client_id`	Reddit API client ID	.env file
`reddit_client_secret`	Reddit API client secret	.env file
`reddit_user_agent`	Reddit API user agent	.env file
`openai_api_key`	OpenAI API key	.env file
`azure_github_api_key`	Azure/GitHub Models API key	.env file
`urls_file`	CSV file with URLs to analyze	config.py
`output_dir`	Output directory for results	config.py

🤖 AI Integration

PainPoint.er Scraper supports two AI providers for analysis:

1. OpenAI

To use OpenAI's models (default: gpt-3.5-turbo):

Add your OpenAI API key to the .env file:

# OpenAI API Key
OPENAI_API_KEY=your_openai_api_key_here

When prompted, select option 1 for OpenAI.

2. Azure/GitHub Models API

To use Azure's AI models via the GitHub Models API:

Add your Azure/GitHub API key to the .env file:

# Azure/GitHub Models API Key
AZURE_GITHUB_API_KEY=your_azure_github_api_key_here

When prompted, select option 2 for Azure.

The application will:

Prompt you to choose between available AI providers
Verify that the required API key is present in the .env file
Initialize the selected AI client for analysis

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

📂 Project Structure

PainPoint.er Scraper has a modular architecture for maintainability and extensibility:

├── main.py                 # Main entry point script
├── config.py               # Configuration loading and validation
├── utils.py                # Utility functions (file I/O, URL parsing)
├── scraper.py              # Reddit scraping functionality
├── ai_clients.py           # AI provider client implementations
├── analysis.py             # Text analysis and processing
├── requirements.txt        # Python dependencies
├── .env.example            # Example environment variables
└── urls.csv                # Input file with URLs to analyze

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🔍 Why PainPoint.er Scraper?

In today's competitive software landscape, understanding user pain points is crucial for building successful products. PainPoint.er Scraper automates the process of discovering what users are struggling with and where opportunities exist for new or improved software solutions.

By analyzing real user feedback across multiple platforms, you can:

Validate your product ideas with real-world data
Discover underserved markets and niches
Prioritize features based on user demand
Understand competitor weaknesses to exploit
Generate new product ideas backed by evidence

Whether you're a solo entrepreneur, product manager, or part of a development team, PainPoint.er Scraper helps you make data-driven decisions about what to build next.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PainPoint.er Scraper 🔍

Discover Software Pain Points & Product Opportunities

🚀 Key Features

📋 Table of Contents

🔧 Installation

Prerequisites

Setup

🚀 Quick Start

💡 Usage Examples

Basic Usage

Customizing the Configuration

Customizing URLs to Scrape

📄 Input Format

📊 Output Format

⚙️ Advanced Configuration

Configuration Options

🤖 AI Integration

1. OpenAI

2. Azure/GitHub Models API

🤝 Contributing

📂 Project Structure

📝 License

🔍 Why PainPoint.er Scraper?

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ai_clients.py		ai_clients.py
analysis.py		analysis.py
config.py		config.py
main.old.py		main.old.py
main.py		main.py
requirements.txt		requirements.txt
scraper.py		scraper.py
urls.csv		urls.csv
utils.py		utils.py

License

sdotdev/painpoint.er-scraper

Folders and files

Latest commit

History

Repository files navigation

PainPoint.er Scraper 🔍

Discover Software Pain Points & Product Opportunities

🚀 Key Features

📋 Table of Contents

🔧 Installation

Prerequisites

Setup

🚀 Quick Start

💡 Usage Examples

Basic Usage

Customizing the Configuration

Customizing URLs to Scrape

📄 Input Format

📊 Output Format

⚙️ Advanced Configuration

Configuration Options

🤖 AI Integration

1. OpenAI

2. Azure/GitHub Models API

🤝 Contributing

📂 Project Structure

📝 License

🔍 Why PainPoint.er Scraper?

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages