Amazon Wishlist Scraper

This project is a Python-based web scraper that collects data from Amazon wishlists. The scraper extracts information such as product names, prices, availability, ratings, and more. It can handle multiple wishlist URLs and uses Selenium to scroll through the pages.

Features

Scrape product details such as name, price, stock status, and more.
Supports multiple wishlist URLs.
Automatically scrolls to the end of each wishlist.
CAPTCHA handling in place in case of issues.
Stores scraped data in a PostgreSQL database.

Setup and Installation

Prerequisites

Python 3.x
pip (Python package manager)
PostgreSQL (if using a database to store scraped data)
BrowserDriver (e.g., Chrome WebDriver)

Installation

Clone the repository:

git clone https://your-repo-url.git
cd your-repo-directory

Set up a Python virtual environment (recommended):

python3 -m venv venv
source venv/bin/activate   # On Windows: venv\Scripts\activate

Install the required packages:
```
pip install -r requirements.txt
```

Set up environment variables:

Create a .env file in the project root directory and add your database credentials:

File: .env

DB_USER=your_postgres_username
DB_PASSWORD=your_postgres_password
DB_HOST=localhost
DB_PORT=5432
DB_NAME=your_postgres_db

Set up your wishlist URLs:
- The scraper reads wishlist URLs from a wishlist_URL.txt file.
- Copy the sample file provided (wishlist_URL.sample.txt) and rename it to wishlist_URL.txt:
```
cp wishlist_URL.sample.txt wishlist_URL.txt
```
- Replace the sample URLs with actual Amazon wishlist URLs.

Running the Scraper

To run the scraper, use the following command:
```
python scraper.py
```
The scraper will automatically navigate through the wishlists provided in wishlist_url.txt, extract the product information, and store it in your desired format (CSV or database).

Example Output

The scraper outputs the collected data to a CSV file (output.csv) or can store it in a PostgreSQL database if configured.

Configuration

The scraper can be configured to store the scraped data in a PostgreSQL database. To do this, make sure you have PostgreSQL installed and the correct credentials in your .env file.

Sample Files

.env.example: Provides the structure for your environment variables. Copy this file and rename it to .env to configure your credentials.
wishlist_url_sample.txt: A sample file with placeholder URLs. Rename this file to wishlist_url.txt and update it with actual wishlist URLs.

Contributing

Feel free to submit pull requests for new features, improvements, or bug fixes.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
scripts		scripts
.gitignore		.gitignore
README.MD		README.MD
env.example		env.example
requirements.txt		requirements.txt
wishlist_URL.sample.txt		wishlist_URL.sample.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Amazon Wishlist Scraper

Features

Setup and Installation

Prerequisites

Installation

Running the Scraper

Example Output

Configuration

Sample Files

Contributing

License

About

Releases

Packages

Languages

cheynolds/Amazon-Wishlist-PostgresDB

Folders and files

Latest commit

History

Repository files navigation

Amazon Wishlist Scraper

Features

Setup and Installation

Prerequisites

Installation

Running the Scraper

Example Output

Configuration

Sample Files

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages