This tool is designed to scrape data from Instagram. It utilizes the Selenium library to perform web scraping and extract information directly from Instagram without relying on the Instagram API. Additionally, it provides a Streamlit-powered dashboard for visualizing the scraped data in a user-friendly interface.
- python >= 3.10
-
Clone this repository
-
Move to the project directory
-
Install the required Python packages:
pip install -r requirements.txt
To use this script, follow these steps:
- You need to configure the necessary settings in the {project}/config/config.ini file.
- Set "timout_sec" appropriately to prevent infinite loading.
- Set the maximum number of video to scrape with the "video_limit_cnt" setting.
- If you need proxy settings, put them in "proxy_server"
- For instagram, you need to put account info in INSTAGRAM section
python main.py --keyword "Search Keyword" --output <Output File Path>
# If you want to include the dashboard additionally:
python main.py --keyword "Search Keyword" --output <Output File Path> --dashboard
- If you use a proxy, you may be required to provide an authentication code after logging in. This code will typically be sent via email or phone.
- There is a risk of the account being blocked.
- As a result, this script may not be suitable for full automation and is better used for single transactions.
- 2025.01.15 : Initial implementation started.
For any feedback or queries, please reach out to me at kimyk0120@gmail.com.