Skip to content
View rainmana's full-sized avatar

Block or report rainmana

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

OSINt

232 repositories

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

Python 23,362 1,236 Updated Feb 13, 2025

Tool to index and serve HTML files. Powered by Datasette.

HTML 96 6 Updated Mar 2, 2022

A bunch of website scraping scripts

Ruby 8 1 Updated Jul 9, 2013

List of libraries, tools and APIs for web scraping and data processing.

Makefile 6,916 803 Updated Dec 27, 2024

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Python 6,663 697 Updated Oct 12, 2024

Lighter web automation with Python

Python 7,572 459 Updated Feb 20, 2025

Faster requests on Python 3

Nim 1,110 90 Updated Feb 13, 2025

Get info from any web service or page

PHP 2,103 311 Updated Jan 2, 2025

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…

TypeScript 17,025 763 Updated Mar 7, 2025

artoo.js - the client-side scraping companion.

JavaScript 1,105 93 Updated Mar 31, 2021

Websites crawler with built-in exploration and control web interface

JavaScript 341 62 Updated Jan 29, 2025

A webmining CLI tool & library for python.

Python 307 27 Updated Feb 20, 2025

DataHen Till is a companion tool to your existing web scraper that instantly makes it scalable, maintainable, and more unblockable, with minimal code changes on your scraper. Integrates with any sc…

Go 814 22 Updated Dec 5, 2021

🧹 Python package for text cleaning

Python 970 78 Updated May 9, 2023

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Python 4,002 284 Updated Feb 17, 2025
Python 150 22 Updated Jun 30, 2023

An #OSINT Framework to perform various recon techniques on Companies, People, Phone Number, Bitcoin Addresses, etc., aggregate all the raw data, and give data in multiple formats.

Python 3,081 432 Updated May 23, 2020

Retrieve and parse whois data for IPv4 and IPv6 addresses

Python 562 124 Updated Oct 15, 2024

🔎 Most Advanced Open Source Intelligence (OSINT) Framework for scanning IP Address, Emails, Websites, Organizations.

Python 2,204 333 Updated Sep 26, 2023

A rapid API for the Project Sonar dataset

Go 643 97 Updated May 5, 2023

A very high performance Domain Name parser package in Go.

Go 45 15 Updated Aug 27, 2021

Expand CIDR ranges to IPv4 addresses

Go 14 4 Updated Jul 28, 2022

A Linux eBPF rootkit with a backdoor, C2, library injection, execution hijacking, persistence and stealth capabilities.

C 1,818 229 Updated Apr 7, 2024

A OSINT tool to obtain a target's phone number just by having his email address

Python 2,275 267 Updated Jul 26, 2024

Hunt down social media accounts by username across social networks

Python 62,852 7,247 Updated Feb 17, 2025

A temporary email right from your terminal written in POSIX sh

Shell 3,926 157 Updated Aug 17, 2024

strings2: An improved strings extraction tool.

C++ 316 64 Updated May 30, 2022

The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.

Python 46,985 2,151 Updated Apr 18, 2024

A machine learning tool that ranks strings based on their relevance for malware analysis.

Python 701 124 Updated Jul 15, 2024

Parse and stringify URL query strings

JavaScript 6,819 455 Updated Nov 21, 2024