Skip to content
View rainmana's full-sized avatar

Block or report rainmana

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

OSINt

231 repositories

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

Python 22,910 1,217 Updated Jan 9, 2025

Tool to index and serve HTML files. Powered by Datasette.

HTML 95 6 Updated Mar 2, 2022

A bunch of website scraping scripts

Ruby 8 1 Updated Jul 9, 2013

List of libraries, tools and APIs for web scraping and data processing.

Makefile 6,817 795 Updated Dec 27, 2024

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Python 6,573 682 Updated Oct 12, 2024

Lighter web automation with Python

Python 7,138 436 Updated Dec 13, 2024

Faster requests on Python 3

Nim 1,113 90 Updated Jan 9, 2025

Get info from any web service or page

PHP 2,102 315 Updated Jan 2, 2025

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…

TypeScript 16,483 730 Updated Jan 10, 2025

artoo.js - the client-side scraping companion.

JavaScript 1,102 93 Updated Mar 31, 2021

Websites crawler with built-in exploration and control web interface

JavaScript 328 61 Updated Dec 13, 2024

A webmining CLI tool & library for python.

Python 292 27 Updated Jan 10, 2025

DataHen Till is a companion tool to your existing web scraper that instantly makes it scalable, maintainable, and more unblockable, with minimal code changes on your scraper. Integrates with any sc…

Go 815 23 Updated Dec 5, 2021

🧹 Python package for text cleaning

Python 963 78 Updated May 9, 2023

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Python 3,818 270 Updated Dec 28, 2024
Python 149 22 Updated Jun 30, 2023

An #OSINT Framework to perform various recon techniques on Companies, People, Phone Number, Bitcoin Addresses, etc., aggregate all the raw data, and give data in multiple formats.

Python 3,059 427 Updated May 23, 2020

Retrieve and parse whois data for IPv4 and IPv6 addresses

Python 560 123 Updated Oct 15, 2024

🔎 Most Advanced Open Source Intelligence (OSINT) Framework for scanning IP Address, Emails, Websites, Organizations.

Python 2,142 327 Updated Sep 26, 2023

A rapid API for the Project Sonar dataset

Go 643 96 Updated May 5, 2023

A very high performance Domain Name parser package in Go.

Go 45 15 Updated Aug 27, 2021

Expand CIDR ranges to IPv4 addresses

Go 13 4 Updated Jul 28, 2022

A Linux eBPF rootkit with a backdoor, C2, library injection, execution hijacking, persistence and stealth capabilities.

C 1,799 225 Updated Apr 7, 2024

A OSINT tool to obtain a target's phone number just by having his email address

Python 2,226 264 Updated Jul 26, 2024

Hunt down social media accounts by username across social networks

Python 61,661 7,089 Updated Nov 13, 2024

A temporary email right from your terminal written in POSIX sh

Shell 3,914 156 Updated Aug 17, 2024

strings2: An improved strings extraction tool.

C++ 310 64 Updated May 30, 2022

The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.

Python 46,799 2,147 Updated Apr 18, 2024

A machine learning tool that ranks strings based on their relevance for malware analysis.

Python 691 125 Updated Jul 15, 2024

Parse and stringify URL query strings

JavaScript 6,791 456 Updated Nov 21, 2024