Extract and decompose (fuzzy) URLs (including emails, which are conceptually a part of URLs) in texts with Area-Pattern-based modularity
-
Updated
Jan 26, 2025 - TypeScript
Extract and decompose (fuzzy) URLs (including emails, which are conceptually a part of URLs) in texts with Area-Pattern-based modularity
🔗🧹 Normalize URLs to a standardized form. HTTPS by default, flexible configuration, custom protocols, domain extraction, humazing URL, and punycode support. Both CJS & ESM modules available.
URL normalizer to canonicalize (standardize) the text representation of a URL to determine if differently-formatted URLs are identical
🔗 Pathor is a PHP library for normalizing, analyzing, and comparing URLs.
Add a description, image, and links to the url-normalizer topic page so that developers can more easily learn about it.
To associate your repository with the url-normalizer topic, visit your repo's landing page and select "manage topics."