PicoToolkit
Extracted data:
View Settings
Applies to real TAB characters.
0 characters
0 without spaces
0 words
0 lines
IndexValue
No matching items found
Spotted a bug or have an idea for a new feature? Let us know here »

URL Extractor

Extract all URLs from the text with just two clicks

Extract URLs from text or HTML — online URL extractor from text and HTML

Quickly extract every URL from plain text or HTML. This online URL extractor from text and HTML finds absolute, protocol-agnostic and relative links and returns them to the Extraction toolbar (original input is unchanged).

How to use

  • Paste your text or HTML into the editor.
  • Click Extract → URLs.
  • Results appear in the Extraction toolbar — use Copy to clipboard, Paste to editor, or Append to editor to continue your workflow.

What this tool extracts

  • Absolute links: http:// and https://
  • File and FTP links: file:// and ftp://
  • Protocol-agnostic links starting with // (e.g., //cdn.example.com)
  • Relative links starting with / (site-relative URLs)
  • URLs found inside common HTML attributes (href, src, srcset and similar)
  • Plain-text URLs embedded in text

Note: the extractor does not capture mailto: links. It returns raw matches and does not normalize or modify the links.

Examples

Example — plain text input → output

Input:
Visit https://example.com and http://site.test/page.

Output:
https://example.com
http://site.test/page

Example — HTML snippet → output

Input:
AboutContact

Output:
/about
//cdn.example.com/img.jpg
https://example.com/contact

Combining tools & typical workflows

  • Remove duplicate links: Extract → URLs, then Remove → Duplicates.
  • Keep only external links: Extract → URLs, then Remove → Filter with pattern ^http (Leave matches).
  • Extract only image URLs: Extract → URLs, then Remove → Filter with pattern (jpg|jpeg|png|gif)$.
  • Advanced extraction or pattern matching: pipe results into the Pattern Extractor or use Remove → Lines containing for custom filters.

All extracted results can be appended back to the editor for further processing or exported manually from the editor area.

Tips & edge cases

  • Extraction is atomic — duplicates are preserved. Run Remove → Duplicates to unique the list.
  • Order is preserved: links are returned in the sequence they appear in the source.
  • The extractor does not strip trailing punctuation automatically (commas, parentheses) — check results when extracting from prose.
  • For HTML-heavy input, consider running HTML Stripper first if you want to simplify the source before extraction.

FAQ

Does the tool remove duplicate URLs?

No. Extraction is an atomic operation and returns every match. To remove duplicates, run Remove → Duplicates after extraction.

Does the extractor pull links from HTML attributes like href and src?

Yes. It captures URLs from common attributes (href, src, srcset and similar) as well as plain-text URLs. It also finds protocol-agnostic (//) and relative (/) links.

Will extraction change my original input?

No. The original editor content is left unchanged. Extracted results appear in the Extraction toolbar where you can copy, paste to the editor, or append to the editor.

Can I export results as CSV or download a file?

Extraction places results in the toolbar for copying or appending to the editor. From the editor you can save or export manually; there is no built-in one-click CSV download on this tool.

How do I extract only external/internal/image links?

Use Extract → URLs, then run Remove → Filter with appropriate patterns (examples are provided in the page). For advanced patterns use the Pattern Extractor.

Related tools

PicoToolkit evolves fast. Stay ahead.

Get early access to new tools, features, and productivity upgrades.

Unsubscribe anytime.
© PicoToolkit 2022-2026 All rights reserved. Before using this website read and accept terms of use and privacy policy. Icons by Icons8