Index	Value
No matching items found

URL Extractor

Table of Contents

Extract URLs from text or HTML — online URL extractor from text and HTML
How to use
What this tool extracts
Examples
Combining tools & typical workflows
Tips & edge cases
FAQ
Does the tool remove duplicate URLs?
Does the extractor pull links from HTML attributes like href and src?
Will extraction change my original input?
Can I export results as CSV or download a file?
How do I extract only external/internal/image links?
Related tools

Extract all URLs from the text with just two clicks

Extract URLs from text or HTML — online URL extractor from text and HTML

Quickly extract every URL from plain text or HTML. This online URL extractor from text and HTML finds absolute, protocol-agnostic and relative links and returns them to the Extraction toolbar (original input is unchanged).

How to use

Paste your text or HTML into the editor.
Click Extract → URLs.
Results appear in the Extraction toolbar — use Copy to clipboard, Paste to editor, or Append to editor to continue your workflow.

What this tool extracts

Absolute links: http:// and https://
File and FTP links: file:// and ftp://
Protocol-agnostic links starting with // (e.g., //cdn.example.com)
Relative links starting with / (site-relative URLs)
URLs found inside common HTML attributes (href, src, srcset and similar)
Plain-text URLs embedded in text

Note: the extractor does not capture mailto: links. It returns raw matches and does not normalize or modify the links.

Examples

Example — plain text input → output

Input:
Visit https://example.com and http://site.test/page.

Output:
https://example.com
http://site.test/page

Example — HTML snippet → output

Input:
AboutContact

Output:
/about
//cdn.example.com/img.jpg
https://example.com/contact

Combining tools & typical workflows

Remove duplicate links: Extract → URLs, then Remove → Duplicates.
Keep only external links: Extract → URLs, then Remove → Filter with pattern ^http (Leave matches).
Extract only image URLs: Extract → URLs, then Remove → Filter with pattern (jpg|jpeg|png|gif)$.
Advanced extraction or pattern matching: pipe results into the Pattern Extractor or use Remove → Lines containing for custom filters.

All extracted results can be appended back to the editor for further processing or exported manually from the editor area.

Tips & edge cases

Extraction is atomic — duplicates are preserved. Run Remove → Duplicates to unique the list.
Order is preserved: links are returned in the sequence they appear in the source.
The extractor does not strip trailing punctuation automatically (commas, parentheses) — check results when extracting from prose.
For HTML-heavy input, consider running HTML Stripper first if you want to simplify the source before extraction.

FAQ

Does the tool remove duplicate URLs?

No. Extraction is an atomic operation and returns every match. To remove duplicates, run Remove → Duplicates after extraction.

Does the extractor pull links from HTML attributes like href and src?

Yes. It captures URLs from common attributes (href, src, srcset and similar) as well as plain-text URLs. It also finds protocol-agnostic (//) and relative (/) links.

Will extraction change my original input?

No. The original editor content is left unchanged. Extracted results appear in the Extraction toolbar where you can copy, paste to the editor, or append to the editor.

Can I export results as CSV or download a file?

Extraction places results in the toolbar for copying or appending to the editor. From the editor you can save or export manually; there is no built-in one-click CSV download on this tool.

How do I extract only external/internal/image links?

Use Extract → URLs, then run Remove → Filter with appropriate patterns (examples are provided in the page). For advanced patterns use the Pattern Extractor.

Related tools

Pattern Extractor — custom regex extraction.
Remove Duplicates — deduplicate extracted links.
Remove Lines Containing — filter unwanted links.
HTML Stripper — remove HTML before extracting if needed.