How to parse URL and extract components in Perl?

Question

Free Perl Code · Accepted Answer

Parsing URLs and extracting their components—such as scheme, host, port, path, query, and fragment—is a common task in networking and web programming. In Perl, you have multiple ways to achieve this, from using regular expressions (quick but error-prone) to using specialized modules that conform to URL standards. Recommended Approach: Using the Core URI Module The URI module comes with modern Perl installations and provides a clean, object-oriented way to parse URIs (Uniform Resource Identifiers, which includes URLs). It handles different URL schemes, edge cases, and percent encoding correctly. Here’s how you can use URI to parse a URL and extract components: use strict; use warnings; use URI; # Example URL my $url = 'https://user:pass@example.com:8080/path/to/file.html?key=value&foo=bar#section2'; # Create a URI object my $uri = URI->new($url); # Extract components my $scheme = $uri->scheme; # https my $userinfo = $uri->userinfo; # user:pass my $host = $uri->host; # example.com my $port = $uri->port; # 8080 my $path = $uri->path; # /path/to/file.html my $query = $uri->query; # key=value&foo=bar my $fragment = $uri->fragment; # section2 print "Scheme: $scheme
"; print "Userinfo: $

How to parse URL and extract components in Perl?

Question

Recommended Approach: Using the Core `URI` Module

Explanation of Perl Concepts

Alternative: Parsing Manually With a Regex (Not Generally Recommended)

Common Pitfalls and Gotchas

Summary

Verified Code

Was this helpful?

Related Questions