Full Possible URL syntax and grammar

Tags:

url

I was reviewing some information about the components of the URL, but can't find a reasonable explanation of the the possible full length url and what each component could be. I want to know what a full URL could look like, taking advantage of all of the intricasies. i will also hope to build a little GUI helping explain them once I undderstand them better, but until then I would try with the components I am aware of:

[ ] Brackets contain a full component | Pipe shows possible subcomponents of a component ( ) Parenthesis contain notes, thoughts, and assumptions about the sub/components

My full understanding:

[type][://][subdomain][domain][port][path][file][query][hash]

Here are the descriptions of each component: if it has an *, it is optional

[type]* = [ (type {http | https | ftp | file | etc...}) ] (although this is optional, I believe that it is also required, meaning that modern browsers insert the type to request it to the server, and the server may return a different type as well)

[://] = (don't know what this is called)

[subdomain]* = [ [subdomain] | [subdomain]subdomain ]

[domain] = [ name . (type {com | org | etc..}) ]

[port]* = [ (blank which is by default port:80) | port:** ]

[path]* = [ (blank) | [path] | [path]path ]

[file] = [ name . (type {html | php | php | (etc...) }) ]

[query]* = [ ?[ blank(ie no query) | paramater=value | paramater=value&paramater=value(etc...) ]]

[hash]* = [ #[ blank(ie no hash) | anyStringToBeParsedClientSide(usually for persistence) ] ( just learned a hash is also known as a fragment identifier )

What else am I forgetting, or am I overlooking a good site that explains them. Please correct my naming, as they are likely incorrect, as I am trying to also learn what they are called.

794

asked Nov 14 '12 19:11

chris Frisina

1 Answers

If you really want all the intricacies, standards documents are the only way to go, and learning to find and read them definitely pays off. And RFC's aren't typically very hard to read.

In this case, RFC 1738 (Uniform Resource Locators) is the resource you want. It's no more "overly technical" than what you've come up with so far; in fact, section 5 has the formal BNF grammar similar to what you wrote.

You might also be interested in RFC 3986 (Uniform Resource Identifiers) which describes the URI format, which is more general than mere URLs.

Some of the things you mention are specific to HTTP, described in RFC 2616 (Hypertext Transfer Protocol 1.1). Section 3.2 briefly touches on URIs.

154

answered Oct 19 '22 00:10

Thomas

Related questions
                            
                                How do you programmatically find a true URL in C# instead of a forwarding link?
                            
                                Detecting animated GIF on the fly using PHP/CodeIgniter
                            
                                django urlfield http prefix
                            
                                Amazon S3 : Access Denied for URL using symbols
                            
                                Request URL failed/timeout in R
                            
                                Customizing URL in Jenkins
                            
                                How do I use @use a Google font URL in Sass?
                            
                                ASP.NET - Avoid hardcoding paths
                            
                                How do I get the host domain name in ASP .NET without using HttpContext.Current.Request?
                            
                                How can I use JavaScript to match a string inside the current URL of the window I am in?
                            
                                open a link in a new tab in the same window
                            
                                Generating organization IP address range data
                            
                                Opening a URL from Flash using navigateToURL (AS3)
                            
                                Why are characters like @, $, :, and ; reserved characters in a url query component?
                            
                                Brackets in a Request URL are legal but not in a URI (Java)?
                            
                                how to pass an object through url in javascript as a single var
                            
                                How can I remove the ASP.NET Session ID from my URL?
                            
                                Url.Action does not give the expected result. Unwanted route values remain
                            
                                Request to Google Text-To-Speech API [closed]
                            
                                Yii - how to access the base URL in main config

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With