Architecture of the World Wide Web

Terms

web
a set of resources delivered over the Internet using the HTTP protocol
web resource
any information that can be accessed using the web (document, image, video)
web server
a software program on a computer that hosts web resources
web browser
an application used to acccess web resources on web servers

The web is distributed, addressable, and linked

Distributed

There is no centralised organisation or system that decides the content that can be published. Any computer connected to the Internet can run a web server and the person running it can make available any content they wish.

Addressable

Every resource on the web has a unique address that includes both its location and how to access it (URL):

  • scheme
    • specifies the [Application Layer](Application_Layer_of_Internet_Protocol.md for accessing the resource. In the example this is HTTP but it could be FTP or another protocol, always bracketed with a colon
  • authority
    • comprises the DNS hostname of the server containing the resource. This could also be an IP address and could also include a port number
  • path
    • the location of the resource on the web server, organised hierarchically and analagous to a file path
  • query
    • a modifier of the resource returned to the client. Its usage varies accross sites but is typically deterimined by backend implementation details

Note a URL does not have to always contain all this information. Consider an index page.