Class: java.net.URL
- public final class URL
- implements Serializable
URL represents a Uniform Resource
Locator, a pointer to a "resource" on the World
Wide Web. A resource can be something as simple as a file or a
directory, or it can be a reference to a more complicated object,
such as a query to a database or to a search engine. More
information on the types of URLs and their formats can be found at:
http://archive.ncsa.uiuc.edu/SDG/Software/Mosaic/Demo/url-primer.html
In general, a URL can be broken into several parts. The previous
example of a URL indicates that the protocol to use is
http (HyperText Transfer Protocol) and that the
information resides on a host machine named
www.ncsa.uiuc.edu. The information on that host
machine is named /SDG/Software/Mosaic/Demo/url-primer.html. The exact
meaning of this name on the host machine is both protocol
dependent and host dependent. The information normally resides in
a file, but it could be generated on the fly. This component of
the URL is called the path component.
A URL can optionally specify a "port", which is the
port number to which the TCP connection is made on the remote host
machine. If the port is not specified, the default port for
the protocol is used instead. For example, the default port for
http is 80. An alternative port could be
specified as:
http://archive.ncsa.uiuc.edu:80/SDG/Software/Mosaic/Demo/url-primer.html
The syntax of URL is defined by RFC 2396: Uniform
Resource Identifiers (URI): Generic Syntax, amended by RFC 2732: Format for
Literal IPv6 Addresses in URLs. The Literal IPv6 address format
also supports scope_ids. The syntax and usage of scope_ids is described
here.
A URL may have appended to it a "fragment", also known as a "ref" or a "reference". The fragment is indicated by the sharp sign character "#" followed by more characters. For example,
http://java.sun.com/index.html#chapter1
This fragment is not technically part of the URL. Rather, it
indicates that after the specified resource is retrieved, the
application is specifically interested in that part of the
document that has the tag chapter1 attached to it. The
meaning of a tag is resource specific.
An application can also specify a "relative URL", which contains only enough information to reach the resource relative to another URL. Relative URLs are frequently used within HTML pages. For example, if the contents of the URL:
http://java.sun.com/index.html
contained within it the relative URL:
FAQ.html
it would be a shorthand for:
http://java.sun.com/FAQ.html
The relative URL need not specify all the components of a URL. If the protocol, host name, or port number is missing, the value is inherited from the fully specified URL. The file component must be specified. The optional fragment is not inherited.
The URL class does not itself encode or decode any URL components
according to the escaping mechanism defined in RFC2396. It is the
responsibility of the caller to encode any fields, which need to be
escaped prior to calling URL, and also to decode any escaped fields,
that are returned from URL. Furthermore, because URL has no knowledge
of URL escaping, it does not recognise equivalence between the encoded
or decoded form of the same URL. For example, the two URLs:
http://foo.com/hello world/ and http://foo.com/hello%20worldwould be considered not equal to each other.
Note, the java.net.URI class does perform escaping of its component fields in certain circumstances. The recommended way to manage the encoding and decoding of URLs is to use java.net.URI, and to convert between these two classes using java.net.URL.toURI() and java.net.URI.toURL().
The java.net.URLEncoder and java.net.URLDecoder classes can also be used, but only for HTML form encoding, which is not the same as the encoding scheme defined in RFC2396.
Methods
-
URLtop
public URL(String spec) throws MalformedURLExceptionCreates aURLobject from theStringrepresentation.This constructor is equivalent to a call to the two-argument constructor with a
nullfirst argument. -
URLtop
Creates aURLobject from the specifiedprotocol,host,portnumber, andfile.hostcan be expressed as a host name or a literal IP address. If IPv6 literal address is used, it should be enclosed in square brackets ('[' and ']'), as specified by RFC 2732; However, the literal IPv6 address format defined in RFC 2373: IP Version 6 Addressing Architecture is also accepted.Specifying a
portnumber of-1indicates that the URL should use the default port for the protocol.If this is the first URL object being created with the specified protocol, a stream protocol handler object, an instance of class
URLStreamHandler, is created for that protocol:- If the application has previously set up an instance of
URLStreamHandlerFactoryas the stream handler factory, then thecreateURLStreamHandlermethod of that instance is called with the protocol string as an argument to create the stream protocol handler. - If no
URLStreamHandlerFactoryhas yet been set up, or if the factory'screateURLStreamHandlermethod returnsnull, then the constructor finds the value of the system property:
If the value of that system property is notjava.protocol.handler.pkgsnull, it is interpreted as a list of packages separated by a vertical slash character '|'. The constructor tries to load the class named:
where <package> is replaced by the name of the package and <protocol> is replaced by the name of the protocol. If this class does not exist, or if the class exists but it is not a subclass of<package>.<protocol>.HandlerURLStreamHandler, then the next package in the list is tried. - If the previous step fails to find a protocol handler, then the
constructor tries to load from a system default package.
If this class does not exist, or if the class exists but it is not a subclass of<system default package>.<protocol>.HandlerURLStreamHandler, then aMalformedURLExceptionis thrown.
Protocol handlers for the following protocols are guaranteed to exist on the search path :-
Protocol handlers for additional protocols may also be available.http, https, ftp, file, and jarNo validation of the inputs is performed by this constructor.
- If the application has previously set up an instance of
-
URLtop
public URL(String protocol, String host, int port, String file, URLStreamHandler handler) throws MalformedURLExceptionCreates aURLobject from the specifiedprotocol,host,portnumber,file, andhandler. Specifying aportnumber of-1indicates that the URL should use the default port for the protocol. Specifying ahandlerofnullindicates that the URL should use a default stream handler for the protocol, as outlined for: java.net.URL#URL(java.lang.String, java.lang.String, int, java.lang.String)If the handler is not null and there is a security manager, the security manager's
checkPermissionmethod is called with aNetPermission("specifyStreamHandler")permission. This may result in a SecurityException. No validation of the inputs is performed by this constructor. -
URLtop
Creates a URL from the specifiedprotocolname,hostname, andfilename. The default port for the specified protocol is used.This method is equivalent to calling the four-argument constructor with the arguments being
protocol,host,-1, andfile. No validation of the inputs is performed by this constructor. -
URLtop
Creates a URL by parsing the given spec within a specified context. The new URL is created from the given context URL and the spec argument as described in RFC2396 "Uniform Resource Identifiers : Generic * Syntax" :
The reference is parsed into the scheme, authority, path, query and fragment parts. If the path component is empty and the scheme, authority, and query components are undefined, then the new URL is a reference to the current document. Otherwise, the fragment and query parts present in the spec are used in the new URL.<scheme>://<authority><path>?<query>#<fragment>If the scheme component is defined in the given spec and does not match the scheme of the context, then the new URL is created as an absolute URL based on the spec alone. Otherwise the scheme component is inherited from the context URL.
If the authority component is present in the spec then the spec is treated as absolute and the spec authority and path will replace the context authority and path. If the authority component is absent in the spec then the authority of the new URL will be inherited from the context.
If the spec's path component begins with a slash character "/" then the path is treated as absolute and the spec path replaces the context path.
Otherwise, the path is treated as a relative path and is appended to the context path, as described in RFC2396. Also, in this case, the path is canonicalized through the removal of directory changes made by occurences of ".." and ".".
For a more detailed description of URL parsing, refer to RFC2396.
-
URLtop
Creates a URL by parsing the given spec with the specified handler within a specified context. If the handler is null, the parsing occurs as with the two argument constructor. -
equalstop
public boolean equals(Object obj)Compares this URL for equality with another object.If the given object is not a URL then this method immediately returns
false.Two URL objects are equal if they have the same protocol, reference equivalent hosts, have the same port number on the host, and the same file and fragment of the file.
Two hosts are considered equivalent if both host names can be resolved into the same IP addresses; else if either host name can't be resolved, the host names must be equal without regard to case; or both host names equal to null.
Since hosts comparison requires name resolution, this operation is a blocking operation.
Note: The defined behavior for
equalsis known to be inconsistent with virtual hosting in HTTP. -
getAuthoritytop
public String getAuthority()Gets the authority part of thisURL. -
getContenttop
public final Object getContent() throws IOExceptionGets the contents of this URL. This method is a shorthand for:openConnection().getContent() -
getContenttop
Gets the contents of this URL. This method is a shorthand for:openConnection().getContent(Class[]) -
getDefaultPorttop
public int getDefaultPort()Gets the default port number of the protocol associated with thisURL. If the URL scheme or the URLStreamHandler for the URL do not define a default port number, then -1 is returned. -
getFiletop
public String getFile()Gets the file name of thisURL. The returned file portion will be the same asgetPath(), plus the concatenation of the value ofgetQuery(), if any. If there is no query portion, this method andgetPath()will return identical results. -
getHosttop
public String getHost()Gets the host name of thisURL, if applicable. The format of the host conforms to RFC 2732, i.e. for a literal IPv6 address, this method will return the IPv6 address enclosed in square brackets ('[' and ']'). -
getPathtop
public String getPath()Gets the path part of thisURL. -
getPorttop
public int getPort()Gets the port number of thisURL. -
getProtocoltop
public String getProtocol()Gets the protocol name of thisURL. -
getQuerytop
public String getQuery()Gets the query part of thisURL. -
getReftop
public String getRef()Gets the anchor (also known as the "reference") of thisURL. -
getUserInfotop
public String getUserInfo()Gets the userInfo part of thisURL. -
hashCodetop
public synchronized int hashCode()Creates an integer suitable for hash table indexing.The hash code is based upon all the URL components relevant for URL comparison. As such, this operation is a blocking operation.
-
openConnectiontop
public URLConnection openConnection() throws IOExceptionReturns aURLConnectionobject that represents a connection to the remote object referred to by theURL.A new connection is opened every time by calling the
openConnectionmethod of the protocol handler for this URL.If for the URL's protocol (such as HTTP or JAR), there exists a public, specialized URLConnection subclass belonging to one of the following packages or one of their subpackages: java.lang, java.io, java.util, java.net, the connection returned will be of that subclass. For example, for HTTP an HttpURLConnection will be returned, and for JAR a JarURLConnection will be returned.
-
openConnectiontop
Same as openConnection(), except that the connection will be made through the specified proxy; Protocol handlers that do not support proxing will ignore the proxy parameter and make a normal connection. Calling this method preempts the system's default ProxySelector settings. -
openStreamtop
public final InputStream openStream() throws IOExceptionOpens a connection to thisURLand returns anInputStreamfor reading from that connection. This method is a shorthand for:openConnection().getInputStream() -
sameFiletop
public boolean sameFile(URL other)Compares two URLs, excluding the fragment component.Returns
trueif thisURLand theotherargument are equal without taking the fragment component into consideration. -
settop
Sets the fields of the URL. This is not a public method so that only URLStreamHandlers can modify URL fields. URLs are otherwise constant. -
settop
protected void set(String protocol, String host, int port, String authority, String userInfo, String path, String query, String ref)Sets the specified 8 fields of the URL. This is not a public method so that only URLStreamHandlers can modify URL fields. URLs are otherwise constant. -
setURLStreamHandlerFactorytop
public static void setURLStreamHandlerFactory(URLStreamHandlerFactory fac)Sets an application'sURLStreamHandlerFactory. This method can be called at most once in a given Java Virtual Machine.The
URLStreamHandlerFactoryinstance is used to construct a stream protocol handler from a protocol name.If there is a security manager, this method first calls the security manager's
checkSetFactorymethod to ensure the operation is allowed. This could result in a SecurityException. -
toExternalFormtop
public String toExternalForm()Constructs a string representation of thisURL. The string is created by calling thetoExternalFormmethod of the stream protocol handler for this object. -
toStringtop
public String toString()Constructs a string representation of thisURL. The string is created by calling thetoExternalFormmethod of the stream protocol handler for this object. -
toURItop
public URI toURI() throws URISyntaxExceptionReturns a java.net.URI equivalent to this URL. This method functions in the same way asnew URI (this.toString()).Note, any URL instance that complies with RFC 2396 can be converted to a URI. However, some URLs that are not strictly in compliance can not be converted to a URI.
