Manual of Lua-URI: _util
lua-uri-_util - Utility functions for Lua URI library
Description
This module contains various utility functions used by the rest of the library. They are mostly intended only for internal use, and are subject to change in future versions, but the URI encoding and decoding functions may be more widely useful.
On loading, the module returns a table containing the functions, but like all the modules in this library it does not install itself into any global variables.
local Util = require "uri._util"
Functions
The following functions can be found in the table returned from require
:
- uri_encode(text, pattern)
Use URI encoding (or 'percent encoding') to encode any unsafe characters in
text
. Ifpattern
is specified then it should be part of a Lua pattern which can be enclosed in square brackets to make a character class. Usually it will start with^
so that the rest of the characters will be considered the 'safe' ones, not to be encoded. Any character matched by the pattern will be encoded.print(Util.uri_encode("foo bar!")) ---> foo%20bar! print(Util.uri_encode("foo bar!", "^A-Za-z0-9")) ---> foo%20bar%21
The default pattern is:
^A-Za-z0-9%-_.!~*'()
- uri_decode(text, pattern)
Decode any URI encoding in
text
. Ifpattern
is nil then all encoded characters will be decoded. If a pattern is supplied then it should be in the same form as foruri_encode
. Any character not matched by the pattern will be left encoded as it was.print(Util.uri_decode("foo%20bar%21")) ---> foo bar! print(Util.uri_decode("foo%20bar%21", "^!")) ---> foo bar%21
- remove_dot_segments(path)
Removes single and double dot segments from a URI path.
This is the 'remove_dot_segments' algorithm from RFC 3986 section 5.2.4. The value of
path
is used as the input buffer, and the contents of the output buffer are returned.- split(pattern, str, max)
Split the string
str
whereverpattern
matches it, returning the pieces as individual strings in an array. Ifmax
is not nil, then stop splitting after that many pieces have been created.- attempt_require(name)
Calling this function is the same as calling Lua's built in
require
function, except that if a module calledname
cannot be found, it returns nil instead of throwing an exception. If loading the module is successful then the result ofrequire
is returned. An exception is thrown if any error occurs loading the module other than it not being found.- subclass_of(class, baseclass)
Sets up the metatable and a few other things for the table
class
so that it will be a subclass ofbaseclass
. This is used by the classes in this library to implement inheritance.- do_class_changing_change(uri, baseclass, changedesc, newvalue, changefunc)
This is used when a mutator method changes something about a URI which leads it to need to belong to a different class.
uri
is the URI object to change,baseclass
is the class to reset it to before making the change,changedesc
is a description to be included in an error message if necessary,newvalue
is the new value to be set (which must be a string, as it is also included in error messages), andchangefunc
is a function which is called with a temporary URI object it should adjust andnewvalue
.- uri_part_not_allowed (class, method)
This should be called in scheme-specific classes where certain parts of URIs are not allowed to be present (e.g., the 'host' part in a URN). It will override the named method in the class with one which throws an exception if an attempt is made to set the part to anything other than nil. If the rest of the code for the scheme keeps objects internally consistent then the new method should always return nil, although when a URI is being validated during the
init
method's execution, it may return other things, which can be used to detect disallowed parts in a URI being parsed.