HTTP Utilities
HTTP Utilities
Werkzeug provides a couple of functions to parse and generate HTTP headersthat are useful when implementing WSGI middlewares or whenever you areoperating on a lower level layer. All this functionality is also exposedfrom request and response objects.
Date Functions
The following functions simplify working with times in an HTTP context.Werkzeug uses offset-naive datetime [http://docs.python.org/dev/library/datetime.html#datetime.datetime] objects internallythat store the time in UTC. If you're working with timezones in yourapplication make sure to replace the tzinfo attribute with a UTC timezoneinformation before processing the values.
werkzeug.http.cookie_date(expires=None)
Formats the time to ensure compatibility with Netscape's cookiestandard.
Accepts a floating point number expressed in seconds since the epoch in, adatetime object or a timetuple. All times in UTC. The parse_date()function can be used to parse such a date.
Outputs a string in the format Wdy,DD-Mon-YYYYHH:MM:SSGMT.
werkzeug.http.http_date(timestamp=None)
Formats the time to match the RFC1123 date format.
Accepts a floating point number expressed in seconds since the epoch in, adatetime object or a timetuple. All times in UTC. The parse_date()function can be used to parse such a date.
Outputs a string in the format Wdy,DDMonYYYYHH:MM:SSGMT.
werkzeug.http.parse_date(value)
Parse one of the following date formats into a datetime object:
Sun, 06 Nov 1994 08:49:37 GMT ; RFC 822, updated by RFC 1123
Sunday, 06-Nov-94 08:49:37 GMT ; RFC 850, obsoleted by RFC 1036
Sun Nov 6 08:49:37 1994 ; ANSI C's asctime() format
If parsing fails the return value is None.
参数: | value – a string with a supported date format. |
---|---|
返回: | a datetime.datetime [http://docs.python.org/dev/library/datetime.html#datetime.datetime] object. |
Header Parsing
The following functions can be used to parse incoming HTTP headers.Because Python does not provide data structures with the semantics requiredby RFC 2616 [http://tools.ietf.org/html/rfc2616.html], Werkzeug implements some custom data structures that aredocumented separately.
werkzeug.http.parse_options_header(value)
Parse a Content-Type like header into a tuple with the contenttype and the options:
>>> parse_options_header('text/html; charset=utf8')
('text/html', {'charset': 'utf8'})
This should not be used to parse Cache-Control like headers that usea slightly different format. For these headers use theparse_dict_header() function.
0.5 新版功能.
参数: | value – the header to parse. |
---|---|
返回: | (str, options) |
werkzeug.http.parse_set_header(value, on_update=None)
Parse a set-like header and return aHeaderSet object:
>>> hs = parse_set_header('token, "quoted value"')
The return value is an object that treats the items case-insensitivelyand keeps the order of the items:
>>> 'TOKEN' in hs
True
>>> hs.index('quoted value')
1
>>> hs
HeaderSet(['token', 'quoted value'])
To create a header from the HeaderSet again, use thedump_header() function.
参数: |
|
---|---|
返回: |
werkzeug.http.parse_list_header(value)
Parse lists as described by RFC 2068 Section 2.
In particular, parse comma-separated lists where the elements ofthe list may include quoted-strings. A quoted-string couldcontain a comma. A non-quoted string could have quotes in themiddle. Quotes are removed automatically after parsing.
It basically works like parse_set_header() just that itemsmay appear multiple times and case sensitivity is preserved.
The return value is a standard list [http://docs.python.org/dev/library/stdtypes.html#list]:
>>> parse_list_header('token, "quoted value"')
['token', 'quoted value']
To create a header from the list [http://docs.python.org/dev/library/stdtypes.html#list] again, use thedump_header() function.
参数: | value – a string with a list header. |
---|---|
返回: | list [http://docs.python.org/dev/library/stdtypes.html#list] |
werkzeug.http.parse_dict_header(value, cls=<type 'dict'>)
Parse lists of key, value pairs as described by RFC 2068 Section 2 andconvert them into a python dict (or any other mapping object created fromthe type with a dict like interface provided by the cls arugment):
>>> d = parse_dict_header('foo="is a fish", bar="as well"')
>>> type(d) is dict
True
>>> sorted(d.items())
[('bar', 'as well'), ('foo', 'is a fish')]
If there is no value for a key it will be None:
>>> parse_dict_header('key_without_value')
{'key_without_value': None}
To create a header from the dict [http://docs.python.org/dev/library/stdtypes.html#dict] again, use thedump_header() function.
在 0.9 版更改: Added support for cls argument.
参数: |
|
---|---|
返回: | an instance of cls |
werkzeug.http.parse_accept_header(value[, class])
Parses an HTTP Accept-* header. This does not implement a completevalid algorithm but one that supports at least value and qualityextraction.
Returns a new Accept object (basically a list of (value,quality)tuples sorted by the quality with some additional accessor methods).
The second parameter can be a subclass of Accept that is createdwith the parsed values and returned.
参数: |
|
---|---|
返回: | an instance of cls. |
werkzeug.http.parse_cache_control_header(value, on_update=None, cls=None)
Parse a cache control header. The RFC differs between response andrequest cache control, this method does not. It's your responsibilityto not use the wrong control statements.
0.5 新版功能: The cls was added. If not specified an immutableRequestCacheControl is returned.
参数: |
|
---|---|
返回: | a cls object. |
werkzeug.http.parse_authorization_header(value)
Parse an HTTP basic/digest authorization header transmitted by the webbrowser. The return value is either None if the header was invalid ornot given, otherwise an Authorizationobject.
参数: | value – the authorization header to parse. |
---|---|
返回: | a Authorization object or None. |
werkzeug.http.parse_www_authenticate_header(value, on_update=None)
Parse an HTTP WWW-Authenticate header into aWWWAuthenticate object.
参数: |
|
---|---|
返回: | a WWWAuthenticate object. |
werkzeug.http.parse_if_range_header(value)
Parses an if-range header which can be an etag or a date. Returnsa IfRange object.
0.7 新版功能.
werkzeug.http.parse_range_header(value, make_inclusive=True)
Parses a range header into a Rangeobject. If the header is missing or malformed None is returned.ranges is a list of (start,stop) tuples where the ranges arenon-inclusive.
0.7 新版功能.
werkzeug.http.parse_content_range_header(value, on_update=None)
Parses a range header into aContentRange object or None ifparsing is not possible.
0.7 新版功能.
参数: |
|
---|
Header Utilities
The following utilities operate on HTTP headers well but do not parsethem. They are useful if you're dealing with conditional responses or ifyou want to proxy arbitrary requests but want to remove WSGI-unsupportedhop-by-hop headers. Also there is a function to create HTTP headerstrings from the parsed data.
werkzeug.http.is_entity_header(header)
Check if a header is an entity header.
0.5 新版功能.
参数: | header – the header to test. |
---|---|
返回: | True if it's an entity header, False otherwise. |
werkzeug.http.is_hop_by_hop_header(header)
Check if a header is an HTTP/1.1 “Hop-by-Hop” header.
0.5 新版功能.
参数: | header – the header to test. |
---|---|
返回: | True if it's an entity header, False otherwise. |
werkzeug.http.remove_entity_headers(headers, allowed=('expires', 'content-location'))
Remove all entity headers from a list or Headers object. Thisoperation works in-place. Expires and Content-Location headers areby default not removed. The reason for this is RFC 2616 [http://tools.ietf.org/html/rfc2616.html] section10.3.5 which specifies some entity headers that should be sent.
在 0.5 版更改: added allowed parameter.
参数: |
|
---|
werkzeug.http.remove_hop_by_hop_headers(headers)
Remove all HTTP/1.1 “Hop-by-Hop” headers from a list orHeaders object. This operation works in-place.
0.5 新版功能.
werkzeug.http.is_byte_range_valid(start, stop, length)
Checks if a given byte content range is valid for the given length.
0.7 新版功能.
werkzeug.http.quote_header_value(value, extra_chars='', allow_token=True)
Quote a header value if necessary.
0.5 新版功能.
参数: |
|
---|
werkzeug.http.unquote_header_value(value, is_filename=False)
Unquotes a header value. (Reversal of quote_header_value()).This does not use the real unquoting but what browsers are actuallyusing for quoting.
0.5 新版功能.
werkzeug.http.dump_header(iterable, allow_token=True)
Dump an HTTP header again. This is the reversal ofparse_list_header(), parse_set_header() andparse_dict_header(). This also quotes strings that include anequals sign unless you pass it as dict of key, value pairs.
>>> dump_header({'foo': 'bar baz'})
'foo="bar baz"'
>>> dump_header(('foo', 'bar baz'))
'foo, "bar baz"'
参数: |
|
---|
Cookies
werkzeug.http.parse_cookie(header, charset='utf-8', errors='replace', cls=None)
Parse a cookie. Either from a string or WSGI environ.
Per default encoding errors are ignored. If you want a different behavioryou can set errors to 'replace' or 'strict'. In strict mode aHTTPUnicodeError is raised.
在 0.5 版更改: This function now returns a TypeConversionDict instead of aregular dict. The cls parameter was added.
参数: |
|
---|
werkzeug.http.dump_cookie(key, value='', max_age=None, expires=None, path='/', domain=None, secure=False, httponly=False, charset='utf-8', sync_expires=True)
Creates a new Set-Cookie header without the Set-Cookie prefixThe parameters are the same as in the cookie Morsel object in thePython standard library but it accepts unicode data, too.
On Python 3 the return value of this function will be a unicodestring, on Python 2 it will be a native string. In both cases thereturn value is usually restricted to ascii as the vast majority ofvalues are properly escaped, but that is no guarantee. If a unicodestring is returned it's tunneled through latin1 as required byPEP 3333.
The return value is not ASCII safe if the key contains unicodecharacters. This is technically against the specification buthappens in the wild. It's strongly recommended to not usenon-ASCII values for the keys.
参数: |
|
---|
Conditional Response Helpers
For conditional responses the following functions might be useful:
werkzeug.http.parse_etags(value)
Parse an etag header.
参数: | value – the tag header to parse |
---|---|
返回: | an ETags object. |
werkzeug.http.quote_etag(etag, weak=False)
Quote an etag.
参数: |
|
---|
werkzeug.http.unquote_etag(etag)
Unquote a single etag:
>>> unquote_etag('w/"bar"')
('bar', True)
>>> unquote_etag('"bar"')
('bar', False)
参数: | etag – the etag identifier to unquote. |
---|---|
返回: | a (etag,weak) tuple. |
werkzeug.http.generate_etag(data)
Generate an etag for some data.
werkzeug.http.is_resource_modified(environ, etag=None, data=None, last_modified=None)
Convenience method for conditional requests.
参数: |
|
---|---|
返回: | True if the resource was modified, otherwise False. |
Constants
werkzeug.http.HTTP_STATUS_CODES
A dict of status code -> default status message pairs. This is usedby the wrappers and other places where an integer status code is expandedto a string throughout Werkzeug.
Form Data Parsing
Werkzeug provides the form parsing functions separately from the requestobject so that you can access form data from a plain WSGI environment.
The following formats are currently supported by the form data parser:
- application/x-www-form-urlencoded
- multipart/form-data
Nested multipart is not currently supported (Werkzeug 0.9), but it isn't usedby any of the modern web browsers.
Usage example:
>>> from cStringIO import StringIO
>>> data = '--foo\r\nContent-Disposition: form-data; name="test"\r\n' \
... '\r\nHello World!\r\n--foo--'
>>> environ = {'wsgi.input': StringIO(data), 'CONTENT_LENGTH': str(len(data)),
... 'CONTENT_TYPE': 'multipart/form-data; boundary=foo',
... 'REQUEST_METHOD': 'POST'}
>>> stream, form, files = parse_form_data(environ)
>>> stream.read()
''
>>> form['test']
u'Hello World!'
>>> not files
True
Normally the WSGI environment is provided by the WSGI gateway with theincoming data as part of it. If you want to generate such fake-WSGIenvironments for unittesting you might want to use thecreate_environ() function or the EnvironBuilder instead.
class werkzeug.formparser.FormDataParser(stream_factory=None, charset='utf-8', errors='replace', max_form_memory_size=None, max_content_length=None, cls=None, silent=True)
This class implements parsing of form data for Werkzeug. By itselfit can parse multipart and url encoded form data. It can be subclassedand extended but for most mimetypes it is a better idea to use theuntouched stream and expose it as separate attributes on a requestobject.
0.8 新版功能.
参数: |
|
---|
werkzeug.formparser.parse_form_data(environ, stream_factory=None, charset='utf-8', errors='replace', max_form_memory_size=None, max_content_length=None, cls=None, silent=True)
Parse the form data in the environ and return it as tuple in the form(stream,form,files). You should only call this method if thetransport method is POST, PUT, or PATCH.
If the mimetype of the data transmitted is multipart/form-data thefiles multidict will be filled with FileStorage objects. If themimetype is unknown the input stream is wrapped and returned as firstargument, else the stream is empty.
This is a shortcut for the common usage of FormDataParser.
Have a look at Dealing with Request Data for more details.
0.5 新版功能: The max_form_memory_size, max_content_length andcls parameters were added.
0.5.1 新版功能: The optional silent flag was added.
参数: |
|
---|---|
返回: | A tuple in the form (stream, form, files). |
werkzeug.formparser.parse_multipart_headers(iterable)
Parses multipart headers from an iterable that yields lines (includingthe trailing newline symbol). The iterable has to be newline terminated.
The iterable will stop at the line where the headers ended so it can befurther consumed.