scrachy.settings.defaults.storage
The default settings for configuring
AlchemyCacheStorage.
Module Attributes
Consider any page that is in the cache stale (do not retrieve it) if it has not been in the cache for at least this many seconds. |
|
A list of tuples consisting of a pattern and a delay time in seconds. |
|
Similar to |
|
Expire all responses that do not match a |
|
Expire any response who's URL matches the given pattern according to the corresponding schedule. |
|
Sometimes it is not possible to determine the encoding of a page because it was not set properly at the source. |
|
The cache stores quite a bit of information about each response. |
|
This specifies the database dialect to use and must be supported by SQLAlchemy |
|
This specifies the name of the driver used to connect to the database. |
|
The hostname (or ip address) where the database server is running. |
|
The port number the database server is listening on. |
|
For sqlite this is the path to the database file and it will be created if it does not already exist. |
|
This will set the schema for databases that support them (e.g., PostgreSQL). |
|
The username used to connect to the database. |
|
The password (if any) used to connect to the database. |
|
Any other arguments that should be passed to |
|
Whether or not to store the full scrape history for each page (identified by its fingerprint). |
|
A class implementing the |
|
The parser to use for constructing the DOM. |
|
A boilerpy |