| Commit message (Collapse) | Author | Age | Files | Lines |
|
|
|
|
|
| |
etc to improve deserialization performance with a factor 25 but now require python 2.4 of more recent
git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@343 86f53f14-5ff3-0310-afe5-9b438ce3f40c
|
|
|
|
| |
git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@342 86f53f14-5ff3-0310-afe5-9b438ce3f40c
|
|
|
|
| |
git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@341 86f53f14-5ff3-0310-afe5-9b438ce3f40c
|
|
|
|
| |
git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@336 86f53f14-5ff3-0310-afe5-9b438ce3f40c
|
|
|
|
|
|
| |
in a map allowing them to be serialized
git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@293 86f53f14-5ff3-0310-afe5-9b438ce3f40c
|
|
|
|
|
|
| |
quoted strings
git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@288 86f53f14-5ff3-0310-afe5-9b438ce3f40c
|
|
|
|
|
|
| |
and add FIXME
git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@287 86f53f14-5ff3-0310-afe5-9b438ce3f40c
|
|
|
|
| |
git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@283 86f53f14-5ff3-0310-afe5-9b438ce3f40c
|
|
|
|
|
|
| |
crawler.postprocess() functions
git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@279 86f53f14-5ff3-0310-afe5-9b438ce3f40c
|
|
|
|
|
|
| |
reserialized again
git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@276 86f53f14-5ff3-0310-afe5-9b438ce3f40c
|
|
|
|
| |
git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@270 86f53f14-5ff3-0310-afe5-9b438ce3f40c
|
|
|
|
| |
git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@269 86f53f14-5ff3-0310-afe5-9b438ce3f40c
|
|
|
|
|
|
| |
constructor's default value
git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@267 86f53f14-5ff3-0310-afe5-9b438ce3f40c
|
|
|
|
|
|
| |
deserialized links to avoid duplicates as a link can be deserialized multiple times
git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@266 86f53f14-5ff3-0310-afe5-9b438ce3f40c
|
|
|
|
|
|
| |
could be a partial deserialize that needs more tweaking to the site before the call to crawl()
git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@265 86f53f14-5ff3-0310-afe5-9b438ce3f40c
|
|
deserializing all crawler state (site and links) to and from a file, this module is not called anywhere yet
git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@257 86f53f14-5ff3-0310-afe5-9b438ce3f40c
|