Arthur de Jong

Open Source / Free Software developer

summaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeFilesLines
* get files ready for 1.9.8 release1.9.8Arthur de Jong2007-01-158-127/+213
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@309 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* catch any exception in HTTP module and report is as a ↵Arthur de Jong2007-01-151-0/+7
| | | | | | link problem git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@308 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* move section on webcheck design into HACKING documentArthur de Jong2007-01-153-30/+28
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@307 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* fix the bugreporting section to more clearly state the ↵Arthur de Jong2007-01-151-6/+16
| | | | | | needed information git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@306 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* switch to using latest syntax of python-supportArthur de Jong2007-01-133-3/+1
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@305 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* present sitemap with all basesArthur de Jong2006-10-231-1/+5
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@304 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* add USE_ROBOTS optionArthur de Jong2006-10-231-0/+4
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@303 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* include list of bases in Site classArthur de Jong2006-10-231-10/+13
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@302 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* print reason why link is yanked if availableArthur de Jong2006-10-231-1/+6
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@301 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* include link to homepage in package descriptionArthur de Jong2006-09-291-0/+2
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@300 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* explicitly transform username and password to string in ↵Arthur de Jong2006-09-041-2/+2
| | | | | | case either one isn't supplied git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@299 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* also handle SSL related socket errors (e.g. SSL time-out)Arthur de Jong2006-07-131-1/+1
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@298 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* add set_encoding method to Link object to do some basic ↵Arthur de Jong2006-07-133-14/+23
| | | | | | encoding sanity checks git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@297 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* get files ready for 1.9.7 release1.9.7Arthur de Jong2006-07-026-26/+442
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@295 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* always keep navigation on topArthur de Jong2006-06-291-0/+2
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@294 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* store internal, external and yanked regular expressions ↵Arthur de Jong2006-06-242-12/+12
| | | | | | in a map allowing them to be serialized git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@293 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* switch to using python-support and follow recent python ↵Arthur de Jong2006-06-233-3/+7
| | | | | | policy git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@292 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* split Build-Depends-Indep into Build-Depends and ↵Arthur de Jong2006-06-051-1/+2
| | | | | | Build-Depends-Indep git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@291 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* also install favicon.ico in deb package (plus cosmetic fix)Arthur de Jong2006-06-051-1/+2
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@290 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* fix typos and fix example explanationArthur de Jong2006-06-041-3/+4
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@289 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* do not split list of strings on comma's inside the ↵Arthur de Jong2006-06-041-2/+4
| | | | | | quoted strings git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@288 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* make DeSerializeException a class instead of a function ↵Arthur de Jong2006-06-041-1/+2
| | | | | | and add FIXME git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@287 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* add --continue option to resume the crawling from the ↵Arthur de Jong2006-06-043-4/+40
| | | | | | point where the previous crawl stopped git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@286 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* handle break signals in all codeArthur de Jong2006-06-021-6/+6
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@285 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* add code to serialize crawled data during crawl and ↵Arthur de Jong2006-06-021-1/+12
| | | | | | again after crawl git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@284 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* raise a custom exception instead of IOErrorArthur de Jong2006-06-021-9/+11
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@283 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* add TODOsArthur de Jong2006-05-311-0/+2
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@282 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* upgrade to standards-version 3.7.2 (no changes needed)Arthur de Jong2006-05-311-1/+1
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@281 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* update feature list from deb package descriptionArthur de Jong2006-05-311-2/+3
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@280 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* split crawler.crawl() function into crawler.crawl() and ↵Arthur de Jong2006-05-163-7/+12
| | | | | | crawler.postprocess() functions git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@279 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* also serialize remaining links after crawlArthur de Jong2006-05-161-0/+8
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@278 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* remove anchor debugging statementsArthur de Jong2006-05-161-2/+0
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@277 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* flag deserialized links as changed so they will be ↵Arthur de Jong2006-05-161-0/+1
| | | | | | reserialized again git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@276 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* fix sortingArthur de Jong2006-05-161-1/+1
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@275 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* update link to fancytooltipsArthur de Jong2006-05-162-2/+2
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@274 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* add makebackup option to open_file() so we can implement ↵Arthur de Jong2006-05-151-10/+15
| | | | | | updating files (e.g. serialization files) git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@273 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* fix some stupid typosArthur de Jong2006-05-151-3/+3
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@272 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* add code to serialize links to a file while crawling the ↵Arthur de Jong2006-05-151-2/+16
| | | | | | site git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@271 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* import crawler late as to simplify dependenciesArthur de Jong2006-05-151-1/+1
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@270 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* fix typo in FIXMEArthur de Jong2006-05-151-3/+3
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@269 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* add _ischanged attribute to link objects to indicate ↵Arthur de Jong2006-05-151-0/+10
| | | | | | change since the constructor (or serialization) git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@268 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* only write serialized data if it is different from the ↵Arthur de Jong2006-05-151-10/+20
| | | | | | constructor's default value git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@267 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* clear anchors, linkproblems and pageproblems from to be ↵Arthur de Jong2006-05-151-0/+4
| | | | | | deserialized links to avoid duplicates as a link can be deserialized multiple times git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@266 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* remove the call to crawl() from deserialize as this ↵Arthur de Jong2006-05-151-3/+3
| | | | | | could be a partial deserialize that needs more tweaking to the site before the call to crawl() git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@265 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* make decoding try/fall-back code a lot simpler and ↵Arthur de Jong2006-05-151-12/+7
| | | | | | handle case where encoding is specified as empty string git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@264 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* improve warning text and add comment concerning trying ↵Arthur de Jong2006-05-121-1/+2
| | | | | | of encodings git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@263 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* ignore unknown entities instead of throwing an errorArthur de Jong2006-05-121-2/+5
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@262 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* include favicon.ico file in generated reportArthur de Jong2006-05-073-0/+3
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@261 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* ensure that we are not importing anything weird by using ↵Arthur de Jong2006-05-071-0/+9
| | | | | | invalid scheme names git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@260 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* support floats as parameter for --waitArthur de Jong2006-05-071-1/+1
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@259 86f53f14-5ff3-0310-afe5-9b438ce3f40c