2007-11-04 08:27 arthur * [r362] debian/control: move Homepage pseudo header to control header and remove XS- prefix for Vcs tags 2007-11-04 07:38 arthur * [r361] webcheck.1: add comma for readability 2007-11-04 07:37 arthur * [r360] debian/copyright: fix reference to GPL file in common-licenses 2007-11-04 07:35 arthur * [r359] README: updated information about Python versions to use and add section about Distutils 2007-10-09 19:34 arthur * [r358] AUTHORS, crawler.py, myurllib.py, plugins/sitemap.py: add workaround for Python 2.3 (based on a patch by Claire Connelly ) 2007-09-17 19:11 arthur * [r357] parsers/html/beautifulsoup.py: add a warning if the used version of BeautifulSoup contains a bug 2007-07-22 14:32 arthur * [r356] config.py, webcheck.py: implement undocumented --profile option to write profiling information in output directory 2007-07-22 14:23 arthur * [r355] debian/control: remove old linbot provides/conflicts/replaces stuff as linbot last shipped in woody 2007-07-22 13:27 arthur * [r354] debian/control: update recommends to python-beautifulsoup version 3.0.2 or later since that version fixes a problem with find(attr=True) 2007-07-15 13:07 arthur * [r352] ChangeLog, NEWS, README, TODO, config.py, debian/changelog, webcheck.1: get files ready for 1.10.1 release 2007-07-15 08:23 arthur * [r351] crawler.py: output which parser module is used in debug mode 2007-07-15 08:13 arthur * [r350] ChangeLog: fix spelling in ChangeLog messages 2007-07-15 07:56 arthur * [r349] parsers/html/beautifulsoup.py: also handle http-equiv refresh meta header 2007-07-15 07:27 arthur * [r348] crawler.py: just ignore setting encoding to None 2007-07-14 18:20 arthur * [r347] crawler.py: fix printing of None encoding 2007-07-14 18:18 arthur * [r346] myurllib.py: simplify _normalize_escapes() function to improve performance 2007-07-14 10:26 arthur * [r345] myurllib.py: replace double slashes in file URL paths with single ones 2007-07-13 18:48 arthur * [r344] myurllib.py: add note about improving performance more 2007-07-13 18:47 arthur * [r343] crawler.py, plugins/__init__.py, plugins/sitemap.py, serialize.py: use sets instead of sequences for children, embedded, etc to improve deserialization performance with a factor 25 but now require python 2.4 of more recent 2007-07-13 13:56 arthur * [r342] serialize.py: give the matched URL a name to make code more readable 2007-07-13 13:55 arthur * [r341] serialize.py: be a little more verbose when raising parsing exceptions 2007-07-13 13:50 arthur * [r340] plugins/badlinks.py: get rid of unneeded sort 2007-07-07 14:02 arthur * [r339] crawler.py, myurllib.py, parsers/html/beautifulsoup.py, parsers/html/htmlparser.py: split out URL cleaning code into own module 2007-07-07 13:54 arthur * [r338] schemes/http.py: do not handle control-C and pass it along to the main exception handler and log http exceptions with a higher level 2007-07-07 13:39 arthur * [r337] debian/control: added XS-Vcs-Svn and XS-Vcs-Browser as specified in #391023 2007-07-06 14:18 arthur * [r336] crawler.py, serialize.py: improve deserialization and handling of Unicode strings 2007-07-06 13:51 arthur * [r335] plugins/problems.py, plugins/size.py: some extra precautions for handling Unicode data and correct HTML escaping 2007-05-12 20:57 arthur * [r333] ChangeLog, NEWS, README, TODO, config.py, debian/changelog, debian/copyright, webcheck.1: get files ready for 1.10.0 release 2007-05-12 07:49 arthur * [r332] crawler.py: also lower-case reqanchor 2007-05-11 22:03 arthur * [r331] crawler.py, plugins/anchors.py, plugins/badlinks.py, plugins/problems.py, schemes/http.py: fix some copyright dates 2007-05-11 22:01 arthur * [r330] config.py, webcheck.1, webcheck.py: switch robots.txt handling to default on again (broken in 1.9.8) and add new --ignore-robots option to be able to ignore robots retrieval 2007-05-09 19:58 arthur * [r329] webcheck.py: present the default number of redirects 2007-05-08 21:33 arthur * [r328] plugins/about.py: update copyright information 2007-04-24 20:09 arthur * [r327] plugins/__init__.py, plugins/badlinks.py, plugins/problems.py: fixes to make output XHTML 1.1 compliant 2007-04-24 18:53 arthur * [r326] parsers/html/beautifulsoup.py: handle ID attribute as anchor on any tag 2007-04-24 18:52 arthur * [r325] crawler.py, plugins/anchors.py: lower-case anchor and errors to include id as option 2007-04-20 10:11 arthur * [r324] parsers/html/beautifulsoup.py: correctly parse author information 2007-04-20 09:42 arthur * [r323] debian/control, parsers/html, parsers/html.py, parsers/html/__init__.py, parsers/html/beautifulsoup.py, parsers/html/htmlparser.py: introduce HTML parsing using BeautifulSoup with a fall-back mechanism to the old HTMLParser based solution 2007-04-20 09:40 arthur * [r322] crawler.py: mark encoding problems and output more debugging 2007-04-20 08:34 arthur * [r321] debian/changelog: fix formatting of previous changelog entry 2007-04-20 08:20 arthur * [r320] plugins/anchors.py: fix typo 2007-04-06 12:38 arthur * [r319] schemes/http.py: add workaround for bug in idna module 2007-04-06 12:31 arthur * [r318] crawler.py: add some comments to the follow_link() method 2007-04-06 12:29 arthur * [r317] crawler.py: make parsing of URLs and conversion to Link objects a little more consistent 2007-04-06 12:02 arthur * [r316] plugins/__init__.py: use consistent Unicode conversion 2007-04-06 11:46 arthur * [r315] webcheck.1: document the fact that --force should be used for non-interactive use 2007-04-06 11:35 arthur * [r314] plugins/__init__.py: bail out if reading user input failed 2007-03-31 11:39 arthur * [r313] parsers/html.py: evaluate archive attribute of tag instead of code attribute if that is present 2007-03-14 21:47 arthur * [r312] crawler.py: get rid of old base (singular) as bases is now used everywhere 2007-03-10 12:49 arthur * [r311] plugins/sitemap.py: clean up a little and simplify 2007-01-15 20:27 arthur * [r309] ChangeLog, NEWS, README, TODO, config.py, debian/changelog, webcheck.1, webcheck.py: get files ready for 1.9.8 release 2007-01-15 20:26 arthur * [r308] schemes/http.py: catch any exception in HTTP module and report is as a link problem 2007-01-15 08:25 arthur * [r307] HACKING, README, debian/rules: move section on webcheck design into HACKING document 2007-01-15 08:21 arthur * [r306] README: fix the bugreporting section to more clearly state the needed information 2007-01-13 08:19 arthur * [r305] debian/control, debian/pycompat, debian/rules: switch to using latest syntax of python-support 2006-10-23 21:09 arthur * [r304] plugins/sitemap.py: present sitemap with all bases 2006-10-23 21:08 arthur * [r303] config.py: add USE_ROBOTS option 2006-10-23 21:08 arthur * [r302] crawler.py: include list of bases in Site class 2006-10-23 21:05 arthur * [r301] plugins/__init__.py: print reason why link is yanked if available 2006-09-29 10:27 arthur * [r300] debian/control: include link to homepage in package description 2006-09-04 10:47 arthur * [r299] schemes/http.py: explicitly transform username and password to string in case either one isn't supplied 2006-07-13 20:47 arthur * [r298] schemes/http.py: also handle SSL related socket errors (e.g. SSL time-out) 2006-07-13 20:44 arthur * [r297] crawler.py, parsers/html.py, schemes/http.py: add set_encoding method to Link object to do some basic encoding sanity checks 2006-07-02 21:42 arthur * [r295] ChangeLog, NEWS, TODO, config.py, debian/changelog, webcheck.1: get files ready for 1.9.7 release 2006-06-29 21:09 arthur * [r294] webcheck.css: always keep navigation on top 2006-06-24 15:35 arthur * [r293] crawler.py, serialize.py: store internal, external and yanked regular expressions in a map allowing them to be serialized 2006-06-23 21:01 arthur * [r292] debian/control, debian/pycompat, debian/rules: switch to using python-support and follow recent python policy 2006-06-05 20:21 arthur * [r291] debian/control: split Build-Depends-Indep into Build-Depends and Build-Depends-Indep 2006-06-05 20:19 arthur * [r290] debian/rules: also install favicon.ico in deb package (plus cosmetic fix) 2006-06-04 21:28 arthur * [r289] webcheck.1: fix typos and fix example explanation 2006-06-04 21:09 arthur * [r288] serialize.py: do not split list of strings on comma's inside the quoted strings 2006-06-04 20:41 arthur * [r287] serialize.py: make DeSerializeException a class instead of a function and add FIXME 2006-06-04 20:40 arthur * [r286] config.py, webcheck.1, webcheck.py: add --continue option to resume the crawling from the point where the previous crawl stopped 2006-06-02 11:47 arthur * [r285] webcheck.py: handle break signals in all code 2006-06-02 11:42 arthur * [r284] webcheck.py: add code to serialize crawled data during crawl and again after crawl 2006-06-02 11:37 arthur * [r283] serialize.py: raise a custom exception instead of IOError 2006-05-31 20:24 arthur * [r282] parsers/html.py: add TODOs 2006-05-31 20:23 arthur * [r281] debian/control: upgrade to standards-version 3.7.2 (no changes needed) 2006-05-31 20:22 arthur * [r280] README: update feature list from deb package description 2006-05-16 19:18 arthur * [r279] crawler.py, serialize.py, webcheck.py: split crawler.crawl() function into crawler.crawl() and crawler.postprocess() functions 2006-05-16 19:07 arthur * [r278] crawler.py: also serialize remaining links after crawl 2006-05-16 19:05 arthur * [r277] crawler.py: remove anchor debugging statements 2006-05-16 18:23 arthur * [r276] serialize.py: flag deserialized links as changed so they will be reserialized again 2006-05-16 18:21 arthur * [r275] plugins/size.py: fix sorting 2006-05-16 18:19 arthur * [r274] plugins/about.py, webcheck.css: update link to fancytooltips 2006-05-15 21:30 arthur * [r273] plugins/__init__.py: add makebackup option to open_file() so we can implement updating files (e.g. serialization files) 2006-05-15 21:00 arthur * [r272] crawler.py: fix some stupid typos 2006-05-15 20:51 arthur * [r271] crawler.py: add code to serialize links to a file while crawling the site 2006-05-15 20:50 arthur * [r270] serialize.py: import crawler late as to simplify dependencies 2006-05-15 20:36 arthur * [r269] serialize.py: fix typo in FIXME 2006-05-15 20:35 arthur * [r268] crawler.py: add _ischanged attribute to link objects to indicate change since the constructor (or serialization) 2006-05-15 19:17 arthur * [r267] serialize.py: only write serialized data if it is different from the constructor's default value 2006-05-15 19:15 arthur * [r266] serialize.py: clear anchors, linkproblems and pageproblems from to be deserialized links to avoid duplicates as a link can be deserialized multiple times 2006-05-15 19:13 arthur * [r265] serialize.py: remove the call to crawl() from deserialize as this could be a partial deserialize that needs more tweaking to the site before the call to crawl() 2006-05-15 17:26 arthur * [r264] parsers/html.py: make decoding try/fall-back code a lot simpler and handle case where encoding is specified as empty string 2006-05-12 21:32 arthur * [r263] parsers/html.py: improve warning text and add comment concerning trying of encodings 2006-05-12 21:23 arthur * [r262] parsers/html.py: ignore unknown entities instead of throwing an error 2006-05-07 10:31 arthur * [r261] favicon.ico, plugins/__init__.py, webcheck.py: include favicon.ico file in generated report 2006-05-07 10:31 arthur * [r260] schemes/__init__.py: ensure that we are not importing anything weird by using invalid scheme names 2006-05-07 10:26 arthur * [r259] webcheck.py: support floats as parameter for --wait 2006-05-07 10:25 arthur * [r258] webcheck.1: fix usage of dash 2006-05-07 10:19 arthur * [r257] serialize.py: add serialize module that allows serializing and deserializing all crawler state (site and links) to and from a file, this module is not called anywhere yet 2006-05-07 09:56 arthur * [r256] crawler.py: fix typo in docstring and add comment 2006-05-07 09:36 arthur * [r255] parsers/html.py, plugins/__init__.py: move html escaping and unescaping functions to parsers.html 2006-05-07 09:25 arthur * [r254] parsers/html.py: use unichr() to generate Unicode characters, not chr() 2006-05-07 09:20 arthur * [r253] schemes/file.py: return None explicitly 2006-05-07 09:12 arthur * [r252] crawler.py, parsers/html.py, plugins/__init__.py, schemes/file.py, schemes/ftp.py: some more small code improvements thanks to pychecker 2006-05-06 15:44 arthur * [r251] parsers/html.py: implement checking for id and name tags in anchors 2006-05-06 15:28 arthur * [r250] schemes/ftp.py, schemes/https.py, webcheck.css: bump copyright notices 2006-04-27 21:53 arthur * [r249] crawler.py: also add all unfetched links from a site to make this method recallable 2006-04-27 21:47 arthur * [r248] crawler.py: make get_link() function a public class function 2006-04-27 21:43 arthur * [r247] crawler.py: move URL checking bit to right function and improve anchor debugging messages even further 2006-04-27 21:39 arthur * [r246] plugins/__init__.py: fix remaining references to escape instead of htmlescape 2006-04-27 21:32 arthur * [r245] crawler.py: support passing a URL to add_reqanchor() plus some minor comments changes 2006-04-27 21:25 arthur * [r244] webcheck.py: handle problems in regular expressions passed on the command line a little more gracefully 2006-04-23 14:52 arthur * [r243] plugins/__init__.py, plugins/about.py, plugins/badlinks.py, plugins/problems.py: rename escape() function to htmlescape() to make it a little clearer what we're escaping 2006-04-23 11:31 arthur * [r242] TODO, config.py, crawler.py, debugio.py, parsers/__init__.py, parsers/css.py, parsers/html.py, plugins/__init__.py, plugins/about.py, plugins/anchors.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/new.py, plugins/notchkd.py, plugins/notitles.py, plugins/old.py, plugins/problems.py, plugins/sitemap.py, plugins/size.py, plugins/urllist.py, schemes/__init__.py, schemes/file.py, schemes/ftp.py, schemes/http.py, schemes/https.py, webcheck.py: code improvements thanks to pylint 2006-04-23 11:26 arthur * [r241] plugins/__init__.py: also sort parent list by URL if titles are the same 2006-04-23 11:25 arthur * [r240] schemes/http.py: also properly handle time-out problems which only pass one parameter with the exception 2006-04-11 21:40 arthur * [r239] config.py, schemes/http.py: implement a time-out setting with a default of 10 seconds 2006-04-11 21:35 arthur * [r238] webcheck.css: revert to borderless links as they look ugly in some (most) cases 2006-04-11 21:06 arthur * [r237] config.py, plugins/size.py, plugins/slow.py: rename slow plugin to size 2006-04-07 17:58 arthur * [r236] parsers/html.py: do not fail on unknown encodings (fall back to system encoding) and add some TODOs to do extra encoding checking 2006-03-26 19:05 arthur * [r235] crawler.py, parsers/html.py: split urlescape() from _urlclean() and ensure that all anchors are consistently URL-encoded 2006-03-26 19:01 arthur * [r234] plugins/anchors.py: only report missing anchors for pages that were fetched and some clean-ups 2006-03-26 18:58 arthur * [r233] webcheck.css: put a border around links 2006-03-26 16:47 arthur * [r232] plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/new.py, plugins/notchkd.py, plugins/notitles.py, plugins/old.py, plugins/problems.py, plugins/slow.py: properly close html files on no output 2006-03-10 23:02 arthur * [r231] parsers/html.py: revert catching Exception instead of IOError that was there for testing 2006-03-10 22:58 arthur * [r230] config.py, crawler.py, parsers/html.py, plugins/anchors.py: implement checking of anchors (there should be no double anchors and all referenced anchors should exist) 2006-03-10 22:56 arthur * [r229] plugins/__init__.py: do not include navigation for plugins that do not generate output 2006-03-10 22:48 arthur * [r228] parsers/html.py, plugins/notitles.py: trim spaces from title and author fields and check that title is not empty string (apart from undefined) 2006-03-09 22:10 arthur * [r227] plugins/__init__.py, plugins/about.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/new.py, plugins/notchkd.py, plugins/notitles.py, plugins/old.py, plugins/problems.py, plugins/sitemap.py, plugins/slow.py, plugins/urllist.py, webcheck.py: restructure plugin code to open output files from within plugin itself to be able to write different kinds of files 2006-01-30 16:27 arthur * [r225] ChangeLog, NEWS, README, TODO, config.py, debian/changelog, debian/copyright, webcheck.1, webcheck.py: get files ready for 1.9.6 release 2006-01-29 22:39 arthur * [r224] crawler.py: bug fix in matching URL-encoding 2006-01-29 21:24 arthur * [r223] crawler.py: actually decode URL-encoded character as hex not decimal 2006-01-29 20:50 arthur * [r222] fancytooltips/fancytooltips.js: html escape all content that is retreived from attributes 2006-01-29 20:48 arthur * [r221] crawler.py, parsers/html.py: make sure all URLs are consistently URL-encoded where it counts 2006-01-29 20:15 arthur * [r220] schemes/http.py: add some more debugging information (cache hit or miss) 2006-01-29 20:14 arthur * [r219] plugins/about.py: update copyright notice and indicate that we're using GPL2+ 2006-01-25 23:16 arthur * [r218] parsers/html.py: fix typo (thanks Andrew Kim ) 2006-01-19 21:38 arthur * [r217] plugins/__init__.py: ignore errors when converting to Unicode string and uses system encoding instead of UTF-8 as default 2006-01-19 21:35 arthur * [r216] plugins/__init__.py: also escape the URL when generating links 2006-01-19 20:46 arthur * [r215] plugins/__init__.py: explicitly convert strings to Unicode to avoid potential problems with non-ASCII characters in strings 2006-01-19 20:45 arthur * [r214] parsers/html.py: quote links so that they do not contain any non-ASCII characters to avoid problems later on (and add some more debugging) 2006-01-19 20:32 arthur * [r213] crawler.py: fix debug message to print url instead of object reference 2006-01-15 08:44 arthur * [r212] crawler.py: give some more debugging info while following base URLs and no longer delete unreferenced followed links 2005-12-30 22:33 arthur * [r210] ChangeLog, NEWS, TODO, config.py, debian/changelog, webcheck.1: get files ready for 1.9.5 release 2005-12-30 22:09 arthur * [r209] crawler.py: fix copy-pasto from r204 2005-12-30 21:21 arthur * [r208] webcheck.1: add some clarifications to --internal and URL classes sections 2005-12-30 20:45 arthur * [r207] debian, debian/changelog, debian/compat, debian/control, debian/copyright, debian/postinst, debian/rules: import updated Debian package configuration data, partially from old webcheck package 2005-12-29 00:53 arthur * [r206] crawler.py, schemes/http.py: trim empty ports (http://host:/) from URLs and do not crash on improperly formatted URLs 2005-12-29 00:51 arthur * [r205] plugins/slow.py: fix typo 2005-12-28 22:29 arthur * [r204] crawler.py, webcheck.1, webcheck.py: add --internal option to match internal URLs with a regular expression 2005-12-28 21:37 arthur * [r203] webcheck.1: clarify section on URL classes that yanked URLs can be internal or external and some typo fixes 2005-12-28 21:26 arthur * [r202] AUTHORS: add Stefan Schröder to the contributors list 2005-12-28 21:23 arthur * [r201] plugins/about.py: make text even shorter 2005-12-28 00:10 arthur * [r200] plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/new.py, plugins/notchkd.py, plugins/notitles.py, plugins/old.py, plugins/problems.py, plugins/slow.py: first generate (with filter and lambda expressions) a list of links that should be reported by the plugin and just then present the result, including a nicer message when there is nothing to report 2005-12-28 00:08 arthur * [r199] plugins/about.py: make copyright information a little more compact 2005-12-27 21:51 arthur * [r198] plugins/__init__.py, plugins/about.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/new.py, plugins/notchkd.py, plugins/notitles.py, plugins/old.py, plugins/problems.py, plugins/sitemap.py, plugins/slow.py, plugins/urllist.py: move printing of description into plugin instead of from __init__.py 2005-12-27 21:23 arthur * [r197] plugins/about.py: fix indenting and closing li of generated html code 2005-12-27 21:16 arthur * [r196] plugins/__init__.py, plugins/about.py, plugins/badlinks.py, plugins/external.py, plugins/new.py, plugins/notchkd.py, plugins/notitles.py, plugins/old.py, plugins/problems.py, plugins/slow.py: replace backslashes from end of lines where they are not required 2005-12-27 20:52 arthur * [r195] webcheck.css: give more areas a rounder look, change opacity of tooltips and try to use some css 3.0 attributes 2005-12-27 20:26 arthur * [r194] plugins/about.py: include reference to FancyTooltips in about screen 2005-12-27 20:26 arthur * [r193] README: s/contains/includes/ FancyTooltips 2005-12-26 08:47 arthur * [r192] schemes/http.py: catch all relevant exceptions when looking up content-type header 2005-12-26 08:46 arthur * [r191] parsers/html.py: bug fix to handle numeric character references better (Unicode characters) 2005-12-17 22:22 arthur * [r190] README, plugins/__init__.py, webcheck.css, webcheck.py: reference and install FancyTooltips from webcheck 2005-12-17 22:08 arthur * [r189] fancytooltips/fancytooltips.js: local customisations of fancyurltips: don't trim long strings and replace newlines with html
's 2005-12-17 21:43 arthur * [r188] fancytooltips, fancytooltips/fancytooltips.css, fancytooltips/fancytooltips.js, fancytooltips/readme.txt: import FancyTooltips 1.2.1 from http://victr.lm85.com/Design/css/fancytooltips-a-la-victr.php 2005-12-17 20:34 arthur * [r187] webcheck.py: update --help output to take multiple base URLs into account 2005-12-17 18:32 arthur * [r186] README, config.py, crawler.py, debugio.py, parsers/__init__.py, parsers/css.py, parsers/html.py, plugins/__init__.py, plugins/about.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/new.py, plugins/notchkd.py, plugins/notitles.py, plugins/old.py, plugins/problems.py, plugins/sitemap.py, plugins/slow.py, plugins/urllist.py, schemes/__init__.py, schemes/file.py, schemes/ftp.py, schemes/http.py, schemes/https.py, webcheck.1, webcheck.py: add copyright clarification to specify that generated output files are not covered by our copyright 2005-12-17 17:57 arthur * [r185] schemes/http.py: remove trailing : from netloc if it is present 2005-12-17 17:10 arthur * [r184] crawler.py: fix wrapping of text in pydoc 2005-12-17 17:09 arthur * [r183] webcheck.1: add section to document URL classes 2005-09-18 14:55 arthur * [r182] config.py, schemes/http.py: add configuration option to disable proxy caching 2005-09-18 14:26 arthur * [r181] webcheck.1, webcheck.py: add long command-line options as equivalents for the short options 2005-09-17 20:54 arthur * [r180] plugins/__init__.py, plugins/about.py, plugins/badlinks.py, plugins/problems.py: implement out own proper escape function and use it instead of the functions from saxutils (this one escapes as much as possible to result in a 7 bit clean file 2005-09-17 16:05 arthur * [r179] crawler.py, parsers/html.py, plugins/__init__.py, plugins/problems.py: store author and title in Unicode internally and ensure that they are output as UTF-8 2005-09-17 15:58 arthur * [r178] parsers/html.py: also try to get character encoding from XML declaration and http-equiv meta tag 2005-09-17 15:55 arthur * [r177] plugins/__init__.py: fix typo 2005-09-17 15:40 arthur * [r176] parsers/html.py: parse character entries as normal data, these entities will be expanded later on (they are also used in attribute values 2005-09-17 15:21 arthur * [r175] crawler.py, schemes/http.py: try to extract character encoding from http response and store it in the link object 2005-09-16 21:38 arthur * [r174] plugins/__init__.py: improve code and documentation of the open_file() function, adding an istext flag (defaults to True) to open files as text 2005-09-16 19:51 arthur * [r173] webcheck.py: do not prepend output directory twice (thanks to Stefan Schröder for spotting this) 2005-09-16 09:48 arthur * [r172] webcheck.py: turn error into warning 2005-09-13 20:49 arthur * [r171] schemes/http.py: support basic authentication for http proxies and some initial fixes to get proxying HTTPS traffic working 2005-09-10 08:10 arthur * [r170] plugins/about.py: present some more information about webcheck and the generated report instead of a plain list of plugins (and change names and descriptions where needed) 2005-09-10 07:50 arthur * [r169] plugins/about.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/new.py, plugins/notchkd.py, plugins/notitles.py, plugins/old.py, plugins/problems.py, plugins/sitemap.py, plugins/slow.py, plugins/urllist.py: remove version numbers from plugins since they were not really used or useful 2005-09-10 07:48 arthur * [r168] schemes/__init__.py: fix wrapping of documentation 2005-09-03 09:20 arthur * [r167] NEWS: some fixes to NEWS file 2005-09-03 09:05 arthur * [r163] ChangeLog, NEWS, TODO, config.py, webcheck.1: get files ready for 1.9.4 release 2005-09-01 21:04 arthur * [r162] plugins/__init__.py, webcheck.py: make error handling more robust and have consistent error messages 2005-09-01 20:12 arthur * [r161] AUTHORS: add Herbert Weinhandl to contributors 2005-09-01 20:11 arthur * [r160] README: add some design notes for developers 2005-09-01 20:10 arthur * [r159] webcheck.py: add extra checks not to overwrite our own input file while copying files into place 2005-09-01 20:06 arthur * [r158] plugins/__init__.py: typo fix 2005-09-01 18:47 arthur * [r157] plugins/__init__.py, webcheck.css: highlight current plugin in the navigation, based on a patch by Herbert Weinhandl 2005-08-30 17:47 arthur * [r156] config.py, plugins/__init__.py: make specifying of target in links configurable (disabled by default to keep page valid XHTML 1.1) 2005-08-25 19:27 arthur * [r155] crawler.py: add note about making instances of Link class 2005-08-23 15:15 arthur * [r154] webcheck.py: handle passing file names (instead of URLs) on the command line 2005-08-23 15:14 arthur * [r153] webcheck.py: add initial support for passing URLs to install_file() function 2005-08-23 14:29 arthur * [r152] plugins/badlinks.py: include transfer problem in pageproblem description 2005-08-23 14:28 arthur * [r151] plugins/problems.py: make problem lists sorted by URL and problem description 2005-08-21 18:18 arthur * [r150] plugins/about.py: include short description in plugin overview page 2005-08-21 14:23 arthur * [r149] AUTHORS: add some other people to the AUTHORS file, mostly based on contents of the Debian bug tracking system 2005-08-20 16:32 arthur * [r148] parsers/html.py: also feed style tag content to the CSS parser to parse inline CSS 2005-08-20 16:31 arthur * [r147] parsers/css.py: remove some debugging functions from CSS parser 2005-08-20 16:30 arthur * [r146] parsers/css.py: first attempt at a very simple CSS parser that just summarises links to images and imported CSS files 2005-08-20 09:24 arthur * [r145] crawler.py, plugins/__init__.py, schemes/http.py: set status to result of fetching the document (not an error indicator) 2005-08-20 08:06 arthur * [r144] parsers/html.py: add checking of unescaped spaces to the html parser, including line and column information 2005-08-19 20:48 arthur * [r143] webcheck.py: pass site as parameter to parse_args() instead of declaring it global 2005-08-19 20:44 arthur * [r142] crawler.py: fix bug with following redirects where otherwise unreferenced links were removed and implement redirect loop detection 2005-08-19 20:27 arthur * [r141] crawler.py, schemes/file.py, schemes/ftp.py, schemes/http.py: move redirect handling code to crawler module, including redirect loop detection code 2005-08-19 20:24 arthur * [r140] plugins/badlinks.py: fix html bug and improve bad link string 2005-08-19 18:16 arthur * [r139] plugins/badlinks.py, plugins/new.py, plugins/old.py, plugins/problems.py, plugins/slow.py, webcheck.css: change html display of problems to a nicer list 2005-08-19 18:14 arthur * [r138] crawler.py, parsers/html.py, plugins/__init__.py, plugins/badlinks.py, plugins/notitles.py, plugins/old.py, plugins/problems.py, plugins/slow.py, schemes/file.py, schemes/ftp.py, schemes/http.py: split problems into page problems (parsing errors, wrong links, etc) and link problems (errors retreiving the document) 2005-08-16 20:50 arthur * [r136] ChangeLog, NEWS, TODO, config.py, webcheck.1: get files ready for 1.9.3 release 2005-08-16 20:36 arthur * [r135] config.py, schemes/file.py, schemes/ftp.py: pick up configured filenames if present in directories 2005-08-16 18:25 arthur * [r134] schemes/ftp.py: add extra debugging info 2005-08-13 19:19 arthur * [r133] schemes/ftp.py: use a pool of ftp connections to keep ftp connection to a host open to do multiple requests (this greatly speeds up crawling of ftp sites) 2005-08-13 19:08 arthur * [r132] schemes/ftp.py: almost complete reimplementation of the ftp scheme, handling errors more gracefully and also crawl normal ftp directories 2005-08-13 19:06 arthur * [r131] plugins/__init__.py: add missing newline and trim trailing newline of extra link info 2005-08-12 19:04 arthur * [r130] schemes/file.py: complete reimplementation of file module, reading index.html from directory, otherwise read directory contents 2005-08-12 18:20 arthur * [r129] schemes/__init__.py, schemes/file.py, schemes/ftp.py, schemes/http.py: rename parameter to acceptedtypes to not conflict with mimetypes module 2005-08-12 17:27 arthur * [r128] crawler.py, parsers/__init__.py, schemes/__init__.py, schemes/file.py, schemes/ftp.py, schemes/http.py: also pass mimetypes to scheme modules to only fetch content if we can parse the content type 2005-08-12 17:02 arthur * [r127] plugins/__init__.py: don't print referenced from if there are no parents 2005-08-12 16:57 arthur * [r126] crawler.py: add checkurl method to clean up URLs and report problems (currently only checks for spaces in URLs) 2005-08-12 16:55 arthur * [r125] parsers/html.py: put compiled regular expression on module level so that it is compiled only once 2005-08-12 16:52 arthur * [r124] webcheck.css: small fix to render menu better under MSIE 2005-08-11 21:41 arthur * [r123] plugins/__init__.py: add some extra information to every link with a nicely formatted size 2005-08-01 17:58 arthur * [r122] parsers/html.py: make parsing handle errors a little more gracefully, thanks to Stefan Schröder for all the testing 2005-07-31 20:58 arthur * [r120] ChangeLog, NEWS, TODO, config.py: get files ready for 1.9.2 release 2005-07-31 20:44 arthur * [r119] parsers/html.py: also catch AttributeError for problem in HTMLParser not fully supporting continuing after errors 2005-07-31 10:50 arthur * [r118] README: add note about supported versions of python 2005-07-31 09:45 arthur * [r117] parsers/html.py: replace numeric entity refs with their proper values based on patch by Eric W.Brown 2005-07-31 09:21 arthur * [r116] parsers/html.py: put new html parser in place 2005-07-31 09:14 arthur * [r115] schemes/https.py: add https module as a wrapper to the http module 2005-07-31 09:02 arthur * [r114] crawler.py: while cleaning URLs also make host part lower-case and also clean added internal URLs 2005-07-30 15:34 arthur * [r113] crawler.py: fix a thinko 2005-07-30 15:32 arthur * [r112] crawler.py: fix typo 2005-07-30 15:20 arthur * [r111] crawler.py: follow_link() now returns None when trying to follow a redirect who's target is not crawled, also don't add children and embeds when we are an external link 2005-07-30 14:05 arthur * [r110] plugins/__init__.py: remove version and author from module as no other module has one (except the plugins themselves) 2005-07-30 14:04 arthur * [r109] config.py: remove support for extra configurable headers 2005-07-30 14:04 arthur * [r108] schemes/http.py: reimplement http module to be a little more generic and clean and handle errors cleaner and more consistently 2005-07-30 14:00 arthur * [r107] crawler.py: give second search through website a slightly different debug message 2005-07-30 13:59 arthur * [r106] crawler.py: also ignore io errors when retrieving robots.txt files 2005-07-30 13:59 arthur * [r105] crawler.py: make a _urlclean() function to always store a proper URL without a fragment and with at least a slash for URLs with path elements 2005-07-30 13:55 arthur * [r104] README: some minor tweaks in the documentation 2005-07-29 14:36 arthur * [r103] crawler.py: import time as we need it for sleep 2005-07-29 14:32 arthur * [r102] crawler.py, plugins/sitemap.py: do an extra breadth first traversal of the site to combine links into pages, combining page children and determining depth of every page and using all this in the sitemap 2005-07-29 10:20 arthur * [r101] AUTHORS, README, config.py, webcheck.1: change email address from arthur@tiefighter.et.tudelft.nl to arthur@ch.tudelft.nl (including URLs etc) 2005-07-29 10:18 arthur * [r100] webcheck.css: remove another reference of an email address 2005-07-29 10:11 arthur * [r99] NEWS, README, config.py, crawler.py, debugio.py, parsers/__init__.py, parsers/css.py, parsers/html.py, plugins/__init__.py, plugins/about.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/new.py, plugins/notchkd.py, plugins/notitles.py, plugins/old.py, plugins/problems.py, plugins/sitemap.py, plugins/slow.py, plugins/urllist.py, schemes/__init__.py, schemes/file.py, schemes/ftp.py, schemes/http.py, webcheck.py: remove references to email addresses where they are not useful, based on a partial patch by Evelyn Mitchell 2005-07-27 20:38 arthur * [r98] plugins/__init__.py, plugins/badlinks.py, plugins/problems.py, plugins/sitemap.py: fix a couple of typos, also thanks to Scott Kirkwood for spotting another one 2005-07-27 20:32 arthur * [r97] crawler.py: turn tocheck list into fifo queue 2005-07-26 20:40 arthur * [r96] plugins/new.py, plugins/old.py: fix typo spotted by Scott Kirkwood 2005-07-25 17:29 arthur * [r94] ChangeLog, NEWS, config.py: get files ready for 1.9.1 release 2005-07-25 17:17 arthur * [r93] webcheck.1: fix typo, thanks to Stefan Schröder 2005-07-25 17:16 arthur * [r92] plugins/slow.py: only report on internal links 2005-07-25 17:13 arthur * [r91] parsers/css.py: empty module as place holder to parse CSS (referenced from __init__.py already) 2005-07-25 17:11 arthur * [r90] parsers/html.py: don't replace an already set title 2005-07-24 09:32 arthur * [r88] ChangeLog: add ChangeLog for release 2005-07-24 09:30 arthur * [r87] NEWS, TODO: get files ready for release 2005-07-24 08:56 arthur * [r86] README: clean up README removing sections that should be in the manual page 2005-07-24 08:55 arthur * [r85] config.py, plugins/new.py, plugins/old.py, plugins/whatsnew.py, plugins/whatsold.py: rename whatsold and whatsnew plugins to old and new 2005-07-24 08:52 arthur * [r84] schemes/http.py: handle socket errors properly 2005-07-24 08:52 arthur * [r83] schemes/http.py: fix for incomplete change in r76, now version should not be referenced any more 2005-07-24 08:49 arthur * [r82] plugins/__init__.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/notchkd.py, plugins/notitles.py, plugins/problems.py, plugins/sitemap.py, plugins/slow.py, plugins/urllist.py, plugins/whatsnew.py, plugins/whatsold.py: call make_link() with a link object instead of a URL, removing the need for a mySite in plugins 2005-07-24 08:47 arthur * [r81] plugins/badlinks.py: remove HTTP status code handling from here as this should be done by the HTTP module 2005-07-24 08:47 arthur * [r80] plugins/whatsnew.py, plugins/whatsold.py: only report on internal links 2005-07-24 08:46 arthur * [r79] crawler.py: only add links to crawl list if they are not in there allready 2005-07-24 08:45 arthur * [r78] debugio.py: flush stdout after each message so that redirecting stdout and stderr together to a file works reliably 2005-07-23 14:02 arthur * [r77] crawler.py: fix regular expression matching 2005-07-23 12:55 arthur * [r76] config.py, plugins/__init__.py, schemes/http.py, version.py, webcheck.1, webcheck.py: integrate versio.py into config.py, clean up config.py removing unused settings and clean up boolean types 2005-07-23 11:00 arthur * [r75] config.py, webcheck.1, webcheck.py: remove logo option since the current output does not use one 2005-07-23 10:53 arthur * [r74] schemes/file.py: most systems already know about .shtml files 2005-07-23 08:34 arthur * [r73] BUGS, INSTALL, README, webcheck.1: first step in cleaning up documentation, integrating INSTALL in README and BUGS in manual page and adding section on robots handling in manual 2005-07-23 08:28 arthur * [r72] AUTHORS, crawler.py, debugio.py, parsers/html.py, plugins/__init__.py, plugins/about.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/notchkd.py, plugins/notitles.py, plugins/problems.py, plugins/sitemap.py, plugins/slow.py, plugins/whatsnew.py, plugins/whatsold.py, schemes/file.py, schemes/ftp.py, schemes/http.py, version.py, webcheck.1, webcheck.py: Mike Meyer -> Mike W. Meyer 2005-07-22 21:21 arthur * [r71] crawler.py: add support for sleep between requests 2005-07-22 21:11 arthur * [r70] webcheck.py: don't add . to python path as it's not needed and put command line handling in same order as options 2005-07-22 21:05 arthur * [r69] plugins/__init__.py, webcheck.css: change layout to have a simpler layout that also should work in MSIE 2005-07-22 21:04 arthur * [r68] debugio.py: fix docstrings 2005-07-22 21:01 arthur * [r67] plugins/__init__.py, webcheck.py: do not use start_time from webcheck saving an import 2005-07-22 19:17 arthur * [r66] crawler.py, myUrlLib.py, parsers/__init__.py, parsers/html.py, plugins/__init__.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/notchkd.py, plugins/notitles.py, plugins/sitemap.py, plugins/slow.py, plugins/urllist.py, plugins/whatsnew.py, plugins/whatsold.py, schemes/__init__.py, schemes/file.py, schemes/ftp.py, schemes/http.py, webcheck.py: almost complete rewrite of crawling and site state code making children and parents link objects instead of URLs and giving link member variables better names, change plugins accordingly, make scheme handling more pluggable and only use one function call and have a better pluggable structure for content parsing (currently only html) 2005-07-17 08:46 arthur * [r65] myUrlLib.py, plugins/__init__.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/notitles.py, plugins/problems.py, plugins/sitemap.py, plugins/slow.py, plugins/urllist.py, plugins/whatsnew.py, plugins/whatsold.py, schemes/file.py, schemes/ftp.py, schemes/http.py, webcheck.py: use lower-case URL attribute in Link instead of upper-case URL 2005-07-16 15:35 arthur * [r64] plugins/__init__.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/notchkd.py, plugins/notitles.py, plugins/problems.py, plugins/rptlib.py, plugins/sitemap.py, plugins/slow.py, plugins/urllist.py, plugins/whatsnew.py, plugins/whatsold.py, webcheck.py: move functionality of rptlib.py to __init__.py so that we can just use the plugins package 2005-07-16 15:33 arthur * [r63] plugins/__init__.py: remove __init__.py to be replaced by contents of rptlib.py 2005-07-16 10:24 arthur * [r62] webcheck.1: add note about pattern matching 2005-07-10 14:08 arthur * [r61] myUrlLib.py, schemes/__init__.py, schemes/file.py, schemes/ftp.py, schemes/http.py: rework scheme code to use more logical function names, more clearly mark internal functions and do some major clean-up of the scheme modules code 2005-07-10 12:26 arthur * [r60] myUrlLib.py, plugins/whatsnew.py, plugins/whatsold.py, schemes/file.py, schemes/http.py: store mtime in link object instead of age in days 2005-07-10 12:00 arthur * [r59] schemes/ftp.py, webcheck.py: remove unneeded import and print 2005-07-09 20:22 arthur * [r58] htmlparse.py, myUrlLib.py, parsers, parsers/__init__.py, parsers/html.py: move htmlparse to a more generic parsers package, cleaning up the code and simplifying dependencies 2005-07-09 13:54 arthur * [r57] plugins/about.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/notchkd.py, plugins/notitles.py, plugins/problems.py, plugins/rptlib.py, plugins/sitemap.py, plugins/slow.py, plugins/urllist.py, plugins/whatsnew.py, plugins/whatsold.py, webcheck.css, webcheck.py: clean up HTML output generating XHTML 1.1 without frames and using CSS for styling also getting rid of the images 2005-07-04 21:25 arthur * [r56] config.py: put plugins in a more logical order 2005-07-04 20:39 arthur * [r55] plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/notchkd.py, plugins/notitles.py, plugins/rptlib.py, plugins/sitemap.py, plugins/slow.py, plugins/urllist.py, plugins/whatsnew.py, plugins/whatsold.py: implement consistent sorting of all lists removing sort functions from rptlib and using lambda functions where needed 2005-07-03 07:04 arthur * [r54] config.py, plugins/rptlib.py, schemes/http.py, webcheck.1: handle and document proxy settings with environment variables 2005-07-03 06:36 arthur * [r53] INSTALL, README, config.py, myUrlLib.py, plugins/rptlib.py, schemes/http.py, webcheck.1, webcheck.py: name webcheck with lower case 2005-06-28 20:32 arthur * [r52] schemes/http.py: clean up get_reply() function to uses proper recursion and don't use self where it doesn't make sense 2005-06-22 19:24 arthur * [r51] COPYING, debugio.py, htmlparse.py, myUrlLib.py, plugins/about.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/notchkd.py, plugins/notitles.py, plugins/problems.py, plugins/rptlib.py, plugins/sitemap.py, plugins/slow.py, plugins/urllist.py, plugins/whatsnew.py, plugins/whatsold.py, schemes/file.py, schemes/ftp.py, schemes/http.py, version.py, webcheck.1, webcheck.py: change to most recent version of the GPL (FSF address change) and update notices 2005-06-18 19:59 arthur * [r50] plugins/external.py: sort external links by URL 2005-06-18 13:48 arthur * [r49] webcheck.py: split main() part into it's own function 2005-06-18 13:32 arthur * [r48] plugins/rptlib.py, webcheck.py: restructure a couple of things to reduce the number of mutual imports and reduce the number of stuff gathered in webcheck.py 2005-06-18 13:31 arthur * [r47] config.py, plugins/urllist.py: add simple urllist plugin to list all visited URLs 2005-06-18 13:20 arthur * [r46] plugins/sitemap.py: only include internal links in sitemap 2005-06-18 12:49 arthur * [r45] config.py, webcheck.py: add problems plugin to config instead of hard-coding 2005-06-18 10:25 arthur * [r44] plugins/rptlib.py: remove ugly redirection for overwrite file question since we now write all html through a file descriptor 2005-06-15 21:01 arthur * [r43] TODO, myUrlLib.py, plugins/about.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/notchkd.py, plugins/notitles.py, plugins/problems.py, plugins/rptlib.py, plugins/sitemap.py, plugins/slow.py, plugins/whatsnew.py, plugins/whatsold.py, schemes/http.py, webcheck.py: pass reference to Link class to plugins with parameter and make import config where it is used instead of accessing it through another module 2005-06-15 20:55 arthur * [r42] myUrlLib.py, plugins/rptlib.py, plugins/sitemap.py, webcheck.py: make use of base consistent, do not modify it to make a nicer URL (at least not now) and do not overwrite it with something silly from webcheck.py 2005-06-14 19:17 arthur * [r41] myUrlLib.py: also set URL attribute on yanked links 2005-06-12 06:21 arthur * [r40] plugins/badlinks.py, plugins/images.py, plugins/notchkd.py, plugins/notitles.py: again use the URL as link title for some links 2005-06-11 21:52 arthur * [r39] httpcodes.py, plugins/about.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/notchkd.py, plugins/notitles.py, plugins/problems.py, plugins/rptlib.py, plugins/sitemap.py, plugins/slow.py, plugins/whatsnew.py, plugins/whatsold.py: general clean-up of plugins structure and code, moving httpcodes to the only place they were used, cleaning up plugin titles, version numbers and descriptions, adding docstrings and using slightly more logical and consistent names (plus some other cleanups) 2005-06-11 21:39 arthur * [r38] plugins/rptlib.py: make_link(): if no title is specified, try to look up the title of the page and fall back to the URL as title 2005-06-11 21:24 arthur * [r37] plugins/about.py: adapt plugin to using file descriptor etc 2005-06-11 18:52 arthur * [r36] contrib, plugins/about.py: move about plugin to plugins directory 2005-06-08 19:29 arthur * [r35] plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/notchkd.py, plugins/notitles.py, plugins/problems.py, plugins/rptlib.py, plugins/sitemap.py, plugins/slow.py, plugins/whatsnew.py, plugins/whatsold.py, webcheck.py: write html files using file descriptors instead of through redirection using stdout, split writing of navigation frame and plugin pages plus some minor clean-ups to calling plugins 2005-06-08 19:10 arthur * [r34] plugins/__init__.py, schemes/__init__.py: claiming copyright on empty files is silly 2005-06-06 21:22 arthur * [r33] debugio.py, htmlparse.py, myUrlLib.py, plugins/rptlib.py, schemes/ftp.py, schemes/http.py, webcheck.1, webcheck.py: redo output writing using a cleaner debugio and change debug command line option 2005-06-06 20:11 arthur * [r32] plugins/badlinks.py, plugins/notchkd.py: replace a couple more tabs 2005-06-06 20:05 arthur * [r31] webcheck.1: initial version of manual page loosely based on documentation 2005-06-06 19:22 arthur * [r30] AUTHORS: added myself as copyright holder and added Bastian Kleineidam (previous Debian package maintainer) as contributor 2005-06-06 19:20 arthur * [r29] webcheck.py: small text improvement 2005-05-27 20:39 arthur * [r28] webcheck.sh: remove unneeded shell script 2005-05-27 20:28 arthur * [r27] webcheck.py: also support --force 2005-05-27 20:18 arthur * [r26] webcheck.py: redo command-line checking 2005-04-13 19:41 arthur * [r25] contrib/plugins/about.py: general cleanup 2005-04-13 19:41 arthur * [r24] plugins/sitemap.py: rework recursion to make it simpler plus some general cleanups 2005-04-13 19:20 arthur * [r23] contrib/plugins/about.py, myUrlLib.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/notchkd.py, plugins/notitles.py, plugins/problems.py, plugins/rptlib.py, plugins/sitemap.py, plugins/slow.py, plugins/whatsnew.py, plugins/whatsold.py, schemes/http.py, webcheck.py: rename linkList to linkMap 2005-04-13 19:18 arthur * [r22] myUrlLib.py, robotparser.py: remove local copy of robotparser, just use python\'s 2005-04-09 20:03 arthur * [r21] myUrlLib.py: qualify references to types functions 2005-04-09 13:48 arthur * [r20] htmlparse.py, myUrlLib.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/notchkd.py, plugins/notitles.py, plugins/rptlib.py, plugins/slow.py, plugins/whatsnew.py, plugins/whatsold.py, schemes/http.py: indent with spaces instead of tabs (tabs are evil) 2005-04-08 21:31 arthur * [r19] myUrlLib.py: move finding of scheme module to separate function 2005-04-08 21:25 arthur * [r18] schemes/http.py: rebump loglevel to debug 2005-04-08 16:24 arthur * [r17] myUrlLib.py, schemes/file.py, schemes/filelink.py, schemes/ftp.py, schemes/ftplink.py, schemes/http.py, schemes/httplink.py: remove link part from scheme modules 2005-04-07 22:37 arthur * [r16] schemes/httplink.py: clean up http request code a little and do not set host header (it is sent by HTTPConnection already 2005-04-07 20:29 arthur * [r15] contrib/plugins/about.py, debugio.py, htmlparse.py, httpcodes.py, myUrlLib.py, plugins/__init__.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/notchkd.py, plugins/notitles.py, plugins/problems.py, plugins/rptlib.py, plugins/sitemap.py, plugins/slow.py, plugins/whatsnew.py, plugins/whatsold.py, schemes/__init__.py, schemes/filelink.py, schemes/ftplink.py, version.py, webcheck.py: make nicer file (copyrights) headers 2005-04-07 20:23 arthur * [r14] schemes/httplink.py: fix problem with incorrect indent 2005-04-07 20:06 arthur * [r13] config.py, httpcodes.py, plugins/notitles.py: tabs to spaces (tabs are evil) 2005-04-07 20:05 arthur * [r12] config.py, contrib/plugins/about.py, httpcodes.py, plugins/badlinks.py, plugins/external.py, plugins/notchkd.py, plugins/notitles.py, plugins/problems.py, plugins/rptlib.py, plugins/sitemap.py, plugins/slow.py, plugins/whatsnew.py, plugins/whatsold.py, schemes/filelink.py, schemes/ftplink.py, schemes/httplink.py: tabs to spaces (tabs are evil) 2005-04-07 20:04 arthur * [r11] AUTHORS, schemes/httplink.py: include patch from Sebastien Delafond (from http://bugs.debian.org/286017) to fix problems with recent versions of python 2005-04-06 19:38 arthur * [r10] INSTALL, config.py, htmlparse.py, plugins/images.py, plugins/rptlib.py, schemes/ftplink.py, schemes/httplink.py, webcheck.css, webcheck.py: import Debian package patches 2005-03-31 12:47 arthur * [r9] COPYING: install updated file without millennium bug 2005-03-31 12:45 arthur * [r8] AUTHORS: reformat file to better match suggested layout 2005-03-31 12:44 arthur * [r7] NEWS: put news items in a little more standard format 2005-03-31 12:42 arthur * [r6] AUTHORS, CHANGES, CREDITS, ChangeLog-1999, ChangeLog-2002, HISTORY, HISTORY.linbot, NEWS: rename files to more standard names 2005-03-31 12:32 arthur * [r5] config.py, plugins/rptlib.py, version.py: remove checks for updates (registry) 2005-03-31 12:28 arthur * [r4] ., contrib, contrib/plugins, plugins, schemes: ignore compiled python objects 2005-03-29 12:08 arthur * [r2] BUGS, CHANGES, COPYING, CREDITS, HISTORY, HISTORY.linbot, INSTALL, README, TODO, config.py, contrib, contrib/plugins, contrib/plugins/about.py, debugio.py, htmlparse.py, httpcodes.py, myUrlLib.py, plugins, plugins/__init__.py, plugins/badlinks.py, plugins/external.py, plugins/images.py, plugins/notchkd.py, plugins/notitles.py, plugins/problems.py, plugins/rptlib.py, plugins/sitemap.py, plugins/slow.py, plugins/whatsnew.py, plugins/whatsold.py, robotparser.py, schemes, schemes/__init__.py, schemes/filelink.py, schemes/ftplink.py, schemes/httplink.py, version.py, webcheck.css, webcheck.py, webcheck.sh: import of release 1.0 2005-03-28 12:57 arthur * [r1] .: create webcheck directory