Arthur de Jong

Open Source / Free Software developer

summaryrefslogtreecommitdiffstats
Commit message (Expand)AuthorAgeFilesLines
* get files ready for 1.10.0 release1.10.0Arthur de Jong2007-05-128-18/+154
* also lower-case reqanchorArthur de Jong2007-05-121-0/+2
* fix some copyright datesArthur de Jong2007-05-125-5/+5
* switch robots.txt handling to default on again (broken in...Arthur de Jong2007-05-123-2/+15
* present the default number of redirectsArthur de Jong2007-05-091-2/+3
* update copyright informationArthur de Jong2007-05-081-2/+2
* fixes to make output XHTML 1.1 compliantArthur de Jong2007-04-243-8/+20
* handle ID attribute as anchor on any tagArthur de Jong2007-04-241-5/+5
* lower-case anchor and errors to include id as optionArthur de Jong2007-04-242-2/+4
* correctly parse author informationArthur de Jong2007-04-201-2/+2
* introduce HTML parsing using BeautifulSoup with a fall-ba...Arthur de Jong2007-04-204-64/+256
* mark encoding problems and output more debuggingArthur de Jong2007-04-201-2/+2
* fix formatting of previous changelog entryArthur de Jong2007-04-201-3/+2
* fix typoArthur de Jong2007-04-201-1/+1
* add workaround for bug in idna moduleArthur de Jong2007-04-061-0/+5
* add some comments to the follow_link() methodArthur de Jong2007-04-061-0/+4
* make parsing of URLs and conversion to Link objects a lit...Arthur de Jong2007-04-061-9/+28
* use consistent Unicode conversionArthur de Jong2007-04-061-8/+14
* document the fact that --force should be used for non-int...Arthur de Jong2007-04-061-1/+2
* bail out if reading user input failedArthur de Jong2007-04-061-1/+6
* evaluate archive attribute of <applet> tag instead of cod...Arthur de Jong2007-03-311-2/+5
* get rid of old base (singular) as bases is now used every...Arthur de Jong2007-03-141-3/+0
* clean up a little and simplifyArthur de Jong2007-03-101-8/+3
* get files ready for 1.9.8 release1.9.8Arthur de Jong2007-01-158-127/+213
* catch any exception in HTTP module and report is as a lin...Arthur de Jong2007-01-151-0/+7
* move section on webcheck design into HACKING documentArthur de Jong2007-01-153-30/+28
* fix the bugreporting section to more clearly state the ne...Arthur de Jong2007-01-151-6/+16
* switch to using latest syntax of python-supportArthur de Jong2007-01-133-3/+1
* present sitemap with all basesArthur de Jong2006-10-231-1/+5
* add USE_ROBOTS optionArthur de Jong2006-10-231-0/+4
* include list of bases in Site classArthur de Jong2006-10-231-10/+13
* print reason why link is yanked if availableArthur de Jong2006-10-231-1/+6
* include link to homepage in package descriptionArthur de Jong2006-09-291-0/+2
* explicitly transform username and password to string in c...Arthur de Jong2006-09-041-2/+2
* also handle SSL related socket errors (e.g. SSL time-out)Arthur de Jong2006-07-131-1/+1
* add set_encoding method to Link object to do some basic e...Arthur de Jong2006-07-133-14/+23
* get files ready for 1.9.7 release1.9.7Arthur de Jong2006-07-026-26/+442
* always keep navigation on topArthur de Jong2006-06-291-0/+2
* store internal, external and yanked regular expressions i...Arthur de Jong2006-06-242-12/+12
* switch to using python-support and follow recent python p...Arthur de Jong2006-06-233-3/+7
* split Build-Depends-Indep into Build-Depends and Build-De...Arthur de Jong2006-06-051-1/+2
* also install favicon.ico in deb package (plus cosmetic fix)Arthur de Jong2006-06-051-1/+2
* fix typos and fix example explanationArthur de Jong2006-06-041-3/+4
* do not split list of strings on comma's inside the quoted...Arthur de Jong2006-06-041-2/+4
* make DeSerializeException a class instead of a function a...Arthur de Jong2006-06-041-1/+2
* add --continue option to resume the crawling from the poi...Arthur de Jong2006-06-043-4/+40
* handle break signals in all codeArthur de Jong2006-06-021-6/+6
* add code to serialize crawled data during crawl and again...Arthur de Jong2006-06-021-1/+12
* raise a custom exception instead of IOErrorArthur de Jong2006-06-021-9/+11
* add TODOsArthur de Jong2006-05-311-0/+2