Arthur de Jong

Open Source / Free Software developer

summaryrefslogtreecommitdiffstats
path: root/schemes
Commit message (Expand)AuthorAgeFilesLines
* do not handle control-C and pass it along to the main exc...Arthur de Jong2007-07-071-1/+4
* fix some copyright datesArthur de Jong2007-05-121-1/+1
* add workaround for bug in idna moduleArthur de Jong2007-04-061-0/+5
* catch any exception in HTTP module and report is as a lin...Arthur de Jong2007-01-151-0/+7
* explicitly transform username and password to string in c...Arthur de Jong2006-09-041-2/+2
* also handle SSL related socket errors (e.g. SSL time-out)Arthur de Jong2006-07-131-1/+1
* add set_encoding method to Link object to do some basic e...Arthur de Jong2006-07-131-1/+1
* ensure that we are not importing anything weird by using ...Arthur de Jong2006-05-071-0/+9
* return None explicitlyArthur de Jong2006-05-071-1/+1
* some more small code improvements thanks to pycheckerArthur de Jong2006-05-072-2/+7
* bump copyright noticesArthur de Jong2006-05-062-2/+2
* code improvements thanks to pylintArthur de Jong2006-04-235-58/+66
* also properly handle time-out problems which only pass on...Arthur de Jong2006-04-231-3/+7
* implement a time-out setting with a default of 10 secondsArthur de Jong2006-04-111-0/+3
* add some more debugging information (cache hit or miss)Arthur de Jong2006-01-291-1/+11
* trim empty ports (http://host:/) from URLs and do not cra...Arthur de Jong2005-12-291-1/+1
* catch all relevant exceptions when looking up content-typ...Arthur de Jong2005-12-261-1/+1
* add copyright clarification to specify that generated out...Arthur de Jong2005-12-175-0/+15
* remove trailing : from netloc if it is presentArthur de Jong2005-12-171-0/+3
* add configuration option to disable proxy cachingArthur de Jong2005-09-181-0/+4
* try to extract character encoding from http response and ...Arthur de Jong2005-09-171-0/+8
* support basic authentication for http proxies and some in...Arthur de Jong2005-09-131-5/+11
* fix wrapping of documentationArthur de Jong2005-09-101-4/+5
* set status to result of fetching the document (not an err...Arthur de Jong2005-08-201-1/+2
* move redirect handling code to crawler module, including ...Arthur de Jong2005-08-193-19/+4
* split problems into page problems (parsing errors, wrong ...Arthur de Jong2005-08-193-10/+11
* pick up configured filenames if present in directoriesArthur de Jong2005-08-162-46/+66
* add extra debugging infoArthur de Jong2005-08-161-8/+15
* use a pool of ftp connections to keep ftp connection to a...Arthur de Jong2005-08-131-18/+25
* almost complete reimplementation of the ftp scheme, handl...Arthur de Jong2005-08-131-62/+64
* complete reimplementation of file module, reading index.h...Arthur de Jong2005-08-121-21/+49
* rename parameter to acceptedtypes to not conflict with mi...Arthur de Jong2005-08-124-6/+6
* also pass mimetypes to scheme modules to only fetch conte...Arthur de Jong2005-08-124-6/+8
* add https module as a wrapper to the http moduleArthur de Jong2005-07-311-0/+26
* reimplement http module to be a little more generic and c...Arthur de Jong2005-07-301-97/+91
* remove references to email addresses where they are not u...Arthur de Jong2005-07-294-10/+10
* handle socket errors properlyArthur de Jong2005-07-241-1/+6
* fix for incomplete change in r76, now version should not ...Arthur de Jong2005-07-241-1/+1
* integrate versio.py into config.py, clean up config.py re...Arthur de Jong2005-07-231-2/+1
* most systems already know about .shtml filesArthur de Jong2005-07-231-4/+1
* Mike Meyer -> Mike W. MeyerArthur de Jong2005-07-233-3/+3
* almost complete rewrite of crawling and site state code m...Arthur de Jong2005-07-224-70/+74
* use lower-case URL attribute in Link instead of upper-cas...Arthur de Jong2005-07-173-13/+13
* rework scheme code to use more logical function names, mo...Arthur de Jong2005-07-104-162/+121
* store mtime in link object instead of age in daysArthur de Jong2005-07-102-2/+3
* remove unneeded import and printArthur de Jong2005-07-101-1/+0
* handle and document proxy settings with environment varia...Arthur de Jong2005-07-031-6/+3
* name webcheck with lower caseArthur de Jong2005-07-031-2/+2
* clean up get_reply() function to uses proper recursion an...Arthur de Jong2005-06-281-23/+16
* change to most recent version of the GPL (FSF address cha...Arthur de Jong2005-06-223-3/+3