Arthur de Jong
Open Source / Free Software developer
index
:
webcheck
master
A website link and structure checker
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
schemes
Commit message (
Expand
)
Author
Age
Files
Lines
*
catch any exception in HTTP module and report is as a lin...
Arthur de Jong
2007-01-15
1
-0
/
+7
*
explicitly transform username and password to string in c...
Arthur de Jong
2006-09-04
1
-2
/
+2
*
also handle SSL related socket errors (e.g. SSL time-out)
Arthur de Jong
2006-07-13
1
-1
/
+1
*
add set_encoding method to Link object to do some basic e...
Arthur de Jong
2006-07-13
1
-1
/
+1
*
ensure that we are not importing anything weird by using ...
Arthur de Jong
2006-05-07
1
-0
/
+9
*
return None explicitly
Arthur de Jong
2006-05-07
1
-1
/
+1
*
some more small code improvements thanks to pychecker
Arthur de Jong
2006-05-07
2
-2
/
+7
*
bump copyright notices
Arthur de Jong
2006-05-06
2
-2
/
+2
*
code improvements thanks to pylint
Arthur de Jong
2006-04-23
5
-58
/
+66
*
also properly handle time-out problems which only pass on...
Arthur de Jong
2006-04-23
1
-3
/
+7
*
implement a time-out setting with a default of 10 seconds
Arthur de Jong
2006-04-11
1
-0
/
+3
*
add some more debugging information (cache hit or miss)
Arthur de Jong
2006-01-29
1
-1
/
+11
*
trim empty ports (http://host:/) from URLs and do not cra...
Arthur de Jong
2005-12-29
1
-1
/
+1
*
catch all relevant exceptions when looking up content-typ...
Arthur de Jong
2005-12-26
1
-1
/
+1
*
add copyright clarification to specify that generated out...
Arthur de Jong
2005-12-17
5
-0
/
+15
*
remove trailing : from netloc if it is present
Arthur de Jong
2005-12-17
1
-0
/
+3
*
add configuration option to disable proxy caching
Arthur de Jong
2005-09-18
1
-0
/
+4
*
try to extract character encoding from http response and ...
Arthur de Jong
2005-09-17
1
-0
/
+8
*
support basic authentication for http proxies and some in...
Arthur de Jong
2005-09-13
1
-5
/
+11
*
fix wrapping of documentation
Arthur de Jong
2005-09-10
1
-4
/
+5
*
set status to result of fetching the document (not an err...
Arthur de Jong
2005-08-20
1
-1
/
+2
*
move redirect handling code to crawler module, including ...
Arthur de Jong
2005-08-19
3
-19
/
+4
*
split problems into page problems (parsing errors, wrong ...
Arthur de Jong
2005-08-19
3
-10
/
+11
*
pick up configured filenames if present in directories
Arthur de Jong
2005-08-16
2
-46
/
+66
*
add extra debugging info
Arthur de Jong
2005-08-16
1
-8
/
+15
*
use a pool of ftp connections to keep ftp connection to a...
Arthur de Jong
2005-08-13
1
-18
/
+25
*
almost complete reimplementation of the ftp scheme, handl...
Arthur de Jong
2005-08-13
1
-62
/
+64
*
complete reimplementation of file module, reading index.h...
Arthur de Jong
2005-08-12
1
-21
/
+49
*
rename parameter to acceptedtypes to not conflict with mi...
Arthur de Jong
2005-08-12
4
-6
/
+6
*
also pass mimetypes to scheme modules to only fetch conte...
Arthur de Jong
2005-08-12
4
-6
/
+8
*
add https module as a wrapper to the http module
Arthur de Jong
2005-07-31
1
-0
/
+26
*
reimplement http module to be a little more generic and c...
Arthur de Jong
2005-07-30
1
-97
/
+91
*
remove references to email addresses where they are not u...
Arthur de Jong
2005-07-29
4
-10
/
+10
*
handle socket errors properly
Arthur de Jong
2005-07-24
1
-1
/
+6
*
fix for incomplete change in r76, now version should not ...
Arthur de Jong
2005-07-24
1
-1
/
+1
*
integrate versio.py into config.py, clean up config.py re...
Arthur de Jong
2005-07-23
1
-2
/
+1
*
most systems already know about .shtml files
Arthur de Jong
2005-07-23
1
-4
/
+1
*
Mike Meyer -> Mike W. Meyer
Arthur de Jong
2005-07-23
3
-3
/
+3
*
almost complete rewrite of crawling and site state code m...
Arthur de Jong
2005-07-22
4
-70
/
+74
*
use lower-case URL attribute in Link instead of upper-cas...
Arthur de Jong
2005-07-17
3
-13
/
+13
*
rework scheme code to use more logical function names, mo...
Arthur de Jong
2005-07-10
4
-162
/
+121
*
store mtime in link object instead of age in days
Arthur de Jong
2005-07-10
2
-2
/
+3
*
remove unneeded import and print
Arthur de Jong
2005-07-10
1
-1
/
+0
*
handle and document proxy settings with environment varia...
Arthur de Jong
2005-07-03
1
-6
/
+3
*
name webcheck with lower case
Arthur de Jong
2005-07-03
1
-2
/
+2
*
clean up get_reply() function to uses proper recursion an...
Arthur de Jong
2005-06-28
1
-23
/
+16
*
change to most recent version of the GPL (FSF address cha...
Arthur de Jong
2005-06-22
3
-3
/
+3
*
pass reference to Link class to plugins with parameter an...
Arthur de Jong
2005-06-15
1
-1
/
+1
*
claiming copyright on empty files is silly
Arthur de Jong
2005-06-08
1
-17
/
+0
*
redo output writing using a cleaner debugio and change de...
Arthur de Jong
2005-06-06
2
-16
/
+15
[next]