Arthur de Jong
Open Source / Free Software developer
index
:
webcheck
master
A website link and structure checker
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
schemes
/
http.py
Commit message (
Expand
)
Author
Age
Files
Lines
*
add some more debugging information (cache hit or miss)
Arthur de Jong
2006-01-29
1
-1
/
+11
*
trim empty ports (http://host:/) from URLs and do not cra...
Arthur de Jong
2005-12-29
1
-1
/
+1
*
catch all relevant exceptions when looking up content-typ...
Arthur de Jong
2005-12-26
1
-1
/
+1
*
add copyright clarification to specify that generated out...
Arthur de Jong
2005-12-17
1
-0
/
+3
*
remove trailing : from netloc if it is present
Arthur de Jong
2005-12-17
1
-0
/
+3
*
add configuration option to disable proxy caching
Arthur de Jong
2005-09-18
1
-0
/
+4
*
try to extract character encoding from http response and ...
Arthur de Jong
2005-09-17
1
-0
/
+8
*
support basic authentication for http proxies and some in...
Arthur de Jong
2005-09-13
1
-5
/
+11
*
set status to result of fetching the document (not an err...
Arthur de Jong
2005-08-20
1
-1
/
+2
*
move redirect handling code to crawler module, including ...
Arthur de Jong
2005-08-19
1
-15
/
+2
*
split problems into page problems (parsing errors, wrong ...
Arthur de Jong
2005-08-19
1
-6
/
+6
*
rename parameter to acceptedtypes to not conflict with mi...
Arthur de Jong
2005-08-12
1
-2
/
+2
*
also pass mimetypes to scheme modules to only fetch conte...
Arthur de Jong
2005-08-12
1
-2
/
+3
*
reimplement http module to be a little more generic and c...
Arthur de Jong
2005-07-30
1
-97
/
+91
*
remove references to email addresses where they are not u...
Arthur de Jong
2005-07-29
1
-3
/
+3
*
handle socket errors properly
Arthur de Jong
2005-07-24
1
-1
/
+6
*
fix for incomplete change in r76, now version should not ...
Arthur de Jong
2005-07-24
1
-1
/
+1
*
integrate versio.py into config.py, clean up config.py re...
Arthur de Jong
2005-07-23
1
-2
/
+1
*
Mike Meyer -> Mike W. Meyer
Arthur de Jong
2005-07-23
1
-1
/
+1
*
almost complete rewrite of crawling and site state code m...
Arthur de Jong
2005-07-22
1
-52
/
+58
*
use lower-case URL attribute in Link instead of upper-cas...
Arthur de Jong
2005-07-17
1
-5
/
+5
*
rework scheme code to use more logical function names, mo...
Arthur de Jong
2005-07-10
1
-61
/
+20
*
store mtime in link object instead of age in days
Arthur de Jong
2005-07-10
1
-1
/
+1
*
handle and document proxy settings with environment varia...
Arthur de Jong
2005-07-03
1
-6
/
+3
*
name webcheck with lower case
Arthur de Jong
2005-07-03
1
-2
/
+2
*
clean up get_reply() function to uses proper recursion an...
Arthur de Jong
2005-06-28
1
-23
/
+16
*
change to most recent version of the GPL (FSF address cha...
Arthur de Jong
2005-06-22
1
-1
/
+1
*
pass reference to Link class to plugins with parameter an...
Arthur de Jong
2005-06-15
1
-1
/
+1
*
redo output writing using a cleaner debugio and change de...
Arthur de Jong
2005-06-06
1
-12
/
+11
*
rename linkList to linkMap
Arthur de Jong
2005-04-13
1
-3
/
+3
*
indent with spaces instead of tabs (tabs are evil)
Arthur de Jong
2005-04-09
1
-5
/
+5
*
rebump loglevel to debug
Arthur de Jong
2005-04-08
1
-1
/
+1
*
remove link part from scheme modules
Arthur de Jong
2005-04-08
1
-0
/
+173