Arthur de Jong
Open Source / Free Software developer
index
:
webcheck
master
A website link and structure checker
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
Commit message (
Expand
)
Author
Age
Files
Lines
...
*
upgrade to standards-version 3.7.2 (no changes needed)
Arthur de Jong
2006-05-31
1
-1
/
+1
*
update feature list from deb package description
Arthur de Jong
2006-05-31
1
-2
/
+3
*
split crawler.crawl() function into crawler.crawl() and c...
Arthur de Jong
2006-05-16
3
-7
/
+12
*
also serialize remaining links after crawl
Arthur de Jong
2006-05-16
1
-0
/
+8
*
remove anchor debugging statements
Arthur de Jong
2006-05-16
1
-2
/
+0
*
flag deserialized links as changed so they will be reseri...
Arthur de Jong
2006-05-16
1
-0
/
+1
*
fix sorting
Arthur de Jong
2006-05-16
1
-1
/
+1
*
update link to fancytooltips
Arthur de Jong
2006-05-16
2
-2
/
+2
*
add makebackup option to open_file() so we can implement ...
Arthur de Jong
2006-05-15
1
-10
/
+15
*
fix some stupid typos
Arthur de Jong
2006-05-15
1
-3
/
+3
*
add code to serialize links to a file while crawling the ...
Arthur de Jong
2006-05-15
1
-2
/
+16
*
import crawler late as to simplify dependencies
Arthur de Jong
2006-05-15
1
-1
/
+1
*
fix typo in FIXME
Arthur de Jong
2006-05-15
1
-3
/
+3
*
add _ischanged attribute to link objects to indicate chan...
Arthur de Jong
2006-05-15
1
-0
/
+10
*
only write serialized data if it is different from the co...
Arthur de Jong
2006-05-15
1
-10
/
+20
*
clear anchors, linkproblems and pageproblems from to be d...
Arthur de Jong
2006-05-15
1
-0
/
+4
*
remove the call to crawl() from deserialize as this could...
Arthur de Jong
2006-05-15
1
-3
/
+3
*
make decoding try/fall-back code a lot simpler and handle...
Arthur de Jong
2006-05-15
1
-12
/
+7
*
improve warning text and add comment concerning trying of...
Arthur de Jong
2006-05-12
1
-1
/
+2
*
ignore unknown entities instead of throwing an error
Arthur de Jong
2006-05-12
1
-2
/
+5
*
include favicon.ico file in generated report
Arthur de Jong
2006-05-07
3
-0
/
+3
*
ensure that we are not importing anything weird by using ...
Arthur de Jong
2006-05-07
1
-0
/
+9
*
support floats as parameter for --wait
Arthur de Jong
2006-05-07
1
-1
/
+1
*
fix usage of dash
Arthur de Jong
2006-05-07
1
-1
/
+1
*
add serialize module that allows serializing and deserial...
Arthur de Jong
2006-05-07
1
-0
/
+313
*
fix typo in docstring and add comment
Arthur de Jong
2006-05-07
1
-1
/
+2
*
move html escaping and unescaping functions to parsers.html
Arthur de Jong
2006-05-07
2
-36
/
+55
*
use unichr() to generate Unicode characters, not chr()
Arthur de Jong
2006-05-07
1
-1
/
+1
*
return None explicitly
Arthur de Jong
2006-05-07
1
-1
/
+1
*
some more small code improvements thanks to pychecker
Arthur de Jong
2006-05-07
5
-4
/
+11
*
implement checking for id and name tags in anchors
Arthur de Jong
2006-05-06
1
-12
/
+39
*
bump copyright notices
Arthur de Jong
2006-05-06
3
-3
/
+3
*
also add all unfetched links from a site to make this met...
Arthur de Jong
2006-04-27
1
-0
/
+5
*
make get_link() function a public class function
Arthur de Jong
2006-04-27
1
-5
/
+5
*
move URL checking bit to right function and improve ancho...
Arthur de Jong
2006-04-27
1
-5
/
+5
*
fix remaining references to escape instead of htmlescape
Arthur de Jong
2006-04-27
1
-7
/
+7
*
support passing a URL to add_reqanchor() plus some minor ...
Arthur de Jong
2006-04-27
1
-3
/
+7
*
handle problems in regular expressions passed on the comm...
Arthur de Jong
2006-04-27
1
-39
/
+43
*
rename escape() function to htmlescape() to make it a lit...
Arthur de Jong
2006-04-23
4
-10
/
+10
*
code improvements thanks to pylint
Arthur de Jong
2006-04-23
27
-372
/
+460
*
also sort parent list by URL if titles are the same
Arthur de Jong
2006-04-23
1
-1
/
+1
*
also properly handle time-out problems which only pass on...
Arthur de Jong
2006-04-23
1
-3
/
+7
*
implement a time-out setting with a default of 10 seconds
Arthur de Jong
2006-04-11
2
-0
/
+7
*
revert to borderless links as they look ugly in some (mos...
Arthur de Jong
2006-04-11
1
-2
/
+0
*
rename slow plugin to size
Arthur de Jong
2006-04-11
2
-10
/
+13
*
do not fail on unknown encodings (fall back to system enc...
Arthur de Jong
2006-04-07
1
-3
/
+6
*
split urlescape() from _urlclean() and ensure that all an...
Arthur de Jong
2006-03-26
2
-6
/
+14
*
only report missing anchors for pages that were fetched a...
Arthur de Jong
2006-03-26
1
-6
/
+6
*
put a border around links
Arthur de Jong
2006-03-26
1
-4
/
+6
*
properly close html files on no output
Arthur de Jong
2006-03-26
9
-0
/
+9
[prev]
[next]