Arthur de Jong

Open Source / Free Software developer

summaryrefslogtreecommitdiffstats
Commit message (Expand)AuthorAgeFilesLines
* get files ready for 1.9.7 release1.9.7Arthur de Jong2006-07-026-26/+442
* always keep navigation on topArthur de Jong2006-06-291-0/+2
* store internal, external and yanked regular expressions i...Arthur de Jong2006-06-242-12/+12
* switch to using python-support and follow recent python p...Arthur de Jong2006-06-233-3/+7
* split Build-Depends-Indep into Build-Depends and Build-De...Arthur de Jong2006-06-051-1/+2
* also install favicon.ico in deb package (plus cosmetic fix)Arthur de Jong2006-06-051-1/+2
* fix typos and fix example explanationArthur de Jong2006-06-041-3/+4
* do not split list of strings on comma's inside the quoted...Arthur de Jong2006-06-041-2/+4
* make DeSerializeException a class instead of a function a...Arthur de Jong2006-06-041-1/+2
* add --continue option to resume the crawling from the poi...Arthur de Jong2006-06-043-4/+40
* handle break signals in all codeArthur de Jong2006-06-021-6/+6
* add code to serialize crawled data during crawl and again...Arthur de Jong2006-06-021-1/+12
* raise a custom exception instead of IOErrorArthur de Jong2006-06-021-9/+11
* add TODOsArthur de Jong2006-05-311-0/+2
* upgrade to standards-version 3.7.2 (no changes needed)Arthur de Jong2006-05-311-1/+1
* update feature list from deb package descriptionArthur de Jong2006-05-311-2/+3
* split crawler.crawl() function into crawler.crawl() and c...Arthur de Jong2006-05-163-7/+12
* also serialize remaining links after crawlArthur de Jong2006-05-161-0/+8
* remove anchor debugging statementsArthur de Jong2006-05-161-2/+0
* flag deserialized links as changed so they will be reseri...Arthur de Jong2006-05-161-0/+1
* fix sortingArthur de Jong2006-05-161-1/+1
* update link to fancytooltipsArthur de Jong2006-05-162-2/+2
* add makebackup option to open_file() so we can implement ...Arthur de Jong2006-05-151-10/+15
* fix some stupid typosArthur de Jong2006-05-151-3/+3
* add code to serialize links to a file while crawling the ...Arthur de Jong2006-05-151-2/+16
* import crawler late as to simplify dependenciesArthur de Jong2006-05-151-1/+1
* fix typo in FIXMEArthur de Jong2006-05-151-3/+3
* add _ischanged attribute to link objects to indicate chan...Arthur de Jong2006-05-151-0/+10
* only write serialized data if it is different from the co...Arthur de Jong2006-05-151-10/+20
* clear anchors, linkproblems and pageproblems from to be d...Arthur de Jong2006-05-151-0/+4
* remove the call to crawl() from deserialize as this could...Arthur de Jong2006-05-151-3/+3
* make decoding try/fall-back code a lot simpler and handle...Arthur de Jong2006-05-151-12/+7
* improve warning text and add comment concerning trying of...Arthur de Jong2006-05-121-1/+2
* ignore unknown entities instead of throwing an errorArthur de Jong2006-05-121-2/+5
* include favicon.ico file in generated reportArthur de Jong2006-05-073-0/+3
* ensure that we are not importing anything weird by using ...Arthur de Jong2006-05-071-0/+9
* support floats as parameter for --waitArthur de Jong2006-05-071-1/+1
* fix usage of dashArthur de Jong2006-05-071-1/+1
* add serialize module that allows serializing and deserial...Arthur de Jong2006-05-071-0/+313
* fix typo in docstring and add commentArthur de Jong2006-05-071-1/+2
* move html escaping and unescaping functions to parsers.htmlArthur de Jong2006-05-072-36/+55
* use unichr() to generate Unicode characters, not chr()Arthur de Jong2006-05-071-1/+1
* return None explicitlyArthur de Jong2006-05-071-1/+1
* some more small code improvements thanks to pycheckerArthur de Jong2006-05-075-4/+11
* implement checking for id and name tags in anchorsArthur de Jong2006-05-061-12/+39
* bump copyright noticesArthur de Jong2006-05-063-3/+3
* also add all unfetched links from a site to make this met...Arthur de Jong2006-04-271-0/+5
* make get_link() function a public class functionArthur de Jong2006-04-271-5/+5
* move URL checking bit to right function and improve ancho...Arthur de Jong2006-04-271-5/+5
* fix remaining references to escape instead of htmlescapeArthur de Jong2006-04-271-7/+7