Arthur de Jong

Open Source / Free Software developer

summaryrefslogtreecommitdiffstats
path: root/serialize.py
Commit message (Expand)AuthorAgeFilesLines
* use sets instead of sequences for children, embedded, etc...Arthur de Jong2007-07-131-1/+1
* give the matched URL a name to make code more readableArthur de Jong2007-07-131-1/+2
* be a little more verbose when raising parsing exceptionsArthur de Jong2007-07-131-5/+5
* improve deserialization and handling of Unicode stringsArthur de Jong2007-07-061-16/+13
* store internal, external and yanked regular expressions i...Arthur de Jong2006-06-241-3/+3
* do not split list of strings on comma's inside the quoted...Arthur de Jong2006-06-041-2/+4
* make DeSerializeException a class instead of a function a...Arthur de Jong2006-06-041-1/+2
* raise a custom exception instead of IOErrorArthur de Jong2006-06-021-9/+11
* split crawler.crawl() function into crawler.crawl() and c...Arthur de Jong2006-05-161-2/+2
* flag deserialized links as changed so they will be reseri...Arthur de Jong2006-05-161-0/+1
* import crawler late as to simplify dependenciesArthur de Jong2006-05-151-1/+1
* fix typo in FIXMEArthur de Jong2006-05-151-3/+3
* only write serialized data if it is different from the co...Arthur de Jong2006-05-151-10/+20
* clear anchors, linkproblems and pageproblems from to be d...Arthur de Jong2006-05-151-0/+4
* remove the call to crawl() from deserialize as this could...Arthur de Jong2006-05-151-3/+3
* add serialize module that allows serializing and deserial...Arthur de Jong2006-05-071-0/+313