Arthur de Jong

Open Source / Free Software developer

summaryrefslogtreecommitdiffstats
path: root/parsers/html/beautifulsoup.py
Commit message (Collapse)AuthorAgeFilesLines
* also handle http-equiv refresh meta headerArthur de Jong2007-07-151-3/+13
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@349 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* split out URL cleaning code into own moduleArthur de Jong2007-07-071-15/+16
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@339 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* handle ID attribute as anchor on any tagArthur de Jong2007-04-241-5/+5
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@326 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* correctly parse author informationArthur de Jong2007-04-201-2/+2
| | | | git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@324 86f53f14-5ff3-0310-afe5-9b438ce3f40c
* introduce HTML parsing using BeautifulSoup with a ↵Arthur de Jong2007-04-201-0/+137
fall-back mechanism to the old HTMLParser based solution git-svn-id: http://arthurdejong.org/svn/webcheck/webcheck@323 86f53f14-5ff3-0310-afe5-9b438ce3f40c