Parsing HTML

February 24, 2006

How to deal with strings that have been URI escaped mulitple times

ONLamp.com: A Canary Trap for URI Escaping

Posted by pj at 12:19 PM

October 12, 2005

More on entity codes

Index of HTML 4.0 Character Entity References

Posted by pj at 02:31 PM

September 29, 2005

HTML character entity code list

Numeric Code Character Entities

Posted by pj at 09:27 AM

May 27, 2005

More information about Python HTMLTidy wrappers

XML.com: Wrestling HTML

Posted by pj at 12:03 PM