• Show log

    Commit

  • Hash : 868d92da
    Author : Denis Pauk
    Date : 2012-05-10T15:34:57

    Add HTML parser support for HTML5 meta charset encoding declaration
    
    For https://bugzilla.gnome.org/show_bug.cgi?id=655218
    
    http://www.w3.org/TR/2011/WD-html5-20110525/semantics.html#the-meta-element
    
    """
    The charset attribute specifies the character encoding used by the document.
    This is a character encoding declaration. If the attribute is present in an XML
    document, its value must be an ASCII case-insensitive match for the string
    "UTF-8" (and the document is therefore forced to use UTF-8 as its
    encoding).
    """
    
    However, while <meta http-equiv="Content-Type" content="text/html;
    charset=utf8"> works, <meta charset="utf8"> does not.
    
    While libxml2 HTML parser is not tuned for HTML5, this is a simple
    addition
    
    Also added a testcase