读书人

帮忙看上html转化为xml有什么有关问题

发布时间: 2012-12-29 10:28:09 作者: rapoo

帮忙看下html转化为xml有什么问题呢?


def flawcoverRead(self):

file = open(self.inputdir)
rst = file.read()
"""</b> (buffer) <i> --> </b> <type> (buffer) </type> <i>"""
"""</i> --> </i></li>"""
xmlstr = '<ul>' + rst.split('<ul>')[1].split('</ul>')[0] + '</ul>'
xmlstr = xmlstr.replace("</b>", "</b><type>").replace("<i>", "</type><i>").replac("</i>", "</i></li>")
root = etree.fromstring(xmlstr)
xmlrst = []
for c in root:
els = []
file, line = c.text.split(':')[:-1]
els.append(file)
els.append(line)

level, ty, msg = [i.text.strip() for i in c]
level = level[1]
ty = ty[1:-1]
temp = msg.split(":")
func = temp[0]
msg = ''.join(temp[1:]).strip().replace("\n", '')

els.append(level)
els.append(ty)
els.append(func)
els.append(msg)
xmlrst.append(els)

return xmlrst


inputdir是html的输入路径,我在调用的时候有如下错误
line 23, in flawcoverRead root = etree.fromstring(xmlstr)
File "C:\Python27\lib\xml\etree\ElementTree.py", line 1301, in XML parser.feed(text)
File "C:\Python27\lib\xml\etree\ElementTree.py", line 1643, in feed self._raiseerror(v) File "C:\Python27\lib\xml\etree\ElementTree.py", line 1507, in _raiseerror raise err xml.etree.ElementTree.ParseError: mismatched tag: line 71, column 2

[解决办法]
按解释器提示的信息除错,看看xmlstr的内容吧...

读书人网 >perl python

热点推荐