報(bào)錯(cuò)1
xpath語(yǔ)句:
contents = response.xpath("(//div[@id='article_content']/text()) | (//div[@id='article_content']/br) | (//div[@id='article_content']/p/img)").extract()
報(bào)錯(cuò):
ValueError: All strings must be XML compatible: Unicode or ASCII, no NULL bytes or control characters
解決辦法:
contents = response.xpath(u"(//div[@id='article_content']/text()) | (//div[@id='article_content']/br)|(//div[@id='article_content']/p/img)").extract()