【问题标题】:php remove invalid charsphp删除无效字符
【发布时间】:2012-05-17 18:39:39
【问题描述】:

我在使用$dom->loadHTML('<?xml version="1.0" encoding="UTF-8"?>' . $html); 时输出了以下错误。

Warning: DOMDocument::loadHTML() [domdocument.loadhtml]: Char 0xD860 out of allowed range in Entity, line: 1 in D:\xampp\xampp\htdocs\xampp\similarity\functions.php on line 438
Warning: DOMDocument::loadHTML() [domdocument.loadhtml]: Char 0xDEE2 out of allowed range in Entity, line: 1 in D:\xampp\xampp\htdocs\xampp\similarity\functions.php on line 438
Warning: DOMDocument::loadHTML() [domdocument.loadhtml]: Invalid char in CDATA 0x3 in Entity, line: 1 in D:\xampp\xampp\htdocs\xampp\similarity\functions.php on line 438

如何使用 php 定位和删除那些“无效”字符?

安德烈

【问题讨论】:

标签: php regex dom character


【解决方案1】:

未经测试,但这应该可以:

$buffer = ob_get_clean();
$tidy = new tidy();
$myHTML = $tidy->repairString('<?xml version="1.0" encoding="UTF-8"?>' . $html);

$dom->loadHTML($myHTML);

【讨论】:

    猜你喜欢
    • 2012-01-21
    • 2017-02-02
    • 1970-01-01
    • 2012-09-27
    • 1970-01-01
    • 2022-11-14
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    相关资源
    最近更新 更多