修复teleport软件中文爬取网站的错误。 http://blog.yoqi.me/?p=4081
|
|
7 years ago | |
|---|---|---|
| .vscode | 7 years ago | |
| src | 7 years ago | |
| .classpath | 7 years ago | |
| .gitignore | 7 years ago | |
| .project | 7 years ago | |
| LICENSE | 8 years ago | |
| README.md | 7 years ago | |
| covert.php | 8 years ago | |
| js_convert.py | 7 years ago | |
| pom.xml | 7 years ago |
修复teleport软件中文爬取网站的错误。
| 查找 | 替换 |
|---|---|
| /\*tpa=.*\*/ | |
| \btppabs="h[^"]*"或者tppabs="h[^"]*" | |
| href="javascript:if\(confirm\('htt[^"]*" | href=www.xxx.com |
| href=" *javascript:if\(confirm\('(htt[^"\s]*).*?" | href="$1" |
| utf-8"utf-8" | utf-8 |
| css文件: | |
| tpa=http://[^\s]*.gif | |
| /\*tpa.*?\*/ |
中文乱码,使用工具:
http://others.yoqi.me/convert.php
##20171223更新
批量更改为:
href="http://www.beian.gov.cn/portal/registerSystemInfo?recordcode=31011502004838"