业余爱好『Favourite』 Archives

秀人集爬虫 [更新版] 【23.05.13】【Windows】

2023年5月13日 76 条评论

C:\Users\obaby>F:\Pycharm_Projects\meitulu-spider\dist\xiurenji2\xiurenji2.exe
****************************************************************************************************
秀人集爬虫 [更新版]
Verson: 23.05.13
当前服务器地址：https://www.xiuren5.vip
Blog: http://oba.by
姐姐的上面的域名怎样啊？说不好的不让用！！哼！！
****************************************************************************************************
USAGE:
spider -h <help> -a <all> -q <search>
Arguments:
         -a <download all site images>
         -q <query the image with keywords>
         -h <display help text, just this>
Option Arguments:
         -p <image download path>
         -r &lt;random index category list>
         -c <single category url>
         -e <early stop, work in site crawl mode only>
         -s <site url eg: http://www.xiurenji.vip (no last backslash "/")>
****************************************************************************************************

业余爱好『Favourite』

全国统计用区划代码和城乡划分代码 2022年数据[爬虫]【Json+CSV格式】

2023年4月21日 5 条评论

<noscript>
<h1><strong>Please enable JavaScript and refresh the page.</strong></h1>
</noscript>

业余爱好『Favourite』

精品美女吧爬虫【Windows】【23.04.16】

2023年4月16日 33 条评论

精品美女吧 爬虫
Verson: 23.04.16
Blog: http://www.h4ck.org.cn
****************************************************************************************************
USAGE:
spider -h <help> -a <all> -q <search> -e <early stop>
Arguments:
         -a <download all site images>
         -h <display help text, just this>
Option Arguments:
         -p <image download path>
         -r <random index category list>
         -c <single category url>
         -e <early stop, work in site crawl mode only>
         -s <site url eg: https://www.jpxgmn.net (no last backslash "/")>
****************************************************************************************************

业余爱好『Favourite』

requests SSLCertVerificationError

2023年4月16日一条评论

Traceback (most recent call last):
  File "requests\adapters.py", line 439, in send
  File "urllib3\connectionpool.py", line 785, in urlopen
  File "urllib3\util\retry.py", line 592, in increment
urllib3.exceptions.MaxRetryError: HTTPSConnectionPool(host='www.jpmn8.cc', port=443): Max retries exceeded with url: / (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1124)')))

业余爱好『Favourite』

爱看美女网爬虫【群辉Docker】【23.03.02】

2023年3月16日 8 条评论

在群辉下通过pyinstaller编译py文件会出现各种问题。首先是没有binutils，如果要安装这个工具包，需要安装包管理器ipkg。在确定系统处理器架构之后即可安装对应的包管理下，命令如下：

wget http://ipkg.nslu2-linux.org/feeds/optware/syno-i686/cross/stable/syno-i686-bootstrap_1.2-7_i686.xsh
chmod +x syno-i686-bootstrap_1.2-7_i686.xsh
sh syno-i686-bootstrap_1.2-7_i686.xsh

安装完成之后即可通过ipkg进行包管理了，

ipkg install binutils

业余爱好『Favourite』

美女图片整理【异常图片】

2023年3月14日 27 条评论

由于爬虫比较多，有的爬虫在下载的时候没有处理网络问题或者图片本山链接错误导致的图片异常。有的是处理了的，不要问为什么没加异常检测，问就是懒。

下载的图片会出现下面的问题，其实预览的时候就会发下问题了，另外打开这个图片其实会显示404或者502之类的错误页面。所以写了一段处理代码，主要两个功能：

1.删除小文件，至于多小自己去调整代码
2.如果目录下所有的文件都有问题，删除文件后同时删除目录

« 上一页 1 … 8 9 10 11 12 … 19 下一页 »

芭芭雅嘎 on 永夜: “那是因为姐姐太过于年轻，才显得我网站比较…” 12 月 26, 03:48

SSShooter on 西西弗斯的巨石: “上架一个平台算起来要花多少钱多少时间呢？” 12 月 25, 21:44

威言威语 on 永夜: “体型看着还是可以的，不要自己给自己压力。” 12 月 25, 20:27

秋风于渭水 on 永夜: “看完本文，再翻了历史文章，发现我居然比你…” 12 月 25, 17:45

obaby on R！P！G！二次元踢踏舞: “(:dance:)” 12 月 25, 17:08

obaby on 永夜: “酱紫，这不错~~” 12 月 25, 17:08

格子老师 on R！P！G！二次元踢踏舞: “真好我竟然是第一个” 12 月 25, 16:44

棋 on 永夜: “你别说这个丝袜我在酒吧看到过” 12 月 25, 15:15

2025 年 12 月
日	一	二	三	四	五	六
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30	31

秀人集爬虫 [更新版] 【23.05.13】【Windows】

全国统计用区划代码和城乡划分代码 2022年数据[爬虫]【Json+CSV格式】

精品美女吧 爬虫【Windows】【23.04.16】

requests SSLCertVerificationError

爱看美女网爬虫【群辉Docker】【23.03.02】

美女图片整理【异常图片】

精品美女吧爬虫【Windows】【23.04.16】