环境准备

python3.8
需要用到的第三方包

- @H_696_23@requests：通过http请求获取页面，官方文档
- @H_696_23@Beautiful Soup4：可以从HTML或XML文件中提取数据，官方文档

在终端中分别输入以下pip命令，安装它们

python -m pip install beautifulsoup4
python -m pip install requests

最后，代码附上。

import os
import time
import requests
from bs4 import BeautifulSoup

# 需要爬取的页数
gain_page = int(input("请输入你需要爬取的页数："))
# 根据页数进行逻辑判断
for i in range(1, gain_page + 1):
    if i == 1:
        url = "https://www.53pic.com/bizhi/dongman/"
    else:
        url = "https://www.53pic.com/bizhi/dongman/index_%s.html" % str(i)

    # print(url)    # 测试代码

    # ---------------提取主页源代码--------------- #
    # 向服务器请求数据
    main_page_info = requests.get(url)
    # 解决乱码问题
    main_page_info.encoding = "utf-8"
    main_page_text = main_page_info.text
    # print(main_page_text)

    # -------2、通过href拿到子页面内容，从子页面中找到图片下载地址   <img src=”“>------

    # 将主页源码交给BeautifulSoup处理
    handle_main = BeautifulSoup(main_page_text, "html.parser")
    # print(handle_main)
    # 缩小数据匹配范围
    son_link_list_a = handle_main.find_all(name="a", attrs={"class": "title-content"})
    # print(son_link_list)

    # 通过循环取出a标签中的href、标题
    for a_href_a in son_link_list_a:
        # print(a_href_a)
        href = "https://www.53pic.com" + a_href_a.get("href")
        title = a_href_a.get("title")
        # print(href, titlE)

        # 拿到子页面的页面源代码
        son_page_info = requests.get(href)
        # 解决中文乱码问题
        son_page_info.encoding = "utf-8"
        son_page_info_text = son_page_info.text
        # print(son_page_info_text)
        # 将子页面交给BeautifulSoup处理
        handle_son = BeautifulSoup(son_page_info_text, "html.parser")
        # 缩小子页面数据匹配范围
        download_link_p = handle_son.find_all(name="div", attrs={"id": "showimgXFL"})
        # print(download_link_p)
        for div_src_div in download_link_p:
            # print(div_src_div)
            # 查找img标签
            download_src_img = div_src_div.find("img")
            # 匹配src属性
            download_src = download_src_img.get("src")
            # 请求下载
            download = requests.get(download_srC)
            # print(download_srC)
            # 切换工作目录
            os.chdir(r"C:\Users\崔泽\Desktop\mig")
            with open("%s.jpg" % title, mode='wb+') as file:
                # 以二进制文件写入文件
                file.write(download.content)
                time.sleep(1)
            print("%s...下载成功！" % titlE)

大佬总结

以上是大佬教程为你收集整理的python 爬虫抓取高清美女壁纸源码附上全部内容，希望文章能够帮你解决python 爬虫抓取高清美女壁纸源码附上所遇到的程序开发问题。

如果觉得大佬教程网站内容还不错，欢迎将大佬教程推荐给程序员好友。

本图文内容来源于网友网络收集整理提供，作为学习参考使用，版权属于原作者。
如您有任何意见或建议可联系处理。小编QQ：384754419，请注明来意。

标签：

上一篇: 警惕！Python 中少为人知的 10 个... 下一篇:用python写一个自动生成春联的软...

猜你在找的Python相关文章

Anaconda 01_安装问题 2022-04-02
python将ansible配置转为json格式实例代码 2019-10-05
对Python进行数据分析_关于Package的安装问题 2019-10-05
Python入门_条件控制(详解) 2019-10-05
python数据类型_字符串常用操作(详解) 2019-10-05
matplotlib绘制符合论文要求的图片实例(必看篇) 2019-10-05
Python中easy_install 和 pip 的安装及使用 2019-10-05
Python常见异常分类与处理方法 2019-10-05
详解使用python的logging模块在stdout输出的两种方法 2019-10-05
Python计时相关操作详解【time,datetime】 2019-10-05

python 爬虫 抓取高清美女壁纸 源码附上

环境准备

大佬总结

python 爬虫抓取高清美女壁纸源码附上