顶级肉欲(出轨高h),亚洲国产日产无码精品,gucci是什么牌子

一、前言

?? ? 參考了作者布咯咯_rieuse的《爬取豆瓣電影中速度與激情8演員圖片》一文，原文地址：爬取豆瓣電影中速度與激情8演員圖片。于是開始學習、模仿、改進。把自己學習的過程在此整理，與大家一起分享。
???? 目標：爬取圖片，并用對應圖片的明星中文名字為文件名保存于電腦中。
???? 地址：戰狼2?

二、分析

本次爬取使用第三方的requests庫，同時也是使用了urllib.urlretrieve函數下載文件。
解析主要使用了BeautifulSoup，部分也使用了re正則表達式

1、導入需要的庫

import requests
from bs4 import BeautifulSoup

2、獲取地址，使用BS庫并使用lxml解析器

html=requests.get(url).content
soup=BeautifulSoup(html,'lxml')

3、抓取<title>標簽中的片名作為文件的保存目錄

movie=soup.title.string.split(' ')[0]

4、抓取演員和對應的圖片url

從下圖中分析可知中間的<div class="list-wrapper">對應的才是所有演員的信息，所以我們用BS抓取中間的部分，代碼如下：

tags=soup.find_all(class_='list-wrapper')? #BS遍歷所有的list-wrapper類
starts=[]
for tag in tags[1].find_all('li')? ? ? ? ? ? ? ? ? ? ? #使用list-wrapper[1] 獲取對應的演員的類，再次遍歷其下的li 標簽
? ? title=tag.a['title'].split(' ')[0]? ? ? ? ? ? ? ? ? ? #獲取演員名字，去除后面英文名字
? ? img_url=re.findall(r'https://img\d.doubanio.com/img/celebrity/medium/.*.jpg',str(tag))[0]? #正則表達式獲取圖片信息，正則表達式返回列表，使用[0]獲取數據
???? stars.append([title,img_url])???????????????? #追加拼裝數據

三、完整代碼：

######1、geturl.py?

#coding=utf-8
import requests
from bs4 import BeautifulSoup
import sys
reload(sys)
sys.setdefaultencoding('utf-8')
def geturl(url):
??? html=requests.get(url).content
??? soup=BeautifulSoup(html,'lxml')
??? return soup

######2、download_doupan.py

import geturl,re,urllib,os
soup=geturl.geturl('https://movie.douban.com/subject/20451290/celebrities')
movie=soup.title.string.split(' ')[0]
tags=soup.find_all(class_='list-wrapper')
stars=[]
for tag in tags[1].find_all('li'):
??? title=tag.a['title'].split(' ')[0]
??? img_url=re.findall(r'https://img\d.doubanio.com/img/celebrity/medium/.*.jpg',str(tag))[0]
??? stars.append([title,img_url])

if not os.path.exists(movie):
??? os.makedirs(movie)
??? for star in stars:
??????? filename=os.path.join(movie,star[0]+'.jpg')
??????? with open(filename,'w') as f:
??????????? urllib.urlretrieve(star[1],filename)

源碼鏈接Click Here

三个男躁一个女,国精产品一区一手机的秘密,麦子交换系列最经典十句话,欧美国产综合欧美视频

Python爬蟲日記一：爬取豆瓣電影中《戰狼2》演員圖片

Python爬蟲日記一：爬取豆瓣電影中《戰狼2》演員圖片

一、前言

二、分析

三、完整代碼：

推薦閱讀更多精彩內容

三个男躁一个女,国精产品一区一手机的秘密,麦子交换系列最经典十句话,欧美 国产 综合 欧美 视频

Python爬蟲日記一：爬取豆瓣電影中《戰狼2》演員圖片

一、前言

二、分析

三、完整代碼：

推薦閱讀更多精彩內容

三个男躁一个女,国精产品一区一手机的秘密,麦子交换系列最经典十句话,欧美国产综合欧美视频