转载于吾爱大佬https://www.52pojie.cn/thread-1658021-1-1.html
import requests from lxml import etree wps =[] url="https://www.163.com/dy/media/T1603594732083.html" heders = { 'User-Agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/98.0.4758.102 Safari/537.36' } rsp = requests.get(url,headers = heders) hot = rsp.content.decode('utf-8') html=etree.HTML(hot) today_url=html.xpath("//ul[@class='list_box cur']/li/a/@href")[0] rsp = requests.get(today_url,headers = heders) hot = rsp.content.decode('utf8') html=etree.HTML(hot) news_list = html.xpath('//div[@class="post_body"]/p[2]//text()') news_list = news_list[1:] for news in news_list: print(news)
实测:
东风随春归,发我枝上花。
水流花谢两无情,送尽东风过楚城。
6
茅檐人静,蓬窗灯暗,春晚连江风雨。
相见时难别亦难,东风无力百花残。
风度精神如彦辅,大鲜明。