BeautifulSoup " />

扫描二维码下载沐宇APP

沐宇

微信扫码使用沐宇小程序

沐宇

鎬庝箞浣跨敤BeautifulSoup澶勭悊鍒楄〃鏁版嵁

扬州沐宇科技
2024-05-14 12:54:16
BeautifulSoup

浣跨敤BeautifulSoup澶勭悊鍒楄〃鏁版嵁鐨勬楠ゅ涓嬶細

  1. 瀵煎叆BeautifulSoup搴擄細棣栧厛闇€瑕佸鍏eautifulSoup搴擄紝鍙互浣跨敤浠ヤ笅璇彞瀵煎叆锛?/li>
from bs4 import BeautifulSoup
  1. 鍒涘缓BeautifulSoup瀵硅薄锛氬皢瑕佸鐞嗙殑HTML鍐呭浼犻€掔粰BeautifulSoup瀵硅薄锛屽垱寤轰竴涓狟eautifulSoup瀵硅薄锛屽彲浠ヤ娇鐢ㄤ互涓嬭鍙ュ垱寤猴細
soup = BeautifulSoup(html_content, 'html.parser')

鍏朵腑锛宧tml_content鏄澶勭悊鐨凥TML鍐呭銆?/p>

  1. 鏌ユ壘鍒楄〃鏁版嵁锛氫娇鐢˙eautifulSoup鎻愪緵鐨勬柟娉曟潵鏌ユ壘鍜屾彁鍙栧垪琛ㄦ暟鎹紝姣斿find()鏂规硶鍜宖ind_all()鏂规硶銆備緥濡傦紝濡傛灉瑕佹彁鍙栨墍鏈夌殑鍒楄〃椤癸紝鍙互浣跨敤find_all()鏂规硶鏉ヨ幏鍙栨墍鏈夌殑鍒楄〃椤瑰厓绱狅紝濡備笅鎵€绀猴細
list_items = soup.find_all('li')
  1. 閬嶅巻鍒楄〃鏁版嵁锛氶亶鍘嗚幏鍙栧埌鐨勫垪琛ㄦ暟鎹紝杩涜杩涗竴姝ュ鐞嗐€備緥濡傦紝鍙互浣跨敤for寰幆鏉ラ亶鍘嗚幏鍙栧埌鐨勫垪琛ㄩ」锛屽涓嬫墍绀猴細
for item in list_items:
    print(item.text)

浠ヤ笂鏄娇鐢˙eautifulSoup澶勭悊鍒楄〃鏁版嵁鐨勫熀鏈楠わ紝鏍规嵁鍏蜂綋鐨勯渶姹傚拰鎯呭喌锛岃繕鍙互杩涗竴姝ュ鑾峰彇鍒扮殑鍒楄〃鏁版嵁杩涜澶勭悊鍜屾搷浣溿€?/p>

扫码添加客服微信