鎬庝箞浣跨敤BeautifulSoup澶勭悊鍒楄〃鏁版嵁
浣跨敤BeautifulSoup澶勭悊鍒楄〃鏁版嵁鐨勬楠ゅ涓嬶細
- 瀵煎叆BeautifulSoup搴擄細棣栧厛闇€瑕佸鍏eautifulSoup搴擄紝鍙互浣跨敤浠ヤ笅璇彞瀵煎叆锛?/li>
from bs4 import BeautifulSoup
- 鍒涘缓BeautifulSoup瀵硅薄锛氬皢瑕佸鐞嗙殑HTML鍐呭浼犻€掔粰BeautifulSoup瀵硅薄锛屽垱寤轰竴涓狟eautifulSoup瀵硅薄锛屽彲浠ヤ娇鐢ㄤ互涓嬭鍙ュ垱寤猴細
soup = BeautifulSoup(html_content, 'html.parser')
鍏朵腑锛宧tml_content鏄澶勭悊鐨凥TML鍐呭銆?/p>
- 鏌ユ壘鍒楄〃鏁版嵁锛氫娇鐢˙eautifulSoup鎻愪緵鐨勬柟娉曟潵鏌ユ壘鍜屾彁鍙栧垪琛ㄦ暟鎹紝姣斿find()鏂规硶鍜宖ind_all()鏂规硶銆備緥濡傦紝濡傛灉瑕佹彁鍙栨墍鏈夌殑鍒楄〃椤癸紝鍙互浣跨敤find_all()鏂规硶鏉ヨ幏鍙栨墍鏈夌殑鍒楄〃椤瑰厓绱狅紝濡備笅鎵€绀猴細
list_items = soup.find_all('li')
- 閬嶅巻鍒楄〃鏁版嵁锛氶亶鍘嗚幏鍙栧埌鐨勫垪琛ㄦ暟鎹紝杩涜杩涗竴姝ュ鐞嗐€備緥濡傦紝鍙互浣跨敤for寰幆鏉ラ亶鍘嗚幏鍙栧埌鐨勫垪琛ㄩ」锛屽涓嬫墍绀猴細
for item in list_items:
print(item.text)
浠ヤ笂鏄娇鐢˙eautifulSoup澶勭悊鍒楄〃鏁版嵁鐨勫熀鏈楠わ紝鏍规嵁鍏蜂綋鐨勯渶姹傚拰鎯呭喌锛岃繕鍙互杩涗竴姝ュ鑾峰彇鍒扮殑鍒楄〃鏁版嵁杩涜澶勭悊鍜屾搷浣溿€?/p>
相关问答