扫描二维码下载沐宇APP

沐宇

微信扫码使用沐宇小程序

沐宇

BeautifulSoup鎬庝箞浠庣綉椤典腑鎶撳彇鏁版嵁

扬州沐宇科技
2024-05-14 12:46:17
BeautifulSoup

浣跨敤BeautifulSoup浠庣綉椤典腑鎶撳彇鏁版嵁鐨勬楠ゅ涓嬶細

  1. 瀵煎叆BeautifulSoup鍜宺equests搴擄細
from bs4 import BeautifulSoup
import requests
  1. 浣跨敤requests搴撳彂閫佽姹傝幏鍙栫綉椤靛唴瀹癸細
url = 'https://example.com'
response = requests.get(url)
  1. 浣跨敤BeautifulSoup瑙f瀽缃戦〉鍐呭锛?/li>
soup = BeautifulSoup(response.text, 'html.parser')
  1. 浣跨敤BeautifulSoup鐨勬柟娉曟壘鍒版兂瑕佹姄鍙栫殑鏁版嵁锛?/li>
# 鎵惧埌鎵€鏈夌殑鏍囬
titles = soup.find_all('h2')

# 鎵惧埌鎵€鏈夌殑閾炬帴
links = soup.find_all('a')

# 鎵惧埌鐗瑰畾class鐨勫厓绱?/span>
specific_class = soup.find_all(class_='specific-class')
  1. 閬嶅巻鎵惧埌鐨勬暟鎹苟鎻愬彇鍑洪渶瑕佺殑鍐呭锛?/li>
for title in titles:
    print(title.text)

for link in links:
    print(link['href'])

for element in specific_class:
    print(element.text)

閫氳繃浠ヤ笂姝ラ锛屾偍鍙互浣跨敤BeautifulSoup浠庣綉椤典腑鎶撳彇鏁版嵁骞舵彁鍙栧嚭闇€瑕佺殑鍐呭銆?/p>

扫码添加客服微信