BeautifulSoup鎬庝箞浠庣綉椤典腑鎶撳彇鏁版嵁
浣跨敤BeautifulSoup浠庣綉椤典腑鎶撳彇鏁版嵁鐨勬楠ゅ涓嬶細
- 瀵煎叆BeautifulSoup鍜宺equests搴擄細
from bs4 import BeautifulSoup
import requests
- 浣跨敤requests搴撳彂閫佽姹傝幏鍙栫綉椤靛唴瀹癸細
url = 'https://example.com'
response = requests.get(url)
- 浣跨敤BeautifulSoup瑙f瀽缃戦〉鍐呭锛?/li>
soup = BeautifulSoup(response.text, 'html.parser')
- 浣跨敤BeautifulSoup鐨勬柟娉曟壘鍒版兂瑕佹姄鍙栫殑鏁版嵁锛?/li>
# 鎵惧埌鎵€鏈夌殑鏍囬
titles = soup.find_all('h2')
# 鎵惧埌鎵€鏈夌殑閾炬帴
links = soup.find_all('a')
# 鎵惧埌鐗瑰畾class鐨勫厓绱?/span>
specific_class = soup.find_all(class_='specific-class')
- 閬嶅巻鎵惧埌鐨勬暟鎹苟鎻愬彇鍑洪渶瑕佺殑鍐呭锛?/li>
for title in titles:
print(title.text)
for link in links:
print(link['href'])
for element in specific_class:
print(element.text)
閫氳繃浠ヤ笂姝ラ锛屾偍鍙互浣跨敤BeautifulSoup浠庣綉椤典腑鎶撳彇鏁版嵁骞舵彁鍙栧嚭闇€瑕佺殑鍐呭銆?/p>
相关问答