鎬庝箞浣跨敤python鐖彇鏂囩珷鍐呭
瑕佷娇鐢≒ython鐖彇鏂囩珷鍐呭锛岄鍏堥渶瑕佸畨瑁呬竴涓敤浜庣綉缁滅埇鍙栫殑搴擄紝姣斿requests鎴栬€卽rllib銆傜劧鍚庯紝闇€瑕佷簡瑙g洰鏍囩綉绔欑殑缁撴瀯鍜孶RL锛岀‘瀹氳鐖彇鐨勬枃绔犲唴瀹规墍鍦ㄧ殑椤甸潰銆?/p>
鎺ヤ笅鏉ワ紝鍙互閫氳繃鍙戦€丠TTP璇锋眰鑾峰彇椤甸潰鐨凥TML鍐呭锛岀劧鍚庝娇鐢˙eautifulSoup鎴栬€呮鍒欒〃杈惧紡绛夋柟娉曚粠HTML涓彁鍙栧嚭鏂囩珷鍐呭銆傛渶鍚庯紝鍙互灏嗘彁鍙栫殑鏂囩珷鍐呭淇濆瓨鍒版湰鍦版枃浠朵腑鎴栬€呰繘琛屽叾浠栧鐞嗐€?/p>
浠ヤ笅鏄竴涓畝鍗曠殑绀轰緥浠g爜锛屾紨绀哄浣曚娇鐢≒ython鐖彇鏂囩珷鍐呭锛?/p>
import requests
from bs4 import BeautifulSoup
url = 'https://example.com/article'
response = requests.get(url)
html = response.text
soup = BeautifulSoup(html, 'html.parser')
article = soup.find('div', class_='article-content').get_text()
print(article)
鍦ㄨ繖涓ず渚嬩腑锛屾垜浠鍏堜娇鐢╮equests搴撳彂閫佷簡涓€涓狦ET璇锋眰鑾峰彇浜嗘枃绔犻〉闈㈢殑HTML鍐呭锛岀劧鍚庝娇鐢˙eautifulSoup搴撹В鏋怘TML锛屾壘鍒颁簡鏂囩珷鍐呭鎵€鍦ㄧ殑鏍囩锛屽苟鎻愬彇鍑烘枃绔犲唴瀹广€傛渶鍚庯紝灏嗘枃绔犲唴瀹规墦鍗板嚭鏉ャ€備綘鍙互鏍规嵁闇€瑕佸鏂囩珷鍐呭杩涜杩涗竴姝ョ殑澶勭悊鎴栦繚瀛樸€?/p>
相关问答