如何在报废时选择多个元素？

发布于 2025-02-07 12:55:38 字数 1001 浏览 1 评论 0原文

我正在研究有关《纽约时报》报道的新闻标题和内容。

这就是我写的：

import requests 
from bs4 import BeautifulSoup
import urllib.request as req 
import bs4
import pandas as pd
import numpy as np
import time
import warnings

url = 'https://www.nytimes.com/search?query=ukraine+war'

headers = {
   'User-Agent':
       'Mozilla/5.0 (Windows NT 10.0; Win64; x64)'
       ' AppleWebKit/537.36 (KHTML, like Gecko)'
       ' Chrome/102.0.5005.63 Safari/537.36 Edg/102.0.1245.33'
}

r = requests.get(url, headers=headers)

soup = BeautifulSoup(r.text, 'html5lib')

title = soup.find_all("h4", "p.css-16nhkrn")

for title in titles:
    title = title.text.strip()
    print(title)

from wordcloud import WordCloud, STOPWORDS
import matplotlib.pyplot as plt

text = title

wordcloud = WordCloud().generate(text)

plt.figure()
plt.imshow(wordcloud, interpolation="bilinear")
plt.axis("off")
plt.show()

我不确定为什么p class = CSS-16NHKRN未出现。我尝试了其他人，但失败了。

原文

I am doing research on news titles and contents reporting on New York Times.

This is what I have written:

import requests 
from bs4 import BeautifulSoup
import urllib.request as req 
import bs4
import pandas as pd
import numpy as np
import time
import warnings

url = 'https://www.nytimes.com/search?query=ukraine+war'

headers = {
   'User-Agent':
       'Mozilla/5.0 (Windows NT 10.0; Win64; x64)'
       ' AppleWebKit/537.36 (KHTML, like Gecko)'
       ' Chrome/102.0.5005.63 Safari/537.36 Edg/102.0.1245.33'
}

r = requests.get(url, headers=headers)

soup = BeautifulSoup(r.text, 'html5lib')

title = soup.find_all("h4", "p.css-16nhkrn")

for title in titles:
    title = title.text.strip()
    print(title)

from wordcloud import WordCloud, STOPWORDS
import matplotlib.pyplot as plt

text = title

wordcloud = WordCloud().generate(text)

plt.figure()
plt.imshow(wordcloud, interpolation="bilinear")
plt.axis("off")
plt.show()

I am not sure why p class = css-16nhkrn didn't appear. I have tried others but failed.

分享到QQ

分享到微博