我该如何自动扩展整个网页以用pandas（Python）刮擦？

发布于 2025-01-31 02:50:03 字数 663 浏览 1 评论 0原文

我正在尝试从 https://coinmarketcap.com/exchanges/exchanges/binance/binance/ ）。在页面上，有一个“负载更多”按钮。单击此点时，将显示更多的行，但URL不会更改。当我使用pd.read_html（url（>）将此URL传递给PANDAS时，它会拉出前100行，而无需其他。如何通过url或命令自动加载所有表？任何帮助都将受到赞赏。代码：

import json
import requests
import pandas as pd
from bs4 import BeautifulSoup
import lxml
import html5lib


url = "https://coinmarketcap.com/exchanges/binance/"


df = pd.read_html(url)
pd.set_option("display.max_rows", None, "display.max_columns", None)
print(df)

原文

I am trying to scrape html tables from https://coinmarketcap.com/exchanges/binance/ . On the page, there is a "load more" button. When this is clicked, more rows are displayed, but the URL doesn't change. When I pass this URL to pandas using pd.read_html(url(, it pulls the first 100 rows and nothing else. How do I auto load all the tables, either through the URL or through a command? Any help is appreciated.
Code:

import json
import requests
import pandas as pd
from bs4 import BeautifulSoup
import lxml
import html5lib


url = "https://coinmarketcap.com/exchanges/binance/"


df = pd.read_html(url)
pd.set_option("display.max_rows", None, "display.max_columns", None)
print(df)

分享到QQ

分享到微博