读取 BQUERY STREAMING（实时）API

发布于 2025-01-11 02:18:16 字数 779 浏览 0 评论 0原文

我有 BigQuery 数据仓库，它从 Google Analytics 获取数据。数据是实时传输的。现在我想使用 BigQuery 的 API 在数据到达（而不是之后）时获取这些数据。

我已经看到了 api，它允许您在将数据保存到 bigquery 后查询数据，例如：

from google.cloud import bigquery

# Construct a BigQuery client object.
client = bigquery.Client()

query = """
    SELECT name, SUM(number) as total_people
    FROM `bigquery-public-data.usa_names.usa_1910_2013`
    WHERE state = 'TX'
    GROUP BY name, state
    ORDER BY total_people DESC
    LIMIT 20
"""
query_job = client.query(query)  # Make an API request.

print("The query data:")
for row in query_job:
    # Row values can be accessed by field name or index.
    print("name={}, count={}".format(row[0], row["total_people"]))

有没有办法“监听”数据并将其中一些存储在云端？而不是让它被保存然后从bigquery查询？

谢谢

原文

I have BigQuery data warehouse which gets its data from Google Analytics.
the data is streamd - real time.
now I want to get this data as it arrives (and not after) to the bigquery using its API.

I have seen the api which lets you query the data after it saved into the bigquery,
for example:

from google.cloud import bigquery

# Construct a BigQuery client object.
client = bigquery.Client()

query = """
    SELECT name, SUM(number) as total_people
    FROM `bigquery-public-data.usa_names.usa_1910_2013`
    WHERE state = 'TX'
    GROUP BY name, state
    ORDER BY total_people DESC
    LIMIT 20
"""
query_job = client.query(query)  # Make an API request.

print("The query data:")
for row in query_job:
    # Row values can be accessed by field name or index.
    print("name={}, count={}".format(row[0], row["total_people"]))

Is there any way to "listen" to the data and store some of it on cloud?
rather than let it be saved and then query from the bigquery?

Thanks

分享到QQ

分享到微博