计算数据集范围内积分的最有效方法

发布于 2024-10-11 15:43:31 字数 443 浏览 9 评论 0原文

我有一个 10 行 x 20 列的数组。每列对应一个数据集，该数据集无法用任何类型的连续数学函数拟合（它是通过实验得出的一系列数字）。我想计算第4行和第8行之间每一列的积分，然后将获得的结果存储在一个新数组（20行x 1列）中。

我尝试过使用不同的 scipy.integrate 模块（例如quad，trpz，...）。

问题是，据我了解， scipy.integrate 必须应用于函数，并且我不确定如何将初始数组的每一列转换为函数。作为替代方案，我考虑计算第 4 行和第 8 行之间每列的平均值，然后将该数字乘以 4（即 8-4=4，x 间隔），然后将其存储到我的最终 20x1 数组中。问题是……嗯……我不知道如何计算给定范围内的平均值。我要问的问题是：

哪种方法更有效/更直接？
可以在像我所描述的那样的数据集上计算积分吗？
如何计算一系列行的平均值？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

温柔少女心 2024-10-18 15:43:31

由于您只知道数据点，因此最好的选择是使用trapz（积分的梯形近似，基于您知道的数据点）。

您很可能不想将数据集转换为函数，而使用 trapz 则不需要这样做。

所以如果我理解正确的话，你想做这样的事情：

from numpy import *

# x-coordinates for data points
x = array([0, 0.4, 1.6, 1.9, 2, 4, 5, 9, 10])

# some random data: 3 whatever data sets (sharing the same x-coordinates)
y = zeros([len(x), 3])
y[:,0] = 123
y[:,1] = 1 + x
y[:,2] = cos(x/5.)
print y

# compute approximations for integral(dataset, x=0..10) for datasets i=0,1,2
yi = trapz(y, x[:,newaxis], axis=0)
# what happens here: x must be an array of the same shape as y
# newaxis tells numpy to add a new "virtual" axis to x, in effect saying that the
# x-coordinates are the same for each data set

# approximations of the integrals based the datasets
# (here we also know the exact values, so print them too)
print yi[0], 123*10
print yi[1], 10 + 10*10/2.
print yi[2], sin(10./5.)*5.

Since you know only the data points, the best choice is to use trapz (the trapezoidal approximation to the integral, based on the data points you know).

You most likely don't want to convert your data sets to functions, and with trapz you don't need to.

So if I understand correctly, you want to do something like this:

from numpy import *

# x-coordinates for data points
x = array([0, 0.4, 1.6, 1.9, 2, 4, 5, 9, 10])

# some random data: 3 whatever data sets (sharing the same x-coordinates)
y = zeros([len(x), 3])
y[:,0] = 123
y[:,1] = 1 + x
y[:,2] = cos(x/5.)
print y

# compute approximations for integral(dataset, x=0..10) for datasets i=0,1,2
yi = trapz(y, x[:,newaxis], axis=0)
# what happens here: x must be an array of the same shape as y
# newaxis tells numpy to add a new "virtual" axis to x, in effect saying that the
# x-coordinates are the same for each data set

# approximations of the integrals based the datasets
# (here we also know the exact values, so print them too)
print yi[0], 123*10
print yi[1], 10 + 10*10/2.
print yi[2], sin(10./5.)*5.

回复收藏 0 原文

司马昭之心 2024-10-18 15:43:31

要获取每列中条目 4 到 8（包括两端）的总和，请使用

a = numpy.arange(200).reshape(10, 20)
a[4:9].sum(axis=0)

(第一行只是创建所需形状的示例数组。)

To get the sum of the entries 4 to 8 (including both ends) in each column, use

a = numpy.arange(200).reshape(10, 20)
a[4:9].sum(axis=0)

(The first line is just to create an example array of the desired shape.)

回复收藏 0 原文

~没有更多了~

关于作者

梦中的蝴蝶

暂无简介

文章

26 人气

关注发私信

友情链接

文江博客

计算数据集范围内积分的最有效方法

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

夢野间

百度③文鱼

小草泠泠

zhuwenyan

weirdo

坚持沉默

友情链接

计算数据集范围内积分的最有效方法

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

夢野间

百度③文鱼

小草泠泠

zhuwenyan

weirdo

坚持沉默

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。