CachedRowSet 比 ResultSet 慢？

发布于 2024-12-09 00:01:05 字数 838 浏览 2 评论 0原文

在我的 java 代码中，我使用 select 语句访问 Oracle 数据库表。我收到很多行（大约 50.000 行），因此 rs.next() 需要一些时间来处理所有行。

using ResultSet, the processing of all rows (rs.next) takes about 30 secs

我的目标是加快这个过程，所以我更改了代码，现在使用 CachedRowSet：

using CachedRowSet, the processing of all rows takes about 35 secs

我不明白为什么 CachedRowSet 比正常的 慢ResultSet，因为 CachedRowSet 会一次性检索所有数据，而 ResultSet 会在每次调用 rs.next 时检索数据。

这是代码的一部分：

try {
    stmt = masterCon.prepareStatement(sql);
    rs = stmt.executeQuery();

    CachedRowSet crset = new CachedRowSetImpl();
    crset.populate(rs);

    while (rs.next()) {
        int countStar = iterRs.getInt("COUNT");
        ...
    }
} finally {
    //cleanup
}

原文

In my java code, I access an oracle database table with an select statement.
I receive a lot of rows (about 50.000 rows), so the rs.next() needs some time to process all of the rows.

using ResultSet, the processing of all rows (rs.next) takes about 30 secs

My goal is to speed up this process, so I changed the code and now using a CachedRowSet:

using CachedRowSet, the processing of all rows takes about 35 secs

I don't understand why the CachedRowSet is slower than the normal ResultSet, because the CachedRowSet retrieves all data at once, while the ResultSet retrieves the data every time the rs.next is called.

Here is a part of the code:

try {
    stmt = masterCon.prepareStatement(sql);
    rs = stmt.executeQuery();

    CachedRowSet crset = new CachedRowSetImpl();
    crset.populate(rs);

    while (rs.next()) {
        int countStar = iterRs.getInt("COUNT");
        ...
    }
} finally {
    //cleanup
}

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

真心难拥有 2024-12-16 00:01:05

CachedRowSet 将结果缓存在内存中，即您不再需要连接。因此它首先“慢”。

CachedRowSet 对象是数据行的容器，用于缓存其数据行
内存中的行，这使得无需总是进行操作即可
正在连接到其数据源。

-> http://download.oracle.com /javase/1,5.0/docs/api/javax/sql/rowset/CachedRowSet.html

回复收藏 0 原文

我的黑色迷你裙 2024-12-16 00:01:05

CachedRowSet 与 postgres jdbc 驱动程序耦合存在问题。

CachedRowSet 需要知道列的类型，以便它知道要创建哪些 java 对象
（天知道它还从数据库中获取了什么！）。

因此，它会进行更多次往返数据库以获取列元数据。
在非常大的数量下，这成为一个真正的问题。
如果数据库位于远程服务器上，由于网络延迟，这也是一个真正的问题。

我们已经使用 CachedRowSet 多年了，刚刚发现了这一点。我们现在实现了自己的 CachedRowSet ，因为我们从来没有使用过它的任何花哨的东西。
我们对所有类型执行 getString 并自行转换，因为这似乎是最快的方法。

这显然不是获取大小的问题，因为 postgres 驱动程序默认获取所有内容。

回复收藏 0 原文

雨巷深深 2024-12-16 00:01:05

是什么让您认为每次调用 rs.next() 时 ResultSet 都会检索数据？这取决于具体的实现方式——如果它一次获取一个块，我不会感到惊讶；很可能是相当大的一块。

我怀疑您基本上看到了将所有数据复制到 CachedRowSet 中并然后访问所有数据所需的时间 - 基本上您无需进行额外的复制操作目的。

回复收藏 0 原文

臻嫒无言 2024-12-16 00:01:05

使用普通的 ResultSet，您可以通过 RowPrefetch 和 FetchSize 获得更多优化选项。

这些优化了 while 循环中的网络传输块和处理，因此 rs.next() 始终有要处理的数据。

FetchSize 默认设置为 10（Oracle 最新版本），但据我所知 RowPrefetch 未设置。因此意味着网络传输根本没有优化。

回复收藏 0 原文

北恋 2024-12-16 00:01:05

我在使用/不使用 CachedRowSet 和不同 fetchSize 的情况下进行了一些性能测试（使用 Exasol DB 和最新的 7.xx JDBC 驱动程序）

测试和代码非常简单；

第一次尝试：执行 Select 查询并填充 CachedRowSet
第二次尝试：执行 Select 查询并使用标准 ResultSet 填充 Map 第一次尝试（使用

CachedRowSet）：

stmt = this.connection.prepareStatement(query);
stmt.setFetchSize(this.fetchSize);
rs = stmt.executeQuery();
CachedRowSet crs = RowSetProvider.newFactory().createCachedRowSet();
crs.populate(rs);
return crs;

结果：

| Table Size | FetchSize | ExecuteQuery Time | CrsPopulate Time | Total Time |
| ---------- | --------- | ----------------- | ---------------- | ---------- |
| 50K items  | 10K       | 441.0ms           | 8124.0ms         | 8565.0ms   |
| 50K items  | 100K      | 394.0ms           | 4252.0ms         | 4646.0ms   |

第二次尝试（填充HashMap而不是CachedRowSet）：

stmt = this.connection.prepareStatement(query);
stmt.setFetchSize(this.fetchSize);
rs = stmt.executeQuery();
Map<Long, String> m = new HashMap<>();
while (rs.next())
    m.put(rs.getLong("id"), rs.getString("name"));
return m;

结果：

| Table Size | FetchSize | ExecuteQuery Time | MapPopulate Time | Total Time |
| ---------- | --------- | ----------------- | ---------------- | ---------- |
| 50K items  | 10K       | 387.0ms           | 1561.0ms         | 1948.0ms   |
| 50K items  | 100K      | 389.0ms           | 1464.0ms         | 1853.0ms   |

所以确实，看起来与直接使用 ResultSet 相比，CachedRowSet 实现的性能较差，但它提供了一种通用方法，允许使用不同的表结构。

免责声明：

我只使用了一种类型的数据集（即包含 50K 个简单项目的表格）。
我建议您在决定您的方法之前先根据您的具体用例进行测试。

I've done some performance tests with/without CachedRowSet and with different fetchSize (using Exasol DB and the latest 7.x.x JDBC driver)

My tests and code are very straight forward;

First Try: executing a Select query and populating the CachedRowSet
Second Try: executing a Select query and populating a Map using a standard ResultSet

First Try (using a CachedRowSet):

stmt = this.connection.prepareStatement(query);
stmt.setFetchSize(this.fetchSize);
rs = stmt.executeQuery();
CachedRowSet crs = RowSetProvider.newFactory().createCachedRowSet();
crs.populate(rs);
return crs;

Results:

| Table Size | FetchSize | ExecuteQuery Time | CrsPopulate Time | Total Time |
| ---------- | --------- | ----------------- | ---------------- | ---------- |
| 50K items  | 10K       | 441.0ms           | 8124.0ms         | 8565.0ms   |
| 50K items  | 100K      | 394.0ms           | 4252.0ms         | 4646.0ms   |

Second Try (Populating a HashMap rather than a CachedRowSet):

stmt = this.connection.prepareStatement(query);
stmt.setFetchSize(this.fetchSize);
rs = stmt.executeQuery();
Map<Long, String> m = new HashMap<>();
while (rs.next())
    m.put(rs.getLong("id"), rs.getString("name"));
return m;

Results:

| Table Size | FetchSize | ExecuteQuery Time | MapPopulate Time | Total Time |
| ---------- | --------- | ----------------- | ---------------- | ---------- |
| 50K items  | 10K       | 387.0ms           | 1561.0ms         | 1948.0ms   |
| 50K items  | 100K      | 389.0ms           | 1464.0ms         | 1853.0ms   |

So indeed, looks like the CachedRowSet implementation has poor performance comparing to using the ResultSet directly, but it provides a generic approach which allows to use different tables structures.

Disclaimer:

I used only one type of data set (i.e. a table of 50K simple items).
I suggest that you test that on your specific use case before deciding on your approach.

回复收藏 0 原文

~没有更多了~