TextRank 运行时间
我在java中实现了textrank,但它看起来很慢。有谁知道它的预期性能吗?
如果预计不会很慢,可能是以下任何一个问题:
1)似乎没有办法创建边缘并同时为其添加权重JGraphT 时间,所以我计算重量,如果它 > 0,我添加一条边。后来我重新计算权重以在循环边缘时添加它们。这是一个糟糕的主意吗?
2)我正在使用JGraphT。那是一个缓慢的图书馆吗?
3)我还能做些什么来加快速度吗?
I implemented textrank in java but it seems pretty slow. Does anyone know about its expected performance?
If it's not expected to be slow, could any of the following be the problem:
1) It didn't seem like there was a way to create an edge and add a weight to it at the same in JGraphT time so I calculate the weight and if it's > 0, I add an edge. I later recalculate the weights to add them while looping through the edges. Is that a terrible idea?
2) I'm using JGraphT. Is that a slow library?
3) Anything else I could do to make it faster?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
这取决于你所说的“相当慢”是什么意思。谷歌搜索了一下发现了这段:
(参见http://www.scribd.com/doc/51398390 /11/Evaluating-ef%EF%AC%81ciency 的上下文。)
因此,据此,我推断一个不错的 TextRank 实现应该能够从约 1 秒内约 500 个摘要。
It depends what you mean by "pretty slow". A bit of googling found this paragraph:
(See http://www.scribd.com/doc/51398390/11/Evaluating-ef%EF%AC%81ciency for the context.)
So from this, I infer that a decent TextRank implementation should be capable of extracting keywords from ~500 abstracts in ~1second.