优化文件搜索
我有一个庞大的(大约 10 000 人)数据库。这些人可以输入标准信息并上传文档(pdf、.doc 和 .docx)。 当我进行自由文本搜索时,它会搜索所有人员的信息字段及其上传的文档。搜索所有人民的文档需要花费很多时间,wicket 会抛出一个“WicketRuntimeException:1 分钟后,Pagemap null 仍然被以下锁定:Thread ...”
我的问题是是否可以创建一个搜索功能,可以搜索最多无需花费很长时间即可处理 10 000 个文档。
该项目运行 Java、Wicket、Hibernate 和 Java。春天。
I have a large ( around 10 000) database of people. These people can input standard information and upload documents (pdf, .doc & .docx).
When I do a freetext search, it searches through all of the peoples information fields AND their uploaded documents. Searching through ALL of the peoples documents takes so much time wicket throws an "WicketRuntimeException: After 1 minute the Pagemap null is still locked by: Thread ... "
My question is if it's possible to create a search function that can search through up to 10 000 documents without taking extremely long time.
The project is running Java, with Wicket, Hibernate & Spring.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
您所描述的内容听起来像是直接适合 Apache Lucene。在确定现有轮子不能满足您的需求之前,不要重新发明轮子。
What you're describing sounds like a direct fit for Apache Lucene. Don't reinvent the wheel until you're sure that existing wheels don't meet your needs.