Lucene 和突出显示文本字体问题
我正在使用 Lucene HighLighter,并取得了成功。这是我的代码:
StringBuffer sb = new StringBuffer();
for (int t = 0; t < fields.length; t++) {
SimpleHTMLFormatter formatter = new SimpleHTMLFormatter(
"<span class=\"highlight\">", "</span>");
Highlighter highlighter = new Highlighter(formatter,
new QueryScorer(parser.parse(queryString)));
if (d.get(fields[t]) != null) {
hilites = highlighter.getBestFragments(analyzer, fields[t],
d.get(fields[t]), 3);
int l = hilites.length;
// System.out.println("hilites length: "+l);
if (l > 0) {
for (int x = 0; x < l; x++) {
sb.append(hilites[x]).append("...");
}
}
}
}
问题出在我的搜索结果/突出显示的文本上,字符是乱码。这是因为缺少字体吗?
这是我的突出显示文本:
**on Educational Materials ~ ATS Job Board ""OR~C'C" .. III DUES United States Full... ? SL[I!," Full Memberhsip - Domestic membership is for residents residing in the United States. Dues...**
注意时髦的文本!
任何帮助将不胜感激。
Im using Lucene HighLighter, with success. Here is my code:
StringBuffer sb = new StringBuffer();
for (int t = 0; t < fields.length; t++) {
SimpleHTMLFormatter formatter = new SimpleHTMLFormatter(
"<span class=\"highlight\">", "</span>");
Highlighter highlighter = new Highlighter(formatter,
new QueryScorer(parser.parse(queryString)));
if (d.get(fields[t]) != null) {
hilites = highlighter.getBestFragments(analyzer, fields[t],
d.get(fields[t]), 3);
int l = hilites.length;
// System.out.println("hilites length: "+l);
if (l > 0) {
for (int x = 0; x < l; x++) {
sb.append(hilites[x]).append("...");
}
}
}
}
The problem is on my search results/highlighted text, the characters are garbled. Is this due to missing fonts?
Here is my Highlight text:
**on Educational Materials ~ ATS Job Board ""OR~C'C" .. III DUES United States Full... ? SL[I!," Full Memberhsip - Domestic membership is for residents residing in the United States. Dues...**
Notice the funky text!
Any help would be greatly appreciated.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
“乱码问题”可能与 Lucene 无关,而是与 XML 编码有关。您是否将“contentType”设置为“text/html;charset=UTF-8”?
The 'Garbled Text Problem' is probably not related to Lucene, but XML encoding. Did you set the 'contentType' to "text/html;charset=UTF-8"?