使用 Gunning Fox 索引分析文本
在使用 Gunning Fox 索引进行可读性分析时。我必须计算以下值
- 平均句子长度=单词数/句子数
- 复杂单词的百分比=复杂单词的数量/单词数
- 雾指数= 0.4 *(平均句子长度+复杂单词的百分比)
I想知道删除重复项和停用词(即清理后)后是否会计算字数,还是仅计算文本中的总字数而不删除任何单词或清理?
感谢您的帮助!
While doing Analysis of readability using Gunning Fox index-. I have to calculate following values
- Average Sentence Length = the number of words / the number of sentences
- Percentage of Complex words = the number of complex words / the number of words
- Fog Index = 0.4 * (Average Sentence Length + Percentage of Complex words)
I want to know whether the number of words will be calculated after removing duplicates and stop words i.e. after cleaning or just the total no of words in the text without removing any words or cleaning?
Thanks for help!
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
不,您不进行任何清洁或“停止词”拆卸。
您正在尝试计算阅读文本的容易性。停止单词仅与旧式信息检索有关。另外,请勿删除重复项。处理AS的文本,否则结果将是错误的。
如果您要删除停止词,那么文本将很难读取,因为有效的简短(即“简单”)单词将被删除。
No, you don't do any cleaning or 'stop-word' removal.
You are trying to calculate how easy it is to read the text. Stop words are only relevant for old-style information retrieval. Also, do not remove duplicates. Process the text as-is, otherwise the result will be wrong.
If you were to remove stopwords, the text would be more difficult to read, as effectively a lot of short (ie "easy") words will have been removed.