如何获取 Fry Graph 可读性公式的级别?
我正在开发一个应用程序 (C#),它将一些可读性公式应用于文本,例如 Gunning-Fog、Precise SMOG、Flesh-Kincaid。
现在,我需要在我的程序中实现基于 Fry 的评分公式,我理解公式的逻辑,几乎需要 3 100 个单词的样本并计算每 100 个单词的句子和每 100 个单词的音节的平均值,然后,您可以使用图表来绘制值。
此处更详细地解释了此公式的工作原理。
我已经有了平均值,但我不知道如何告诉我的程序“检查图表并绘制值并给我一个水平”。我不必向用户显示图表,我只需向他显示级别。
我想也许我可以把所有的值都记在记忆里,分成不同的级别,例如:
级别 1:句子平均值在 10.0 到 25+ 之间,音节平均值在 108 到 132 之间的值。 级别 2:句子平均值在 10.0 到 25+ 之间
的值。句子平均值在 7.7 和 10.0 之间,等等。
但问题是,到目前为止,我找到定义级别的值的唯一地方是在图表本身中,而且它们并不是太非常准确,所以如果我应用上面评论的方法,尝试从图表中获取值,我的水平估计会太不精确,因此,基于 Fry 的等级将不准确。
所以,也许你们中的任何人都知道在某个地方我可以找到基于弗莱的等级的不同级别的精确值,或者你们中的任何人都可以帮助我思考解决这个问题的方法。
谢谢
I'm working in an application (C#) that applies some readability formulas to a text, like Gunning-Fog, Precise SMOG, Flesh-Kincaid.
Now, I need to implement the Fry-based Grade formula in my program, I understand the formula's logic, pretty much you take 3 100-words samples and calculate the average on sentences per 100-words and syllables per 100-words, and then, you use a graph to plot the values.
Here is a more detailed explanation on how this formula works.
I already have the averages, but I have no idea on how can I tell my program to "go check the graph and plot the values and give me a level." I don't have to show the graph to the user, I only have to show him the level.
I was thinking that maybe I can have all the values in memory, divided into levels, for example:
Level 1: values whose sentence average are between 10.0 and 25+, and whose syllables average are between 108 and 132.
Level 2: values whose sentence average are between 7.7 and 10.0, and .... so on
But the problem is that so far, the only place in which I have found the values that define a level, are in the graph itself, and they aren't too much accurate, so if I apply the approach commented above, trying to take the values from the graph, my level estimations would be too much imprecise, thus, the Fry-based Grade will not be accurate.
So, maybe any of you knows about some place where I can find exact values for the different levels of the Fry-based Grade, or maybe any of you can help me think in a way to workaround this.
Thanks
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(3)
好吧,我不确定这是否是最有效的解决方案,也不是最好的解决方案,但至少它可以完成工作。
我放弃了用数学公式来获得等级的想法,也许有这样的公式,但我找不到它。
所以我采用了 Fry 的图表,包含所有级别,并为每个级别绘制了不同的颜色,我使用以下方法将图像加载到我的程序中:
如您所见,加载图像后,我使用 GetPixel 方法来获取颜色在指定的坐标处。我必须进行一些转换,以获得图表上给定值的等效像素,因为图表的比例并不等于图像的像素。
最后,我比较了 GetPixel 返回的颜色,看看哪个是文本的 Fry 可读性级别。
我希望这对面临同样问题的人有帮助。
干杯。
Well, I'm not sure about this being the most efficient solution, neither the best one, but at least it does the job.
I gave up to the idea of having like a math formula to get the levels, maybe there is such a formula, but I couldn't find it.
So I took the Fry's graph, with all the levels, and I painted each level of a different color, them I loaded the image on my program using:
As you can see, after loading the image I use the GetPixel method to get the color at the specified coordinates. I had to do some conversion, to get the equivalent pixels for a given value on the graph, since the scale of the graph is not the equivalent to the pixels of the image.
In the end, I compare the color returned by GetPixel to see which was the Fry readability level of the text.
I hope this may be of any help for someone who faces the same problem.
Cheers.
您只需确定图表的公式即可。即,接受句子数和音节数并返回级别的公式。
如果找不到公式,可以自己确定。估计 图表上每条线的线性方程。还要估计“长单词”和“长句子”区域中的“出界”区域。
现在对于每个点,只需确定它所在的区域即可;哪一行在上面,哪一行在下面。这是相当简单的代数,不幸的是这是我能找到的最好的链接描述如何做到这一点。
You simply need to determine the formula for the graph. That is, a formula that accepts the number of sentences and number of syllables, and returns the level.
If you can't find the formula, you can determine it yourself. Estimate the linear equation for each of the lines on the graph. Also estimate the 'out-of-bounds' areas in the 'long words' and 'long sentences' areas.
Now for each point, just determine the region in which it resides; which lines it is above and which lines it is below. This is fairly simple algebra, unfortunately this is the best link I can find to describe how to do that.
我已经解决了这个问题,我想我会分享,以防其他人将来某个时候看到。我根据上面的答案创建了一个通用的线性方程列表,可以用来确定大致的年级水平。首先必须纠正这些值以使其更加线性。这没有考虑到无效区域,但我可能会重新考虑这一点。
方程类:
这是 FryCalculator:
I have made a first pass at solving this that I thought I would share in case someone else is looking sometime in the future. I built on the answer above and created a generic list of linear equations that one can use to determine an approximate grade level. First had to correct the values to make it more linear. This does not take into account the invalid areas, but I may revisit that.
The equation class:
Here is the FryCalculator: