如何在 C# 中将字符串转换为 RTF?
问题
如何将字符串“Européen”转换为 RTF 格式的字符串“Europ\'e9en”?
[TestMethod]
public void Convert_A_Word_To_Rtf()
{
// Arrange
string word = "Européen";
string expected = "Europ\'e9en";
string actual = string.Empty;
// Act
// actual = ... // How?
// Assert
Assert.AreEqual(expected, actual);
}
到目前为止我发现的
RichTextBox
RichTextBox 可以用于某些事情。示例:
RichTextBox richTextBox = new RichTextBox();
richTextBox.Text = "Européen";
string rtfFormattedString = richTextBox.Rtf;
但是 rtfFormattedString 结果是整个 RTF 格式的文档,而不仅仅是字符串“Europ\'e9en”。
Stackoverflow
我还找到了很多其他资源在网上,但没有什么能完全解决我的问题。
回答
必须添加 Trim()
来删除 结果
中前面的空格。除此之外,布拉德·克里斯蒂的解决方案似乎有效。
我现在将使用这个解决方案,尽管我有一种不好的直觉,因为我们必须对 RichTextBox 进行 SubString 和 Trim 以获得 RTF 格式的字符串。
测试用例:
[TestMethod]
public void Test_To_Verify_Brad_Christies_Stackoverflow_Answer()
{
Assert.AreEqual(@"Europ\'e9en", "Européen".ConvertToRtf());
Assert.AreEqual(@"d\'e9finitif", "définitif".ConvertToRtf());
Assert.AreEqual(@"\'e0", "à".ConvertToRtf());
Assert.AreEqual(@"H\'e4user", "Häuser".ConvertToRtf());
Assert.AreEqual(@"T\'fcren", "Türen".ConvertToRtf());
Assert.AreEqual(@"B\'f6den", "Böden".ConvertToRtf());
}
逻辑作为扩展方法:
public static class StringExtensions
{
public static string ConvertToRtf(this string value)
{
RichTextBox richTextBox = new RichTextBox();
richTextBox.Text = value;
int offset = richTextBox.Rtf.IndexOf(@"\f0\fs17") + 8; // offset = 118;
int len = richTextBox.Rtf.LastIndexOf(@"\par") - offset;
string result = richTextBox.Rtf.Substring(offset, len).Trim();
return result;
}
}
Question
How do I convert the string "Européen" to the RTF-formatted string "Europ\'e9en"?
[TestMethod]
public void Convert_A_Word_To_Rtf()
{
// Arrange
string word = "Européen";
string expected = "Europ\'e9en";
string actual = string.Empty;
// Act
// actual = ... // How?
// Assert
Assert.AreEqual(expected, actual);
}
What I have found so far
RichTextBox
RichTextBox can be used for certain things. Example:
RichTextBox richTextBox = new RichTextBox();
richTextBox.Text = "Européen";
string rtfFormattedString = richTextBox.Rtf;
But then rtfFormattedString turns out to be the entire RTF-formatted document, not just the string "Europ\'e9en".
Stackoverflow
- Insert string with special characters into RTF
- How to output unicode string to RTF (using C#)
- Output RTF special characters to Unicode
- Convert Special Characters for RTF (iPhone)
I've also found a bunch of other resources on the web, but nothing quite solved my problem.
Answer
Had to add Trim()
to remove the preceeding space in result
. Other than that, Brad Christie's solution seems to work.
I'll run with this solution for now even though I have a bad gut feeling since we have to SubString and Trim the heck out of RichTextBox to get a RTF-formatted string.
Test case:
[TestMethod]
public void Test_To_Verify_Brad_Christies_Stackoverflow_Answer()
{
Assert.AreEqual(@"Europ\'e9en", "Européen".ConvertToRtf());
Assert.AreEqual(@"d\'e9finitif", "définitif".ConvertToRtf());
Assert.AreEqual(@"\'e0", "à".ConvertToRtf());
Assert.AreEqual(@"H\'e4user", "Häuser".ConvertToRtf());
Assert.AreEqual(@"T\'fcren", "Türen".ConvertToRtf());
Assert.AreEqual(@"B\'f6den", "Böden".ConvertToRtf());
}
Logic as an extension method:
public static class StringExtensions
{
public static string ConvertToRtf(this string value)
{
RichTextBox richTextBox = new RichTextBox();
richTextBox.Text = value;
int offset = richTextBox.Rtf.IndexOf(@"\f0\fs17") + 8; // offset = 118;
int len = richTextBox.Rtf.LastIndexOf(@"\par") - offset;
string result = richTextBox.Rtf.Substring(offset, len).Trim();
return result;
}
}
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(8)
RichTextBox
不是总是具有相同的页眉/页脚吗?您可以根据偏移位置读取内容,然后继续使用它进行解析。 (我认为?如果我错了,请纠正我)有可用的库,但我个人从来没有运气好过它们(尽管总是在完全穷尽可能性之前找到另一种方法)。此外,大多数更好的通常都包含象征性的费用。
编辑
有点黑客,但这应该能让你完成你需要完成的事情(我希望):
编辑2
代码细分 RTF 控制代码如下:
{\fonttbl{\f0\fnil\fcharset0微软无衬线字体;}}
))希望这能澄清一些事情。 ;-)
Doesn't
RichTextBox
always have the same header/footer? You could just read the content based on off-set location, and continue using it to parse. (I think? please correct me if I'm wrong)There are libraries available, but I've never had good luck with them personally (though always just found another method before fully exhausting the possibilities). In addition, most of the better ones are usually include a nominal fee.
EDIT
Kind of a hack, but this should get you through what you need to get through (I hope):
EDIT 2
The breakdown of the codes RTF control code are as follows:
\f0
- Use the 0-index font (first font in the list, which is typically Microsoft Sans Serif (noted in the font table in the header:{\fonttbl{\f0\fnil\fcharset0 Microsoft Sans Serif;}}
))\fs17
- Font formatting, specify the size is 17 (17 being in half-points)\par
is specifying that it's the end of a paragraph.Hopefully that clears some things up. ;-)
我找到了一个很好的解决方案,实际上使用 RichTextBox 本身来进行转换:
http://www.baltimoreconsulting.com/blog/development/easily-convert-a-string-to-rtf-in-net/
I found a nice solution that actually uses the RichTextBox itself to do the conversion:
http://www.baltimoreconsulting.com/blog/development/easily-convert-a-string-to-rtf-in-net/
我就是这样:
我认为使用
RichTextBox
是:1)过度杀戮
2) 在花了几天时间尝试让它与在 Word 中创建的 RTF 文档一起使用之后,我不喜欢
RichTextBox
。This is how I went:
I think using a
RichTextBox
is:1) overkill
2) I don't like
RichTextBox
after spending days of trying to make it work with an RTF document created in Word.这是改进的 @Vladislav Zalesak 的答案:
Here's improved @Vladislav Zalesak's answer:
下面是一个将字符串转换为 RTF 字符串的丑陋示例:
Below is an ugly example of converting a string to an RTF string:
我知道已经有一段时间了,希望这会有所帮助..
在尝试了我可以使用的所有转换代码后,此代码对我有用:
titleText 和 contentText 是填充在常规 TextBox 中的简单文本
rtb.rtf 现在保存rtf 文本。
以下代码将保存 rtf 文本并允许您打开文件、编辑它,然后再次将其加载回 RichTextBox:
I know it has been a while, hope this helps..
This code is working for me after trying every conversion code I could put my hands on:
titleText and contentText are simple text filled in a regular TextBox
rtb.rtf now holds the rtf text.
The following code will save the rtf text and allow you to open the file, edit it and than load it back into a RichTextBox back again:
不是最优雅的,但相当优化和快速的方法:
。
编码字符(波兰语)方法:
Not the most elegant, but quite optimal and fast method:
.
Encode characters (Polish ones) method: