在c/cocoa中读取并输出UTF-8字符串

发布于 2024-08-18 10:01:01 字数 570 浏览 4 评论 0原文

在 Objective-C/cocoa 应用程序中，我使用 c 函数打开一个文本文件，逐行读取它并在第三方函数中使用一些行。在伪代码中：

char *line = fgets(aFile);
library_function(line);  // This function calls for a utf-8 encoded char * string

这可以正常工作，直到输入文件包含特殊字符（例如重音符号或 UTF-8 BOM），此时库函数会输出损坏的字符。

但是，如果我这样做：

char *line = fgets(aFile);
NSString *stringObj = [NSString stringWithUTF8String:line];
library_function([stringObj UTF8String]);

那么一切都会正常工作并且字符串会正确输出。

那条 [NSString... 行在做什么，而我却没有这样做？我最初获取该行的方式是否有问题？或者完全是另外一回事？

原文

In an objective-c/cocoa app, I am using c functions to open a text file, read it line-by-line and use some lines in a third-party function. In psuedo-code:

char *line = fgets(aFile);
library_function(line);  // This function calls for a utf-8 encoded char * string

This works fine until the input file contains special characters (such as accents or the UTF-8 BOM) whereupon the library function outputs mangled characters.

However, if I do this:

char *line = fgets(aFile);
NSString *stringObj = [NSString stringWithUTF8String:line];
library_function([stringObj UTF8String]);

Then it all works fine and the string is outputted correctly.

What is that [NSString... line doing that I'm not?
Am I doing something wrong with how the line is fetched initially? Or is it something else entirely?

分享到QQ

分享到微博