将 EBCDIC Char 转换为十六进制值（AFP EBCDIC 数据）

发布于 07-16 20:07 字数 1038 浏览 13 评论 0原文

我正在处理一些 EBCDIC 数据，我需要解析这些数据并找到一些十六进制值。我遇到的问题是，我似乎正在使用不正确的编码读取文件。我可以看到我的记录以“!”开头（这是 EBCDIC 中的 x5A），但在转换为十六进制时，它返回为 x21 code>，这是“!”的 ASCII 值。

我希望框架中有一个内置方法，但恐怕我必须创建一个自定义类来正确映射 EBCDIC 字符集。

Using fileInStream As New FileStream(inputFile, FileMode.Open, FileAccess.Read)
   Using bufferedInStream As New BufferedStream(fileInStream)
      Using reader As New StreamReader(bufferedInStream, Encoding.GetEncoding(37))
         While Not reader.EndOfStream
            Do While reader.Peek() >= 0
               Dim charArray(52) As Char
               reader.Read(charArray, 0, charArray.Length)

               For Each letter As Char In charArray
                  Dim value As Integer = Convert.ToInt16(letter)

                  Dim hexOut As String = [String].Format("{0:x}", value)
                  Debug.WriteLine(hexOut)
               Next
            Loop
         End While
      End Using
   End Using
End Using

谢谢！

原文

I working with some EBCDIC data that I need to parse and find some Hex values. The problem that I'm having is that it appears that I'm reading the file in with the incorrect encoding. I can see that my record begins with "!" (which is a x5A in EBCDIC) but when doing the conversion to hex it returns as a x21, which is the ASCII value for a "!".

I was hoping that there was a built-in method in the framework, but I'm afraid that I'm going to have to create a custom class to correctly map the EBCDIC character set.

Using fileInStream As New FileStream(inputFile, FileMode.Open, FileAccess.Read)
   Using bufferedInStream As New BufferedStream(fileInStream)
      Using reader As New StreamReader(bufferedInStream, Encoding.GetEncoding(37))
         While Not reader.EndOfStream
            Do While reader.Peek() >= 0
               Dim charArray(52) As Char
               reader.Read(charArray, 0, charArray.Length)

               For Each letter As Char In charArray
                  Dim value As Integer = Convert.ToInt16(letter)

                  Dim hexOut As String = [String].Format("{0:x}", value)
                  Debug.WriteLine(hexOut)
               Next
            Loop
         End While
      End Using
   End Using
End Using

Thanks!

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

倾听心声的旋律2024-07-23 20:07:14

您可以这样做：

打开 AFP 文件。读取前 9 个字节。
字节 0 应为 0xD3 或 0x5A。字节 1 和字节 2 将是 SFI 的长度，包括您刚刚读取的 9 个字节中的 8 个。由于是big endian，所以长度=byte1 * 256+byte2。
字节 3、4 和 5 是结构化字段标识符。如果您正在寻找可打印文本，请查找 PTX（演示文本元素）0xD3 0xEE 0x9B。如果没有找到，请跳过 length-8 并读取接下来的 9 个字节。
如果您确实找到了 PTX，请读取长度 8 字节。解析控制序列以获取文本有点棘手。第一个将从 0x2b 0xD3 开始，一个字节表示长度，另一个字节表示它是什么类型的控制序列。如果该字节是奇数，则下一个控制序列将省略 0x2B 0xD3 标头，而是从长度字节开始。这被称为“链接”，显然是为了让程序员疯狂地解析这些东西而引入的。
从长度字节 length-1 向前跳并按或仅查找下一个 0x2B 0xD3；最后一个控制序列不会被链接，并且 PTX 末尾之后的所有内容都将是 EBCDIC。使用 Jon Skeet 的库（谢谢 Jon）并寻找下一个 PTX 元素。

抱歉我啰嗦了。这是可行的，但并不简单。

回复收藏 0 原文