使用 asyncore 读取套接字缓冲区

发布于 2024-08-12 08:38:27 字数 1049 浏览 6 评论 0原文

我是 Python 新手（尽管我已经使用 Java 编程多年），并且我正在开发一个简单的基于套接字的网络应用程序（只是为了好玩）。我的想法是，我的代码连接到远程 TCP 端点，然后侦听从服务器推送到客户端的任何数据，并对此执行一些解析。

从服务器推送的数据-> client 是 UTF-8 编码的文本，每行由 CRLF (\x0D\x0A) 分隔。您可能猜到了：这个想法是客户端连接到服务器（直到被用户取消），然后读取并解析进入的行。

我已经设法让它工作，但是，我没有确保我以正确的方式做这件事。因此，我的实际问题（要遵循的代码）：

这是在 Python 中执行此操作的正确方法吗（即，它真的这么简单吗）？
有关缓冲区/asyncore 的任何提示/技巧/有用资源（除了参考文档）？

目前，数据的读取和缓冲如下：

def handle_read(self):
    self.ibuffer = b""

    while True:
        self.ibuffer += self.recv(self.buffer_size)
        if ByteUtils.ends_with_crlf(self.ibuffer):
            self.logger.debug("Got full line including CRLF")
            break
        else:
            self.logger.debug("Buffer not full yet (%s)", self.ibuffer)

    self.logger.debug("Filled up the buffer with line")
    print(str(self.ibuffer, encoding="UTF-8"))

ByteUtils.ends_with_crlf 函数只是检查缓冲区的最后两个字节是否有 \x0D\x0A。第一个问题是主要问题（答案基于此），但任何其他想法/提示都值得赞赏。谢谢。

原文

I'm new to Python (I have been programming in Java for multiple years now though), and I am working on a simple socket-based networking application (just for fun). The idea is that my code connects to a remote TCP end-point and then listens for any data being pushed from the server to the client, and perform some parsing on this.

The data being pushed from server -> client is UTF-8 encoded text, and each line is delimited by CRLF (\x0D\x0A). You probably guessed: the idea is that the client connects to the server (until cancelled by the user), and then reads and parses the lines as they come in.

I've managed to get this to work, however, I'm not sure that I'm doing this quite the right way. So hence my actual questions (code to follow):

Is this the right way to do it in Python (ie. is it really this simple)?
Any tips/tricks/useful resources (apart from the reference documentation) regarding buffers/asyncore?

Currently, the data is being read and buffered as follows:

def handle_read(self):
    self.ibuffer = b""

    while True:
        self.ibuffer += self.recv(self.buffer_size)
        if ByteUtils.ends_with_crlf(self.ibuffer):
            self.logger.debug("Got full line including CRLF")
            break
        else:
            self.logger.debug("Buffer not full yet (%s)", self.ibuffer)

    self.logger.debug("Filled up the buffer with line")
    print(str(self.ibuffer, encoding="UTF-8"))

The ByteUtils.ends_with_crlf function simply checks the last two bytes of the buffer for \x0D\x0A. The first question is the main one (answer is based on this), but any other ideas/tips are appreciated. Thanks.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

上课铃就是安魂曲 2024-08-19 08:38:27

TCP 是一个流，并且不能保证您的缓冲区不会包含一条消息的结尾和下一条消息的开头。
因此，在缓冲区末尾检查 \n\r 不会在所有情况下都按预期工作。您必须检查流中的每个字节。

而且，我强烈建议您使用 Twisted 而不是 asyncore。
像这样的东西（凭记忆，可能无法开箱即用）：

from twisted.internet import reactor, protocol
from twisted.protocols.basic import LineReceiver


class MyHandler(LineReceiver):

    def lineReceived(self, line):
        print "Got line:", line


f = protocol.ClientFactory()
f.protocol = MyHandler
reactor.connectTCP("127.0.0.1", 4711, f)
reactor.run()

TCP is a stream, and you are not guaranteed that your buffer will not contain the end of one message and the beginning of the next.
So, checking for \n\r at the end of the buffer will not work as expected in all situations. You have to check each byte in the stream.

And, I would strongly recommend that you use Twisted instead of asyncore.
Something like this (from memory, might not work out of the box):

from twisted.internet import reactor, protocol
from twisted.protocols.basic import LineReceiver


class MyHandler(LineReceiver):

    def lineReceived(self, line):
        print "Got line:", line


f = protocol.ClientFactory()
f.protocol = MyHandler
reactor.connectTCP("127.0.0.1", 4711, f)
reactor.run()

回复收藏 0 原文