服务器（PC）上的实时视频流来自机器人通过 UDP 发送的图像

发布于 2024-09-29 18:49:46 字数 1839 浏览 5 评论 0原文

唔。我发现这似乎很有希望：

http://sourceforge.net/projects/mjpg-streamer/

好的。我将尝试清楚、详细地解释我正在尝试做的事情。

我有一个带有摄像头和 wifi 棒的小型人形机器人（这是机器人）。机器人的 WiFi 棒平均 WiFi 传输速率为 1769KB/s。该机器人拥有 500Mhz CPU 和 256MB RAM，因此不足以进行任何严肃的计算（此外，机器人上已经运行了几个用于运动、视觉、声纳、语音等的模块）。

我有一台电脑，可以用来控制机器人。我试图让机器人在房间里走动，并观看机器人在电脑中看到的实时流视频。

我已经在工作了。机器人按照我希望的方式行走并用相机拍照。图像通过 UDP 协议发送到我接收图像的 PC（我已通过将传入图像保存在磁盘上来验证这一点）。

相机返回 YUV442 色彩空间中 640 x 480 像素的图像。我正在发送有损压缩 (JPEG) 的图像，因为我试图在 PC 上获得最佳的 FPS。我正在使用 PIL 库在机器人上进行 JPEG 压缩。

我的问题：

有人可以给我一些关于如何将传入的 JPEG 图像转换为实时视频流的想法吗？我知道我需要一些视频编码器。您推荐哪种视频编码器？ FFMPEG 还是其他什么？我对视频流非常陌生，所以我想知道什么最适合这项任务。我更喜欢使用 Python 来编写这个，所以我更喜欢一些具有 Python API 的视频编码器或库。但我想如果这个库有一些好的命令行 API，它就不必使用 Python。
我能从中得到的最好的 FPS 是什么？考虑到 1769KB/s 的平均 wifi 传输速率和图像的尺寸？我应该使用与 JPEG 不同的压缩吗？
我很高兴看到任何代码示例。解释如何执行此操作的文章链接也很好。

一些代码示例。以下是我如何将 JPEG 图像从机器人发送到 PC（缩短的简化片段）。它在机器人上运行：

# lots of code here

UDPSock = socket(AF_INET,SOCK_DGRAM)

  while 1:
    image = camProxy.getImageLocal(nameId)
    size = (image[0], image[1])
    data = image[6]
    im = Image.fromstring("YCbCr", size, data)
    s = StringIO.StringIO()
    im.save(s, "JPEG")

    UDPSock.sendto(s.getvalue(), addr)

    camProxy.releaseImage(nameId)

  UDPSock.close()

  # lots of code here

以下是我在 PC 上接收图像的方式。它在 PC 上运行：

  # lots of code here

  UDPSock = socket(AF_INET,SOCK_DGRAM)
  UDPSock.bind(addr)

  while 1:
    data, addr = UDPSock.recvfrom(buf)
    # here I need to create a stream from the data
    # which contains JPEG image

  UDPSock.close()

  # lots of code here

原文

Hmm. I found this which seems promising:

http://sourceforge.net/projects/mjpg-streamer/

Ok. I will try to explain what I am trying to do clearly and in much detail.

I have a small humanoid robot with camera and wifi stick (this is the robot). The robot's wifi stick average wifi transfer rate is 1769KB/s. The robot has 500Mhz CPU and 256MB RAM so it is not enough for any serious computations (moreover there are already couple modules running on the robot for motion, vision, sonar, speech etc).

I have a PC from which I control the robot. I am trying to have the robot walk around the room and see a live stream video of what the robot sees in the PC.

What I already have working. The robot is walking as I want him to do and taking images with the camera. The images are being sent through UDP protocol to the PC where I am receiving them (I have verified this by saving the incoming images on the disk).

The camera returns images which are 640 x 480 px in YUV442 colorspace. I am sending the images with lossy compression (JPEG) because I am trying to get the best possible FPS on the PC. I am doing the compression to JPEG on the robot with PIL library.

My questions:

Could somebody please give me some ideas about how to convert the incoming JPEG images to a live video stream? I understand that I will need some video encoder for that. Which video encoder do you recommend? FFMPEG or something else? I am very new to video streaming so I want to know what is best for this task. I'd prefer to use Python to write this so I would prefer some video encoder or library which has Python API. But I guess if the library has some good command line API it doesn't have to be in Python.
What is the best FPS I could get out from this? Given the 1769KB/s average wifi transfer rate and the dimensions of the images? Should I use different compression than JPEG?
I will be happy to see any code examples. Links to articles explaining how to do this would be fine, too.

Some code samples. Here is how I am sending JPEG images from robot to the PC (shortened simplified snippet). This runs on the robot:

# lots of code here

UDPSock = socket(AF_INET,SOCK_DGRAM)

  while 1:
    image = camProxy.getImageLocal(nameId)
    size = (image[0], image[1])
    data = image[6]
    im = Image.fromstring("YCbCr", size, data)
    s = StringIO.StringIO()
    im.save(s, "JPEG")

    UDPSock.sendto(s.getvalue(), addr)

    camProxy.releaseImage(nameId)

  UDPSock.close()

  # lots of code here

Here is how I am receiving the images on the PC. This runs on the PC:

  # lots of code here

  UDPSock = socket(AF_INET,SOCK_DGRAM)
  UDPSock.bind(addr)

  while 1:
    data, addr = UDPSock.recvfrom(buf)
    # here I need to create a stream from the data
    # which contains JPEG image

  UDPSock.close()

  # lots of code here

分享到QQ

分享到微博