HTML5-VIDEO如何记录PTS代码或暂停的框架的数量,以使后续查看器跳到该精确的帧?
我读过这篇文章与实施帧准确寻求的4个商业参与者回答,我将调查它们,但是我们正在尝试构建视频中标记对象的工具,我们需要精确知道他们拥有的框架标记为这样的观察者可以验证其标签。
我们要求用户观看直到他们看到我们所需的对象。然后暂停视频并导航到“最佳帧”。我们给他们按钮向前返回(1秒/平均fps)一次以一次移动一个帧(除非我们的视频掉落帧,则他们必须为每个掉落的帧按下一次这些按钮。)
[奖励问题:可以:我们告诉我们是否给我们一个新的框架或与更新的视频时间相同的框架?如果我们有相同的框架,我们可以循环直到第一个新框架,这样用户就不必继续按键!]
当他们到达最佳框架时,我们可以记录该帧的时间,但是我们正在发现每次有人标记完全相同的框架时,时间通常会带有不同的毫秒。因此,我们无法可靠地让随后的用户寻求完全相同的框架来验证第一个观察者的标签。我们正在使用画布允许第一个用户在我们感兴趣的对象周围绘制一个边界框。但是,如果我们的对象迅速移动时,如果在错误的框架上,我们将无法可靠地重新刷新框架框。
理想情况下,如果它不能是顺序帧号,它可以是PTS代码吗?是否有其他任何方法可以确定每次精确的框架?我们正在为可以训练的受控受众群体建立这一点,因此创意解决方案是可以接受的。
我们对“作弊”开放,因为我们控制工作流程中的每个视频。如果我们可以在可以通过HTML-VIDEO读取的文件中写入“某物”,我们可以寻求大概的时间为0.5秒,然后读取后续帧的“东西”,以告诉我们我们的匹配确切匹配。
I have read this post that answers with 4 commercial players that implement frame-accurate seeking and I will investigate them, but we are trying to build a tool for labeling objects in a video and we need to know precisely the frame that they have labeled so a subsequent watcher can validate their label.
We ask the users to watch until they see our desired object. Then pause the video and navigate to the "Best Frame". We give them buttons to go back and forward (1 second/Average FPS) to go move one frame at a time (unless our video has dropped frames, then they must push those buttons once for each dropped frame.)
[Bonus question: can we tell if gave us a new frame or just the same frame with an updated time-in-video? If we got the same frame, we could loop until the first new frame so the user wouldn't have to keep pressing the key!]
When they arrive at the best frame, we can record the time of that frame, but we are finding that the time often comes back with different milliseconds each time someone tags the exact same frame. So we can't reliably have a subsequent user seek the exact same frame to validate the first watcher's label. We are using Canvas to allow the first user to draw a bounding box around our object of interest. Yet we can't reliably redraw the bounding box if it's on the wrong frame as our objects move quickly.
Ideally, if it can't be a sequential frame number, can it be PTS code? Is there any other way we can determine the precise frame each time? We are building this for a controlled audience that we can train, so creative solutions are acceptable.
We are open to a "cheat" because we control each video in our workflow. If we can write "something" into the file that can be read by the HTML-Video, we could seek to the approximate time minus 0.5 seconds and then read the subsequent frames for "something" that can tell us we have an exact match.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论