PyAudio - 前几个录音块为零

发布于 2025-01-13 00:17:17 字数 2426 浏览 3 评论 0原文

我在尝试同步播放设备或从设备（在本例中为我的笔记本电脑扬声器和麦克风）录制音频时遇到了一些问题。

问题

我尝试使用Python模块来实现这个：“sounddevice”和“pyaudio”；但这两种实现都有一个奇怪的问题，即录制的音频的前几帧始终为零。还有其他人遇到过此类问题吗？这个问题似乎与所使用的块大小无关（即，其样本数量始终为零）。

我能做些什么来防止这种情况发生吗？

代码

import queue

import matplotlib.pyplot as plt
import numpy as np
import pyaudio
import soundfile as sf

FRAME_SIZE = 512
excitation, fs = sf.read("excitation.wav", dtype=np.float32)

# Instantiate PyAudio
p = pyaudio.PyAudio()
q = queue.Queue()

output_idx = 0
mic_buffer = np.zeros((excitation.shape[0] + FRAME_SIZE
                       - (excitation.shape[0] % FRAME_SIZE), 1))


def rec_play_callback(in_data, framelength, time_info, status):
    global output_idx

    # print status of playback in case of event
    if status:
        print(f"status: {status}")

    chunksize = min(excitation.shape[0] - output_idx, framelength)

    # write data to output buffer
    out_data = excitation[output_idx:output_idx + chunksize]
    # write input data to input buffer
    inputsamples = np.frombuffer(in_data, dtype=np.float32)

    if not np.sum(inputsamples):
        print("Empty frame detected")

    # send input data to buffer for main thread
    q.put(inputsamples)

    if chunksize < framelength:
        out_data[chunksize:] = 0
        return (out_data.tobytes(), pyaudio.paComplete)

    output_idx += chunksize
    return (out_data.tobytes(), pyaudio.paContinue)


# Define playback and record stream
stream = p.open(rate=fs,
                channels=1,
                frames_per_buffer=FRAME_SIZE,
                format=pyaudio.paFloat32,
                input=True,
                output=True,
                input_device_index=1,  # Macbook Pro microphone
                output_device_index=2,  # Macbook Pro speakers
                stream_callback=rec_play_callback)

stream.start_stream()

input_idx = 0
while stream.is_active():
    data = q.get(timeout=1)
    mic_buffer[input_idx:input_idx + FRAME_SIZE, 0] = data
    input_idx += FRAME_SIZE

stream.stop_stream()
stream.close()
p.terminate()

# Plot captured microphone signal
plt.plot(mic_buffer)
plt.show()

输出

检测到空帧

编辑：使用 CoreAudio 在 MacOS 上运行它。正如 @2e0byo 所指出的，这可能是相关的。

原文

I've been having some issues when trying to synchronously playback and record audio to/from a device, in this case, my laptop speakers and microphone.

The problem

I've tried to implement this using the Python modules: "sounddevice" and "pyaudio"; but both implementations have this weird issue where the first few frames of recorded audio are always zero. Has anyone else experienced this type of issue? This issue seems to be independent of the chunksize that is used (i.e., its always the same amount of samples being zero).

Is there anything I can do to prevent this from happening?

Code

import queue

import matplotlib.pyplot as plt
import numpy as np
import pyaudio
import soundfile as sf

FRAME_SIZE = 512
excitation, fs = sf.read("excitation.wav", dtype=np.float32)

# Instantiate PyAudio
p = pyaudio.PyAudio()
q = queue.Queue()

output_idx = 0
mic_buffer = np.zeros((excitation.shape[0] + FRAME_SIZE
                       - (excitation.shape[0] % FRAME_SIZE), 1))


def rec_play_callback(in_data, framelength, time_info, status):
    global output_idx

    # print status of playback in case of event
    if status:
        print(f"status: {status}")

    chunksize = min(excitation.shape[0] - output_idx, framelength)

    # write data to output buffer
    out_data = excitation[output_idx:output_idx + chunksize]
    # write input data to input buffer
    inputsamples = np.frombuffer(in_data, dtype=np.float32)

    if not np.sum(inputsamples):
        print("Empty frame detected")

    # send input data to buffer for main thread
    q.put(inputsamples)

    if chunksize < framelength:
        out_data[chunksize:] = 0
        return (out_data.tobytes(), pyaudio.paComplete)

    output_idx += chunksize
    return (out_data.tobytes(), pyaudio.paContinue)


# Define playback and record stream
stream = p.open(rate=fs,
                channels=1,
                frames_per_buffer=FRAME_SIZE,
                format=pyaudio.paFloat32,
                input=True,
                output=True,
                input_device_index=1,  # Macbook Pro microphone
                output_device_index=2,  # Macbook Pro speakers
                stream_callback=rec_play_callback)

stream.start_stream()

input_idx = 0
while stream.is_active():
    data = q.get(timeout=1)
    mic_buffer[input_idx:input_idx + FRAME_SIZE, 0] = data
    input_idx += FRAME_SIZE

stream.stop_stream()
stream.close()
p.terminate()

# Plot captured microphone signal
plt.plot(mic_buffer)
plt.show()

Output

Empty frame detected

Edit: running this on MacOS using CoreAudio. This might be relevant, as pointed out by @2e0byo.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

撩发小公举 2025-01-20 00:17:17

这是一个普遍问题，我们缺少对您的架构的完整了解。所以我们能做的就是指出一些一般概念。

在数字信号处理系统中，处理后的信号中经常存在前导空白和恒定延迟。这通常与缓冲区的大小和采样率有关。在某些系统中，您甚至可能不知道缓冲区的存在，例如作为用户级 API 无法访问的设备驱动程序的一部分。

要减少缓冲造成的偏移，您必须减小缓冲区或加快采样速度。然后，您的系统必须更频繁地处理较小的数据包，并且数据包大小或采样时钟的变化可能会影响您的信号处理，具体取决于您的信号内容和您正在进行的信号处理类型。因此，进行这些更改中的任何一个都会增加系统处理的每个数据的开销，并且还可能以其他方式影响性能。

我用于调试此类问题的一种方法是尝试找到设置偏移量的缓冲区，如果需要，可以跟踪源代码，然后查看是否可以调整大小或采样率并仍然实现所需的性能在吞吐量和准确性方面。

回复收藏 0 原文

燕归巢 2025-01-20 00:17:17

为了未来的自己。

找到了解决方法。在 stream.start_stream() 之后，添加

while not any(stream.read(1)):
    pass

If you value the one non-zero chunk that终止循环，存储它：

while not any(v := stream.read(1)):
    pass

之后，v 是 的一个实例bytes，长度取决于流格式。由于问题中它是 32 位浮点数，因此 len(v) 为 4。

For my future self.

Got a workaround. After stream.start_stream(), add

while not any(stream.read(1)):
    pass

If you value the one non-zero chunk that terminates the loop, store it:

while not any(v := stream.read(1)):
    pass

After that, v is an instance of bytes, the length depends on the stream format. As it's a 32-bit float in the question, len(v) is 4.

回复收藏 0 原文

~没有更多了~

关于作者

马蹄踏│碎落叶

暂无简介

文章

27 人气

关注发私信

友情链接

文江博客

PyAudio - 前几个录音块为零

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

李珊平

Quxin

范无咎

github_ZOJ2N8YxBm

若言

南…巷孤猫

友情链接

PyAudio - 前几个录音块为零

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

李珊平

Quxin

范无咎

github_ZOJ2N8YxBm

若言

南…巷孤猫

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。