现场工作录音用于声学分析:立体声或单声道?适当的收益?
我在语音学领域工作,通常需要记录人类的语音进行声学分析。我有两个我找不到答案的问题:
如果我在立体声频道中录制,则需要稍后转换为单声道以进行注释。因此,原则上的单声道信号足够好。是否应该使用立体声声音的原因(例如信号会更好?)
,我们也被警告说,增益水平应保持较小,以便记录水平不应超过最大值,这会导致信号cuttoff。但是,当录制文件显示出太低幅度(虽然仍然很清楚)时,我也受到了批评,因为这导致了较低的SNR。人们如何选择适当的增益水平?
I work in the field of phonetics and often need to record human speech for acoustic analysis. I have two questions that I couldn't find answers:
If I record in stereo channels, I need to convert to mono later on to proceed with annotation. So in principle mono signal is good enough. Are there reasons that stereo sound should be used (e.g. the signal would be better?)
Also, we were warned that the gain level should be kept small so that the recording level shouldn't exceed the maximum, which leads to signal cuttoff. However, I was also criticised when the recording file shows too low an amplitude (it's still very clear though), for that leads to a low SNR. How do people choose an appropriate gain level?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
涉及录制的行为,声音设计论坛可能是您最好的选择。
我认为,通过具有立体声信号,就频率分析而言,可能获得的任何东西。立体声更多是在3D空间中找到声音源。声音的来源是否在不同方向上发出不同的频率轮廓?环境在到立体声输入的两个路径的过程中是否有所滤波声音?如果答案“不是显着”,那么单声道应该没问题。
选择适当的增益水平主要是了解您的设备。理想情况下,您的录制设置将提供显示信号强度的反馈(通常是某种视觉仪表)。 “最好的”是(理论上)最大的水平,不会扭曲。因此,您必须知道在录制链的所有元素上发生了什么水平失真。
鉴于记录段上最大的峰值可能是一个异常值,因此可能会有一些挑剔。
As the act of recording is involved, the Sound Design forum might be your best bet.
I can't think anything that might be gained, in terms of frequency analysis, by having a stereo signal. Stereo is more about locating the source of a sound in 3D space. Does the source of sound emit different frequency profiles in different directions? Does the environment filter the sound differently over the course of the two paths to the stereo inputs? If the the answer is "not significantly" then mono should be fine.
Choosing an appropriate gain level is mostly a matter of knowing your equipment. Ideally, your recording setup will provide feedback (usually a visual meter of some sort) that shows the signal strength. The "best" would be (theoretically) the loudest level that does not distort. So you have to know at what level distortion happens on all the elements of the recording chain.
There can be some fudging on this, given that the loudest peak on a recorded segment may be an outlier.