将音频转换为文本

发布于 2024-09-28 02:39:38 字数 91 浏览 9 评论 0原文

我只是想知道 Java 或 C# 中是否有任何内置库或外部库允许我获取音频文件并解析它并从中提取文本。

我需要提出申请才能这样做，但我不知道从哪里开始。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

失与倦＂ 2024-10-05 02:39:38

以下是一些选项：

回复收藏 0 原文

我们的影子 2024-10-05 02:39:38

这是使用 C# 和 System.Speech 的完整示例

代码可分为 2 个主要部分：

配置 SpeechRecognitionEngine 对象（及其所需元素）
处理 SpeechRecognized 和 SpeechHypothesized 事件。

第 1 步：配置 SpeechRecognitionEngine

_speechRecognitionEngine = new SpeechRecognitionEngine();
_speechRecognitionEngine.SetInputToDefaultAudioDevice();
_dictationGrammar = new DictationGrammar();
_speechRecognitionEngine.LoadGrammar(_dictationGrammar);
_speechRecognitionEngine.RecognizeAsync(RecognizeMode.Multiple);

此时，您的对象已准备好开始从麦克风转录音频。不过，您需要处理一些事件，才能真正访问结果。

第 2 步：处理 SpeechRecognitionEngine 事件

_speechRecognitionEngine.SpeechRecognized -= new EventHandler(SpeechRecognized);
_speechRecognitionEngine.SpeechHypothesized -= new EventHandler(SpeechHypothesizing);
_speechRecognitionEngine.SpeechRecognized += new EventHandler(SpeechRecognized);
_speechRecognitionEngine.SpeechHypothesized += new EventHandler(SpeechHypothesizing);
private void SpeechHypothesizing（对象发送者，
SpeechHypothesizedEventArgs e) {
///引擎实时结果
字符串 realTimeResults = e.Result.Text; }
private void SpeechRecognized（对象发送者，SpeechRecognizedEventArgs
e) {
///来自引擎的最终答案字符串finalAnswer =
e.结果.文本； }

就是这样。如果您想使用预先录制的 .wav 文件而不是麦克风，您可以使用

_speechRecognitionEngine.SetInputToWaveFile(pathToTargetWavFile);

而不是

_speechRecognitionEngine.SetInputToDefaultAudioDevice();

这些课程中有很多不同的选项，值得更详细地探索。

http://ellismis.com/2012/03/17/converting-or-transcribing-audio-to-text-using-c-and-net-system-speech/

Here is a complete example using C# and System.Speech

The code can be divided into 2 main parts:

configuring the SpeechRecognitionEngine object (and its required elements)
handling the SpeechRecognized and SpeechHypothesized events.

Step 1: Configuring the SpeechRecognitionEngine

_speechRecognitionEngine = new SpeechRecognitionEngine();
_speechRecognitionEngine.SetInputToDefaultAudioDevice();
_dictationGrammar = new DictationGrammar();
_speechRecognitionEngine.LoadGrammar(_dictationGrammar);
_speechRecognitionEngine.RecognizeAsync(RecognizeMode.Multiple);

At this point your object is ready to start transcribing audio from the microphone. You need to handle some events though, in order to actually get access to the results.

Step 2: Handling the SpeechRecognitionEngine Events

_speechRecognitionEngine.SpeechRecognized -= new EventHandler(SpeechRecognized);
_speechRecognitionEngine.SpeechHypothesized -= new EventHandler(SpeechHypothesizing);
_speechRecognitionEngine.SpeechRecognized += new EventHandler(SpeechRecognized);
_speechRecognitionEngine.SpeechHypothesized += new EventHandler(SpeechHypothesizing);
private void SpeechHypothesizing(object sender,
SpeechHypothesizedEventArgs e) {
///real-time results from the engine
string realTimeResults = e.Result.Text; }
private void SpeechRecognized(object sender, SpeechRecognizedEventArgs
e) {
///final answer from the engine string finalAnswer =
e.Result.Text; }

That’s it. If you want to use a pre-recorded .wav file instead of a microphone, you would use