SpeechRecognition - Web APIs 编辑
Experimental
This is an experimental technology
Check the Browser compatibility table carefully before using this in production.
The SpeechRecognition
interface of the Web Speech API is the controller interface for the recognition service; this also handles the SpeechRecognitionEvent
sent from the recognition service.
Note: On some browsers, like Chrome, using Speech Recognition on a web page involves a server-based recognition engine. Your audio is sent to a web service for recognition processing, so it won't work offline.
Constructor
SpeechRecognition.SpeechRecognition()
- Creates a new
SpeechRecognition
object.
Properties
SpeechRecognition
also inherits properties from its parent interface, EventTarget
.
SpeechRecognition.grammars
- Returns and sets a collection of
SpeechGrammar
objects that represent the grammars that will be understood by the currentSpeechRecognition
. SpeechRecognition.lang
- Returns and sets the language of the current
SpeechRecognition
. If not specified, this defaults to the HTMLlang
attribute value, or the user agent's language setting if that isn't set either. SpeechRecognition.continuous
- Controls whether continuous results are returned for each recognition, or only a single result. Defaults to single (
false
.) SpeechRecognition.interimResults
- Controls whether interim results should be returned (
true
) or not (false
.) Interim results are results that are not yet final (e.g. theSpeechRecognitionResult.isFinal
property isfalse
.) SpeechRecognition.maxAlternatives
- Sets the maximum number of
SpeechRecognitionAlternative
s provided per result. The default value is 1. SpeechRecognition.serviceURI
- Specifies the location of the speech recognition service used by the current
SpeechRecognition
to handle the actual recognition. The default is the user agent's default speech service.
Methods
SpeechRecognition
also inherits methods from its parent interface, EventTarget
.
SpeechRecognition.abort()
- Stops the speech recognition service from listening to incoming audio, and doesn't attempt to return a
SpeechRecognitionResult
. SpeechRecognition.start()
- Starts the speech recognition service listening to incoming audio with intent to recognize grammars associated with the current
SpeechRecognition
. SpeechRecognition.stop()
- Stops the speech recognition service from listening to incoming audio, and attempts to return a
SpeechRecognitionResult
using the audio captured so far.
Events
Listen to these events using addEventListener()
or by assigning an event listener to the oneventname
property of this interface.
audiostart
- Fired when the user agent has started to capture audio.
Also available via theonaudiostart
property. audioend
- Fired when the user agent has finished capturing audio.
Also available via theonaudioend
property. end
- Fired when the speech recognition service has disconnected.
Also available via theonend
property. error
- Fired when a speech recognition error occurs.
Also available via theonerror
property. nomatch
- Fired when the speech recognition service returns a final result with no significant recognition. This may involve some degree of recognition, which doesn't meet or exceed the
confidence
threshold.
Also available via theonnomatch
property. result
- Fired when the speech recognition service returns a result — a word or phrase has been positively recognized and this has been communicated back to the app.
Also available via theonresult
property. soundstart
- Fired when any sound — recognisable speech or not — has been detected.
Also available via theonsoundstart
property. soundend
- Fired when any sound — recognisable speech or not — has stopped being detected.
Also available via theonsoundend
property. speechstart
- Fired when sound that is recognised by the speech recognition service as speech has been detected.
Also available via theonspeechstart
property. speechend
- Fired when speech recognised by the speech recognition service has stopped being detected.
Also available via theonspeechend
property. start
- Fired when the speech recognition service has begun listening to incoming audio with intent to recognize grammars associated with the current
SpeechRecognition
.
Also available via theonstart
property.
Examples
In our simple Speech color changer example, we create a new SpeechRecognition
object instance using the SpeechRecognition()
constructor, create a new SpeechGrammarList
, and set it to be the grammar that will be recognised by the SpeechRecognition
instance using the SpeechRecognition.grammars
property.
After some other values have been defined, we then set it so that the recognition service starts when a click event occurs (see SpeechRecognition.start()
.) When a result has been successfully recognised, the SpeechRecognition.onresult
handler fires, we extract the color that was spoken from the event object, and then set the background color of the <html>
element to that color.
var grammar = '#JSGF V1.0; grammar colors; public <color> = aqua | azure | beige | bisque | black | blue | brown | chocolate | coral | crimson | cyan | fuchsia | ghostwhite | gold | goldenrod | gray | green | indigo | ivory | khaki | lavender | lime | linen | magenta | maroon | moccasin | navy | olive | orange | orchid | peru | pink | plum | purple | red | salmon | sienna | silver | snow | tan | teal | thistle | tomato | turquoise | violet | white | yellow ;'
var recognition = new SpeechRecognition();
var speechRecognitionList = new SpeechGrammarList();
speechRecognitionList.addFromString(grammar, 1);
recognition.grammars = speechRecognitionList;
recognition.continuous = false;
recognition.lang = 'en-US';
recognition.interimResults = false;
recognition.maxAlternatives = 1;
var diagnostic = document.querySelector('.output');
var bg = document.querySelector('html');
document.body.onclick = function() {
recognition.start();
console.log('Ready to receive a color command.');
}
recognition.onresult = function(event) {
var color = event.results[0][0].transcript;
diagnostic.textContent = 'Result received: ' + color;
bg.style.backgroundColor = color;
}
Specifications
Specification | Status | Comment |
---|---|---|
Web Speech API The definition of 'SpeechRecognition' in that specification. | Draft |
Browser compatibility
BCD tables only load in the browser
See also
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论