.. raw:: html .. raw:: html Model
- .. raw:: html | .. raw:: html Description
- .. raw:: html | .. raw:: html
.. raw::
- html .. raw:: html command_and_search ..
- raw:: html | .. raw:: html Best for short
- queries such as voice commands or voice search. .. raw:: html
- | .. raw:: html
.. raw:: html ..
- raw:: html phone_call .. raw:: html | ..
- raw:: html Best for audio that originated from a
- phone call (typically recorded at an 8khz sampling rate). ..
- raw:: html | .. raw:: html
.. raw:: html
- .. raw:: html video .. raw:: html |
- .. raw:: html Best for audio that originated from
- from video or includes multiple speakers. Ideally the audio is
- recorded at a 16khz or greater sampling rate. This is a
- premium model that costs more than the standard rate. ..
- raw:: html | .. raw:: html
.. raw:: html
- .. raw:: html default .. raw:: html |
- .. raw:: html Best for audio that is not one of the
- specific audio models. For example, long-form audio. Ideally
- the audio is high-fidelity, recorded at a 16khz or greater
- sampling rate. .. raw:: html | .. raw:: html
-
.. raw:: html
- use_enhanced:
- Set to true to use an enhanced model for speech recognition.
- If ``use_enhanced`` is set to true and the ``model`` field is
- not set, then an appropriate enhanced model is chosen if an
- enhanced model exists for the audio. If ``use_enhanced`` is
- true and an enhanced version of the specified model does not
- exist, then the speech is recognized using the standard
- version of the specified model.
- """,
- # @@protoc_insertion_point(class_scope:google.cloud.speech.v1.RecognitionConfig)
- },
-)
-_sym_db.RegisterMessage(RecognitionConfig)
-
-SpeakerDiarizationConfig = _reflection.GeneratedProtocolMessageType(
- "SpeakerDiarizationConfig",
- (_message.Message,),
- {
- "DESCRIPTOR": _SPEAKERDIARIZATIONCONFIG,
- "__module__": "google.cloud.speech_v1.proto.cloud_speech_pb2",
- "__doc__": """Config to enable speaker diarization.
-
- Attributes:
- enable_speaker_diarization:
- If ‘true’, enables speaker detection for each recognized word
- in the top alternative of the recognition result using a
- speaker_tag provided in the WordInfo.
- min_speaker_count:
- Minimum number of speakers in the conversation. This range
- gives you more flexibility by allowing the system to
- automatically determine the correct number of speakers. If not
- set, the default value is 2.
- max_speaker_count:
- Maximum number of speakers in the conversation. This range
- gives you more flexibility by allowing the system to
- automatically determine the correct number of speakers. If not
- set, the default value is 6.
- speaker_tag:
- Unused.
- """,
- # @@protoc_insertion_point(class_scope:google.cloud.speech.v1.SpeakerDiarizationConfig)
- },
-)
-_sym_db.RegisterMessage(SpeakerDiarizationConfig)
-
-RecognitionMetadata = _reflection.GeneratedProtocolMessageType(
- "RecognitionMetadata",
- (_message.Message,),
- {
- "DESCRIPTOR": _RECOGNITIONMETADATA,
- "__module__": "google.cloud.speech_v1.proto.cloud_speech_pb2",
- "__doc__": """Description of audio data to be recognized.
-
- Attributes:
- interaction_type:
- The use case most closely describing the audio content to be
- recognized.
- industry_naics_code_of_audio:
- The industry vertical to which this speech recognition request
- most closely applies. This is most indicative of the topics
- contained in the audio. Use the 6-digit NAICS code to identify
- the industry vertical - see https://www.naics.com/search/.
- microphone_distance:
- The audio type that most closely describes the audio being
- recognized.
- original_media_type:
- The original media the speech was recorded on.
- recording_device_type:
- The type of device the speech was recorded with.
- recording_device_name:
- The device used to make the recording. Examples ‘Nexus 5X’ or
- ‘Polycom SoundStation IP 6000’ or ‘POTS’ or ‘VoIP’ or
- ‘Cardioid Microphone’.
- original_mime_type:
- Mime type of the original audio file. For example
- ``audio/m4a``, ``audio/x-alaw-basic``, ``audio/mp3``,
- ``audio/3gpp``. A list of possible audio mime types is
- maintained at http://www.iana.org/assignments/media-
- types/media-types.xhtml#audio
- audio_topic:
- Description of the content. Eg. “Recordings of federal supreme
- court hearings from 2012”.
- """,
- # @@protoc_insertion_point(class_scope:google.cloud.speech.v1.RecognitionMetadata)
- },
-)
-_sym_db.RegisterMessage(RecognitionMetadata)
-
-SpeechContext = _reflection.GeneratedProtocolMessageType(
- "SpeechContext",
- (_message.Message,),
- {
- "DESCRIPTOR": _SPEECHCONTEXT,
- "__module__": "google.cloud.speech_v1.proto.cloud_speech_pb2",
- "__doc__": """Provides “hints” to the speech recognizer to favor specific words and
- phrases in the results.
-
- Attributes:
- phrases:
- A list of strings containing words and phrases “hints” so that
- the speech recognition is more likely to recognize them. This
- can be used to improve the accuracy for specific words and
- phrases, for example, if specific commands are typically
- spoken by the user. This can also be used to add additional
- words to the vocabulary of the recognizer. See `usage limits
-