Skip to content

Commit ee59cc2

Browse files
author
AWS
committed
Amazon Transcribe Streaming Service Update: This release adds support for automatic language identification in AWS Transcribe Call Analytics
1 parent 1bfba0c commit ee59cc2

File tree

2 files changed

+93
-2
lines changed

2 files changed

+93
-2
lines changed
Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
{
2+
"type": "feature",
3+
"category": "Amazon Transcribe Streaming Service",
4+
"contributor": "",
5+
"description": "This release adds support for automatic language identification in AWS Transcribe Call Analytics"
6+
}

services/transcribestreaming/src/main/resources/codegen-resources/service-2.json

Lines changed: 87 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -45,7 +45,7 @@
4545
{"shape":"ConflictException"},
4646
{"shape":"ServiceUnavailableException"}
4747
],
48-
"documentation":"<p>Starts a bidirectional HTTP/2 or WebSocket stream where audio is streamed to Amazon Transcribe and the transcription results are streamed to your application. Use this operation for <a href=\"https://docs.aws.amazon.com/transcribe/latest/dg/call-analytics.html\">Call Analytics</a> transcriptions.</p> <p>The following parameters are required:</p> <ul> <li> <p> <code>language-code</code> </p> </li> <li> <p> <code>media-encoding</code> </p> </li> <li> <p> <code>sample-rate</code> </p> </li> </ul> <p>For more information on streaming with Amazon Transcribe, see <a href=\"https://docs.aws.amazon.com/transcribe/latest/dg/streaming.html\">Transcribing streaming audio</a>.</p>"
48+
"documentation":"<p>Starts a bidirectional HTTP/2 or WebSocket stream where audio is streamed to Amazon Transcribe and the transcription results are streamed to your application. Use this operation for <a href=\"https://docs.aws.amazon.com/transcribe/latest/dg/call-analytics.html\">Call Analytics</a> transcriptions.</p> <p>The following parameters are required:</p> <ul> <li> <p> <code>language-code</code> or <code>identify-language</code> </p> </li> <li> <p> <code>media-encoding</code> </p> </li> <li> <p> <code>sample-rate</code> </p> </li> </ul> <p>For more information on streaming with Amazon Transcribe, see <a href=\"https://docs.aws.amazon.com/transcribe/latest/dg/streaming.html\">Transcribing streaming audio</a>.</p>"
4949
},
5050
"StartMedicalScribeStream":{
5151
"name":"StartMedicalScribeStream",
@@ -251,6 +251,24 @@
251251
"pt-BR"
252252
]
253253
},
254+
"CallAnalyticsLanguageIdentification":{
255+
"type":"list",
256+
"member":{"shape":"CallAnalyticsLanguageWithScore"}
257+
},
258+
"CallAnalyticsLanguageWithScore":{
259+
"type":"structure",
260+
"members":{
261+
"LanguageCode":{
262+
"shape":"CallAnalyticsLanguageCode",
263+
"documentation":"<p>The language code of the identified language.</p>"
264+
},
265+
"Score":{
266+
"shape":"Double",
267+
"documentation":"<p>The confidence score associated with the identified language code. Confidence scores are values between zero and one; larger values indicate a higher confidence in the identified language.</p>"
268+
}
269+
},
270+
"documentation":"<p>The language code that represents the language identified in your audio, including the associated confidence score.</p>"
271+
},
254272
"CallAnalyticsTranscriptResultStream":{
255273
"type":"structure",
256274
"members":{
@@ -1411,7 +1429,6 @@
14111429
"StartCallAnalyticsStreamTranscriptionRequest":{
14121430
"type":"structure",
14131431
"required":[
1414-
"LanguageCode",
14151432
"MediaSampleRateHertz",
14161433
"MediaEncoding",
14171434
"AudioStream"
@@ -1469,6 +1486,36 @@
14691486
"location":"header",
14701487
"locationName":"x-amzn-transcribe-language-model-name"
14711488
},
1489+
"IdentifyLanguage":{
1490+
"shape":"Boolean",
1491+
"documentation":"<p>Enables automatic language identification for your Call Analytics transcription.</p> <p>If you include <code>IdentifyLanguage</code>, you must include a list of language codes, using <code>LanguageOptions</code>, that you think may be present in your audio stream. You must provide a minimum of two language selections.</p> <p>You can also include a preferred language using <code>PreferredLanguage</code>. Adding a preferred language can help Amazon Transcribe identify the language faster than if you omit this parameter.</p> <p>Note that you must include either <code>LanguageCode</code> or <code>IdentifyLanguage</code> in your request. If you include both parameters, your transcription job fails.</p>",
1492+
"location":"header",
1493+
"locationName":"x-amzn-transcribe-identify-language"
1494+
},
1495+
"LanguageOptions":{
1496+
"shape":"LanguageOptions",
1497+
"documentation":"<p>Specify two or more language codes that represent the languages you think may be present in your media.</p> <p>Including language options can improve the accuracy of language identification.</p> <p>If you include <code>LanguageOptions</code> in your request, you must also include <code>IdentifyLanguage</code>.</p> <p>For a list of languages supported with Call Analytics streaming, refer to the <a href=\"https://docs.aws.amazon.com/transcribe/latest/dg/supported-languages.html\">Supported languages</a> table.</p> <important> <p>You can only include one language dialect per language per stream. For example, you cannot include <code>en-US</code> and <code>en-AU</code> in the same request.</p> </important>",
1498+
"location":"header",
1499+
"locationName":"x-amzn-transcribe-language-options"
1500+
},
1501+
"PreferredLanguage":{
1502+
"shape":"CallAnalyticsLanguageCode",
1503+
"documentation":"<p>Specify a preferred language from the subset of languages codes you specified in <code>LanguageOptions</code>.</p> <p>You can only use this parameter if you've included <code>IdentifyLanguage</code> and <code>LanguageOptions</code> in your request.</p>",
1504+
"location":"header",
1505+
"locationName":"x-amzn-transcribe-preferred-language"
1506+
},
1507+
"VocabularyNames":{
1508+
"shape":"VocabularyNames",
1509+
"documentation":"<p>Specify the names of the custom vocabularies that you want to use when processing your Call Analytics transcription. Note that vocabulary names are case sensitive.</p> <p>If the custom vocabulary's language doesn't match the identified media language, it won't be applied to the transcription.</p> <important> <p>This parameter is only intended for use <b>with</b> the <code>IdentifyLanguage</code> parameter. If you're <b>not</b> including <code>IdentifyLanguage</code> in your request and want to use a custom vocabulary with your transcription, use the <code>VocabularyName</code> parameter instead.</p> </important> <p>For more information, see <a href=\"https://docs.aws.amazon.com/transcribe/latest/dg/custom-vocabulary.html\">Custom vocabularies</a>.</p>",
1510+
"location":"header",
1511+
"locationName":"x-amzn-transcribe-vocabulary-names"
1512+
},
1513+
"VocabularyFilterNames":{
1514+
"shape":"VocabularyFilterNames",
1515+
"documentation":"<p>Specify the names of the custom vocabulary filters that you want to use when processing your Call Analytics transcription. Note that vocabulary filter names are case sensitive.</p> <p>These filters serve to customize the transcript output.</p> <important> <p>This parameter is only intended for use <b>with</b> the <code>IdentifyLanguage</code> parameter. If you're <b>not</b> including <code>IdentifyLanguage</code> in your request and want to use a custom vocabulary filter with your transcription, use the <code>VocabularyFilterName</code> parameter instead.</p> </important> <p>For more information, see <a href=\"https://docs.aws.amazon.com/transcribe/latest/dg/vocabulary-filtering.html\">Using vocabulary filtering with unwanted words</a>.</p>",
1516+
"location":"header",
1517+
"locationName":"x-amzn-transcribe-vocabulary-filter-names"
1518+
},
14721519
"EnablePartialResultsStabilization":{
14731520
"shape":"Boolean",
14741521
"documentation":"<p>Enables partial result stabilization for your transcription. Partial result stabilization can reduce latency in your output, but may impact accuracy. For more information, see <a href=\"https://docs.aws.amazon.com/transcribe/latest/dg/streaming.html#streaming-partial-result-stabilization\">Partial-result stabilization</a>.</p>",
@@ -1563,6 +1610,36 @@
15631610
"location":"header",
15641611
"locationName":"x-amzn-transcribe-language-model-name"
15651612
},
1613+
"IdentifyLanguage":{
1614+
"shape":"Boolean",
1615+
"documentation":"<p>Shows whether automatic language identification was enabled for your Call Analytics transcription.</p>",
1616+
"location":"header",
1617+
"locationName":"x-amzn-transcribe-identify-language"
1618+
},
1619+
"LanguageOptions":{
1620+
"shape":"LanguageOptions",
1621+
"documentation":"<p>Provides the language codes that you specified in your Call Analytics request.</p>",
1622+
"location":"header",
1623+
"locationName":"x-amzn-transcribe-language-options"
1624+
},
1625+
"PreferredLanguage":{
1626+
"shape":"CallAnalyticsLanguageCode",
1627+
"documentation":"<p>Provides the preferred language that you specified in your Call Analytics request.</p>",
1628+
"location":"header",
1629+
"locationName":"x-amzn-transcribe-preferred-language"
1630+
},
1631+
"VocabularyNames":{
1632+
"shape":"VocabularyNames",
1633+
"documentation":"<p>Provides the names of the custom vocabularies that you specified in your Call Analytics request.</p>",
1634+
"location":"header",
1635+
"locationName":"x-amzn-transcribe-vocabulary-names"
1636+
},
1637+
"VocabularyFilterNames":{
1638+
"shape":"VocabularyFilterNames",
1639+
"documentation":"<p>Provides the names of the custom vocabulary filters that you specified in your Call Analytics request.</p>",
1640+
"location":"header",
1641+
"locationName":"x-amzn-transcribe-vocabulary-filter-names"
1642+
},
15661643
"EnablePartialResultsStabilization":{
15671644
"shape":"Boolean",
15681645
"documentation":"<p>Shows whether partial results stabilization was enabled for your Call Analytics transcription.</p>",
@@ -2264,6 +2341,14 @@
22642341
"IssuesDetected":{
22652342
"shape":"IssuesDetected",
22662343
"documentation":"<p>Provides the issue that was detected in the specified segment.</p>"
2344+
},
2345+
"LanguageCode":{
2346+
"shape":"CallAnalyticsLanguageCode",
2347+
"documentation":"<p>The language code that represents the language spoken in your audio stream.</p>"
2348+
},
2349+
"LanguageIdentification":{
2350+
"shape":"CallAnalyticsLanguageIdentification",
2351+
"documentation":"<p>The language code of the dominant language identified in your stream.</p>"
22672352
}
22682353
},
22692354
"documentation":"<p>Contains set of transcription results from one or more audio segments, along with additional information about the parameters included in your request. For example, channel definitions, partial result stabilization, sentiment, and issue detection.</p>",

0 commit comments

Comments
 (0)