asr speech recognition example #2047

husenzhang · 2022-12-31T01:17:39Z

Description

Provides an example of ASR (automated speech recognition) model serving.
Input is a wav file; output text translation of that wav.

Type of change

Please delete options that are not relevant.

New feature (non-breaking change which adds functionality)

Checklist:

Did you have fun?
Have you added tests that prove your fix is effective or that this feature works?
Has code been commented, particularly in hard-to-understand areas?
Have you made corresponding changes to the documentation?

codecov · 2023-01-04T00:45:03Z

Codecov Report

Merging #2047 (8581260) into master (656a30d) will not change coverage.
The diff coverage is n/a.

❗ Current head 8581260 differs from pull request most recent head cb1d39c. Consider uploading reports for the commit cb1d39c to get more accurate results

@@           Coverage Diff           @@
##           master    #2047   +/-   ##
=======================================
  Coverage   72.71%   72.71%           
=======================================
  Files          79       79           
  Lines        3742     3742           
  Branches       58       58           
=======================================
  Hits         2721     2721           
  Misses       1017     1017           
  Partials        4        4

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

examples/asr_rnnt_emformer/README.md

examples/asr_rnnt_emformer/requirements.txt

lxning · 2023-02-10T23:28:41Z

examples/asr_rnnt_emformer/handler.py

+        if isinstance(data, list):
+            data = data[0]
+        data = data.get("data") or data.get("body")


This is not able to handle batching. ref example

To be fair neither does our default handler , for an example I think this is fine

For audio models, batching may not make sense in all cases. Will be good to mention as a note in the readme that this example does not support batching and verify the config properties match

chauhang

Thanks for the PR, can be merged with few minor changes for torchserve version and readme updates

examples/asr_rnnt_emformer/00_save_jit_model.sh

examples/asr_rnnt_emformer/01_create_model_archive.sh

chauhang · 2023-08-24T19:46:43Z

examples/asr_rnnt_emformer/handler.py

+        if isinstance(data, list):
+            data = data[0]
+        data = data.get("data") or data.get("body")


For audio models, batching may not make sense in all cases. Will be good to mention as a note in the readme that this example does not support batching and verify the config properties match

examples/asr_rnnt_emformer/00_save_jit_model.sh

examples/asr_rnnt_emformer/01_create_model_archive.sh

feedback addressed

asr speech recognition example

052dab8

maaquib requested review from lxning and msaroufim January 4, 2023 00:18

Merge branch 'master' into master

9e5cf3e

maaquib reviewed Jan 9, 2023

View reviewed changes

examples/asr_rnnt_emformer/README.md Outdated Show resolved Hide resolved

maaquib and others added 6 commits January 9, 2023 13:25

Merge branch 'master' into master

22165ad

added details in README according to maaquib

e674b25

merge due to added details in README

7b51d4d

Merge branch 'master' into master

4d5c313

Merge branch 'master' into master

9eb49f5

Merge branch 'master' into master

4350fb1

lxning reviewed Feb 10, 2023

View reviewed changes

msaroufim approved these changes Jul 21, 2023

View reviewed changes

Merge branch 'master' into master

e75b39b

chauhang previously requested changes Aug 24, 2023

View reviewed changes

msaroufim added the example label Aug 25, 2023

msaroufim reviewed Aug 30, 2023

View reviewed changes

examples/asr_rnnt_emformer/00_save_jit_model.sh Outdated Show resolved Hide resolved

msaroufim reviewed Aug 30, 2023

View reviewed changes

examples/asr_rnnt_emformer/01_create_model_archive.sh Outdated Show resolved Hide resolved

Apply suggestions from code review

6ede9cf

msaroufim enabled auto-merge August 30, 2023 20:21

Merge branch 'master' into master

cb1d39c

msaroufim requested review from HamidShojanazeri and agunapal as code owners August 30, 2023 21:33

agunapal approved these changes Aug 31, 2023

View reviewed changes

msaroufim added this pull request to the merge queue Aug 31, 2023

Merged via the queue into pytorch:master with commit 242895c Aug 31, 2023
11 of 13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

asr speech recognition example #2047

asr speech recognition example #2047

husenzhang commented Dec 31, 2022

codecov bot commented Jan 4, 2023 •

edited

Loading

lxning Feb 10, 2023

msaroufim Jul 21, 2023

chauhang Aug 24, 2023

chauhang left a comment

chauhang Aug 24, 2023

asr speech recognition example #2047

asr speech recognition example #2047

Conversation

husenzhang commented Dec 31, 2022

Description

Type of change

Checklist:

codecov bot commented Jan 4, 2023 • edited Loading

Codecov Report

lxning Feb 10, 2023

Choose a reason for hiding this comment

msaroufim Jul 21, 2023

Choose a reason for hiding this comment

chauhang Aug 24, 2023

Choose a reason for hiding this comment

chauhang left a comment

Choose a reason for hiding this comment

chauhang Aug 24, 2023

Choose a reason for hiding this comment

codecov bot commented Jan 4, 2023 •

edited

Loading