TorchServe linux-aarch64 experimental support #3071

agunapal · 2024-04-03T19:44:18Z

Description

TorchServe on linux aarch64 - Experimental

TorchServe has been tested to be working on linux aarch64 for some of the examples. Regression tests have not been tested. Tested this on Amazon Graviton 3 instance(m7g.4x.large)

Installation

Currently installation from PyPi or installing from source works. Conda binaries will be available once this PR is pushed.

python ts_scripts/install_dependencies.py
pip install torchserve torch-model-archiver torch-workflow-archiver

Optimizations

You can also enable this optimizations for Graviton 3 to get an improved performance. More details can be found in this blog

export DNNL_DEFAULT_FPMATH_MODE=BF16
export LRU_CACHE_CAPACITY=1024

Example

This example on Text to Speech synthesis was verified to be working on Graviton 3

Fixes #(issue)

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
New feature (non-breaking change which adds functionality)
This change requires a documentation update

Feature/Issue validation/testing

Please describe the Unit or Integration tests that you ran to verify your changes and relevant result summary. Provide instructions so it can be reproduced.
Please also list any relevant details for your test configuration.

Test A
Logs for Test A
Test B
Logs for Test B

Checklist:

Did you have fun?
Have you added tests that prove your fix is effective or that this feature works?
Has code been commented, particularly in hard-to-understand areas?
Have you made corresponding changes to the documentation?

…serve into feature/aarch64_support

msaroufim · 2024-04-04T23:51:20Z

Any strategy for how we'll test this in CI? Cause if not might be best to mark this support as early preview or experimental with the expectation that some things might break

agunapal · 2024-04-05T00:16:04Z

Any strategy for how we'll test this in CI? Cause if not might be best to mark this support as early preview or experimental with the expectation that some things might break

Good point. I updated the plan to include what needs to be implemented for CI, regression #3072

Once we remove the dependency on TorchText, we'll know what else is remaining.

agunapal · 2024-04-05T00:25:20Z

Any strategy for how we'll test this in CI? Cause if not might be best to mark this support as early preview or experimental with the expectation that some things might break

@msaroufim Updated the documentation to say its experimental

chauhang

Thanks for adding the SpeechT5 example for Graviton. It will be good to also include the new WavLLM model added to https://github.com/microsoft/SpeechT5/tree/main/WavLLM. For the WaveGlow example updates, please update the readme to indicate that example also works on Graviton instances and not just for Nvidia GPUs

docs/linux_aarch64.md

chauhang · 2024-04-14T09:31:41Z

examples/text_to_speech_synthesizer/SpeechT5/text_to_speech_handler.py

+        )
+        return output
+
+    def postprocess(self, inference_output):


How is the response being sent to the client side? Will be good add support for streaming response, and the make the location of /tmp configurable in case the deployment server does not have "/tmp" folder

yeah, the output path can be set in model-config.yaml

Set the output_dir in config file

chauhang · 2024-04-14T09:34:39Z

examples/text_to_speech_synthesizer/SpeechT5/README.md

@@ -0,0 +1,48 @@
+# Text to Speech synthesis with SpeechT5
+
+This is an example showing text to speech synthesis using SpeechT5 model. This has been verified to work on (linux-aarch64) Graviton 3 instance


Thanks for adding the Speech Synthesis example for Graviton. It will be good to also see if support for the new WavLLM in https://github.com/microsoft/SpeechT5/tree/main/WavLLM added to the Microsoft SpeechT5 can also be included.

…serve into feature/aarch64_support

lxning · 2024-05-01T21:06:04Z

examples/text_to_speech_synthesizer/SpeechT5/model-config.yaml

+  model: "./model"
+  vocoder: "./vocoder"
+  processor: "./processor"
+  speaker_embeddings: "./speaker_embeddings"


dot can be removed

lxning · 2024-05-01T21:08:18Z

examples/text_to_speech_synthesizer/SpeechT5/text_to_speech_handler.py

+        self.processor = SpeechT5Processor.from_pretrained(processor)
+        self.model = SpeechT5ForTextToSpeech.from_pretrained(model)
+        self.vocoder = SpeechT5HifiGan.from_pretrained(vocoder)


without prefix model_dir, do these paths work correctly?

updated. They work

lxning · 2024-05-01T21:31:53Z

examples/text_to_speech_synthesizer/SpeechT5/text_to_speech_handler.py

+        )
+        return output
+
+    def postprocess(self, inference_output):


yeah, the output path can be set in model-config.yaml

…serve into feature/aarch64_support

Ubuntu and others added 7 commits March 25, 2024 22:54

Changes for building TorchServe on linux aarch64

5c5d377

Changes for building TorchServe on linux aarch64

1da104f

Added an example for linux aarch64

5ab0b43

Doc update for linux aarch64

441eb5e

Doc update for linux aarch64

92ad55a

Doc update for linux aarch64

aa0a9c5

removed torchtext for aarch64

9a07909

agunapal mentioned this pull request Apr 3, 2024

TorchServe linux aarch64 plan #3072

Open

4 tasks

agunapal and others added 9 commits April 3, 2024 17:55

Merge branch 'master' into feature/aarch64_support

30060c9

lint failure

e7f31a4

lint failure

458be70

Merge branch 'feature/aarch64_support' of https://github.com/pytorch/…

7e298e2

…serve into feature/aarch64_support

Build conda binaries

68706be

Build conda binaries

1a3b2fb

resolving merge conflicts

c8a6871

resolving merge conflicts

8e9e482

Merge branch 'master' into feature/aarch64_support

c6f10ae

agunapal marked this pull request as ready for review April 4, 2024 21:41

agunapal requested review from chauhang and msaroufim April 4, 2024 21:41

update documentation

95da450

agunapal changed the title ~~TorchServe linux-aarch64 support~~ TorchServe linux-aarch64 experimental support Apr 5, 2024

chauhang reviewed Apr 14, 2024

View reviewed changes

agunapal and others added 3 commits April 18, 2024 20:38

review comments

dc1accd

Merge branch 'master' into feature/aarch64_support

810fb8e

Merge branch 'feature/aarch64_support' of https://github.com/pytorch/…

0c6716f

…serve into feature/aarch64_support

agunapal requested a review from lxning April 18, 2024 20:40

lxning reviewed May 1, 2024

View reviewed changes

agunapal and others added 3 commits May 3, 2024 17:45

Updated based on review comments

4a27ed9

Merge branch 'feature/aarch64_support' of https://github.com/pytorch/…

150e8f7

…serve into feature/aarch64_support

Merge branch 'master' into feature/aarch64_support

c5932bd

agunapal requested a review from lxning May 3, 2024 17:46

lxning approved these changes May 3, 2024

View reviewed changes

lxning added this pull request to the merge queue May 3, 2024

Merged via the queue into master with commit 5c1682a May 3, 2024
10 of 12 checks passed

agunapal deleted the feature/aarch64_support branch May 3, 2024 22:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TorchServe linux-aarch64 experimental support #3071

TorchServe linux-aarch64 experimental support #3071

agunapal commented Apr 3, 2024 •

edited

Loading

msaroufim commented Apr 4, 2024

agunapal commented Apr 5, 2024

agunapal commented Apr 5, 2024

chauhang left a comment

chauhang Apr 14, 2024

lxning May 1, 2024

agunapal May 3, 2024

chauhang Apr 14, 2024

lxning May 1, 2024

agunapal May 3, 2024

lxning May 1, 2024

agunapal May 3, 2024

lxning May 1, 2024

		@@ -0,0 +1,48 @@
		# Text to Speech synthesis with SpeechT5

		This is an example showing text to speech synthesis using SpeechT5 model. This has been verified to work on (linux-aarch64) Graviton 3 instance

TorchServe linux-aarch64 experimental support #3071

TorchServe linux-aarch64 experimental support #3071

Conversation

agunapal commented Apr 3, 2024 • edited Loading

Description

TorchServe on linux aarch64 - Experimental

Installation

Optimizations

Example

Type of change

Feature/Issue validation/testing

Checklist:

msaroufim commented Apr 4, 2024

agunapal commented Apr 5, 2024

agunapal commented Apr 5, 2024

chauhang left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

agunapal commented Apr 3, 2024 •

edited

Loading