Update large_model_inference.md #2542

sekyondaMeta · 2023-08-28T16:11:09Z

Description

Adding DeepSpeed MII and Hugging face accelerate information to LMI doc.

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
New feature (non-breaking change which adds functionality)
This change requires a documentation update

Feature/Issue validation/testing

Built pages locally.

Checklist:

Did you have fun?
Have you added tests that prove your fix is effective or that this feature works?
Has code been commented, particularly in hard-to-understand areas?
Have you made corresponding changes to the documentation?

codecov · 2023-08-28T16:30:39Z

Codecov Report

Merging #2542 (0c43413) into master (a599fa0) will not change coverage.
The diff coverage is n/a.

❗ Current head 0c43413 differs from pull request most recent head baf1cbd. Consider uploading reports for the commit baf1cbd to get more accurate results

@@           Coverage Diff           @@
##           master    #2542   +/-   ##
=======================================
  Coverage   72.64%   72.64%           
=======================================
  Files          79       79           
  Lines        3733     3733           
  Branches       58       58           
=======================================
  Hits         2712     2712           
  Misses       1017     1017           
  Partials        4        4

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

msaroufim · 2023-08-28T19:38:57Z

docs/large_model_inference.md

@@ -1,6 +1,13 @@
 # Serving large models with Torchserve

 This document explain how Torchserve supports large model serving, here large model refers to the models that are not able to fit into one gpu so they need be split in multiple partitions over multiple gpus.
+This page is split into the following sections:
+- [How it works](#how-it-works)


I remember there was some issue with relative links when we go to pytorch.org so might wanna double check this works, if it does feel free to dismiss this comment

agunapal

Pending one query about low_cpu_mem_usage=True, rest looks fine

docs/large_model_inference.md

HamidShojanazeri

Left a suggestion

docs/large_model_inference.md

Update large_model_inference.md

3102bad

msaroufim approved these changes Aug 28, 2023

View reviewed changes

msaroufim reviewed Aug 28, 2023

View reviewed changes

agunapal reviewed Aug 29, 2023

View reviewed changes

docs/large_model_inference.md Outdated Show resolved Hide resolved

HamidShojanazeri approved these changes Aug 30, 2023

View reviewed changes

docs/large_model_inference.md Outdated Show resolved Hide resolved

HamidShojanazeri added 2 commits August 30, 2023 10:41

Update docs/large_model_inference.md

af787ea

Merge branch 'master' into lmiDeepspeedmii

baf1cbd

HamidShojanazeri enabled auto-merge August 30, 2023 17:42

HamidShojanazeri added this pull request to the merge queue Aug 30, 2023

Merged via the queue into pytorch:master with commit 30ff033 Aug 30, 2023
10 of 12 checks passed

sekyondaMeta deleted the lmiDeepspeedmii branch September 13, 2023 16:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update large_model_inference.md #2542

Update large_model_inference.md #2542

sekyondaMeta commented Aug 28, 2023

codecov bot commented Aug 28, 2023 •

edited

Loading

msaroufim Aug 28, 2023

agunapal left a comment

HamidShojanazeri left a comment

Update large_model_inference.md #2542

Update large_model_inference.md #2542

Conversation

sekyondaMeta commented Aug 28, 2023

Description

Type of change

Feature/Issue validation/testing

Checklist:

codecov bot commented Aug 28, 2023 • edited Loading

Codecov Report

msaroufim Aug 28, 2023

Choose a reason for hiding this comment

agunapal left a comment

Choose a reason for hiding this comment

HamidShojanazeri left a comment

Choose a reason for hiding this comment

codecov bot commented Aug 28, 2023 •

edited

Loading