Profile TorchServe Handler (preprocess vs inference vs post-process) #2470

agunapal · 2023-07-18T01:20:28Z

Description

Update the benchmark script to show a split of preprocess, inference and postprocess times

Additional log optionally enabled by config in model-config.yaml
Additonal metrics in benchmark report optionally enabled by config
Include a working example of ResNet50

Fixes #(issue)

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
New feature (non-breaking change which adds functionality)
This change requires a documentation update

Feature/Issue validation/testing

Please describe the Unit or Integration tests that you ran to verify your changes and relevant result summary. Provide instructions so it can be reproduced.
Please also list any relevant details for your test configuration.

TorchServe Benchmark on gpu

Date: 2023-07-18 00:56:21

TorchServe Version: 0.8.1

eager_mode_resnet50

version	Benchmark	Batch size	Batch delay	Workers	Model	Concurrency	Input	Requests	TS failed requests	TS throughput	TS latency P50	TS latency P90	TS latency P99	TS latency mean	TS error rate	Model_p50	Model_p90	Model_p99	handler_time_mean	predict_mean	waiting_time_mean	worker_thread_mean	cpu_percentage_mean	memory_percentage_mean	gpu_percentage_mean	gpu_memory_percentage_mean	gpu_memory_used_mean	backend_preprocess_mean	backend_inference_mean	backend_postprocess_mean
0.8.1	AB	1	100	4	.mar	100	input	10000	0	97.91	998	1259	1420	1021.319	0.0	19.04	24.84	26.19	38.7	38.83	974.83	0.42	50.0	35.11	18.0	28.51	6566.0	29.85	7.32	0.27
0.8.1	AB	16	100	4	.mar	100	input	10000	9999	102.3	949	1380	1750	977.526	99.99	436.65	556.46	576.54	610.41	610.67	349.69	10.15	100.0	36.51	8.0	30.13	6938.0	585.39	19.59	0.26
0.8.1	AB	2	100	4	.mar	100	input	10000	9998	94.47	1042	1258	1488	1058.591	99.98	38.15	56.68	62.99	81.12	81.36	966.6	2.16	0.0	34.35	10.5	27.11	6242.0	70.01	8.01	0.24
0.8.1	AB	32	100	4	.mar	100	input	10000	9119	132.2	735	1157	1501	756.404	91.19	110.47	194.85	225.53	513.62	513.85	16.01	6.62	100.0	38.35	2.0	33.7	7760.0	484.35	24.86	0.26
0.8.1	AB	4	100	4	.mar	100	input	10000	3	100.3	975	1213	1453	997.042	0.03	82.37	124.11	133.04	154.0	154.24	829.77	3.7	100.0	34.59	12.0	27.38	6304.0	141.65	8.01	0.29
0.8.1	AB	64	100	4	.mar	100	input	10000	9382	213.37	465	552	641	468.677	93.82	260.14	295.21	305.11	366.7	366.78	59.21	14.16	0.0	48.69	15.0	52.48	12084.0	312.83	52.37	0.34
0.8.1	AB	8	100	4	.mar	100	input	10000	9999	100.38	969	1263	1571	996.255	99.99	199.61	268.68	285.87	309.21	309.48	671.79	6.71	100.0	34.99	14.0	28.18	6490.0	293.13	10.72	0.28

Checklist:

Did you have fun?
Have you added tests that prove your fix is effective or that this feature works?
Has code been commented, particularly in hard-to-understand areas?
Have you made corresponding changes to the documentation?

…orch/serve into feature/ts_benchmark_profile

examples/benchmarking/resnet50/README.md

codecov · 2023-07-18T17:36:33Z

Codecov Report

Merging #2470 (106bcf9) into master (d47b14d) will decrease coverage by 0.23%.
The diff coverage is 53.84%.

❗ Current head 106bcf9 differs from pull request most recent head a531548. Consider uploading reports for the commit a531548 to get more accurate results

@@            Coverage Diff             @@
##           master    #2470      +/-   ##
==========================================
- Coverage   72.86%   72.64%   -0.23%     
==========================================
  Files          78       79       +1     
  Lines        3697     3733      +36     
  Branches       58       58              
==========================================
+ Hits         2694     2712      +18     
- Misses        999     1017      +18     
  Partials        4        4

Files Changed	Coverage Δ
ts/handler_utils/timer.py	`32.00% <32.00%> (ø)`
...orch_handler/unit_tests/test_utils/mock_context.py	`86.95% <66.66%> (-3.05%)`	⬇️
ts/torch_handler/base_handler.py	`69.09% <100.00%> (+0.57%)`	⬆️
ts/torch_handler/image_classifier.py	`90.00% <100.00%> (+1.11%)`	⬆️
ts/torch_handler/vision_handler.py	`90.90% <100.00%> (+0.58%)`	⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

examples/benchmarking/resnet50/README.md

ts/handler_utils/timer.py

…orch/serve into feature/ts_benchmark_profile

agunapal and others added 4 commits July 18, 2023 01:18

Profile TS Handler using ab tool

f376080

Added an example

e03d411

Added an example

10780e2

Merge branch 'master' into feature/ts_benchmark_profile

7363c0c

agunapal changed the title ~~(WIP) Profile TorchServe Handler (preprocess vs inference vs post-process)~~ Profile TorchServe Handler (preprocess vs inference vs post-process) Jul 18, 2023

agunapal requested review from msaroufim and lxning July 18, 2023 01:48

agunapal changed the title ~~Profile TorchServe Handler (preprocess vs inference vs post-process)~~ (WIP)Profile TorchServe Handler (preprocess vs inference vs post-process) Jul 18, 2023

agunapal added 2 commits July 18, 2023 04:57

handler class not needed

51d99be

Merge branch 'feature/ts_benchmark_profile' of https://github.com/pyt…

037c7fc

…orch/serve into feature/ts_benchmark_profile

msaroufim reviewed Jul 18, 2023

View reviewed changes

examples/benchmarking/resnet50/README.md Show resolved Hide resolved

agunapal added 2 commits July 18, 2023 17:13

Add model_yaml_config to MockCOntext

7522388

remove unnecessary config

03bfa52

agunapal changed the title ~~(WIP)Profile TorchServe Handler (preprocess vs inference vs post-process)~~ Profile TorchServe Handler (preprocess vs inference vs post-process) Jul 18, 2023

msaroufim self-requested a review July 21, 2023 20:06

msaroufim approved these changes Jul 21, 2023

View reviewed changes

lxning reviewed Jul 24, 2023

View reviewed changes

examples/benchmarking/resnet50/README.md Show resolved Hide resolved

examples/benchmarking/resnet50/README.md Show resolved Hide resolved

ts/handler_utils/timer.py Outdated Show resolved Hide resolved

agunapal and others added 2 commits July 24, 2023 21:57

based on review comments

046624e

Merge branch 'master' into feature/ts_benchmark_profile

80757f7

agunapal requested a review from lxning July 24, 2023 21:59

agunapal and others added 8 commits July 26, 2023 14:15

Merge branch 'master' into feature/ts_benchmark_profile

c433c69

Merge branch 'master' into feature/ts_benchmark_profile

c6a8c0d

Added details on how to enable this

3132942

Merge branch 'feature/ts_benchmark_profile' of https://github.com/pyt…

6e64ecf

…orch/serve into feature/ts_benchmark_profile

Added details on how to enable this

7869322

Merge branch 'master' into feature/ts_benchmark_profile

ebb132a

Merge branch 'master' into feature/ts_benchmark_profile

267af76

Merge branch 'master' into feature/ts_benchmark_profile

19829be

chauhang approved these changes Aug 24, 2023

View reviewed changes

chauhang and others added 7 commits August 24, 2023 12:31

Merge branch 'master' into feature/ts_benchmark_profile

23a9589

Merge branch 'master' into feature/ts_benchmark_profile

f124d1f

lint fix

366a283

lint fix

b19a9f8

Merge branch 'master' into feature/ts_benchmark_profile

43f5196

lint fix

4c01f35

Merge branch 'feature/ts_benchmark_profile' of https://github.com/pyt…

a531548

…orch/serve into feature/ts_benchmark_profile

agunapal merged commit 03ad862 into master Aug 24, 2023
13 checks passed

agunapal deleted the feature/ts_benchmark_profile branch August 24, 2023 22:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Profile TorchServe Handler (preprocess vs inference vs post-process) #2470

Profile TorchServe Handler (preprocess vs inference vs post-process) #2470

agunapal commented Jul 18, 2023 •

edited

Loading

codecov bot commented Jul 18, 2023 •

edited

Loading

Profile TorchServe Handler (preprocess vs inference vs post-process) #2470

Profile TorchServe Handler (preprocess vs inference vs post-process) #2470

Conversation

agunapal commented Jul 18, 2023 • edited Loading

Description

Type of change

Feature/Issue validation/testing

TorchServe Benchmark on gpu

Date: 2023-07-18 00:56:21

TorchServe Version: 0.8.1

eager_mode_resnet50

Checklist:

codecov bot commented Jul 18, 2023 • edited Loading

Codecov Report

agunapal commented Jul 18, 2023 •

edited

Loading

codecov bot commented Jul 18, 2023 •

edited

Loading