Realtime performance question #73

lachose1 · 2020-08-17T15:48:04Z

Hello and first of all thanks for the great tool you have built.

I wanted to ask some questions, I run the pretrained model on a 1080 ti on a realtime video and obtain so-so performances :
res50_coco_256x192: 12 FPS
mobilenetv2_coco_256x192: 15 FPS

Am I missing something or is it just that the GPU isn't strong enough. Thanks!

innerlee · 2020-08-17T22:42:14Z

hi, which script was run? currently the inference code is not optimized yet. see #40. you may vote here #9 to help us prioritize

jin-s13 · 2020-08-18T02:00:04Z

Please set flip_test=False in the configs for higher speed.

lachose1 · 2020-08-19T15:34:10Z

hi, which script was run? currently the inference code is not optimized yet. see #40. you may vote here #9 to help us prioritize

Yes, I used pretty much the same settings as #40

Please set flip_test=False in the configs for higher speed.

This was already done, but thanks for clarifying. I guess you can close the thread if this is on the road map for the future, thanks a lot for your help!

innerlee · 2020-08-20T00:09:18Z

thanks! it would be great if people can help with profiling and identifying the bottleneck. here is some guide we wrote earlier (can safely omit the Chinese characters and guess the content):

fabro66 · 2021-01-15T02:15:05Z

Hi~
I found cv2.ellipse2Poly in top_dow.py/bottom_up.py highly slow the inference speed.
If I replace it with cv2.line, hrnet_w32_wholebody_256×192_dark will speed up from 3.0fps to 16.0fps on a 1060. There are other areas that can be further optimized.

Replace it with the following:

mmpose/mmpose/models/detectors/top_down.py

Line 256 in 9703521

for _, kpts in enumerate(pose_result):

    for kpts in pose_result:
        # draw each point on image
        if pose_kpt_color is not None:
            assert len(pose_kpt_color) == len(kpts)
            for kid, kpt in enumerate(kpts):
                x_coord, y_coord, kpt_score = int(kpt[0]), int(
                    kpt[1]), kpt[2]
                if kpt_score > kpt_score_thr:
                    r, g, b = pose_kpt_color[kid]
                    cv2.circle(img, (int(x_coord), int(y_coord)),
                               radius, (int(r), int(g), int(b)), -1)

        # draw limbs
        if skeleton is not None and pose_limb_color is not None:
            assert len(pose_limb_color) == len(skeleton)
            for sk_id, sk in enumerate(skeleton):
                pos1 = (int(kpts[sk[0] - 1, 0]), int(kpts[sk[0] - 1,
                                                          1]))
                pos2 = (int(kpts[sk[1] - 1, 0]), int(kpts[sk[1] - 1,
                                                          1]))
                if (pos1[0] > 0 and pos1[0] < img_w and pos1[1] > 0
                        and pos1[1] < img_h and pos2[0] > 0
                        and pos2[0] < img_w and pos2[1] > 0
                        and pos2[1] < img_h
                        and kpts[sk[0] - 1, 2] > kpt_score_thr
                        and kpts[sk[1] - 1, 2] > kpt_score_thr):

                    r, g, b = pose_limb_color[sk_id]
                    cv2.line(img, pos1, pos2, (int(r), int(g), int(b)), thickness=thickness)

jin-s13 · 2021-01-15T02:32:24Z

Thanks for reporting @fabro66. Will try this out.

lucasjinreal · 2021-01-22T07:26:50Z

@fabro66 How many whole post-processing time did u measured for now?

fabro66 · 2021-01-22T13:08:03Z

@jinfagang I did not test the time it takes for post-processing. I just replace cv2.ellipse2Poly with cv2.line to speed up inference.

lucasjinreal · 2021-01-24T12:52:46Z

@fabro66 Did u able to run realtime with a detector (not from GT boxes). Such as with yolov5 and a pose model.

fabro66 · 2021-01-24T13:05:04Z

@jinfagang It can reach 16fps on a GTX1060 when I combine hrnet_w32_wholebody_256×192_dark and yolov3 (from mmdetection) to estimate whole-body keypoints.

lucasjinreal · 2021-01-25T06:25:26Z

@fabro66 I tested with yolov5s detector and shufflenetv2 pose on coco, the speed is about 7fps in 2 person on GTX1080ti.

What's the reason why it's slow?

innerlee · 2021-01-25T06:30:37Z

If you have interest, please try to profile it and post the result, something like #344 (comment)

lucasjinreal · 2021-01-25T06:56:13Z

@innerlee I dont know how to using cprofile. Did u guys get same performance when test a normal video with more than 2 person in it?

lucasjinreal · 2021-01-25T06:56:23Z

@innerlee I dont know how to using cprofile. Did u guys get same performance when test a normal video with more than 2 person in it?

innerlee · 2021-01-25T07:35:40Z

Step1. use cProfile to run the script for a period of time, say, 30 seconds.

Step2. visualize the result by snakeviz

Refer to the instruction in #73 (comment) for more details

lucasjinreal · 2021-01-25T09:16:44Z

@innerlee thanks. Do u have any insights about it?

innerlee · 2021-01-25T09:29:30Z

Please expand shared_transformation section, by clicking on it

lucasjinreal · 2021-01-25T09:41:03Z

@innerlee

innerlee · 2021-01-25T09:47:17Z

Please click on the shared_transform.py, and it should print more details on the bottom level.

Crop the image is not what I meant

lucasjinreal · 2021-01-25T11:54:34Z

@innerlee ok.....

innerlee · 2021-01-26T11:07:47Z

@jinfagang If you haven't deleted the profiling record, please post the result so that the bottleneck is visualized

haseeb33 · 2022-07-26T22:44:18Z

@jinfagang Can you please share the config file for yolov5 for mmdet_model. I trained a yolov5s model independently and I want to use it as a detection model for pose estimation inference but I am having difficulties modifying the config file.
Thanks in advance.

lucasjinreal · 2022-07-27T02:35:24Z

@haseeb33 you can take a look at this repo: https://github.com/jinfagang/yolov7 it provides various yolo model with pure python instead yaml config files. Also support a e2e keypoints model

haseeb33 · 2022-07-27T03:23:13Z

@jinfagang Thank you very much!

jin-s13 added status/duplicate issue/PR already exists enhancement New feature or request labels Aug 18, 2020

innerlee added community/help wanted extra attention is needed and removed status/duplicate issue/PR already exists labels Aug 20, 2020

innerlee mentioned this issue Dec 4, 2020

Using GPU Gaussian blur at DarkPose unbiased decoding & megvii #332

Closed

innerlee mentioned this issue Mar 11, 2021

Accelerating the inference of the trained model for COCO-WholeBody #520

Closed

Tau-J closed this as completed Apr 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Realtime performance question #73

Realtime performance question #73

lachose1 commented Aug 17, 2020

innerlee commented Aug 17, 2020 •

edited

Loading

jin-s13 commented Aug 18, 2020

lachose1 commented Aug 19, 2020

innerlee commented Aug 20, 2020 •

edited

Loading

fabro66 commented Jan 15, 2021 •

edited

Loading

jin-s13 commented Jan 15, 2021

lucasjinreal commented Jan 22, 2021

fabro66 commented Jan 22, 2021

lucasjinreal commented Jan 24, 2021

fabro66 commented Jan 24, 2021 •

edited

Loading

lucasjinreal commented Jan 25, 2021

innerlee commented Jan 25, 2021

lucasjinreal commented Jan 25, 2021

lucasjinreal commented Jan 25, 2021

innerlee commented Jan 25, 2021

lucasjinreal commented Jan 25, 2021

innerlee commented Jan 25, 2021

lucasjinreal commented Jan 25, 2021

innerlee commented Jan 25, 2021

lucasjinreal commented Jan 25, 2021

innerlee commented Jan 26, 2021

haseeb33 commented Jul 26, 2022

lucasjinreal commented Jul 27, 2022

haseeb33 commented Jul 27, 2022

Realtime performance question #73

Realtime performance question #73

Comments

lachose1 commented Aug 17, 2020

innerlee commented Aug 17, 2020 • edited Loading

jin-s13 commented Aug 18, 2020

lachose1 commented Aug 19, 2020

innerlee commented Aug 20, 2020 • edited Loading

fabro66 commented Jan 15, 2021 • edited Loading

jin-s13 commented Jan 15, 2021

lucasjinreal commented Jan 22, 2021

fabro66 commented Jan 22, 2021

lucasjinreal commented Jan 24, 2021

fabro66 commented Jan 24, 2021 • edited Loading

lucasjinreal commented Jan 25, 2021

innerlee commented Jan 25, 2021

lucasjinreal commented Jan 25, 2021

lucasjinreal commented Jan 25, 2021

innerlee commented Jan 25, 2021

lucasjinreal commented Jan 25, 2021

innerlee commented Jan 25, 2021

lucasjinreal commented Jan 25, 2021

innerlee commented Jan 25, 2021

lucasjinreal commented Jan 25, 2021

innerlee commented Jan 26, 2021

haseeb33 commented Jul 26, 2022

lucasjinreal commented Jul 27, 2022

haseeb33 commented Jul 27, 2022

innerlee commented Aug 17, 2020 •

edited

Loading

innerlee commented Aug 20, 2020 •

edited

Loading

fabro66 commented Jan 15, 2021 •

edited

Loading

fabro66 commented Jan 24, 2021 •

edited

Loading