About style transferring #27

nldhuyen0047 · 2024-09-05T05:47:34Z

Hi, thanks for your excellent work.

I have a question about style transferring. I would like to transfer style of the target image into the content image, when inference, I test on both Contour and LineArt, but the result is not good.

With my datasets use for the inference, I use a cube with no details on it (I would like to add details for this) for the content image, and a house with details and colors for the target image. With the LineArt, the colors and some details needing to add in the content image is not filled, and with the Contour, the image changed so much and be quite messy.

Could you please give me some solutions for improving the result?

Thank you so much.

The text was updated successfully, but these errors were encountered:

Jeoyal · 2024-09-05T07:06:07Z

Hi @nldhuyen0047, thank you for your interest in our work. I noticed that you previously mentioned an issue related to model training. Are the inference results based your own training model?
Also, could you post the content image, style image, and the generated results?

zhihongz · 2024-09-06T11:00:28Z

I met the same problem. For example, when the content image and the style image are the same, the output is supposed to be the same image as well (some works introduces identity loss to guarantee this). But with StyleShot, the output differs greatly with the input image.

Jeoyal · 2024-09-06T11:23:41Z

Hi @zhihongz , thank you for your interest in our work. I randomly test some cases that have same content and style images, the outputs are the same image as well, here are the samples:

I think your difference comes from the resolution of inputs. Style image will be center crop in 512*512, while content image will not be processed, which might lead to difference.

zhihongz · 2024-09-06T11:48:01Z

Emm, the output looks similiar to the input, but there are obvious differences. For example, the color of the cat in your last screen shot changes to blue.

zhihongz · 2024-09-06T11:54:05Z

When you test with some natural images, the differences become even larger. You can exam this pic:

Jeoyal · 2024-09-06T12:16:30Z

In StyleShot, styles are integrated into style embeddings by a specially designed style-aware encoder and then incorporated into the diffusion model through a cross-attention module. This highly aggregative style information is not suitable for pixel-level reconstruction.

zhihongz · 2024-09-06T12:18:40Z

Got it, thanks for your patient and quick reply.

nldhuyen0047 · 2024-09-07T04:07:04Z

In StyleShot, styles are integrated into style embeddings by a specially designed style-aware encoder and then incorporated into the diffusion model through a cross-attention module. This highly aggregative style information is not suitable for pixel-level reconstruction.

Sorry for my late reply.

I have encountered some problems with the training model.

I got it. Thank you very much.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About style transferring #27

About style transferring #27

nldhuyen0047 commented Sep 5, 2024

Jeoyal commented Sep 5, 2024

zhihongz commented Sep 6, 2024

Jeoyal commented Sep 6, 2024 •

edited

Loading

zhihongz commented Sep 6, 2024

zhihongz commented Sep 6, 2024

Jeoyal commented Sep 6, 2024

zhihongz commented Sep 6, 2024

nldhuyen0047 commented Sep 7, 2024

About style transferring #27

About style transferring #27

Comments

nldhuyen0047 commented Sep 5, 2024

Jeoyal commented Sep 5, 2024

zhihongz commented Sep 6, 2024

Jeoyal commented Sep 6, 2024 • edited Loading

zhihongz commented Sep 6, 2024

zhihongz commented Sep 6, 2024

Jeoyal commented Sep 6, 2024

zhihongz commented Sep 6, 2024

nldhuyen0047 commented Sep 7, 2024

Jeoyal commented Sep 6, 2024 •

edited

Loading