Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

From Generation to Translation #7

Open
riven314 opened this issue Jul 27, 2020 · 5 comments
Open

From Generation to Translation #7

riven314 opened this issue Jul 27, 2020 · 5 comments
Labels
question Further information is requested

Comments

@riven314
Copy link

riven314 commented Jul 27, 2020

This is a very interesting work!
Do you think this model can be easily adapted to tackle SVG translation problem? (ie translating a SVG from one type to another)
Do you have any related literatures in your mind for these line of work?
Would love to hear your thought!

@alexandre01
Copy link
Owner

Hey Alex,
Yes for sure, we even had this kind of application in mind while developing DeepSVG. Instead of training for input reconstruction, just replace the output target with the desired translation.

One such example could be SVG "beautification". Given a dataset of clean SVGs, randomly jitter it and feed it as input to the model, whose goal is now to reconstruct the clean SVG. This could maybe be used by graphic designers to automatically clean their vector drawings...

What kind of SVG translation do you have in mind?

@riven314
Copy link
Author

riven314 commented Jul 28, 2020

"Beautification" is a good idea! I have 2 ideas in my mind:

  1. transferring one emoji from one style to another (e.g. author style, color ... etc.)
  2. Treating a layout as a SVG object (layout is a composition of different shapes), I wanna transform a layout from one kind to another. (having identical shapes, simply apply geometric transformation on different shapes) I wanna see if the model could capture the spatial pattern. And by the time interpolation is applied between two samples, could I see a smooth transition/ nice disentanglement between shapes.

@alexandre01
Copy link
Owner

Great!

  • While 1. sounds potentially feasible, I see the problem that you won't have enough training data and the model will just overfit without capacity to generalize.
    1. sounds like a great idea! On our side, we imagined, as potential applications/extensions to DeepSVG, doing HTML/CSS generation, treating the different 'div' sections as square boxes. But what type described has even the advantage that you can generate as much data as you want. And I'm pretty sure this Transformer-based architecture is capable of finding these spatial relationships :)

@riven314
Copy link
Author

riven314 commented Jul 28, 2020

Thanks for your feedback! I am actually in a brain-storming stage, thinking which direction is more rewarding and fast to iterate haha

  • For 1, I also foresee limited data is an obstacle, especially an author usually only created a limited amount of emojis. Do you think a generic transfer learning could apply in this model? (e.g. fine-tuning the decoder part with few examples)
  • For 2, this is a model capacity I am really interested in because such capacity could enable the model to be applied in many areas such as layout design! To quickly validate the concept, I think RICO (webpage layout dataset) is a good dataset for that. I am thinking to create a pseudo pair (artificially transform each webpage layout to a particular form) in order to convert the dataset to one for translation problem. To further extend that, the model could also take into account the image for each placeholder. Do you think the current model could be able to encode shape + its enclosed image?

@alexandre01 alexandre01 added the question Further information is requested label Jul 29, 2020
@tsaxena
Copy link

tsaxena commented Dec 18, 2020

This is interesting conversation. @riven314 and @alexandre01 . I am actually using Rico dataset currently but the focus of the work is classification. @alexandre01 do you think this work can be easily used for retraining on UI component SVGs to classify?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants