Conditional Vector Graphics Generation for Music Cover Images
Generative Adversarial Networks (GAN) have motivated a rapid growth of the domain of computer image synthesis. As almost all the existing image synthesis algorithms consider an image as a pixel matrix, the high-resolution image synthesis is complicated.A good alternative can be vector images. Howeve...
Saved in:
| Main Authors | , , , |
|---|---|
| Format | Journal Article |
| Language | English |
| Published |
15.05.2022
|
| Subjects | |
| Online Access | Get full text |
| DOI | 10.48550/arxiv.2205.07301 |
Cover
| Abstract | Generative Adversarial Networks (GAN) have motivated a rapid growth of the
domain of computer image synthesis. As almost all the existing image synthesis
algorithms consider an image as a pixel matrix, the high-resolution image
synthesis is complicated.A good alternative can be vector images. However, they
belong to the highly sophisticated parametric space, which is a restriction for
solving the task of synthesizing vector graphics by GANs. In this paper, we
consider a specific application domain that softens this restriction
dramatically allowing the usage of vector image synthesis.
Music cover images should meet the requirements of Internet streaming
services and printing standards, which imply high resolution of graphic
materials without any additional requirements on the content of such images.
Existing music cover image generation services do not analyze tracks
themselves; however, some services mostly consider only genre tags. To generate
music covers as vector images that reflect the music and consist of simple
geometric objects, we suggest a GAN-based algorithm called CoverGAN. The
assessment of resulting images is based on their correspondence to the music
compared with AttnGAN and DALL-E text-to-image generation according to title or
lyrics. Moreover, the significance of the patterns found by CoverGAN has been
evaluated in terms of the correspondence of the generated cover images to the
musical tracks. Listeners evaluate the music covers generated by the proposed
algorithm as quite satisfactory and corresponding to the tracks. Music cover
images generation code and demo are available at
https://github.com/IzhanVarsky/CoverGAN. |
|---|---|
| AbstractList | Generative Adversarial Networks (GAN) have motivated a rapid growth of the
domain of computer image synthesis. As almost all the existing image synthesis
algorithms consider an image as a pixel matrix, the high-resolution image
synthesis is complicated.A good alternative can be vector images. However, they
belong to the highly sophisticated parametric space, which is a restriction for
solving the task of synthesizing vector graphics by GANs. In this paper, we
consider a specific application domain that softens this restriction
dramatically allowing the usage of vector image synthesis.
Music cover images should meet the requirements of Internet streaming
services and printing standards, which imply high resolution of graphic
materials without any additional requirements on the content of such images.
Existing music cover image generation services do not analyze tracks
themselves; however, some services mostly consider only genre tags. To generate
music covers as vector images that reflect the music and consist of simple
geometric objects, we suggest a GAN-based algorithm called CoverGAN. The
assessment of resulting images is based on their correspondence to the music
compared with AttnGAN and DALL-E text-to-image generation according to title or
lyrics. Moreover, the significance of the patterns found by CoverGAN has been
evaluated in terms of the correspondence of the generated cover images to the
musical tracks. Listeners evaluate the music covers generated by the proposed
algorithm as quite satisfactory and corresponding to the tracks. Music cover
images generation code and demo are available at
https://github.com/IzhanVarsky/CoverGAN. |
| Author | Efimova, Valeria Jarsky, Ivan Bizyaev, Ilya Filchenkov, Andrey |
| Author_xml | – sequence: 1 givenname: Valeria surname: Efimova fullname: Efimova, Valeria – sequence: 2 givenname: Ivan surname: Jarsky fullname: Jarsky, Ivan – sequence: 3 givenname: Ilya surname: Bizyaev fullname: Bizyaev, Ilya – sequence: 4 givenname: Andrey surname: Filchenkov fullname: Filchenkov, Andrey |
| BackLink | https://doi.org/10.48550/arXiv.2205.07301$$DView paper in arXiv |
| BookMark | eNrjYmDJy89LZWCQNDTQM7EwNTXQTyyqyCzTMzIyMNUzMDc2MORksHXOz0vJLMnMz0vMUQhLTS7JL1JwL0osyMhMLlZwT81LLUoESSqkAcV9S4szkxWc88tSixQ8cxPTU4t5GFjTEnOKU3mhNDeDvJtriLOHLtii-IKizNzEosp4kIXxYAuNCasAAGLoNww |
| ContentType | Journal Article |
| Copyright | http://creativecommons.org/licenses/by-nc-sa/4.0 |
| Copyright_xml | – notice: http://creativecommons.org/licenses/by-nc-sa/4.0 |
| DBID | AKY GOX |
| DOI | 10.48550/arxiv.2205.07301 |
| DatabaseName | arXiv Computer Science arXiv.org |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository |
| DeliveryMethod | fulltext_linktorsrc |
| ExternalDocumentID | 2205_07301 |
| GroupedDBID | AKY GOX |
| ID | FETCH-arxiv_primary_2205_073013 |
| IEDL.DBID | GOX |
| IngestDate | Tue Jul 22 23:14:04 EDT 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-arxiv_primary_2205_073013 |
| OpenAccessLink | https://arxiv.org/abs/2205.07301 |
| ParticipantIDs | arxiv_primary_2205_07301 |
| PublicationCentury | 2000 |
| PublicationDate | 2022-05-15 |
| PublicationDateYYYYMMDD | 2022-05-15 |
| PublicationDate_xml | – month: 05 year: 2022 text: 2022-05-15 day: 15 |
| PublicationDecade | 2020 |
| PublicationYear | 2022 |
| Score | 3.594815 |
| SecondaryResourceType | preprint |
| Snippet | Generative Adversarial Networks (GAN) have motivated a rapid growth of the
domain of computer image synthesis. As almost all the existing image synthesis... |
| SourceID | arxiv |
| SourceType | Open Access Repository |
| SubjectTerms | Computer Science - Computer Vision and Pattern Recognition Computer Science - Graphics Computer Science - Sound |
| Title | Conditional Vector Graphics Generation for Music Cover Images |
| URI | https://arxiv.org/abs/2205.07301 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQSTVNsTRJA7bcUhJNjHRNkiwSdS0Sk02BEQKsnQ2TzA1SwDvkfP3MPEJNvCJMI5gYFGB7YRKLKjLLIOcDJxXrg3aB6oETITMDM7ChANrM6x8BmZwEH8UFVY9QB2xjgoWQKgk3QQZ-aOtOwRESHUIMTKl5Igy2zvmgeWHwmJtCGHiUXMEddE50ZnKxAuTYZ5CkArD5qAC-dlnBGbSuUsEzF5jXi0UZ5N1cQ5w9dMEWxhdAToeIB7klHuwWYzEGFmAfPlWCQcHMCNhMMDIHtndAO18NEi2MEs3SzBJTU0A3rhibpUkySOAyRQq3lDQDlxFoNT7oMFFTGQaWkqLSVFlgHVmSJAcOKAAk5Wmu |
| linkProvider | Cornell University |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Conditional+Vector+Graphics+Generation+for+Music+Cover+Images&rft.au=Efimova%2C+Valeria&rft.au=Jarsky%2C+Ivan&rft.au=Bizyaev%2C+Ilya&rft.au=Filchenkov%2C+Andrey&rft.date=2022-05-15&rft_id=info:doi/10.48550%2Farxiv.2205.07301&rft.externalDocID=2205_07301 |