Conditional Vector Graphics Generation for Music Cover Images

Generative Adversarial Networks (GAN) have motivated a rapid growth of the domain of computer image synthesis. As almost all the existing image synthesis algorithms consider an image as a pixel matrix, the high-resolution image synthesis is complicated.A good alternative can be vector images. Howeve...

Full description

Saved in:
Bibliographic Details
Main Authors Efimova, Valeria, Jarsky, Ivan, Bizyaev, Ilya, Filchenkov, Andrey
Format Journal Article
LanguageEnglish
Published 15.05.2022
Subjects
Online AccessGet full text
DOI10.48550/arxiv.2205.07301

Cover

Abstract Generative Adversarial Networks (GAN) have motivated a rapid growth of the domain of computer image synthesis. As almost all the existing image synthesis algorithms consider an image as a pixel matrix, the high-resolution image synthesis is complicated.A good alternative can be vector images. However, they belong to the highly sophisticated parametric space, which is a restriction for solving the task of synthesizing vector graphics by GANs. In this paper, we consider a specific application domain that softens this restriction dramatically allowing the usage of vector image synthesis. Music cover images should meet the requirements of Internet streaming services and printing standards, which imply high resolution of graphic materials without any additional requirements on the content of such images. Existing music cover image generation services do not analyze tracks themselves; however, some services mostly consider only genre tags. To generate music covers as vector images that reflect the music and consist of simple geometric objects, we suggest a GAN-based algorithm called CoverGAN. The assessment of resulting images is based on their correspondence to the music compared with AttnGAN and DALL-E text-to-image generation according to title or lyrics. Moreover, the significance of the patterns found by CoverGAN has been evaluated in terms of the correspondence of the generated cover images to the musical tracks. Listeners evaluate the music covers generated by the proposed algorithm as quite satisfactory and corresponding to the tracks. Music cover images generation code and demo are available at https://github.com/IzhanVarsky/CoverGAN.
AbstractList Generative Adversarial Networks (GAN) have motivated a rapid growth of the domain of computer image synthesis. As almost all the existing image synthesis algorithms consider an image as a pixel matrix, the high-resolution image synthesis is complicated.A good alternative can be vector images. However, they belong to the highly sophisticated parametric space, which is a restriction for solving the task of synthesizing vector graphics by GANs. In this paper, we consider a specific application domain that softens this restriction dramatically allowing the usage of vector image synthesis. Music cover images should meet the requirements of Internet streaming services and printing standards, which imply high resolution of graphic materials without any additional requirements on the content of such images. Existing music cover image generation services do not analyze tracks themselves; however, some services mostly consider only genre tags. To generate music covers as vector images that reflect the music and consist of simple geometric objects, we suggest a GAN-based algorithm called CoverGAN. The assessment of resulting images is based on their correspondence to the music compared with AttnGAN and DALL-E text-to-image generation according to title or lyrics. Moreover, the significance of the patterns found by CoverGAN has been evaluated in terms of the correspondence of the generated cover images to the musical tracks. Listeners evaluate the music covers generated by the proposed algorithm as quite satisfactory and corresponding to the tracks. Music cover images generation code and demo are available at https://github.com/IzhanVarsky/CoverGAN.
Author Efimova, Valeria
Jarsky, Ivan
Bizyaev, Ilya
Filchenkov, Andrey
Author_xml – sequence: 1
  givenname: Valeria
  surname: Efimova
  fullname: Efimova, Valeria
– sequence: 2
  givenname: Ivan
  surname: Jarsky
  fullname: Jarsky, Ivan
– sequence: 3
  givenname: Ilya
  surname: Bizyaev
  fullname: Bizyaev, Ilya
– sequence: 4
  givenname: Andrey
  surname: Filchenkov
  fullname: Filchenkov, Andrey
BackLink https://doi.org/10.48550/arXiv.2205.07301$$DView paper in arXiv
BookMark eNrjYmDJy89LZWCQNDTQM7EwNTXQTyyqyCzTMzIyMNUzMDc2MORksHXOz0vJLMnMz0vMUQhLTS7JL1JwL0osyMhMLlZwT81LLUoESSqkAcV9S4szkxWc88tSixQ8cxPTU4t5GFjTEnOKU3mhNDeDvJtriLOHLtii-IKizNzEosp4kIXxYAuNCasAAGLoNww
ContentType Journal Article
Copyright http://creativecommons.org/licenses/by-nc-sa/4.0
Copyright_xml – notice: http://creativecommons.org/licenses/by-nc-sa/4.0
DBID AKY
GOX
DOI 10.48550/arxiv.2205.07301
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2205_07301
GroupedDBID AKY
GOX
ID FETCH-arxiv_primary_2205_073013
IEDL.DBID GOX
IngestDate Tue Jul 22 23:14:04 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-arxiv_primary_2205_073013
OpenAccessLink https://arxiv.org/abs/2205.07301
ParticipantIDs arxiv_primary_2205_07301
PublicationCentury 2000
PublicationDate 2022-05-15
PublicationDateYYYYMMDD 2022-05-15
PublicationDate_xml – month: 05
  year: 2022
  text: 2022-05-15
  day: 15
PublicationDecade 2020
PublicationYear 2022
Score 3.594815
SecondaryResourceType preprint
Snippet Generative Adversarial Networks (GAN) have motivated a rapid growth of the domain of computer image synthesis. As almost all the existing image synthesis...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Computer Vision and Pattern Recognition
Computer Science - Graphics
Computer Science - Sound
Title Conditional Vector Graphics Generation for Music Cover Images
URI https://arxiv.org/abs/2205.07301
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQSTVNsTRJA7bcUhJNjHRNkiwSdS0Sk02BEQKsnQ2TzA1SwDvkfP3MPEJNvCJMI5gYFGB7YRKLKjLLIOcDJxXrg3aB6oETITMDM7ChANrM6x8BmZwEH8UFVY9QB2xjgoWQKgk3QQZ-aOtOwRESHUIMTKl5Igy2zvmgeWHwmJtCGHiUXMEddE50ZnKxAuTYZ5CkArD5qAC-dlnBGbSuUsEzF5jXi0UZ5N1cQ5w9dMEWxhdAToeIB7klHuwWYzEGFmAfPlWCQcHMCNhMMDIHtndAO18NEi2MEs3SzBJTU0A3rhibpUkySOAyRQq3lDQDlxFoNT7oMFFTGQaWkqLSVFlgHVmSJAcOKAAk5Wmu
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Conditional+Vector+Graphics+Generation+for+Music+Cover+Images&rft.au=Efimova%2C+Valeria&rft.au=Jarsky%2C+Ivan&rft.au=Bizyaev%2C+Ilya&rft.au=Filchenkov%2C+Andrey&rft.date=2022-05-15&rft_id=info:doi/10.48550%2Farxiv.2205.07301&rft.externalDocID=2205_07301