Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning

Over the last decade, the use of Deep Learning in many applications produced results that are comparable to and in some cases surpassing human expert performance. The application domains include diagnosing diseases, finance, agriculture, search engines, robot vision, and many others. In this paper,...

Full description

Saved in:
Bibliographic Details
Published inAdvances in Artificial Intelligence and Applied Cognitive Computing pp. 17 - 28
Main Authors Amirian, Soheyla, Rasheed, Khaled, Taha, Thiab R., Arabnia, Hamid R.
Format Book Chapter
LanguageEnglish
Published Cham Springer International Publishing 2021
SeriesTransactions on Computational Science and Computational Intelligence
Subjects
Online AccessGet full text
ISBN9783030702953
3030702952
ISSN2569-7072
2569-7080
DOI10.1007/978-3-030-70296-0_2

Cover

Abstract Over the last decade, the use of Deep Learning in many applications produced results that are comparable to and in some cases surpassing human expert performance. The application domains include diagnosing diseases, finance, agriculture, search engines, robot vision, and many others. In this paper, we are proposing an architecture that utilizes image/video captioning methods and Natural Language Processing systems to generate a title and a concise abstract for a video. Such a system can potentially be utilized in many application domains, including, the cinema industry, video search engines, security surveillance, video databases/warehouses, data centers, and others. The proposed system functions and operates as followed: it reads a video; representative image frames are identified and selected; the image frames are captioned; NLP is applied to all generated captions together with text summarization; and finally, a title and an abstract are generated for the video. All functions are performed automatically. Preliminary results are provided in this paper using publicly available datasets. This paper is not concerned about the efficiency of the system at the execution time. We hope to be able to address execution efficiency issues in our subsequent publications.
AbstractList Over the last decade, the use of Deep Learning in many applications produced results that are comparable to and in some cases surpassing human expert performance. The application domains include diagnosing diseases, finance, agriculture, search engines, robot vision, and many others. In this paper, we are proposing an architecture that utilizes image/video captioning methods and Natural Language Processing systems to generate a title and a concise abstract for a video. Such a system can potentially be utilized in many application domains, including, the cinema industry, video search engines, security surveillance, video databases/warehouses, data centers, and others. The proposed system functions and operates as followed: it reads a video; representative image frames are identified and selected; the image frames are captioned; NLP is applied to all generated captions together with text summarization; and finally, a title and an abstract are generated for the video. All functions are performed automatically. Preliminary results are provided in this paper using publicly available datasets. This paper is not concerned about the efficiency of the system at the execution time. We hope to be able to address execution efficiency issues in our subsequent publications.
Author Amirian, Soheyla
Arabnia, Hamid R.
Rasheed, Khaled
Taha, Thiab R.
Author_xml – sequence: 1
  givenname: Soheyla
  surname: Amirian
  fullname: Amirian, Soheyla
  email: amirian@uga.edu
– sequence: 2
  givenname: Khaled
  surname: Rasheed
  fullname: Rasheed, Khaled
– sequence: 3
  givenname: Thiab R.
  surname: Taha
  fullname: Taha, Thiab R.
– sequence: 4
  givenname: Hamid R.
  surname: Arabnia
  fullname: Arabnia, Hamid R.
BookMark eNpVkM1OwzAQhA0UiVL6BFz8AoG1Hf_kWBVakCq4FK6Wna6RocRRHHh-TEFInHZ3ZrQjfedk0qUOCblkcMUA9HWjTSUqEFBp4I2qwPIjMi-qKNpBgmMy5VI1JWDg5J8nxeTP0_yMzHN-BQCumVHaTMnD4mNM726MLV1jh0PZUkdToDeY2yH2Y_xEuo3jHjMNaaDPcYeJLvexz_Qpx-6lBLGnG3RDV64LchrcPuP8d87IdnW7Xd5Vm8f1_XKxqTLjeqx0rUzLW4lOtK5uDHovd0Ia09ZGKuMa9DqY0HgewHFWS87YzgelvUemUMwI-3mb-6G04mB9Sm_ZMrDfyGwBYIUtCOyBjy3IxBeICVvD
ContentType Book Chapter
Copyright Springer Nature Switzerland AG 2021
Copyright_xml – notice: Springer Nature Switzerland AG 2021
DOI 10.1007/978-3-030-70296-0_2
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 9783030702960
3030702960
EISSN 2569-7080
Editor Ferens, Ken
Olivas Varela, José Angel
Tinetti, Fernando G.
Arabnia, Hamid R.
Kozerenko, Elena B.
de la Fuente, David
Editor_xml – sequence: 1
  givenname: Hamid R.
  surname: Arabnia
  fullname: Arabnia, Hamid R.
  email: hra@uga.edu
– sequence: 2
  givenname: Ken
  surname: Ferens
  fullname: Ferens, Ken
  email: ken.ferens@ad.umanitoba.ca
– sequence: 3
  givenname: David
  surname: de la Fuente
  fullname: de la Fuente, David
  email: david@uniovi.es
– sequence: 4
  givenname: Elena B.
  surname: Kozerenko
  fullname: Kozerenko, Elena B.
  email: elenakozerenko@yahoo.com
– sequence: 5
  givenname: José Angel
  surname: Olivas Varela
  fullname: Olivas Varela, José Angel
  email: joseangel.olivas@uclm.es
– sequence: 6
  givenname: Fernando G.
  surname: Tinetti
  fullname: Tinetti, Fernando G.
  email: fernando@info.unlp.edu.ar
EndPage 28
GroupedDBID 38.
AABBV
AABLV
ABLLD
ABNDO
ACWLQ
AEJLV
AEKFX
AELOD
AIYYB
ALMA_UNASSIGNED_HOLDINGS
BAHJK
BBABE
CZZ
DBWEY
I4C
IEZ
OCUHQ
ORHYB
SBO
TPJZQ
Z5O
Z7R
Z7S
Z7U
Z7V
Z7W
Z7X
Z7Y
Z7Z
Z81
Z83
Z84
Z85
Z87
Z88
ID FETCH-LOGICAL-s127t-7468c2c5ea3ca498ebb5d3588c48568a9eb7f8f9b2f0a2145211dbf67bbe16e3
ISBN 9783030702953
3030702952
ISSN 2569-7072
IngestDate Tue Jul 29 20:17:14 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-s127t-7468c2c5ea3ca498ebb5d3588c48568a9eb7f8f9b2f0a2145211dbf67bbe16e3
PageCount 12
ParticipantIDs springer_books_10_1007_978_3_030_70296_0_2
PublicationCentury 2000
PublicationDate 2021
PublicationDateYYYYMMDD 2021-01-01
PublicationDate_xml – year: 2021
  text: 2021
PublicationDecade 2020
PublicationPlace Cham
PublicationPlace_xml – name: Cham
PublicationSeriesTitle Transactions on Computational Science and Computational Intelligence
PublicationSeriesTitleAlternate Transactions Computational Science Computational Intelligence
PublicationSubtitle Proceedings from ICAI’20 and ACC’20
PublicationTitle Advances in Artificial Intelligence and Applied Cognitive Computing
PublicationYear 2021
Publisher Springer International Publishing
Publisher_xml – name: Springer International Publishing
RelatedPersons Arabnia, Hamid
RelatedPersons_xml – sequence: 1
  givenname: Hamid
  surname: Arabnia
  fullname: Arabnia, Hamid
SSID ssj0002718678
Score 1.6438258
Snippet Over the last decade, the use of Deep Learning in many applications produced results that are comparable to and in some cases surpassing human expert...
SourceID springer
SourceType Publisher
StartPage 17
SubjectTerms Deep learning
LSTM
NLP
Text summarization
Video captioning
Title Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning
URI http://link.springer.com/10.1007/978-3-030-70296-0_2
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9swDBay7LLtsif2hg47LXBgyw_JxyLrEBRDD0VW9GZINo0EWOOicQZsf3d_ZJRo2U66S5eDEchGxJCfJYoiPzH2yQB-IgRvmkIdJCW-c3mswkAAOhOyMlmpXZbvebb8npxdpVeTyZ9R1tK-NfPy9z_rSv7HqtiGdrVVsvewbP-j2IDf0b54RQvj9cj5PQyzUnox7d67fNaTW5fyQ8QZI45NGxX3fuaiTxSikxz8nGWtfb1BaSkS2qzh149-rL7QuzV08dA1ziXVsNKnnaLVeqPN7GI-AEebLSXgLvX1pupvdULv24ZIYonv2vuruPyl4eunPWoIn3Q0EbPLTQXNbGG3K2aU3PAF4MaTwpL8VtOwo0mXqjTcDgj9SR_q9EOYVcjhnbG-xiEQER2FQHwI9CiIOsTxDtbMsRvmRE4cxTTUot-XBzKkQ4TmMG6jc6a64Z3KTDtHgYra70xB46wT7CqwfWVBWKCf8AB7n7KHJ6dn3y77QKCQllPQnZzopbA1SF5KQSxRg9Q9dRaxIx91cmdD3_lJq6fsia2d4baoBXX0jE1g-5w9HnFevmDnPQT4AAHe1HwEAU4Q4AgB7iDAHQS4gwC3EOAeAi_Z6uvparEMukM9gl0kZBvIJFOlKFPQcamTXIExaRWnSpWJSjOlczCyVnVuRB1qS6MvoqgydSaNgSiD-BWbbpstvGYcfWOl0RvXAmSSSsekVZqownkzFFWevWGfvSoK-5buCk_RjXor4gL1Vji9Fai3t_d5-B17NMDwPZu2t3v4gL5paz521v0Lb6uL0Q
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Advances+in+Artificial+Intelligence+and+Applied+Cognitive+Computing&rft.au=Amirian%2C+Soheyla&rft.au=Rasheed%2C+Khaled&rft.au=Taha%2C+Thiab+R.&rft.au=Arabnia%2C+Hamid+R.&rft.atitle=Automatic+Generation+of+Descriptive+Titles+for+Video+Clips+Using+Deep+Learning&rft.series=Transactions+on+Computational+Science+and+Computational+Intelligence&rft.date=2021-01-01&rft.pub=Springer+International+Publishing&rft.isbn=9783030702953&rft.issn=2569-7072&rft.eissn=2569-7080&rft.spage=17&rft.epage=28&rft_id=info:doi/10.1007%2F978-3-030-70296-0_2
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2569-7072&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2569-7072&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2569-7072&client=summon