Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning

Over the last decade, the use of Deep Learning in many applications produced results that are comparable to and in some cases surpassing human expert performance. The application domains include diagnosing diseases, finance, agriculture, search engines, robot vision, and many others. In this paper,...

Full description

Saved in:

Bibliographic Details
Published in	Advances in Artificial Intelligence and Applied Cognitive Computing pp. 17 - 28
Main Authors	Amirian, Soheyla, Rasheed, Khaled, Taha, Thiab R., Arabnia, Hamid R.
Format	Book Chapter
Language	English
Published	Cham Springer International Publishing 2021
Series	Transactions on Computational Science and Computational Intelligence
Subjects	Deep learning LSTM NLP Text summarization Video captioning
Online Access	Get full text
ISBN	9783030702953 3030702952
ISSN	2569-7072 2569-7080
DOI	10.1007/978-3-030-70296-0_2

Cover

Abstract	Over the last decade, the use of Deep Learning in many applications produced results that are comparable to and in some cases surpassing human expert performance. The application domains include diagnosing diseases, finance, agriculture, search engines, robot vision, and many others. In this paper, we are proposing an architecture that utilizes image/video captioning methods and Natural Language Processing systems to generate a title and a concise abstract for a video. Such a system can potentially be utilized in many application domains, including, the cinema industry, video search engines, security surveillance, video databases/warehouses, data centers, and others. The proposed system functions and operates as followed: it reads a video; representative image frames are identified and selected; the image frames are captioned; NLP is applied to all generated captions together with text summarization; and finally, a title and an abstract are generated for the video. All functions are performed automatically. Preliminary results are provided in this paper using publicly available datasets. This paper is not concerned about the efficiency of the system at the execution time. We hope to be able to address execution efficiency issues in our subsequent publications.
AbstractList	Over the last decade, the use of Deep Learning in many applications produced results that are comparable to and in some cases surpassing human expert performance. The application domains include diagnosing diseases, finance, agriculture, search engines, robot vision, and many others. In this paper, we are proposing an architecture that utilizes image/video captioning methods and Natural Language Processing systems to generate a title and a concise abstract for a video. Such a system can potentially be utilized in many application domains, including, the cinema industry, video search engines, security surveillance, video databases/warehouses, data centers, and others. The proposed system functions and operates as followed: it reads a video; representative image frames are identified and selected; the image frames are captioned; NLP is applied to all generated captions together with text summarization; and finally, a title and an abstract are generated for the video. All functions are performed automatically. Preliminary results are provided in this paper using publicly available datasets. This paper is not concerned about the efficiency of the system at the execution time. We hope to be able to address execution efficiency issues in our subsequent publications.
Author	Amirian, Soheyla Arabnia, Hamid R. Rasheed, Khaled Taha, Thiab R.
Author_xml	– sequence: 1 givenname: Soheyla surname: Amirian fullname: Amirian, Soheyla email: amirian@uga.edu – sequence: 2 givenname: Khaled surname: Rasheed fullname: Rasheed, Khaled – sequence: 3 givenname: Thiab R. surname: Taha fullname: Taha, Thiab R. – sequence: 4 givenname: Hamid R. surname: Arabnia fullname: Arabnia, Hamid R.
BookMark	eNpVkM1OwzAQhA0UiVL6BFz8AoG1Hf_kWBVakCq4FK6Wna6RocRRHHh-TEFInHZ3ZrQjfedk0qUOCblkcMUA9HWjTSUqEFBp4I2qwPIjMi-qKNpBgmMy5VI1JWDg5J8nxeTP0_yMzHN-BQCumVHaTMnD4mNM726MLV1jh0PZUkdToDeY2yH2Y_xEuo3jHjMNaaDPcYeJLvexz_Qpx-6lBLGnG3RDV64LchrcPuP8d87IdnW7Xd5Vm8f1_XKxqTLjeqx0rUzLW4lOtK5uDHovd0Ia09ZGKuMa9DqY0HgewHFWS87YzgelvUemUMwI-3mb-6G04mB9Sm_ZMrDfyGwBYIUtCOyBjy3IxBeICVvD
ContentType	Book Chapter
Copyright	Springer Nature Switzerland AG 2021
Copyright_xml	– notice: Springer Nature Switzerland AG 2021
DOI	10.1007/978-3-030-70296-0_2
DatabaseTitleList
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering
EISBN	9783030702960 3030702960
EISSN	2569-7080
Editor	Ferens, Ken Olivas Varela, José Angel Tinetti, Fernando G. Arabnia, Hamid R. Kozerenko, Elena B. de la Fuente, David
Editor_xml	– sequence: 1 givenname: Hamid R. surname: Arabnia fullname: Arabnia, Hamid R. email: hra@uga.edu – sequence: 2 givenname: Ken surname: Ferens fullname: Ferens, Ken email: ken.ferens@ad.umanitoba.ca – sequence: 3 givenname: David surname: de la Fuente fullname: de la Fuente, David email: david@uniovi.es – sequence: 4 givenname: Elena B. surname: Kozerenko fullname: Kozerenko, Elena B. email: elenakozerenko@yahoo.com – sequence: 5 givenname: José Angel surname: Olivas Varela fullname: Olivas Varela, José Angel email: joseangel.olivas@uclm.es – sequence: 6 givenname: Fernando G. surname: Tinetti fullname: Tinetti, Fernando G. email: fernando@info.unlp.edu.ar
EndPage	28
GroupedDBID	38. AABBV AABLV ABLLD ABNDO ACWLQ AEJLV AEKFX AELOD AIYYB ALMA_UNASSIGNED_HOLDINGS BAHJK BBABE CZZ DBWEY I4C IEZ OCUHQ ORHYB SBO TPJZQ Z5O Z7R Z7S Z7U Z7V Z7W Z7X Z7Y Z7Z Z81 Z83 Z84 Z85 Z87 Z88
ID	FETCH-LOGICAL-s127t-7468c2c5ea3ca498ebb5d3588c48568a9eb7f8f9b2f0a2145211dbf67bbe16e3
ISBN	9783030702953 3030702952
ISSN	2569-7072
IngestDate	Tue Jul 29 20:17:14 EDT 2025
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-s127t-7468c2c5ea3ca498ebb5d3588c48568a9eb7f8f9b2f0a2145211dbf67bbe16e3
PageCount	12
ParticipantIDs	springer_books_10_1007_978_3_030_70296_0_2
PublicationCentury	2000
PublicationDate	2021
PublicationDateYYYYMMDD	2021-01-01
PublicationDate_xml	– year: 2021 text: 2021
PublicationDecade	2020
PublicationPlace	Cham
PublicationPlace_xml	– name: Cham
PublicationSeriesTitle	Transactions on Computational Science and Computational Intelligence
PublicationSeriesTitleAlternate	Transactions Computational Science Computational Intelligence
PublicationSubtitle	Proceedings from ICAI’20 and ACC’20
PublicationTitle	Advances in Artificial Intelligence and Applied Cognitive Computing
PublicationYear	2021
Publisher	Springer International Publishing
Publisher_xml	– name: Springer International Publishing
RelatedPersons	Arabnia, Hamid
RelatedPersons_xml	– sequence: 1 givenname: Hamid surname: Arabnia fullname: Arabnia, Hamid
SSID	ssj0002718678
Score	1.6438258
Snippet	Over the last decade, the use of Deep Learning in many applications produced results that are comparable to and in some cases surpassing human expert...
SourceID	springer
SourceType	Publisher
StartPage	17
SubjectTerms	Deep learning LSTM NLP Text summarization Video captioning
Title	Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning
URI	http://link.springer.com/10.1007/978-3-030-70296-0_2
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9swDBay7LLtsif2hg47LXBgyw_JxyLrEBRDD0VW9GZINo0EWOOicQZsf3d_ZJRo2U66S5eDEchGxJCfJYoiPzH2yQB-IgRvmkIdJCW-c3mswkAAOhOyMlmpXZbvebb8npxdpVeTyZ9R1tK-NfPy9z_rSv7HqtiGdrVVsvewbP-j2IDf0b54RQvj9cj5PQyzUnox7d67fNaTW5fyQ8QZI45NGxX3fuaiTxSikxz8nGWtfb1BaSkS2qzh149-rL7QuzV08dA1ziXVsNKnnaLVeqPN7GI-AEebLSXgLvX1pupvdULv24ZIYonv2vuruPyl4eunPWoIn3Q0EbPLTQXNbGG3K2aU3PAF4MaTwpL8VtOwo0mXqjTcDgj9SR_q9EOYVcjhnbG-xiEQER2FQHwI9CiIOsTxDtbMsRvmRE4cxTTUot-XBzKkQ4TmMG6jc6a64Z3KTDtHgYra70xB46wT7CqwfWVBWKCf8AB7n7KHJ6dn3y77QKCQllPQnZzopbA1SF5KQSxRg9Q9dRaxIx91cmdD3_lJq6fsia2d4baoBXX0jE1g-5w9HnFevmDnPQT4AAHe1HwEAU4Q4AgB7iDAHQS4gwC3EOAeAi_Z6uvparEMukM9gl0kZBvIJFOlKFPQcamTXIExaRWnSpWJSjOlczCyVnVuRB1qS6MvoqgydSaNgSiD-BWbbpstvGYcfWOl0RvXAmSSSsekVZqownkzFFWevWGfvSoK-5buCk_RjXor4gL1Vji9Fai3t_d5-B17NMDwPZu2t3v4gL5paz521v0Lb6uL0Q
linkProvider	Library Specific Holdings
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Advances+in+Artificial+Intelligence+and+Applied+Cognitive+Computing&rft.au=Amirian%2C+Soheyla&rft.au=Rasheed%2C+Khaled&rft.au=Taha%2C+Thiab+R.&rft.au=Arabnia%2C+Hamid+R.&rft.atitle=Automatic+Generation+of+Descriptive+Titles+for+Video+Clips+Using+Deep+Learning&rft.series=Transactions+on+Computational+Science+and+Computational+Intelligence&rft.date=2021-01-01&rft.pub=Springer+International+Publishing&rft.isbn=9783030702953&rft.issn=2569-7072&rft.eissn=2569-7080&rft.spage=17&rft.epage=28&rft_id=info:doi/10.1007%2F978-3-030-70296-0_2
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2569-7072&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2569-7072&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2569-7072&client=summon