Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning
Over the last decade, the use of Deep Learning in many applications produced results that are comparable to and in some cases surpassing human expert performance. The application domains include diagnosing diseases, finance, agriculture, search engines, robot vision, and many others. In this paper,...
Saved in:
| Published in | Advances in Artificial Intelligence and Applied Cognitive Computing pp. 17 - 28 |
|---|---|
| Main Authors | , , , |
| Format | Book Chapter |
| Language | English |
| Published |
Cham
Springer International Publishing
2021
|
| Series | Transactions on Computational Science and Computational Intelligence |
| Subjects | |
| Online Access | Get full text |
| ISBN | 9783030702953 3030702952 |
| ISSN | 2569-7072 2569-7080 |
| DOI | 10.1007/978-3-030-70296-0_2 |
Cover
| Abstract | Over the last decade, the use of Deep Learning in many applications produced results that are comparable to and in some cases surpassing human expert performance. The application domains include diagnosing diseases, finance, agriculture, search engines, robot vision, and many others. In this paper, we are proposing an architecture that utilizes image/video captioning methods and Natural Language Processing systems to generate a title and a concise abstract for a video. Such a system can potentially be utilized in many application domains, including, the cinema industry, video search engines, security surveillance, video databases/warehouses, data centers, and others. The proposed system functions and operates as followed: it reads a video; representative image frames are identified and selected; the image frames are captioned; NLP is applied to all generated captions together with text summarization; and finally, a title and an abstract are generated for the video. All functions are performed automatically. Preliminary results are provided in this paper using publicly available datasets. This paper is not concerned about the efficiency of the system at the execution time. We hope to be able to address execution efficiency issues in our subsequent publications. |
|---|---|
| AbstractList | Over the last decade, the use of Deep Learning in many applications produced results that are comparable to and in some cases surpassing human expert performance. The application domains include diagnosing diseases, finance, agriculture, search engines, robot vision, and many others. In this paper, we are proposing an architecture that utilizes image/video captioning methods and Natural Language Processing systems to generate a title and a concise abstract for a video. Such a system can potentially be utilized in many application domains, including, the cinema industry, video search engines, security surveillance, video databases/warehouses, data centers, and others. The proposed system functions and operates as followed: it reads a video; representative image frames are identified and selected; the image frames are captioned; NLP is applied to all generated captions together with text summarization; and finally, a title and an abstract are generated for the video. All functions are performed automatically. Preliminary results are provided in this paper using publicly available datasets. This paper is not concerned about the efficiency of the system at the execution time. We hope to be able to address execution efficiency issues in our subsequent publications. |
| Author | Amirian, Soheyla Arabnia, Hamid R. Rasheed, Khaled Taha, Thiab R. |
| Author_xml | – sequence: 1 givenname: Soheyla surname: Amirian fullname: Amirian, Soheyla email: amirian@uga.edu – sequence: 2 givenname: Khaled surname: Rasheed fullname: Rasheed, Khaled – sequence: 3 givenname: Thiab R. surname: Taha fullname: Taha, Thiab R. – sequence: 4 givenname: Hamid R. surname: Arabnia fullname: Arabnia, Hamid R. |
| BookMark | eNpVkM1OwzAQhA0UiVL6BFz8AoG1Hf_kWBVakCq4FK6Wna6RocRRHHh-TEFInHZ3ZrQjfedk0qUOCblkcMUA9HWjTSUqEFBp4I2qwPIjMi-qKNpBgmMy5VI1JWDg5J8nxeTP0_yMzHN-BQCumVHaTMnD4mNM726MLV1jh0PZUkdToDeY2yH2Y_xEuo3jHjMNaaDPcYeJLvexz_Qpx-6lBLGnG3RDV64LchrcPuP8d87IdnW7Xd5Vm8f1_XKxqTLjeqx0rUzLW4lOtK5uDHovd0Ia09ZGKuMa9DqY0HgewHFWS87YzgelvUemUMwI-3mb-6G04mB9Sm_ZMrDfyGwBYIUtCOyBjy3IxBeICVvD |
| ContentType | Book Chapter |
| Copyright | Springer Nature Switzerland AG 2021 |
| Copyright_xml | – notice: Springer Nature Switzerland AG 2021 |
| DOI | 10.1007/978-3-030-70296-0_2 |
| DatabaseTitleList | |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering |
| EISBN | 9783030702960 3030702960 |
| EISSN | 2569-7080 |
| Editor | Ferens, Ken Olivas Varela, José Angel Tinetti, Fernando G. Arabnia, Hamid R. Kozerenko, Elena B. de la Fuente, David |
| Editor_xml | – sequence: 1 givenname: Hamid R. surname: Arabnia fullname: Arabnia, Hamid R. email: hra@uga.edu – sequence: 2 givenname: Ken surname: Ferens fullname: Ferens, Ken email: ken.ferens@ad.umanitoba.ca – sequence: 3 givenname: David surname: de la Fuente fullname: de la Fuente, David email: david@uniovi.es – sequence: 4 givenname: Elena B. surname: Kozerenko fullname: Kozerenko, Elena B. email: elenakozerenko@yahoo.com – sequence: 5 givenname: José Angel surname: Olivas Varela fullname: Olivas Varela, José Angel email: joseangel.olivas@uclm.es – sequence: 6 givenname: Fernando G. surname: Tinetti fullname: Tinetti, Fernando G. email: fernando@info.unlp.edu.ar |
| EndPage | 28 |
| GroupedDBID | 38. AABBV AABLV ABLLD ABNDO ACWLQ AEJLV AEKFX AELOD AIYYB ALMA_UNASSIGNED_HOLDINGS BAHJK BBABE CZZ DBWEY I4C IEZ OCUHQ ORHYB SBO TPJZQ Z5O Z7R Z7S Z7U Z7V Z7W Z7X Z7Y Z7Z Z81 Z83 Z84 Z85 Z87 Z88 |
| ID | FETCH-LOGICAL-s127t-7468c2c5ea3ca498ebb5d3588c48568a9eb7f8f9b2f0a2145211dbf67bbe16e3 |
| ISBN | 9783030702953 3030702952 |
| ISSN | 2569-7072 |
| IngestDate | Tue Jul 29 20:17:14 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | OpenURL |
| MergedId | FETCHMERGED-LOGICAL-s127t-7468c2c5ea3ca498ebb5d3588c48568a9eb7f8f9b2f0a2145211dbf67bbe16e3 |
| PageCount | 12 |
| ParticipantIDs | springer_books_10_1007_978_3_030_70296_0_2 |
| PublicationCentury | 2000 |
| PublicationDate | 2021 |
| PublicationDateYYYYMMDD | 2021-01-01 |
| PublicationDate_xml | – year: 2021 text: 2021 |
| PublicationDecade | 2020 |
| PublicationPlace | Cham |
| PublicationPlace_xml | – name: Cham |
| PublicationSeriesTitle | Transactions on Computational Science and Computational Intelligence |
| PublicationSeriesTitleAlternate | Transactions Computational Science Computational Intelligence |
| PublicationSubtitle | Proceedings from ICAI’20 and ACC’20 |
| PublicationTitle | Advances in Artificial Intelligence and Applied Cognitive Computing |
| PublicationYear | 2021 |
| Publisher | Springer International Publishing |
| Publisher_xml | – name: Springer International Publishing |
| RelatedPersons | Arabnia, Hamid |
| RelatedPersons_xml | – sequence: 1 givenname: Hamid surname: Arabnia fullname: Arabnia, Hamid |
| SSID | ssj0002718678 |
| Score | 1.6438258 |
| Snippet | Over the last decade, the use of Deep Learning in many applications produced results that are comparable to and in some cases surpassing human expert... |
| SourceID | springer |
| SourceType | Publisher |
| StartPage | 17 |
| SubjectTerms | Deep learning LSTM NLP Text summarization Video captioning |
| Title | Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning |
| URI | http://link.springer.com/10.1007/978-3-030-70296-0_2 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9swDBay7LLtsif2hg47LXBgyw_JxyLrEBRDD0VW9GZINo0EWOOicQZsf3d_ZJRo2U66S5eDEchGxJCfJYoiPzH2yQB-IgRvmkIdJCW-c3mswkAAOhOyMlmpXZbvebb8npxdpVeTyZ9R1tK-NfPy9z_rSv7HqtiGdrVVsvewbP-j2IDf0b54RQvj9cj5PQyzUnox7d67fNaTW5fyQ8QZI45NGxX3fuaiTxSikxz8nGWtfb1BaSkS2qzh149-rL7QuzV08dA1ziXVsNKnnaLVeqPN7GI-AEebLSXgLvX1pupvdULv24ZIYonv2vuruPyl4eunPWoIn3Q0EbPLTQXNbGG3K2aU3PAF4MaTwpL8VtOwo0mXqjTcDgj9SR_q9EOYVcjhnbG-xiEQER2FQHwI9CiIOsTxDtbMsRvmRE4cxTTUot-XBzKkQ4TmMG6jc6a64Z3KTDtHgYra70xB46wT7CqwfWVBWKCf8AB7n7KHJ6dn3y77QKCQllPQnZzopbA1SF5KQSxRg9Q9dRaxIx91cmdD3_lJq6fsia2d4baoBXX0jE1g-5w9HnFevmDnPQT4AAHe1HwEAU4Q4AgB7iDAHQS4gwC3EOAeAi_Z6uvparEMukM9gl0kZBvIJFOlKFPQcamTXIExaRWnSpWJSjOlczCyVnVuRB1qS6MvoqgydSaNgSiD-BWbbpstvGYcfWOl0RvXAmSSSsekVZqownkzFFWevWGfvSoK-5buCk_RjXor4gL1Vji9Fai3t_d5-B17NMDwPZu2t3v4gL5paz521v0Lb6uL0Q |
| linkProvider | Library Specific Holdings |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Advances+in+Artificial+Intelligence+and+Applied+Cognitive+Computing&rft.au=Amirian%2C+Soheyla&rft.au=Rasheed%2C+Khaled&rft.au=Taha%2C+Thiab+R.&rft.au=Arabnia%2C+Hamid+R.&rft.atitle=Automatic+Generation+of+Descriptive+Titles+for+Video+Clips+Using+Deep+Learning&rft.series=Transactions+on+Computational+Science+and+Computational+Intelligence&rft.date=2021-01-01&rft.pub=Springer+International+Publishing&rft.isbn=9783030702953&rft.issn=2569-7072&rft.eissn=2569-7080&rft.spage=17&rft.epage=28&rft_id=info:doi/10.1007%2F978-3-030-70296-0_2 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2569-7072&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2569-7072&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2569-7072&client=summon |