Implementing Machine Learning for AI-Powered Solutions in Robotics, Computer Vision, and Natural Language Processing
Machine Learning (ML) is a relatively recent development that has had a profound impact on Robotics, Computer Vision (CV) and Natural Language Processing (NLP) by allowing automation of intelligent processing, perception and human-computer interaction. ML-driven AI solutions are crucial in these are...
Saved in:
| Published in | 2025 Global Conference in Emerging Technology (GINOTECH) pp. 1 - 6 |
|---|---|
| Main Authors | , , , , , |
| Format | Conference Proceeding |
| Language | English |
| Published |
IEEE
09.05.2025
|
| Subjects | |
| Online Access | Get full text |
| DOI | 10.1109/GINOTECH63460.2025.11077035 |
Cover
| Abstract | Machine Learning (ML) is a relatively recent development that has had a profound impact on Robotics, Computer Vision (CV) and Natural Language Processing (NLP) by allowing automation of intelligent processing, perception and human-computer interaction. ML-driven AI solutions are crucial in these areas for real-time decision-making, pattern recognition, and autonomous operations; traditional rule-based methods tend to lack the adaptability and scalability to meet the demands of these fields. RL and deep learning have increasingly contributed to the improvement of robotic perception, motion planning, and manipulation in robotics, enabling better and more adaptive autonomous systems. Convolutional neural networks (CNNs) and transformer-based architectures have revolutionized computer vision, with applications ranging from object detection and image segmentation to facial recognition, allowing for advanced visual analysis across sectors such as healthcare, security, and self-driving cars. In NLP, the introduction of transformers like BERT and GPT has transformed the space by enabling more context-aware and human-like AI-driven language models, leading to drastic improvements in areas like speech recognition, sentiment analysis, and machine translation. This study delves into the application of ML techniques across these domains, evaluating the implications on efficiency, accuracy, and scalability along with challenges like data quality, model interpretability, and computational constraints. DQN has made advances for robotics, ViT in the domain of computer vision, and LLMs in the subject of NLU. A comparative study to analyse the progress and limitations of such state-of-the-art ML models. The results highlight the promise of multi-modal AI systems, where machine learning algorithms work in conjunction to improve robotic navigation, as well as visual perception and natural language understanding. Extending prior studies of intelligent automation, this research points the way to the development of new adaptive, human-centric, and autonomous AI innovations in diverse sectors. |
|---|---|
| AbstractList | Machine Learning (ML) is a relatively recent development that has had a profound impact on Robotics, Computer Vision (CV) and Natural Language Processing (NLP) by allowing automation of intelligent processing, perception and human-computer interaction. ML-driven AI solutions are crucial in these areas for real-time decision-making, pattern recognition, and autonomous operations; traditional rule-based methods tend to lack the adaptability and scalability to meet the demands of these fields. RL and deep learning have increasingly contributed to the improvement of robotic perception, motion planning, and manipulation in robotics, enabling better and more adaptive autonomous systems. Convolutional neural networks (CNNs) and transformer-based architectures have revolutionized computer vision, with applications ranging from object detection and image segmentation to facial recognition, allowing for advanced visual analysis across sectors such as healthcare, security, and self-driving cars. In NLP, the introduction of transformers like BERT and GPT has transformed the space by enabling more context-aware and human-like AI-driven language models, leading to drastic improvements in areas like speech recognition, sentiment analysis, and machine translation. This study delves into the application of ML techniques across these domains, evaluating the implications on efficiency, accuracy, and scalability along with challenges like data quality, model interpretability, and computational constraints. DQN has made advances for robotics, ViT in the domain of computer vision, and LLMs in the subject of NLU. A comparative study to analyse the progress and limitations of such state-of-the-art ML models. The results highlight the promise of multi-modal AI systems, where machine learning algorithms work in conjunction to improve robotic navigation, as well as visual perception and natural language understanding. Extending prior studies of intelligent automation, this research points the way to the development of new adaptive, human-centric, and autonomous AI innovations in diverse sectors. |
| Author | Kasu, Venugopala Reddy Sivajothi, E. Deepika, D S Malamuthu, Bakkiyaraj Kanthimathi Kumar, B. Shravan Pandi, V. Samuthira |
| Author_xml | – sequence: 1 givenname: Venugopala Reddy surname: Kasu fullname: Kasu, Venugopala Reddy email: venugopal.15583@gmail.com organization: Sr Functional Architect,Celina,Texas,75009 – sequence: 2 givenname: Bakkiyaraj Kanthimathi surname: Malamuthu fullname: Malamuthu, Bakkiyaraj Kanthimathi email: Bhagykm@gmail.com organization: Morgan Stanley Services Inc,Financial Crimes Technology,New York City,New York,10019 – sequence: 3 givenname: B. Shravan surname: Kumar fullname: Kumar, B. Shravan email: bkshravankumar81@gmail.com organization: JB Institute of Engineering & Technology,Department of Electronics and Communication,Telangana – sequence: 4 givenname: V. Samuthira surname: Pandi fullname: Pandi, V. Samuthira email: samuthirapandime@gmail.com organization: Chennai Institute of Technology,Centre for Advanced Wireless Integrated Technology,Chennai,Tamil Nadu,India – sequence: 5 givenname: E. surname: Sivajothi fullname: Sivajothi, E. email: drsivajothie@veltech.edu.in organization: Dr. Sagunthala R&D Institute of Science and Technology,Vel Tech Rangarajan,Department of Computer Science and Engineering – sequence: 6 givenname: D S surname: Deepika fullname: Deepika, D S email: deepika.it@rmd.ac.in organization: R.M.D. Engineering College,Department of Information Technology,Kavaraipettai,Tamil Nadu |
| BookMark | eNo1kE9LwzAchiPoQee-gYeA13UmTf8kx1HmVqjb0OJ1pOkvM9AmI00Rv_061NMLzwvP4XlAt9ZZQOiZkiWlRLxsyt2-XhfbjCUZWcYkTq88zwlLb9Bc5IIzRtMJUH6PQtmfO-jBBmNP-E2qL2MBVyC9vQLtPF6V0cF9g4cWf7huDMbZARuL313jglHDAheuP48BPP40w_QusLQt3skwetnhStrTKE-AD94pGIZJ-4jutOwGmP_tDNWv67rYRtV-UxarKjKChUgTxUHmGc8lF1wlPKMZxCAIzQjTNNGqBaFFKhqiKWeJUHGbxiSJW9XkWks2Q0-_WgMAx7M3vfQ_x_8W7AJ45VvV |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/GINOTECH63460.2025.11077035 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Xplore IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Xplore Digital Library url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| EISBN | 9798331507718 9798331507756 |
| EndPage | 6 |
| ExternalDocumentID | 11077035 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IL CBEJK RIE RIL |
| ID | FETCH-LOGICAL-i93t-f0c8ea7687a898c48616e2e901603f14fcde9f959b0f18349c2d52042dcb7ffa3 |
| IEDL.DBID | RIE |
| IngestDate | Wed Jul 23 05:50:31 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i93t-f0c8ea7687a898c48616e2e901603f14fcde9f959b0f18349c2d52042dcb7ffa3 |
| PageCount | 6 |
| ParticipantIDs | ieee_primary_11077035 |
| PublicationCentury | 2000 |
| PublicationDate | 2025-May-9 |
| PublicationDateYYYYMMDD | 2025-05-09 |
| PublicationDate_xml | – month: 05 year: 2025 text: 2025-May-9 day: 09 |
| PublicationDecade | 2020 |
| PublicationTitle | 2025 Global Conference in Emerging Technology (GINOTECH) |
| PublicationTitleAbbrev | GINOTECH |
| PublicationYear | 2025 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| Score | 1.9109288 |
| Snippet | Machine Learning (ML) is a relatively recent development that has had a profound impact on Robotics, Computer Vision (CV) and Natural Language Processing (NLP)... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 1 |
| SubjectTerms | Autonomous Systems Computational modeling Computer architecture Computer vision Convolutional Neural Networks (CNNs) Data models Human-Robot Interaction (HRI) Natural Language Processing (NLP) Real-time systems Reinforcement learning Reinforcement Learning (RL) Scalability Sentiment analysis Service robots Transformer Models Transformers Vision Transformers (ViTs) |
| Title | Implementing Machine Learning for AI-Powered Solutions in Robotics, Computer Vision, and Natural Language Processing |
| URI | https://ieeexplore.ieee.org/document/11077035 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8MwHA9uB_Gk4sQ3AT0uXdc8lhxlbG7i6pApu408xxA6ke7ipzfJWkVB8FYKaUse_B_9PQC4wYKL1FCGmLEEEWIsEkR3EeOCptg4LFVgI09yNnom93M6r8jqkQtjrY3gM5uEy_gv36z1JrTKOqFW8TuUNkCjx9mWrLULrivdzM7dOH-cDfojhglLfemX0aQe8cM7JYaO4T7I65duESOvyaZUif74pcf47686AK1vlh6cfsWfQ7BjiyNQRrnfiAEqlnASoZIWViqqS-hTVHg7RtPgjWYN_GqKwVUBn9ZqHUSb27C2eoAvkXnehrIwMJdRogM-VB1OWHEM_GNbYDYczPojVDkroJXAJXKp5lb6QqMnueCacNZlNrMiek67LnHaWOEEFSp1_sgToTNDM3-8jVY95yQ-Bs1iXdgTAH1GgbH0aSKVmFAlOBF-qJa-sNZaKH0KWmHGFm9b7YxFPVlnf9w_B3th4SKkUFyAZvm-sZc-7JfqKi73J5szrw4 |
| linkProvider | IEEE |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3LS8MwGA86QT2pOPFtQI_L7Jqka44yNjvd6pAqu408xxBake7iX2-StRMFwVsJpA15kO_7-nsAcINZzAJFIxQpTRAhSiNGZAdFMaMBVgZz4djI4zRKXsjDlE4rsrrnwmitPfhMt92j_5evCrl0pbJbl6vYHUo3wRYlhNAVXWsbXFfKmbf3w_Qp6_eSCJMosMlfSNt1nx_uKf7yGOyBtP7sCjPy1l6Woi0_fyky_ntc-6D5zdODk_UNdAA2dH4ISi_461FA-RyOPVhSw0pHdQ5tkArvhmji3NG0guuyGFzk8LkQhZNtbsHa7AG-eu55C_JcwZR7kQ44qmqcsGIZ2Nc2QTboZ70EVd4KaMFwiUwgY81tqtHlMYsliaNOpEPNvOu06RAjlWaGUSYCYw89YTJUNLQHXEnRNYbjI9DIi1wfA2hjCoy5DRQpx4QKFhNmu0puU2spmZAnoOlmbPa-Us-Y1ZN1-kf7FdhJsvFoNhqmj2dg1y2iBxiyc9AoP5b6wgYBpbj0S_8F8RqyWw |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2025+Global+Conference+in+Emerging+Technology+%28GINOTECH%29&rft.atitle=Implementing+Machine+Learning+for+AI-Powered+Solutions+in+Robotics%2C+Computer+Vision%2C+and+Natural+Language+Processing&rft.au=Kasu%2C+Venugopala+Reddy&rft.au=Malamuthu%2C+Bakkiyaraj+Kanthimathi&rft.au=Kumar%2C+B.+Shravan&rft.au=Pandi%2C+V.+Samuthira&rft.date=2025-05-09&rft.pub=IEEE&rft.spage=1&rft.epage=6&rft_id=info:doi/10.1109%2FGINOTECH63460.2025.11077035&rft.externalDocID=11077035 |