Implementing Machine Learning for AI-Powered Solutions in Robotics, Computer Vision, and Natural Language Processing

Machine Learning (ML) is a relatively recent development that has had a profound impact on Robotics, Computer Vision (CV) and Natural Language Processing (NLP) by allowing automation of intelligent processing, perception and human-computer interaction. ML-driven AI solutions are crucial in these are...

Full description

Saved in:
Bibliographic Details
Published in2025 Global Conference in Emerging Technology (GINOTECH) pp. 1 - 6
Main Authors Kasu, Venugopala Reddy, Malamuthu, Bakkiyaraj Kanthimathi, Kumar, B. Shravan, Pandi, V. Samuthira, Sivajothi, E., Deepika, D S
Format Conference Proceeding
LanguageEnglish
Published IEEE 09.05.2025
Subjects
Online AccessGet full text
DOI10.1109/GINOTECH63460.2025.11077035

Cover

Abstract Machine Learning (ML) is a relatively recent development that has had a profound impact on Robotics, Computer Vision (CV) and Natural Language Processing (NLP) by allowing automation of intelligent processing, perception and human-computer interaction. ML-driven AI solutions are crucial in these areas for real-time decision-making, pattern recognition, and autonomous operations; traditional rule-based methods tend to lack the adaptability and scalability to meet the demands of these fields. RL and deep learning have increasingly contributed to the improvement of robotic perception, motion planning, and manipulation in robotics, enabling better and more adaptive autonomous systems. Convolutional neural networks (CNNs) and transformer-based architectures have revolutionized computer vision, with applications ranging from object detection and image segmentation to facial recognition, allowing for advanced visual analysis across sectors such as healthcare, security, and self-driving cars. In NLP, the introduction of transformers like BERT and GPT has transformed the space by enabling more context-aware and human-like AI-driven language models, leading to drastic improvements in areas like speech recognition, sentiment analysis, and machine translation. This study delves into the application of ML techniques across these domains, evaluating the implications on efficiency, accuracy, and scalability along with challenges like data quality, model interpretability, and computational constraints. DQN has made advances for robotics, ViT in the domain of computer vision, and LLMs in the subject of NLU. A comparative study to analyse the progress and limitations of such state-of-the-art ML models. The results highlight the promise of multi-modal AI systems, where machine learning algorithms work in conjunction to improve robotic navigation, as well as visual perception and natural language understanding. Extending prior studies of intelligent automation, this research points the way to the development of new adaptive, human-centric, and autonomous AI innovations in diverse sectors.
AbstractList Machine Learning (ML) is a relatively recent development that has had a profound impact on Robotics, Computer Vision (CV) and Natural Language Processing (NLP) by allowing automation of intelligent processing, perception and human-computer interaction. ML-driven AI solutions are crucial in these areas for real-time decision-making, pattern recognition, and autonomous operations; traditional rule-based methods tend to lack the adaptability and scalability to meet the demands of these fields. RL and deep learning have increasingly contributed to the improvement of robotic perception, motion planning, and manipulation in robotics, enabling better and more adaptive autonomous systems. Convolutional neural networks (CNNs) and transformer-based architectures have revolutionized computer vision, with applications ranging from object detection and image segmentation to facial recognition, allowing for advanced visual analysis across sectors such as healthcare, security, and self-driving cars. In NLP, the introduction of transformers like BERT and GPT has transformed the space by enabling more context-aware and human-like AI-driven language models, leading to drastic improvements in areas like speech recognition, sentiment analysis, and machine translation. This study delves into the application of ML techniques across these domains, evaluating the implications on efficiency, accuracy, and scalability along with challenges like data quality, model interpretability, and computational constraints. DQN has made advances for robotics, ViT in the domain of computer vision, and LLMs in the subject of NLU. A comparative study to analyse the progress and limitations of such state-of-the-art ML models. The results highlight the promise of multi-modal AI systems, where machine learning algorithms work in conjunction to improve robotic navigation, as well as visual perception and natural language understanding. Extending prior studies of intelligent automation, this research points the way to the development of new adaptive, human-centric, and autonomous AI innovations in diverse sectors.
Author Kasu, Venugopala Reddy
Sivajothi, E.
Deepika, D S
Malamuthu, Bakkiyaraj Kanthimathi
Kumar, B. Shravan
Pandi, V. Samuthira
Author_xml – sequence: 1
  givenname: Venugopala Reddy
  surname: Kasu
  fullname: Kasu, Venugopala Reddy
  email: venugopal.15583@gmail.com
  organization: Sr Functional Architect,Celina,Texas,75009
– sequence: 2
  givenname: Bakkiyaraj Kanthimathi
  surname: Malamuthu
  fullname: Malamuthu, Bakkiyaraj Kanthimathi
  email: Bhagykm@gmail.com
  organization: Morgan Stanley Services Inc,Financial Crimes Technology,New York City,New York,10019
– sequence: 3
  givenname: B. Shravan
  surname: Kumar
  fullname: Kumar, B. Shravan
  email: bkshravankumar81@gmail.com
  organization: JB Institute of Engineering & Technology,Department of Electronics and Communication,Telangana
– sequence: 4
  givenname: V. Samuthira
  surname: Pandi
  fullname: Pandi, V. Samuthira
  email: samuthirapandime@gmail.com
  organization: Chennai Institute of Technology,Centre for Advanced Wireless Integrated Technology,Chennai,Tamil Nadu,India
– sequence: 5
  givenname: E.
  surname: Sivajothi
  fullname: Sivajothi, E.
  email: drsivajothie@veltech.edu.in
  organization: Dr. Sagunthala R&D Institute of Science and Technology,Vel Tech Rangarajan,Department of Computer Science and Engineering
– sequence: 6
  givenname: D S
  surname: Deepika
  fullname: Deepika, D S
  email: deepika.it@rmd.ac.in
  organization: R.M.D. Engineering College,Department of Information Technology,Kavaraipettai,Tamil Nadu
BookMark eNo1kE9LwzAchiPoQee-gYeA13UmTf8kx1HmVqjb0OJ1pOkvM9AmI00Rv_061NMLzwvP4XlAt9ZZQOiZkiWlRLxsyt2-XhfbjCUZWcYkTq88zwlLb9Bc5IIzRtMJUH6PQtmfO-jBBmNP-E2qL2MBVyC9vQLtPF6V0cF9g4cWf7huDMbZARuL313jglHDAheuP48BPP40w_QusLQt3skwetnhStrTKE-AD94pGIZJ-4jutOwGmP_tDNWv67rYRtV-UxarKjKChUgTxUHmGc8lF1wlPKMZxCAIzQjTNNGqBaFFKhqiKWeJUHGbxiSJW9XkWks2Q0-_WgMAx7M3vfQ_x_8W7AJ45VvV
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/GINOTECH63460.2025.11077035
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Xplore
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore Digital Library
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798331507718
9798331507756
EndPage 6
ExternalDocumentID 11077035
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i93t-f0c8ea7687a898c48616e2e901603f14fcde9f959b0f18349c2d52042dcb7ffa3
IEDL.DBID RIE
IngestDate Wed Jul 23 05:50:31 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i93t-f0c8ea7687a898c48616e2e901603f14fcde9f959b0f18349c2d52042dcb7ffa3
PageCount 6
ParticipantIDs ieee_primary_11077035
PublicationCentury 2000
PublicationDate 2025-May-9
PublicationDateYYYYMMDD 2025-05-09
PublicationDate_xml – month: 05
  year: 2025
  text: 2025-May-9
  day: 09
PublicationDecade 2020
PublicationTitle 2025 Global Conference in Emerging Technology (GINOTECH)
PublicationTitleAbbrev GINOTECH
PublicationYear 2025
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.9109288
Snippet Machine Learning (ML) is a relatively recent development that has had a profound impact on Robotics, Computer Vision (CV) and Natural Language Processing (NLP)...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Autonomous Systems
Computational modeling
Computer architecture
Computer vision
Convolutional Neural Networks (CNNs)
Data models
Human-Robot Interaction (HRI)
Natural Language Processing (NLP)
Real-time systems
Reinforcement learning
Reinforcement Learning (RL)
Scalability
Sentiment analysis
Service robots
Transformer Models
Transformers
Vision Transformers (ViTs)
Title Implementing Machine Learning for AI-Powered Solutions in Robotics, Computer Vision, and Natural Language Processing
URI https://ieeexplore.ieee.org/document/11077035
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8MwHA9uB_Gk4sQ3AT0uXdc8lhxlbG7i6pApu408xxA6ke7ipzfJWkVB8FYKaUse_B_9PQC4wYKL1FCGmLEEEWIsEkR3EeOCptg4LFVgI09yNnom93M6r8jqkQtjrY3gM5uEy_gv36z1JrTKOqFW8TuUNkCjx9mWrLULrivdzM7dOH-cDfojhglLfemX0aQe8cM7JYaO4T7I65duESOvyaZUif74pcf47686AK1vlh6cfsWfQ7BjiyNQRrnfiAEqlnASoZIWViqqS-hTVHg7RtPgjWYN_GqKwVUBn9ZqHUSb27C2eoAvkXnehrIwMJdRogM-VB1OWHEM_GNbYDYczPojVDkroJXAJXKp5lb6QqMnueCacNZlNrMiek67LnHaWOEEFSp1_sgToTNDM3-8jVY95yQ-Bs1iXdgTAH1GgbH0aSKVmFAlOBF-qJa-sNZaKH0KWmHGFm9b7YxFPVlnf9w_B3th4SKkUFyAZvm-sZc-7JfqKi73J5szrw4
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3LS8MwGA86QT2pOPFtQI_L7Jqka44yNjvd6pAqu408xxBake7iX2-StRMFwVsJpA15kO_7-nsAcINZzAJFIxQpTRAhSiNGZAdFMaMBVgZz4djI4zRKXsjDlE4rsrrnwmitPfhMt92j_5evCrl0pbJbl6vYHUo3wRYlhNAVXWsbXFfKmbf3w_Qp6_eSCJMosMlfSNt1nx_uKf7yGOyBtP7sCjPy1l6Woi0_fyky_ntc-6D5zdODk_UNdAA2dH4ISi_461FA-RyOPVhSw0pHdQ5tkArvhmji3NG0guuyGFzk8LkQhZNtbsHa7AG-eu55C_JcwZR7kQ44qmqcsGIZ2Nc2QTboZ70EVd4KaMFwiUwgY81tqtHlMYsliaNOpEPNvOu06RAjlWaGUSYCYw89YTJUNLQHXEnRNYbjI9DIi1wfA2hjCoy5DRQpx4QKFhNmu0puU2spmZAnoOlmbPa-Us-Y1ZN1-kf7FdhJsvFoNhqmj2dg1y2iBxiyc9AoP5b6wgYBpbj0S_8F8RqyWw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2025+Global+Conference+in+Emerging+Technology+%28GINOTECH%29&rft.atitle=Implementing+Machine+Learning+for+AI-Powered+Solutions+in+Robotics%2C+Computer+Vision%2C+and+Natural+Language+Processing&rft.au=Kasu%2C+Venugopala+Reddy&rft.au=Malamuthu%2C+Bakkiyaraj+Kanthimathi&rft.au=Kumar%2C+B.+Shravan&rft.au=Pandi%2C+V.+Samuthira&rft.date=2025-05-09&rft.pub=IEEE&rft.spage=1&rft.epage=6&rft_id=info:doi/10.1109%2FGINOTECH63460.2025.11077035&rft.externalDocID=11077035