editor@ijircst.com

|

+91 8299 564 278

ISSN: 2347 - 5552

International Journal of Innovative Research in Computer Science and Technology (IJIRCST)

International Journal of Innovative Research in Computer Science and Technology- Volume 13, Issue 3, 2025

Pages: 35-46

Real-Time Vision-Based Indian Sign Language Translation Using Deep Learning Techniques

Subham Pandey, Sumaiya Tahseen, Rohit Pathak, Hina Parveen, Maruti Maurya


Download PDF

Abstract:

This work proposes a vision-based approach to real-time sign language translation for Indian Sign Language (ISL). The system uses state-of-the-art deep learning architectures such as CNN (Convolutional Neural Networks), LSTM (Long Short-Term Memory) networks, and Transformer-based encoder-decoder models for gesture recognition in both isolated and continuous forms. Data preprocessing techniques such as DTW (Dynamic Time Warping) were applied to augment and normalize gesture sequences from custom ISL and public ASL datasets. The model performance was quantitatively evaluated using precision, recall, F1-score, BLEU, ROUGE, CER(character error rate) and WER (word error rate). A Transformer-based model outperformed the achieving a BLEU score of 0.74 and a classification accuracy of 96.1%. The developed desktop application enables real-time ISL-to-English translation at 18 FPS without requiring external sensors, while ablation studies validate the benefits of multimodal fusion and pose-language alignment. This work demonstrates a robust, scalable approach to non-intrusive sign language translation, advancing accessibility for the DHH community.

Keywords:

Transformer-based Encoder-Decoder, Spatiotemporal Gesture Modeling, Indian Sign Language (ISL), Convolutional Neural Networks (CNN), Long Short-Term Memory (LSTM), Dynamic Time Warping (DTW), Real-time Sign Language Translation

DOI URL:- https://doi.org/10.55524/ijircst.2025.13.3.6

© kvscsjournal.org . All Rights Reserved.