Gesture Controlled Music Player Using Hand Gestures

10 May

Authors: Professor S.P.Gunjal, Pranali Yemale, Jeel Makwana, Tanushree Kshirsagar

Abstract: This paper presents an AI-powered music player that enhances user interaction by integrating gesture recognition and voice-based control with conventional playback mechanisms. The system is developed as a full-stack web application using React for the frontend, Flask for the backend, and MongoDB for data management. It incorporates secure user authentication through JWT and Bcrypt to ensure safe access. The application supports dual music modes, enabling users to play locally uploaded songs stored on Cloudinary or stream music via Spotify integration. A key contribution of this work is the implementation of real-time hand gesture recognition using MediaPipe, allowing users to control playback functions such as play, pause, track navigation, and volume adjustment without physical contact. Additionally, a voice assistant based on the Web Speech API facilitates natural language commands for seamless interaction. The system intelligently switches between local and online sources, ensuring uninterrupted playback. By combining computer vision, speech recognition, and API-based streaming, the proposed solution offers an innovative and user-friendly approach to interactive media systems, highlighting the potential of multimodal interfaces in modern applications.

DOI: https://doi.org/10.5281/zenodo.20111184