000 03310nam a22003257a 4500
003 OSt
005 20250717142539.0
008 250715b |||||||| |||| 00| 0 eng d
040 _aTUPM
_bEnglish
_cTUPM
_dTUPM
_erda
050 _aBTH TK 870
_bC33 2025
100 _aCabello, Marvin T.
_eauthor
245 _aAugmented reality deaf assistance with real-time speech-to-text and text-to-speech translation using raspberry pi-based smart glasses and a mobile application/
_cMarvin T. Cabello, Aeron Joshua F. Dela Cruz, Jhon Jaree F. Genavia, Nathan Miguel G. Icaro, and Dale Joshua S. Ricardo.--
260 _aManila:
_bTechnological University of the Philippines,
_c2025.
300 _aviii, 124pages:
_c29cm.
336 _2rdacontent
337 _2rdamedia
338 _2rdacarrier
500 _aBachelor's thesis
502 _aCollege of Engineering.--
_bBachelor of science in electronics engineering:
_cTechnological University of the Philippines,
_d2025.
504 _aIncludes bibliographic references and index.
520 _aHumans are socially dependent beings. They rely on each other for survival, emotional support, and personal as well as collective growth. To form and nurture these connections, we developed communication – particularly through verbal sounds. The modern world is flooded with different voices from radios, televisions, mobile phones, and other individuals. These voices are necessary tools for learning. However, it is important to acknowledge that not everyone experiences sound in the same manner. Hard of hearing or Deaf individuals perceive communication differently through sign language and lip reading, and with less than 20% of the world population experiencing hearing loss, it is inevitable that there is a gap in communication for hard of hearing individuals and individuals with typical hearing. This study aims to bridge that communication gap by developing an Augmented Reality Glasses with Real-Time Speech-to-Text Translation. The wearable glasses contain a Mini USB Microphone to take the vocal inputs, a Raspberry Pi Zero 2 W microcomputer that uses Google's Speech-to-Text API to process input, an FLCOS micro display that outputs the text and a magnifying glass that enlarges the text to a readable size without obstructing vision. Findings so far have enabled Deaf or hard of hearing individuals to converse in live conversations at an average of <2 seconds transcription delay per word in mostly controlled environments. With less than 20% Word Error Rate (WER) as expected in Google's own API, the system relied on clear and controlled speech to be transcribed properly. Limitations were primarily due to hardware constraints, the Raspberry Pi Zero 2 W's 512MB RAM was a significant performance bottleneck and lack of direct support for higher end input and output devices lowered the performance ceiling. However, this only highlighted the device's pathways for future improvements as technology advances.
650 _aAugmented reality
650 _aDeaf assistance
650 _aSpeech translation
700 _aDela Cruz, Aeron Joshua F.
_eauthor
700 _aGenavia, Jhon Jaree F.
_eauthor
700 _aIcaro, Nathan Miguel G.
_eauthor
700 _aRicardo, Dale Joshua S.
_eauthor
942 _2lcc
_cBTH COE
_n0
999 _c30380
_d30380