MARC View

000			03310nam a22003257a 4500
003			OSt
005			20250717142539.0
008			250715b \|\|\|\|\|\|\|\| \|\|\|\| 00\| 0 eng d
040			_aTUPM _bEnglish _cTUPM _dTUPM _erda
050			_aBTH TK 870 _bC33 2025
100			_aCabello, Marvin T. _eauthor
245			_aAugmented reality deaf assistance with real-time speech-to-text and text-to-speech translation using raspberry pi-based smart glasses and a mobile application/ _cMarvin T. Cabello, Aeron Joshua F. Dela Cruz, Jhon Jaree F. Genavia, Nathan Miguel G. Icaro, and Dale Joshua S. Ricardo.--
260			_aManila: _bTechnological University of the Philippines, _c2025.
300			_aviii, 124pages: _c29cm.
336			_2rdacontent
337			_2rdamedia
338			_2rdacarrier
500			_aBachelor's thesis
502			_aCollege of Engineering.-- _bBachelor of science in electronics engineering: _cTechnological University of the Philippines, _d2025.
504			_aIncludes bibliographic references and index.
520			_aHumans are socially dependent beings. They rely on each other for survival, emotional support, and personal as well as collective growth. To form and nurture these connections, we developed communication – particularly through verbal sounds. The modern world is flooded with different voices from radios, televisions, mobile phones, and other individuals. These voices are necessary tools for learning. However, it is important to acknowledge that not everyone experiences sound in the same manner. Hard of hearing or Deaf individuals perceive communication differently through sign language and lip reading, and with less than 20% of the world population experiencing hearing loss, it is inevitable that there is a gap in communication for hard of hearing individuals and individuals with typical hearing. This study aims to bridge that communication gap by developing an Augmented Reality Glasses with Real-Time Speech-to-Text Translation. The wearable glasses contain a Mini USB Microphone to take the vocal inputs, a Raspberry Pi Zero 2 W microcomputer that uses Google's Speech-to-Text API to process input, an FLCOS micro display that outputs the text and a magnifying glass that enlarges the text to a readable size without obstructing vision. Findings so far have enabled Deaf or hard of hearing individuals to converse in live conversations at an average of <2 seconds transcription delay per word in mostly controlled environments. With less than 20% Word Error Rate (WER) as expected in Google's own API, the system relied on clear and controlled speech to be transcribed properly. Limitations were primarily due to hardware constraints, the Raspberry Pi Zero 2 W's 512MB RAM was a significant performance bottleneck and lack of direct support for higher end input and output devices lowered the performance ceiling. However, this only highlighted the device's pathways for future improvements as technology advances.
650			_aAugmented reality
650			_aDeaf assistance
650			_aSpeech translation
700			_aDela Cruz, Aeron Joshua F. _eauthor
700			_aGenavia, Jhon Jaree F. _eauthor
700			_aIcaro, Nathan Miguel G. _eauthor
700			_aRicardo, Dale Joshua S. _eauthor
942			_2lcc _cBTH COE _n0
999			_c30380 _d30380