Future. Discover. Together.
We are looking for a Working Student for the Capture and Display Systems (CDS) group, within the Vision and Imaging Technologies department. Recent advances in Multimodal Emotion Recognition Models (MER) have provided researchers and end users with remarkable artificial intelligence tools that can be used for various tasks in different sectors. In healthcare, MER aids therapists by analyzing patients' facial expressions, vocal tone, and biometrics, facilitating the customization of treatment plans. MER can be used to enhance eLearning platforms by assessing students' emotional engagement and adapting content, improving comprehension and educational outcomes. For law enforcement and security personnel, MER tools integrated into Virtual Reality (VR) training can simulate high-stress situations, monitoring emotional cues and offering real-time feedback on de-escalation techniques to enhance safety during encounters. Become a part of our team and join us on our journey of research and innovation!
What you will do
Development of a real-time capable Multimodal Neural Network with primary focus on Emotion Recognition in Speech and integration into a Multisensory AI tool for use in interactive Virtual Reality training environments. The goal of the tool is to analyze and monitor the emotional state of the user over time, returning the results in real time.
Research, development and implementation of deep neural networks to analyze emotion in speech (audio), sentiment analysis (text) as well as non-linguistic utterances (audio)
Develop a real-time capable Multimodal Emotion Recognition Model combining contextual features
Integration of Multimodal Speech Emotion Recognition tool into a real-time capable Multisensory AI system (developed and maintained by Fraunhofer HHI)
What you bring to the table
Master Student in Computer Science or a comparable course of study
Good Python, Java, C++, Open CV programming skills
Deep theoretical understanding of machine learning (ML) as well as deep neural networks, experience with speech emotion recognition and multimodal systems is advantageous
Experience with Lab Streaming Layer, Audio DSP is advantageous, but not essential
What you can expect
Fascinating challenges in a scientific and entrepreneurial setting
Attractive salary
Modern and excellently equipped workspace in central location
Great and cooperative working atmosphere in an international team
Flexible working hours
Opportunities to work from home
The position is initially limited to 6 months. An extension is explicitly desired.
The monthly working time is 80 hours. This position is also available on a part-time basis. We value and promote the diversity of our employees' skills and therefore welcome all applications - regardless of age, gender, nationality, ethnic and social origin, religion, ideology, disability, sexual orientation and identity. Severely disabled persons are given preference in the event of equal suitability.
With its focus on developing key technologies that are vital for the future and enabling the commercial utilization of this work by business and industry, Fraunhofer plays a central role in the innovation process. As a pioneer and catalyst for groundbreaking developments and scientific excellence, Fraunhofer helps shape society now and in the future.
Interested? Apply online now. We look forward to getting to know you!
Thomas Koch
Moana Holenstein
moana.holenstein@hhi.fraunhofer.de
Fraunhofer Institute for Telecommunications, Heinrich Hertz Institute HHI
Requisition Number: 71268 Application Deadline: 03/31/2024