University of Cambridge
Dr Linda Bakkouche
Speech 路 AI 路 Applied Linguistics 路 Data Science

Dr Linda Bakkouche

Research Associate | University of Cambridge
Developing the next generation of AI-powered speech and language learning technologies.

I investigate how individual differences in cognition, auditory processing, experience and neural speech encoding shape second-language speech perception and pronunciation learning. My work also examines AI-generated speech, voice cloning, text-to-speech systems and human responses to synthetic voices.

University of Cambridge Cambridge Phonetics Laboratory Brain, Language and Bilingualism Group

About

I investigate how humans learn, perceive and interact with speech in an increasingly AI-driven world. As a Research Associate at the University of Cambridge, I combine speech science, artificial intelligence, cognitive psychology and data science to understand why some individuals become highly successful language learners while others face persistent challenges. My research sits at the intersection of phonetics, second language acquisition, artificial intelligence, speech technology and data science.

I combine behavioural experiments, acoustic analysis, EEG/FFR methods, statistical modelling and machine learning to understand how learners acquire new speech patterns and how humans evaluate increasingly realistic synthetic voices.

Research themes

AI & Speech Technologies

AI voice cloning and TTS

Evaluation of synthetic voices, text-to-speech systems, prosody, naturalness, similarity and human-AI communication.

L2 Speech Learning

Pronunciation and perception

Research on second-language speech perception, production, high variability phonetic training and learner variability.

Individual Differences

Cognition and auditory processing

Auditory acuity, attention, phonological short-term memory, declarative memory, procedural learning and L2 experience.

Neural Mechanisms

EEG and FFR

Neural encoding of speech and its relationship with perception, production and pronunciation-learning success.

Acoustic Phonetics

Speech analysis

Formant dynamics, VOT, prosody, speech timing and automated acoustic-feature extraction.

Data Science

Predictive modelling

Mixed-effects models, machine learning, Python, R, SQL, PyTorch, scikit-learn, LightGBM and SHAP.

Latest publications and research outputs

1
What determines the success of AI voice-cloned speech? Prosodic and acoustic evidence on three TTS systems
Phonetica
2026
2
Finding the Human Voice in AI: Insights on the Perception of AI Voice Clones from Naturalness and Similarity Ratings
Interspeech 2025
2025
3
Effects of auditory processing, memory, and experience on early and later stages of second language speech learning
Second Language Research
2025
4
Influence of Proficiency and L2 Experience on Dynamic Spectral Cue Utilization in L2 Vowel Perception and Production
Interspeech 2025
2025
5
Mastering Voice Onset Time (VOT) in L2 Learning: The Role of Cognition, Perception, and Experience
Cambridge Open Engage
2024

馃 Flagship Project: SoundCoach

AI-Powered Personalised Pronunciation Learning

SoundCoach is an adaptive pronunciation-learning platform that combines speech technology, artificial intelligence and second-language acquisition research. Unlike conventional pronunciation tools, SoundCoach tailors feedback using auditory processing, attention, memory and language-learning profiles to create personalised learning pathways.

Flagship Projects and Research

AI-supported pronunciation learning

SoundCoach

SoundCoach is an adaptive AI-supported pronunciation-training platform that combines speech technology, artificial intelligence and second-language acquisition research. It delivers personalised feedback tailored to each learner, combining artificial intelligence, speech technology, learner performance data, and individual learning needs to create adaptive and highly personalised pronunciation-training pathways. The platform continuously adapts its feedback to support more effective, engaging, and efficient language learning.

Neural speech learning

EEG/FFR speech learning

This research uses EEG and Frequency-Following Responses to investigate neural mechanisms underlying second-language speech learning. By combining neural recordings with behavioural measures of perception and production, the project examines how auditory processing, cognition and language experience shape pronunciation-learning success.

AI voice technology

AI voice cloning and prosody

Research on synthetic speech, voice cloning, naturalness, speaker similarity and the acoustic/prosodic determinants of successful text-to-speech output.

L2 phonetics

L2 vowel and consonant learning

Experimental work on speech perception, production, segmental contrasts, dynamic spectral cues and high variability phonetic training.

Teaching and supervision

I teach and support students in phonetics, phonology, applied linguistics, speech sciences, language assessment and research methods. I also advise dissertation and research projects involving experimental design, quantitative analysis, acoustic phonetics and second language acquisition.

Data science and technical skills

Programming

Python, C++, R, MATLAB, SQL, HTML, CSS, JavaScript and Docker.

Machine learning

PyTorch, scikit-learn, LightGBM, SHAP, model evaluation and predictive analytics.

Speech processing

Praat, Parselmouth, openSMILE, Montreal Forced Aligner, Whisper-based pipelines and acoustic feature extraction.

Contact and profiles

Cambridge Phonetics Laboratory
Profile page
Brain, Language and Bilingualism Group
Research group page