NLP Research Engineer · Speech

Abdou Aziz Diop

PhD Researcher · Speech-to-Speech Translation · Dakar, Senegal

I build speech and language technology for low-resource African languages — with a focus on automatic speech recognition, neural machine translation, and self-supervised learning for speech.

Currently a PhD researcher at Université Cheikh Anta Diop (UCAD), working on end-to-end Speech-to-Speech Translation for Wolof, Pulaar, and other West African languages. Previously Lead Data Scientist at LAfricaMobile, where I led a team building NLU/NLG and end-to-end TTS systems for local languages.

Open to research collaborations and roles building production speech systems for under-served languages.

Abdou Aziz Diop
01 — News

Recent updates

2026
Working on a unified end-to-end S2ST architecture for Wolof–French and Pulaar–French language pairs.
2025
Released a fine-tuned Whisper checkpoint for Wolof — among the first publicly available Wolof ASR models. open-source
2024
Co-author on AfriQA — cross-lingual open-retrieval QA across 10 African languages, presented at EMNLP 2023 Findings.
2023
Started PhD in Speech-to-Speech Translation at UCAD, Dakar.
2022
Released the open-source Wolof library on PyTorch + Transformers — text classification, NMT, and ASR for Wolof.
2021
YACINE, our Wolof voice assistant, won 1st prize at the Cheikh Anta Diop Day organized by IFAN. 1st prize
02 — Experience

Where I've worked

2023 — present

PhD Researcher · Speech-to-Speech Translation

Université Cheikh Anta Diop · Dakar, Senegal

  • End-to-end and cascaded S2ST systems for Wolof, Pulaar, and other West African languages.
  • Self-supervised learning for ASR — pre-training and fine-tuning Wav2Vec2 / Whisper for low-resource settings.
  • Teaching Assistant for Machine Learning, Deep Learning, and NLP courses.
2022 — present

Lead Data Scientist

LAfricaMobile · Dakar, Senegal

  • Lead the data science team on NLU/NLG and end-to-end speech systems for local languages.
  • Designed and shipped an end-to-end TTS system for a Senegalese language — dataset, model, deployment.
  • Production ML infrastructure: model serving, monitoring, and continuous evaluation.
2021 — 2022

NLP Research Engineer · Wolof Project Co-Lead

Omdena · Remote

  • Co-led an Omdena chapter project on text classification and NLP for Wolof.
  • Built and open-sourced the Wolof library for ASR, NMT, and classification.
2020 — 2022

Data Scientist · Software Engineer

Various roles · Senegal

  • Compliance modelling for banking clients (project Ganeyi Compliance).
  • Mail / "Affranchissement" module as a software engineer.
03 — Research

Selected publications

Towards End-to-End Speech-to-Speech Translation for Low-Resource African Languages

Work in progress · 2026

Comparing direct and cascaded S2ST architectures for Wolof–French and Pulaar–French, with a focus on data-efficient pre-training strategies and evaluation under limited parallel speech.

S2ST low-resource African languages

Fine-Tuning Whisper for Wolof: A Low-Resource ASR Study

Preprint · 2025

Empirical study of Whisper fine-tuning regimes for Wolof — examining data scale, augmentation, and dialectal generalization. Released open checkpoints and a 120h transcribed Wolof corpus.

ASR Whisper Wolof

Can Language Models Teach Wav2Vec to Learn Representations?

Research note · 2023

Proof-of-concept on knowledge distillation from a frozen BERT teacher into a Wav2Vec2 student — exploring whether textual contextual representations can guide acoustic self-supervised learning when paired data is scarce.

SSL distillation Wav2Vec2

Rank-Based Cryptography: Encryption & Signature

M.Sc. Thesis, Mathematics · UCAD · 2018

Master's thesis on rank-metric codes and their application to post-quantum encryption and digital signature schemes.

cryptography coding theory
Full publication list on Google Scholar
04 — Projects

Open-source work

Wolof ★ 31

Python · PyTorch · 🤗 Transformers

An NLP toolkit for the Wolof language: text classification, neural machine translation, and automatic speech recognition. Designed to be simple, easy to use, and a starting point for low-resource African NLP.

NLP ASR NMT

YACINE — Wolof Voice Assistant

2021 · 1st prize, Cheikh Anta Diop Day (IFAN)

A voice assistant in Wolof, in the spirit of Siri / Alexa, with built-in code-switching detection and correction. Designed to make digital tools accessible — including for non-literate users — in agriculture, fishing, education, and healthcare.

ASR TTS code-switching

Translatotron

Jupyter · TensorFlow / PyTorch

An implementation study of direct speech-to-speech translation with a sequence-to-sequence model — re-creating the Translatotron architecture as a foundation for African-language S2ST experiments.

S2ST seq2seq

NAC-ASR

Python · Research prototype

A study of Neural Audio Codecs as discrete representations for ASR — investigating whether quantized codec tokens (à la EnCodec / SoundStream) can serve as efficient inputs for downstream speech recognition.

ASR audio codecs tokenization
More on GitHub
05 — Stack

Technical strengths

Speech

Whisper · Wav2Vec2 · HuBERT · ASR · TTS · S2ST · Self-supervised pre-training · Neural audio codecs

NLP

Transformers · NMT · Tokenization · Language modelling · Multilingual evaluation · Low-resource methods

Engineering & MLOps

Python · PyTorch · 🤗 Transformers / Datasets · FastAPI · Docker · MLflow · W&B · AWS

Foundations

Mathematics · Coding theory · Statistical learning · Information systems · Software engineering

Languages spoken

Wolof · French · English · Arabic (intermediate)

Teaching

Machine Learning · Deep Learning · NLP · TA at UCAD