George Close

Machine Learning Researcher · Speech & Audio

Dr. George Close

Building large-scale speech models that listen, understand, and speak like people do.

I am a machine learning researcher specialising in speech and audio. I am currently a Member of Technical Staff on the Audio team at Zyphra, where I help build large-scale text-to-speech and audio foundation models — including the ZONOS family of TTS systems — working across model architecture, data pipelines and evaluation.

I take ideas from research all the way to production: training and shipping models at scale on large GPU clusters, and grounding them in how humans actually perceive sound. This builds on my PhD in Computer Science from the University of Sheffield (perceptually-motivated speech enhancement and quality assessment) and a first-class BSc from Cardiff University, with published work spanning speech enhancement, speech quality prediction and ASR.

I am especially interested in the place where large speech models meet human perception — building systems that don't just score well on metrics, but genuinely sound right to people. Always happy to talk speech, audio and TTS: george@zyphra.com.

18 Publications
10 First Author
161 Citations
8 h-index

Interests

  • Text To Speech (TTS) & speech synthesis
  • Audio foundation models
  • Self-Supervised Speech Representations
  • Speech Enhancement / Noise Reduction
  • Speech Quality / Intelligibility machine perception and prediction
  • Automatic Speech Recognition (ASR)
  • Human perception of digital audio
  • Neural systems for hearing aids & edge devices
  • Deepfake detection / adversarial attacks

Technical Skills

  • Large-scale model training on multi-GPU clusters
  • Python — PyTorch, SpeechBrain, NVIDIA NeMo, SciPy
  • HuggingFace ecosystem / OpenAI API
  • Distributed training & data pipelines for audio
  • Linux / Bash scripting
  • Git / GitHub / GitLab
  • C++, Java, SQL, MATLAB

Experience & Qualifications

  1. Sep 2025 — Present Member of Technical Staff @ Zyphra
    San Francisco, CA, USA Research and development on ZONOS, a large-scale text-to-speech model - spanning model architecture, data pipelines and evaluation.
  2. Nov 2024 — Sep 2025 Speech Data Scientist @ ConnexAI
    Manchester, UK Built and deployed production speech-processing systems, including data filtering for in-the-wild speech corpora and non-intrusive speech quality prediction.
  3. May 2024 — Aug 2024 Yamaha Research and Development (Internship)
    Hamamatsu, Japan Research internship applying machine learning to audio and music signal processing within an industrial R&D team. Project focused on modeling human perception of music spatiality.
  4. Oct 2020 — Jan 2025 PhD Computer Science + Graduate Teaching Assistant
    University of Sheffield, UK
    Thesis: Perceptually Motivated Speech Enhancement Researched neural speech enhancement guided by human perception — using speech quality metrics and self-supervised representations as loss functions. Authored 10+ first-author papers and taught undergraduate courses as a GTA.
  5. Aug 2017 — Aug 2020 BSc Computer Science (First Class Honours)
    Cardiff University, UK
    Thesis: Majel — Voice control for Command Line Interfaces Graduated with First Class Honours, with a final-year project building a speech-driven interface for the command line.

Papers & Publications

I am an author on 18 papers, of which 10 I am first author. These have amassed 161 citations with an h-index of 8. Full list on Google Scholar.

Talks & Presentations