
Phonetic Linguist - Singing Voice Data Annotation for AI
- Remote
- London, Greater London, United Kingdom
- Toronto, Ontario, Canada
- Sydney, New South Wales, Australia
- Boston, Massachusetts, United States
- New York, New York, United States
+4 more- Annotation
Work on breakthrough AI voice synthesis technology, annotating singing recordings with millisecond precision for next-generation vocal AI models.
Job description
We are seeking a highly qualified Phonetic Linguist to join our team working on cutting-edge AI voice synthesis technology. This role involves precise annotation of professional singing recordings to create training data for advanced voice synthesis models.
The position requires segmenting singing audio into individual phonemes with millisecond-level accuracy using spectrogram analysis. You will work with recordings across multiple languages, identifying and labeling vocal techniques including breathing patterns, glottal stops, vocal fry, and silence markers.
This is a remote contract position offering the opportunity to work with proprietary annotation technology while contributing to breakthrough developments in AI-powered vocal synthesis. The role involves collaboration with an international team and handling confidential voice synthesis research.
Key responsibilities include:
Segmenting singing recordings into 5-30 second phrases with precise timing
Correcting automated phoneme predictions using visual and auditory analysis
Labeling special vocal elements (breathing, silences, glottal stops)
Handling cross-linguistic pronunciation variations in musical contexts
Maintaining consistent quality standards across large datasets
Working with proprietary web-based annotation platforms
Technical requirements:
Reliable high-speed internet connection
Professional audio equipment for precise listening
Quiet workspace suitable for detailed audio analysis
Availability for 20-30 hours per week
Comfortable with NDA and confidentiality requirements
Job requirements
Essential qualifications:
Master's or PhD in Linguistics, Phonetics, or related field
Expert knowledge of International Phonetic Alphabet (IPA)
Proven experience with acoustic analysis software (Praat, ELAN)
Strong background in phonetic transcription and spectrogram interpretation
Experience with time-aligned annotation workflows
Native or near-native English proficiency
Additional language skills (Japanese, Mandarin, Korean, Spanish preferred)
Preferred qualifications:
Experience with speech synthesis or voice technology projects
Understanding of vocal techniques and singing styles
Background in computational linguistics or audio signal processing
Previous work with AI training data annotation
Research experience in acoustic phonetics or speech science
or
Application Received!
Your application has been successfully submitted. Thank you for your interest in joining Your Personal AI. Our team will review your application and get in touch with you soon. If you need to update any information, please respond to the confirmation email you received.