Phonetic Linguist - Singing Voice Data Annotation for AI

Remote
- London, Greater London, United Kingdom
- Toronto, Ontario, Canada
- Sydney, New South Wales, Australia
- Boston, Massachusetts, United States
- New York, New York, United States
+4 more
Annotation

Work on breakthrough AI voice synthesis technology, annotating singing recordings with millisecond precision for next-generation vocal AI models.

Job description

We are seeking a highly qualified Phonetic Linguist to join our team working on cutting-edge AI voice synthesis technology. This role involves precise annotation of professional singing recordings to create training data for advanced voice synthesis models.

The position requires segmenting singing audio into individual phonemes with millisecond-level accuracy using spectrogram analysis. You will work with recordings across multiple languages, identifying and labeling vocal techniques including breathing patterns, glottal stops, vocal fry, and silence markers.

This is a remote contract position offering the opportunity to work with proprietary annotation technology while contributing to breakthrough developments in AI-powered vocal synthesis. The role involves collaboration with an international team and handling confidential voice synthesis research.

Key responsibilities include:

Segmenting singing recordings into 5-30 second phrases with precise timing
Correcting automated phoneme predictions using visual and auditory analysis
Labeling special vocal elements (breathing, silences, glottal stops)
Handling cross-linguistic pronunciation variations in musical contexts
Maintaining consistent quality standards across large datasets
Working with proprietary web-based annotation platforms

Technical requirements:

Reliable high-speed internet connection
Professional audio equipment for precise listening
Quiet workspace suitable for detailed audio analysis
Availability for 20-30 hours per week
Comfortable with NDA and confidentiality requirements

Job requirements

Essential qualifications:

Master's or PhD in Linguistics, Phonetics, or related field
Expert knowledge of International Phonetic Alphabet (IPA)
Proven experience with acoustic analysis software (Praat, ELAN)
Strong background in phonetic transcription and spectrogram interpretation
Experience with time-aligned annotation workflows
Native or near-native English proficiency
Additional language skills (Japanese, Mandarin, Korean, Spanish preferred)

Preferred qualifications:

Experience with speech synthesis or voice technology projects
Understanding of vocal techniques and singing styles
Background in computational linguistics or audio signal processing
Previous work with AI training data annotation
Research experience in acoustic phonetics or speech science

Remote

Annotation

Apply with Indeed unavailable

My information

Fill out the information below

Full name

Email address

Phone number

I consent to be contacted via text messages for this and any other job within Your Personal AI.

CV or resume

Upload your CV or resume file

Upload a file or drag and drop hereAccepted files: PDF, DOC, DOCX, JPEG and PNG up to 50MB.

Cover letter

Upload your cover letter

Upload a file or drag and drop hereAccepted files: PDF, DOC, DOCX, JPEG and PNG up to 50MB.

Questions

Please fill in additional questions

What is your preferred work location?

Boston (Boston, Massachusetts, United States)

London (London, Greater London, United Kingdom)

New York (New York, New York, United States)

Sydney (Sydney, New South Wales, Australia)

Toronto (Toronto, Ontario, Canada)

Legal Agreements

Phonetic Linguist - Singing Voice Data Annotation for AI

Job description

Job requirements

Application Received!

You've already applied for this job