Skip to content

Phonemizer

The phonemizer allows simple phonemisation of words and texts in many languages. It is based on four backends: espeak, espeak-mbrola, festival and segments.

Phonemizer is available as an Apptainer container on Apocrita.

Usage

To run the default installed version of Phonemizer, simply load the phonemizer module:

$ module load phonemizer
$ phonemizer --version
phonemizer-VERSION
available backends: espeak-ng, espeak-mbrola, festival, segments

For full usage documentation, run phonemizer --help.

The phonemizer Pip package has been installed inside the Phonemizer container, which can be used by launching the python program after loading the phonemizer module, as shown below:

$ module load phonemizer
$ python
>>> import phonemizer
>>> phonemizer.__version__
'X.Y.Z'

Example jobs

Serial jobs

Here is an example job running running on 1 core and 1GB memory:

#!/bin/bash
#$ -cwd
#$ -j y
#$ -pe smp 1
#$ -l h_rt=1:0:0
#$ -l h_vmem=1G

module load phonemizer

# Phonemize phrase using English (GB) dialect within the espeak backend engine
echo 'The quick brown fox jumps over the lazy dog' \
  | phonemizer -l en-gb -b espeak

Here is an output this job will produce:

ðə kwɪk bɹaʊn fɒks dʒʌmps əʊvə ðə leɪzi dɒɡ

Here is an example job running running using the phonemizer Python package inside the container, on 1 core and 1GB memory:

#!/bin/bash
#$ -cwd
#$ -j y
#$ -pe smp 1
#$ -l h_rt=1:0:0
#$ -l h_vmem=1G

module load phonemizer

python -c "from phonemizer import phonemize; \
        print(phonemize(['line1', 'line2']))"

Here is an output this job will produce:

['laɪn wʌn ', 'laɪn tuː ']

Reference