This is a collection of scripts and data I used when working on my dissertation.
- recording_procedure.praat – this is the script I used for collecting the audio
data (it’s a mess like most Praat scripts I’ve written, but it was good enough
to do what I wanted)
survey.sur
– PsyToolkit survey for collecting metadata and running the
experimentrating_experiment.psy
– The experiment itselfscale_l_to_r.txt
– Code that draws a sliding scale in
rating_experiment.psy
with left-to-right orientationscale_r_to_l.txt
– Code that draws a sliding scale in
rating_experiment.psy
with right-to-left orientationsounds.txt
– List of audio files used in the experiment (audio files are not
included due to license limitations)- bitmaps/ – Collection of pictures used in
rating_experiment.psy
- tables/ – Collection of sounds used in
rating_experiment.psy
. Each table has
three columns: variable name, file name, duration
measure_EER.py
– script that draws DET curves and calculates EER values from
thesid_results_table_individual_files_filtered.csv
results table. Simply
runpython measure_EER.py
. The script creates a folder named
EER_measurements which contains temporary files to verify that same and
different speaker labels are correct.sid_results_table_individual_files_filtered.csv
– different-sex recordings
have been filtered out
phonexia_lid_wrapper.sh
– wrapper for the PhonexiaLID
technology (this is
mainly here to show which subset of languages was used in the analysis)speechbrain_lid_wrapper.py
– wrapper for the SpeechBrainLID
technology it
subclasses the EncoderClassifier to extract scores for English and Czech apart
from the best-matching language (the audio files are not included due to
license limitations so the script does not really produce the results!)
sid4_wrapper.sh
– wrapper for the PhonexiaSID4
technology. A simple helper
script which extracts voiceprints from all wave files in one directory and
then compares all voiceprints with each other. It also saves the amount of
“net speech” in each voiceprint to a file.speechbrain_sid_wrapper.py
– a wrapper for SpeechBrain’sSpeakerRecognition
.
It has options for computing embeddings from audio files or loading previously
created embeddings. It subclassesSpeakerRecognition
in order to be able to
perform verification on embeddings instead of on audio files.
The script creates embeddings from all audio files in a directory and then
performs speaker verification either on all combinations or all permutations
of the embeddings set.
Leave a Reply