Explore the blog
Browse practical insights on transcription, captioning, and speech data — with the newest posts first.
Showing 12 posts on this page (527 total)
Why Is Gender Balance Necessary in Datasets to Reduce Bias?
By Way With Words Team
The need for gender balance in speech datasets extends beyond technical quality—it is a matter of social justice, ethics, and law.
Read article
How Does Audio Session Length Training Impact Speech Datasets?
By Way With Words Team
This article explores the dimensions of audio session length training, why it matters, and how to balance short and long recordings.
Read article
What Metrics Define Data Sufficiency in Speech Collection?
By Way With Words Team
What Metrics Define Data Sufficiency in Speech Collection? What are the Quantitative and Qualitative Metrics of Data Sufficiency? The success of any speech...
Read article
How Do You Plan a Multilingual Speech Project Like a Pro?
By Way With Words Team
A multilingual speech project requires a structured approach that moves beyond simply recording voices in different languages.
Read article
Speaker Diarisation: Challenges and Solutions in Datasets
By Way With Words Team
Speaker diarisation, sometimes called audio diarisation or multi-speaker voice tagging, is the process of partitioning an audio stream by identity of the speaker.
Read article
Why is Speaker Consent Vital in Voice Data Gathering?
By Way With Words Team
This article explores why speaker consent is vital in data gathering and examines how legal, ethical, and operational frameworks converge to shape best practices in this area.
Read article
What Recording Environment Produces Optimal Speech Quality?
By Way With Words Team
This article explores what an optimal recording environment looks like, why it matters so much, and how to set it up in practice.
Read article
How Do Synthetic Voices Affect Data Quality in Training?
By Way With Words Team
Synthetic voices has become a powerful tool for speech technology development to accelerate dataset creation and expand coverage to low-resource languages.
Read article
Reliable Voice Diary Speech Data Use in Behavioural Studies
By Way With Words Team
This article explores why researchers use voice diary speech data in behavioural research, how data is collected, and what analytical techniques are applied.
Read article
Prompted vs Freeform Speech: Best Applications in Data Collection
By Way With Words Team
This article explores the differences between prompted vs freeform speech, their advantages and limitations, and how they are best applied in data collection projects.
Read article
How Valuable is Call Centre Speech Data for Research?
By Way With Words Team
From conversational AI to sentiment analysis, the applications of call centre speech data are vast and transformative.
Read article
What Tools Are Used for Mobile Speech Data Gathering?
By Way With Words Team
This article explores the tools and practices used for mobile speech data gathering, and key considerations around data security and limitations.
Read article