Way With Words' South African English Speech Collection Dataset product image in hero

Way With Words' South African English Speech Collection Dataset

WayWithWords
4.4(2)Badge iconVerified Data Provider
#
Column1
Column2
Column3
Column4
Column7
Column8
Column9
1 xxxxxxxxxx Xxxxxxxxx xxxxxx xxxxxxxxxx Xxxxx Xxxxxx Xxxxxxxxxx
2 Xxxxxx Xxxxxxxxx Xxxxxxxxxx xxxxxxxxx Xxxxxxxxx xxxxxxxxx Xxxxxxx
3 xxxxxx Xxxxx xxxxxxxxxx xxxxxx Xxxxxxxxxx xxxxxx Xxxxx
4 Xxxxxx xxxxx xxxxxxxx xxxxxxx Xxxxx Xxxxxxxx xxxxxxxxxx
5 xxxxxx Xxxxxxxxx xxxxxx Xxxxxxxxx Xxxxxxxxx xxxxxxxxxx Xxxxxx
6 Xxxxx xxxxxx xxxxxxx xxxxxxx Xxxxx xxxxxx Xxxxxxxxxx
7 xxxxxxxx xxxxxx Xxxxx Xxxxxxx xxxxxx Xxxxxxxx Xxxxxxx
8 Xxxxx xxxxxx xxxxxxxxxx Xxxxx xxxxxxxxxx xxxxxxxxx Xxxxxxx
9 xxxxxxxx xxxxxxxx Xxxxxxxxxx Xxxxxxxx Xxxxxxxx xxxxxxxxx Xxxxxxxxxx
10 Xxxxxx Xxxxxxxxx xxxxx xxxxxxx xxxxxxxxx Xxxxxx Xxxxxxx
... Xxxxxxxxx xxxxxxxxx xxxxxxxxx Xxxxx xxxxxxxx Xxxxxxx xxxxxxxxx
Sign In To Preview Data
Volume
50
Hours
Data Quality
99%
Accurate
Avail. Formats
.json, .xml, and .csv
File
Coverage
1
Country

Data Dictionary

[Sample] language-en_za.csv
Attribute Type Example Mapping
Column1
String file_name
Column2
String segment_name
Column3
String duration
Column4
String speaker
Column7
String start
Column8
String end
Column9
String transcript

Description

50 hours of simulated, unscripted agent-caller dialogue. Domains include: Insurance, Retail, Debt Collection, Travel. 63 participants from all South African provinces: Western Cape, Eastern Cape, KwaZulu-Natal, Mpumalanga, Limpopo, North-West, Northern Cape, Free State, and Gauteng.
Thank you for your interest in Way With Words' off-the-shelf Speech Collection Dataset in South African English. This collection features 63 participants in the age range of 18 - 69. Participants were sourced from all nine provinces of South Africa (Western Cape, Eastern Cape, KwaZulu-Natal, Mpumalanga, Limpopo, North-West, Northern Cape, Free State, and Gauteng) with a gender split across recorded hours of 50% female and 50% male participants. 27% of participants have completed high school, 24% of participants are at an undergraduate level, 1% of participants have a certificate qualification, 5% of participants have a diploma qualification and 43% have obtained graduate degrees. This dataset is equally split across four domains: Insurance, Retail, Debt Collection, and Travel.

Country Coverage

Africa (1)
South Africa

Volume

50 Hours

Pricing

Free sample available
License Starts at
One-off purchase Available
Monthly License Not available
Yearly License Not available
Usage-based Available

Suitable Company Sizes

Small Business
Medium-sized Business
Enterprise

Quality

Self-reported by the provider
99%
Accurate

Delivery

Methods
S3 Bucket
SFTP
Frequency
daily
weekly
Format
.json
.xml
.csv
.xls
.txt

Use Cases

Machine Learning (ML)
Natural Language Processing (NLP)
Speech Recognition

Categories

Related Products

50 Hours
99% Accurate
South Africa covered
50 hours of simulated, unscripted agent-caller dialogue. Domains include: Insurance, Retail, Debt Collection, Travel. 46 participants from Western Cape, No...
100K hours per month
99.5% word accuracy
130 countries covered
Nexdata is equipped with professional recording equipment and has resources pool of 70+ countries and regions, and provide various types of speech recognitio...
5K Videos
100% Quality
249 countries covered
We offer face anti-spoofing dataset designed to combat deceptive attacks on facial recognition systems, such as deepfakes and imprinted images. Our dataset i...
10K recordings
95% accuracy
64 countries covered
Authentic and spoofed faces recorded with different mobile phone cameras, showcasing both men and women, with and without glasses, under indoor and outdoor l...

Frequently asked questions

What is Way With Words’ South African English Speech Collection Dataset?

50 hours of simulated, unscripted agent-caller dialogue. Domains include: Insurance, Retail, Debt Collection, Travel. 63 participants from all South African provinces: Western Cape, Eastern Cape, KwaZulu-Natal, Mpumalanga, Limpopo, North-West, Northern Cape, Free State, and Gauteng.

What is Way With Words’ South African English Speech Collection Dataset used for?

This product has 3 key use cases. WayWithWords recommends using the data for Machine Learning (ML), Natural Language Processing (NLP), and Speech Recognition. Global businesses and organizations buy AI Training Data from WayWithWords to fuel their analytics and enrichment.

Who can use Way With Words’ South African English Speech Collection Dataset?

This product is best suited if you’re a Small Business looking for AI Training Data. Get in touch with WayWithWords to see what their data can do for your business and find out which integrations they provide.

Which countries does Way With Words’ South African English Speech Collection Dataset cover?

This product includes data covering 1 country like South Africa. WayWithWords is headquartered in United Kingdom.

How much does Way With Words’ South African English Speech Collection Dataset cost?

Pricing information for Way With Words’ South African English Speech Collection Dataset is available by getting in contact with WayWithWords. Connect with WayWithWords to get a quote and arrange custom pricing models based on your data requirements.

How can I get Way With Words’ South African English Speech Collection Dataset?

Businesses can buy AI Training Data from WayWithWords and get the data via S3 Bucket and SFTP. Depending on your data requirements and subscription budget, WayWithWords can deliver this product in .json, .xml, .csv, .xls, and .txt format.

What is the data quality of Way With Words’ South African English Speech Collection Dataset?

WayWithWords has reported that this product has the following quality and accuracy assurances: 99% Accurate. You can compare and assess the data quality of WayWithWords using Datarade’s data marketplace. WayWithWords has received 2 reviews from clients.

What are similar products to Way With Words’ South African English Speech Collection Dataset?

This product has 3 related products. These alternatives include Way With Words’ Afrikaans Speech Collection Dataset, Nexdata Speech Recognition Data Collection Services 100+ Languages Resources Audio Data Speech Recognition Data Machine Learning (ML) Data, and TagX - 5000+ Face Anti Spoofing Data Anti Spoofing Detection Face Recognition Fraud Detection KYC authentication Global coverage. You can compare the best AI Training Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Pricing available upon request
License Starts at
One-off purchase Available
Monthly License Not available
Yearly License Not available
Usage-based Available