English Accent Speech Dataset (Central America) — Authentic Local Speaker Conversations product image in hero

English Accent Speech Dataset (Central America) — Authentic Local Speaker Conversations

FileMarket
No reviews yetBadge iconVerified Data Provider
ID
Gender
Country
City
Language
Age
Audio Length
Validated
xxxxxxxxxx Xxxxxxxxx xxxxxx xxxxxxxxxx Xxxxx Xxxxxx Xxxxxxxxxx Xxxxxx
Xxxxxxxxx Xxxxxxxxxx xxxxxxxxx Xxxxxxxxx xxxxxxxxx Xxxxxxx xxxxxx Xxxxx
xxxxxxxxxx xxxxxx Xxxxxxxxxx xxxxxx Xxxxx Xxxxxx xxxxx xxxxxxxx
xxxxxxx Xxxxx Xxxxxxxx xxxxxxxxxx xxxxxx Xxxxxxxxx xxxxxx Xxxxxxxxx
Xxxxxxxxx xxxxxxxxxx Xxxxxx Xxxxx xxxxxx xxxxxxx xxxxxxx Xxxxx
xxxxxx Xxxxxxxxxx xxxxxxxx xxxxxx Xxxxx Xxxxxxx xxxxxx Xxxxxxxx
Xxxxxxx Xxxxx xxxxxx xxxxxxxxxx Xxxxx xxxxxxxxxx xxxxxxxxx Xxxxxxx
xxxxxxxx xxxxxxxx Xxxxxxxxxx Xxxxxxxx Xxxxxxxx xxxxxxxxx Xxxxxxxxxx Xxxxxx
Xxxxxxxxx xxxxx xxxxxxx xxxxxxxxx Xxxxxx Xxxxxxx Xxxxxxxxx xxxxxxxxx
xxxxxxxxx Xxxxx xxxxxxxx Xxxxxxx xxxxxxxxx Xxxxxxx xxxxx Xxxxxxx
Volume
1K
Hours
Data Quality
97%
Data Accuracy
Avail. Format
.wav
File
Coverage
6
Countries

Data Dictionary

[Sample] Sample Central America Accent Dataset
Attribute Type Example Mapping
ID
String EL 4012
Gender
String ******
Country
String El Salvador
City
String Santa Ana
Language
String English
Age
Integer ##
Audio Length
String 30:02
Validated
Boolean t

Description

High-quality English speech dataset with authentic accents from Mexico, Colombia, Dominican Republic, Costa Rica, Guatemala, and El Salvador. Perfect for AI training, accent recognition, and speech research.
The Central America English Accent Speech Dataset features real conversations from native and bilingual English speakers across Mexico, Colombia, Dominican Republic, Costa Rica, Guatemala, and El Salvador. This curated collection provides authentic English speech with distinct regional accents, recorded in natural conversational settings. The dataset is ideal for: AI speech training and accent detection Automatic speech recognition (ASR) model development Natural language processing (NLP) applications Conversational AI, chatbots, and voice assistants Key Features: ✅ Native and bilingual speakers with verified metadata (age, gender, country) ✅ Clean audio, human-validated for accent clarity ✅ Over 1,000 hours of recordings from 2,000 speakers ✅ Comprehensive CSV metadata with accent labels ✅ Licensed for commercial AI training and research use

Country Coverage

North America (4)
Costa Rica
El Salvador
Guatemala
Mexico
South America (2)
Colombia
Dominican Republic

Volume

1,000 Hours

Pricing

10% discount if you buy via Datarade
Revenue share is available at 20%
License Starts at
One-off purchase Not available
Monthly License Not available
Yearly License Not available
Usage-based
$20$18 / hour

Suitable Company Sizes

Small Business
Medium-sized Business
Enterprise

Quality

Self-reported by the provider
97%
Data Accuracy

Delivery

Methods
S3 Bucket
SFTP
Email
UI Export
REST API
SOAP API
Streaming API
Feed API
Websocket
FIX API
Netty
Compressed File
Snowflake Share
Google BigQuery
Google Cloud Storage
Azure Blob Storage
Databricks Delta Share
MCP Server
RAG API
Frequency
on-demand
Format
.wav

Use Cases

Categories

Related Products

Frequently asked questions

What is English Accent Speech Dataset (Central America) — Authentic Local Speaker Conversations?

High-quality English speech dataset with authentic accents from Mexico, Colombia, Dominican Republic, Costa Rica, Guatemala, and El Salvador. Perfect for AI training, accent recognition, and speech research.

What is English Accent Speech Dataset (Central America) — Authentic Local Speaker Conversations used for?

This product has 2 key use cases. FileMarket recommends using the data for Speech Recognition and LLM Training. Global businesses and organizations buy Natural Language Processing (NLP) Data from FileMarket to fuel their analytics and enrichment.

Who can use English Accent Speech Dataset (Central America) — Authentic Local Speaker Conversations?

This product is best suited if you’re a Small Business, Medium-sized Business, or Enterprise looking for Natural Language Processing (NLP) Data. Get in touch with FileMarket to see what their data can do for your business and find out which integrations they provide.

Which countries does English Accent Speech Dataset (Central America) — Authentic Local Speaker Conversations cover?

This product includes data covering 6 countries like Mexico, Colombia, Dominican Republic, Guatemala, and Costa Rica. FileMarket is headquartered in United States of America.

How much does English Accent Speech Dataset (Central America) — Authentic Local Speaker Conversations cost?

Pricing for English Accent Speech Dataset (Central America) — Authentic Local Speaker Conversations starts at USD20 per hour. FileMarket offers a 10% discount when you buy data from them through Datarade. Connect with FileMarket to get a quote and arrange custom pricing models based on your data requirements.

How can I get English Accent Speech Dataset (Central America) — Authentic Local Speaker Conversations?

Businesses can buy Natural Language Processing (NLP) Data from FileMarket and get the data via S3 Bucket, SFTP, Email, UI Export, REST API, SOAP API, Streaming API, Feed API, Websocket, FIX API, Netty, Compressed File, Snowflake Share, Google BigQuery, Google Cloud Storage, Azure Blob Storage, Databricks Delta Share, MCP Server, and RAG API. Depending on your data requirements and subscription budget, FileMarket can deliver this product in .wav format.

What is the data quality of English Accent Speech Dataset (Central America) — Authentic Local Speaker Conversations?

FileMarket has reported that this product has the following quality and accuracy assurances: 97% Data Accuracy. You can compare and assess the data quality of FileMarket using Datarade’s data marketplace.

What are similar products to English Accent Speech Dataset (Central America) — Authentic Local Speaker Conversations?

This product has 3 related products. These alternatives include 8kHz Conversational Speech Data 15,000 Hours Audio Data Speech Recognition Data Machine Learning (ML) Data, Global English Speech with Accent Conversational Dataset — Multi-Region Validated Speech with Gender, Age & Metadata for AI & NLP Training, and Machine Learning (ML) Data 800M+ B2B Profiles AI-Ready for Deep Learning (DL), NLP & LLM Training. You can compare the best Natural Language Processing (NLP) Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Starts at
$20$18 / hour
License Starts at
One-off purchase Not available
Monthly License Not available
Yearly License Not available
Usage-based
$20$18 / hour