Latin American English Accent Speech Dataset — Authentic Local Speaker Conversations product image in hero

Latin American English Accent Speech Dataset — Authentic Local Speaker Conversations

FileMarket
No reviews yetBadge iconVerified Data Provider
ID
Gender
Country
City
Language
Age
Audio Length
Validated
Samples
xxxxxxxxxx Xxxxxxxxx xxxxxx xxxxxxxxxx Xxxxx Xxxxxx Xxxxxxxxxx Xxxxxx Xxxxxxxxx
Xxxxxxxxxx xxxxxxxxx Xxxxxxxxx xxxxxxxxx Xxxxxxx xxxxxx Xxxxx xxxxxxxxxx xxxxxx
Xxxxxxxxxx xxxxxx Xxxxx Xxxxxx xxxxx xxxxxxxx xxxxxxx Xxxxx Xxxxxxxx
xxxxxxxxxx xxxxxx Xxxxxxxxx xxxxxx Xxxxxxxxx Xxxxxxxxx xxxxxxxxxx Xxxxxx Xxxxx
xxxxxx xxxxxxx xxxxxxx Xxxxx xxxxxx Xxxxxxxxxx xxxxxxxx xxxxxx Xxxxx
Xxxxxxx xxxxxx Xxxxxxxx Xxxxxxx Xxxxx xxxxxx xxxxxxxxxx Xxxxx xxxxxxxxxx
xxxxxxxxx Xxxxxxx xxxxxxxx xxxxxxxx Xxxxxxxxxx Xxxxxxxx Xxxxxxxx xxxxxxxxx Xxxxxxxxxx
Xxxxxx Xxxxxxxxx xxxxx xxxxxxx xxxxxxxxx Xxxxxx Xxxxxxx Xxxxxxxxx xxxxxxxxx
xxxxxxxxx Xxxxx xxxxxxxx Xxxxxxx xxxxxxxxx Xxxxxxx xxxxx Xxxxxxx xxxxxxx
Xxxxx xxxxxxxxxx Xxxxxxx Xxxxx xxxxxxxxxx Xxxxxx xxxxxx Xxxxxxxxx xxxxx
Volume
1K
Hours
Avail. Format
.wav
File
Coverage
6
Countries

Data Dictionary

[Sample] Sample Latin America English Accent Dataset
Attribute Type Example Mapping
ID
String EL 4012
Gender
String ******
Country
String El Salvador
City
String Santa Ana
Language
String English
Age
Integer ##
Audio Length
String 30:02
Validated
Boolean t
Samples
String https://drive.google.com/drive/folders/1yhGn-kT8EVn5_TWft...

Description

High-quality English speech dataset with authentic accents from Mexico, Colombia, Dominican Republic, Costa Rica, Guatemala, and El Salvador. Perfect for AI training, accent recognition, and speech research.
The Central America English Accent Speech Dataset features real conversations from native and bilingual English speakers across Mexico, Colombia, Dominican Republic, Costa Rica, Guatemala, and El Salvador. This curated collection provides authentic English speech with distinct regional accents, recorded in natural conversational settings. The dataset is ideal for: AI speech training and accent detection Automatic speech recognition (ASR) model development Natural language processing (NLP) applications Conversational AI, chatbots, and voice assistants Key Features: ✅ Native and bilingual speakers with verified metadata (age, gender, country) ✅ Clean audio, human-validated for accent clarity ✅ Over 1,000 hours of recordings from 2,000 speakers ✅ Comprehensive CSV metadata with accent labels ✅ Licensed for commercial AI training and research use

Country Coverage

North America (4)
Costa Rica
El Salvador
Guatemala
Mexico
South America (2)
Colombia
Dominican Republic

Volume

1,000 Hours

Pricing

10% discount if you buy via Datarade
Revenue share is available at 20%
License Starts at
One-off purchase
$22,000$19,800 / purchase
Monthly License Available
Yearly License Available
Usage-based
$22$19.80 / hour

Suitable Company Sizes

Small Business
Medium-sized Business
Enterprise

Delivery

Methods
S3 Bucket
SFTP
Email
UI Export
Compressed File
Google Cloud Storage
Azure Blob Storage
REST API
SOAP API
Streaming API
Feed API
Websocket
FIX API
Netty
Snowflake Share
Google BigQuery
Databricks Delta Share
MCP Server
RAG API
Frequency
on-demand
Format
.wav

Use Cases

Categories

Related Products

Frequently asked questions

What is Latin American English Accent Speech Dataset — Authentic Local Speaker Conversations?

High-quality English speech dataset with authentic accents from Mexico, Colombia, Dominican Republic, Costa Rica, Guatemala, and El Salvador. Perfect for AI training, accent recognition, and speech research.

What is Latin American English Accent Speech Dataset — Authentic Local Speaker Conversations used for?

This product has 2 key use cases. FileMarket recommends using the data for Speech Recognition and LLM Training. Global businesses and organizations buy Natural Language Processing (NLP) Data from FileMarket to fuel their analytics and enrichment.

Who can use Latin American English Accent Speech Dataset — Authentic Local Speaker Conversations?

This product is best suited if you’re a Small Business, Medium-sized Business, or Enterprise looking for Natural Language Processing (NLP) Data. Get in touch with FileMarket to see what their data can do for your business and find out which integrations they provide.

Which countries does Latin American English Accent Speech Dataset — Authentic Local Speaker Conversations cover?

This product includes data covering 6 countries like Mexico, Colombia, Dominican Republic, Guatemala, and Costa Rica. FileMarket is headquartered in United States of America.

How much does Latin American English Accent Speech Dataset — Authentic Local Speaker Conversations cost?

Pricing for Latin American English Accent Speech Dataset — Authentic Local Speaker Conversations starts at USD22 per hour. FileMarket offers a 10% discount when you buy data from them through Datarade. Connect with FileMarket to get a quote and arrange custom pricing models based on your data requirements.

How can I get Latin American English Accent Speech Dataset — Authentic Local Speaker Conversations?

Businesses can buy Natural Language Processing (NLP) Data from FileMarket and get the data via S3 Bucket, SFTP, Email, UI Export, Compressed File, Google Cloud Storage, Azure Blob Storage, REST API, SOAP API, Streaming API, Feed API, Websocket, FIX API, Netty, Snowflake Share, Google BigQuery, Databricks Delta Share, MCP Server, and RAG API. Depending on your data requirements and subscription budget, FileMarket can deliver this product in .wav format.

What is the data quality of Latin American English Accent Speech Dataset — Authentic Local Speaker Conversations?

You can compare and assess the data quality of FileMarket using Datarade’s data marketplace.

What are similar products to Latin American English Accent Speech Dataset — Authentic Local Speaker Conversations?

This product has 3 related products. These alternatives include Global English Speech with Accent Conversational Dataset — Multi-Region Validated Speech with Gender, Age & Metadata for AI & NLP Training, 8kHz Conversational Speech Data 15,000 Hours Audio Data Speech Recognition Data Machine Learning (ML) Data, and Machine Learning (ML) Data 800M+ B2B Profiles AI-Ready for Deep Learning (DL), NLP & LLM Training. You can compare the best Natural Language Processing (NLP) Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Starts at
$22$19.80 / hour
License Starts at
One-off purchase
$22,000$19,800 / purchase
Monthly License Available
Yearly License Available
Usage-based
$22$19.80 / hour

FileMarket

Unique Audio and Multimedia Datasets for AI

Verified provider icon Verified Provider
100% Response rate

Trusted by

Customer Logo #1 of FileMarket
Customer Logo #2 of FileMarket
Customer Logo #3 of FileMarket