Best Transcription Datasets & Databases

Easily explore, compare & preview top Transcription Datasets via Datarade.

Filter by

Free sample preview25

Country Coverage

United States of America31

United Kingdom30

Spain27

+ 247 more

Attributes

Language Name7

Stock Ticker4

Hashed Email Address4

+ 4 more

Use case

Speech Recognition28

Artificial Intelligence (AI)22

LLM Training16

+ 20 more

Data Provider

StageZero14

Nexdata10

WiserBrand.com6

+ 11 more

Delivery Method

Email43

REST API23

S3 Bucket23

+ 8 more

52 Transcription Data Datasets

and 58 more countries

and 22 more countries

and 58 more countries

and 245 more countries

and 109 more countries

and 16 more countries

and 235 more countries

Can't find the data you're looking for?

Let data providers come to you by posting your request

/postings/new?utm_content=search_results_page&utm_medium=platform&utm_source=datarade

More Transcription Data Products

Discover related transcription data products.

$5,000$4,500 / purchase

Pricing available upon request

$5,000$4,500 / purchase

$5,000$4,500 / purchase

Pricing available upon request

Pricing available upon request

$5,000$4,500 / purchase

$5,000$4,500 / purchase

Best Transcription Datasets & Databases

Global Consumer Review Data | Transcription Data| Unique Consumer Sentiment Data: Transcription of the calls to the companies

Global Transcription Services and Companies

AI Training Data | US Transcription Data| Unique Consumer Sentiment Data: Transcription of the calls to the companies

China Investor Relations Activities Analytics (CIRA) | Investor Relation Event Data | China earnings call transcript | Alternative Data | Daily Update

TVEyes Global Podcast Transcript Data API

Earnings Call Transcripts - 7,000+ Companies Covered

Speech ML / DL Data | On demand Hours of Text-To-Speech (Hard-to-Source Languages) | GDPR, CCPA Compliant | Native Speakers 180+ Countries

Audio Annotation Services | AI-assisted Labeling |Speech Data | AI Training Data | Natural Language Processing (NLP) Data

LATAM Data Suite | 1.8M+ Sentences | Natural Language Processing (NLP) Data | TTS | Dictionary Display | Translation Data | LATAM Coverage

FileMarket | 20,000 Voice Memos | Multilingual Training Data for Conversational AI | Machine Learning (ML) Data

Can't find the data you're looking for?

More Transcription Data Products

AI Training Data | US Transcription Data| Unique Consumer Sentiment Data: Transcription of the calls to the companies

Audio Annotation Services | AI-assisted Labeling |Speech Data | AI Training Data | Natural Language Processing (NLP) Data

Mixed Speech Data |5,000 Hours |Code-switching|Audio Data| Speech Recognition Data| AI Datasets

Consumer Marketing Data | Unique Consumer Sentiment Data: Transcription of the calls to the companies

Customer Feedback Data | Customer Experience Data | Unique Consumer Sentiment Data: Transcription of the calls to the companies

Customer Service Call Dataset [Multisector] – Annotated support transcripts for training AI and improving CX

Norwegian audio dataset for speech recognition 20 hours (1/5)

Speech ML / DL Data | On demand, Scripted Conversations (Hard-to-Source Languages) | GDPR, CCPA Compliant | Native Speakers from 180+ Countries

Call Transcription Dataset [USA] – Real customer conversations for CX, NLP, and AI training

Real-world Casual Conversation and Monologue Speech Data | 20,000 Hours | Spontaneous Speech |Audio Data

TVEyes Global Podcast Transcript Data API

Mixed Speech Data |5,000 Hours |Code-switching|Audio Data| Speech Recognition Data| AI Datasets

Speech ML / DL Data | On demand, Scripted Conversations (Hard-to-Source Languages) | GDPR, CCPA Compliant | Native Speakers from 180+ Countries

Speech ML/ DL Data - Spontaneous Conversations On-Demand - Accent & Dialect Focus

Speech ML / DL Data | On demand Hours of Text-To-Speech (Hard-to-Source Languages) | GDPR, CCPA Compliant | Native Speakers 180+ Countries

Speech ML / DL Data | On demand Hours of Spontaneous Conversations (Hard-to-Source Languages) | GDPR, CCPA Compliant | Native Speakers 180+ Countries

Customer Service Call Dataset [Multisector] – Annotated support transcripts for training AI and improving CX

Call Transcription Dataset [USA] – Real customer conversations for CX, NLP, and AI training

Best Transcription Datasets & Databases

Global Consumer Review Data | Transcription Data| Unique Consumer Sentiment Data: Transcription of the calls to the companies

Global Transcription Services and Companies

AI Training Data | US Transcription Data| Unique Consumer Sentiment Data: Transcription of the calls to the companies

China Investor Relations Activities Analytics (CIRA) | Investor Relation Event Data | China earnings call transcript | Alternative Data | Daily Update

TVEyes Global Podcast Transcript Data API

Earnings Call Transcripts - 7,000+ Companies Covered

Speech ML / DL Data | On demand Hours of Text-To-Speech (Hard-to-Source Languages) | GDPR, CCPA Compliant | Native Speakers 180+ Countries

Audio Annotation Services | AI-assisted Labeling |Speech Data | AI Training Data | Natural Language Processing (NLP) Data

LATAM Data Suite | 1.8M+ Sentences | Natural Language Processing (NLP) Data | TTS | Dictionary Display | Translation Data | LATAM Coverage

FileMarket | 20,000 Voice Memos | Multilingual Training Data for Conversational AI | Machine Learning (ML) Data

Can't find the data you're looking for?

More Transcription Data Products

AI Training Data | US Transcription Data| Unique Consumer Sentiment Data: Transcription of the calls to the companies

Audio Annotation Services | AI-assisted Labeling |Speech Data | AI Training Data | Natural Language Processing (NLP) Data

Mixed Speech Data |5,000 Hours |Code-switching|Audio Data| Speech Recognition Data| AI Datasets

Consumer Marketing Data | Unique Consumer Sentiment Data: Transcription of the calls to the companies

Customer Feedback Data | Customer Experience Data | Unique Consumer Sentiment Data: Transcription of the calls to the companies

Customer Service Call Dataset [Multisector] – Annotated support transcripts for training AI and improving CX

Norwegian audio dataset for speech recognition 20 hours (1/5)

Speech ML / DL Data | On demand, Scripted Conversations (Hard-to-Source Languages) | GDPR, CCPA Compliant | Native Speakers from 180+ Countries

Call Transcription Dataset [USA] – Real customer conversations for CX, NLP, and AI training

Real-world Casual Conversation and Monologue Speech Data | 20,000 Hours | Spontaneous Speech |Audio Data

TVEyes Global Podcast Transcript Data API

Mixed Speech Data |5,000 Hours |Code-switching|Audio Data| Speech Recognition Data| AI Datasets

Speech ML / DL Data | On demand, Scripted Conversations (Hard-to-Source Languages) | GDPR, CCPA Compliant | Native Speakers from 180+ Countries

Speech ML/ DL Data - Spontaneous Conversations On-Demand - Accent & Dialect Focus

Speech ML / DL Data | On demand Hours of Text-To-Speech (Hard-to-Source Languages) | GDPR, CCPA Compliant | Native Speakers 180+ Countries

Speech ML / DL Data | On demand Hours of Spontaneous Conversations (Hard-to-Source Languages) | GDPR, CCPA Compliant | Native Speakers 180+ Countries

Customer Service Call Dataset [Multisector] – Annotated support transcripts for training AI and improving CX

Call Transcription Dataset [USA] – Real customer conversations for CX, NLP, and AI training

Stay updated with Datarade