
Multilingual Conversational AI Training Data (Text & Audio)
A dataset by ShAIp
Pricing available upon request
Get a Quote
Column | Sample | Another one | Attribute | |
---|---|---|---|---|
1 | Fashion | Sports, Health | 25-49 | Germany |
2 | Just Another | Sample | Another | Row |
Volume
20,000 | Hours of Audio |
Use Cases
Artificial Intelligence (AI)
Machine Learning (ML)
Natural Language Processing (NLP)
Data-Efficient Machine Learning
Geography
Africa
(58)
Algeria
Angola
Benin
Botswana
Burkina Faso
Burundi
Cabo Verde
Cameroon
Central African Republic
Chad
Comoros
Congo
Congo (Democratic Republic of the)
Côte d'Ivoire
Djibouti
Egypt
Equatorial Guinea
Eritrea
Ethiopia
Gabon
Gambia
Ghana
Guinea
Guinea-Bissau
Kenya
Lesotho
Liberia
Libya
Madagascar
Malawi
Mali
Mauritania
Mauritius
Mayotte
Morocco
Mozambique
Namibia
Niger
Nigeria
Rwanda
Réunion
Saint Helena, Ascension and Tristan da Cunha
Sao Tome and Principe
Senegal
Seychelles
Sierra Leone
Somalia
South Africa
South Sudan
Sudan
Swaziland
Tanzania, United Republic of
Togo
Tunisia
Uganda
Western Sahara
Zambia
Zimbabwe
Asia
(51)
Afghanistan
Armenia
Azerbaijan
Bahrain
Bangladesh
Bhutan
Brunei Darussalam
Cambodia
China
Cyprus
Georgia
Hong Kong
India
Indonesia
Iran (Islamic Republic of)
Iraq
Israel
Japan
Jordan
Kazakhstan
Korea (Democratic People's Republic of)
Korea (Republic of)
Kuwait
Kyrgyzstan
Lao People's Democratic Republic
Lebanon
Macao
Malaysia
Maldives
Mongolia
Myanmar
Nepal
Oman
Pakistan
Palestine, State of
Philippines
Qatar
Saudi Arabia
Singapore
Sri Lanka
Syrian Arab Republic
Taiwan, Province of China
Tajikistan
Thailand
Timor-Leste
Turkey
Turkmenistan
United Arab Emirates
Uzbekistan
Vietnam
Yemen
Europe
(51)
Albania
Andorra
Austria
Belarus
Belgium
Bosnia and Herzegovina
Bulgaria
Croatia
Czech Republic
Denmark
Estonia
Faroe Islands
Finland
France
Germany
Gibraltar
Greece
Guernsey
Holy See
Hungary
Iceland
Ireland
Isle of Man
Italy
Jersey
Latvia
Liechtenstein
Lithuania
Luxembourg
Macedonia (the former Yugoslav Republic of)
Malta
Moldova (Republic of)
Monaco
Montenegro
Netherlands
Norway
Poland
Portugal
Romania
Russian Federation
San Marino
Serbia
Slovakia
Slovenia
Spain
Svalbard and Jan Mayen
Sweden
Switzerland
Ukraine
United Kingdom
Åland Islands
North America
(13)
Belize
Bermuda
Canada
Costa Rica
El Salvador
Greenland
Guatemala
Honduras
Mexico
Nicaragua
Panama
Saint Pierre and Miquelon
United States of America
South America
(42)
Anguilla
Antigua and Barbuda
Argentina
Aruba
Bahamas
Barbados
Bolivia (Plurinational State of)
Bonaire, Sint Eustatius and Saba
Brazil
Cayman Islands
Chile
Colombia
Cuba
Curaçao
Dominica
Dominican Republic
Ecuador
Falkland Islands (Malvinas)
French Guiana
Grenada
Guadeloupe
Guyana
Haiti
Jamaica
Martinique
Montserrat
Paraguay
Peru
Puerto Rico
Saint Barthélemy
Saint Kitts and Nevis
Saint Lucia
Saint Martin (French part)
Saint Vincent and the Grenadines
Sint Maarten (Dutch part)
Suriname
Trinidad and Tobago
Turks and Caicos Islands
Uruguay
Venezuela (Bolivarian Republic of)
Virgin Islands (British)
Virgin Islands (U.S.)
Categories
Product Description
With our deep understanding of conversational AI, we helped the client source, curate, and transcribe the right set of data required to train their AI-enabled speech model, with utmost precision. We offered audio data collection and transcription services based on their requirements while fully customizing desired intent, utterances, and demographic distribution.
Languages Supported
Afrikaans, Arabic, Bengali, Chinese Mandarin, Danish, Hebrew, Hindi, Indonesian,
Kannada, Malay, Marathi, Swahili, Swedish, Telugu, Thai, Vietnamese, New Zealand
English, South African English, Hindi - English (Hinglish), Singaporean English
(Singlish), Indian English, Australian English, UK English, US English, US Spanish
Suitable Company Sizes
Small Business
Medium-sized Business
Enterprise
Pricing
Free sample available
ShAIp has not published pricing information for this product yet.
You can request detailed pricing information below.
Quality
Self-reported by the provider
Delivery
Methods
S3 Bucket
SFTP
Email
UI Export
REST API
SOAP API
Streaming API
Feed API
Frequency
secondly
minutely
hourly
daily
weekly
monthly
quarterly
yearly
real-time
on-demand
Format
.bin
.json
.xml
.csv
.xls
.sql
.txt
Related Products
Multilingual Conversational AI Training Data (Text)
by Agents Republic
Conversational AI training data generated for a specific use case
Quality | 99% accuracy |
---|---|
Use Case | Natural Language Processing (NLP), Data-Efficient Machine Learning + 1 more |
Multilingual Conversational AI Training Data (Audio/Voice)
by Agents Republic
Conversational AI training data generated for a specific use case
Quality | 99% accuracy |
---|---|
Use Case | Natural Language Processing (NLP), Machine Learning (ML) + 2 more |
Exercise / Functional Training / Outdoor Exercise Dataset
by Automaton AI
Exercise / Functional Training / Outdoor Exercise Dataset
Volume | 62.8K Images |
---|---|
Country | India |
Use Case | Smart Mirror, Outdoor Workout + 3 more |
TAUS: Parallel text, Colloquial domain, English-Low resource(see description)
by TAUS
A carefully selected part of the colloquial corpus has been translated and reviewed by native speakers in many long-tail languages, to get the highest-quality customized set for your MT training.
Volume | 1M words per language pair, 37K unique words |
---|---|
Quality | 100% words |
Country | USA India United Kingdom + 12 others |
History | 7 months of history data |
Use Case | Artificial Intelligence (AI), Machine Learning (ML) + 2 more |
Judge Data API
by UniCourt
Gain insights on cases judges have heard, motions they’ve ruled on, parties who argued before them, and more.
Country | USA |
---|---|
Use Case | Legal Intelligence, Legal Analytics + 3 more |