Nexdata | OCR Data | 500,000 Images| Computer Vision Data| Invoice Data| AI Training Data product image in hero

Nexdata | OCR Data | 500,000 Images| Computer Vision Data| Invoice Data| AI Training Data

Nexdata
No reviews yetBadge iconVerified Data Provider
#
Product Name
OCR Data
1 xxxxxxxxxx Xxxxxxxxx
2 xxxxxx xxxxxxxxxx
3 Xxxxx Xxxxxx
4 Xxxxxxxxxx Xxxxxx
5 Xxxxxxxxx Xxxxxxxxxx
6 xxxxxxxxx Xxxxxxxxx
7 xxxxxxxxx Xxxxxxx
8 xxxxxx Xxxxx
9 xxxxxxxxxx xxxxxx
10 Xxxxxxxxxx xxxxxx
... Xxxxx Xxxxxx
Sign In To Preview Data
#
Dataset Name
Samples
1 xxxxxxxxxx Xxxxxxxxx
2 xxxxxx xxxxxxxxxx
3 Xxxxx Xxxxxx
4 Xxxxxxxxxx Xxxxxx
5 Xxxxxxxxx Xxxxxxxxxx
6 xxxxxxxxx Xxxxxxxxx
7 xxxxxxxxx Xxxxxxx
8 xxxxxx Xxxxx
9 xxxxxxxxxx xxxxxx
10 Xxxxxxxxxx xxxxxx
... Xxxxx Xxxxxx
Sign In To Preview Data
Volume
500K
images
Data Quality
97%
Accuracy
Avail. Formats
.bin, .json, and .xml
File
Coverage
62
Countries
History
5
years

Data Dictionary

[Sample] Nexdata-OCR Data.csv
Attribute Type Example Mapping
Product Name
String Data size
OCR Data
String 50,000 images
[Sample] Nexdata-OCR Data.csv
Attribute Type Example Mapping
Dataset Name
String 57,645 Images - Vertical OCR Data in Text Scenes
Samples
String https://www.nexdata.ai/dataset/1226?source=Datarade

Description

Off-the-shelf OCR data covers natural scenes image, handwriting, bill and document, test paper and etc. The AI Training Data covers 20 languages, multiple natural scenes, and multiple photographic angles.
1. Specifications Data size : 500,000 images Collecting environment : including shop plaque, stop board, poster, ticket, road sign, comic, cover picture, prompt/reminder, warning, packing instruction, menu, building sign, etc. Diversity : including 20 languages, multiple natural scenes, multiple photographic angles (looking up angle, looking down angle, eye-level angle) Device : cellphone, camera Image parameter : the image data format is .jpg, and the annotation file data format is .json Annotation content : line-level quadrilateral bounding box annotation and transcription for the texts Accuracy : the error bound of each vertex of quadrilateral bounding box is within 5 pixels, which is a qualified annotation, the accuracy of bounding boxes is not less than 97%; the texts transcription accuracy is not less than 97% 2. About Nexdata Nexdata owns off-the-shelf 1,000,000 hours of speech recognition data, 800TB of Annotated Imagery Data, about 2 billion pieces of Natural Language Processing (NLP) Data. These ready-to-go AI & ML Training Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/ocr?source=Datarade

Country Coverage

Africa (8)
Algeria
Egypt
Kenya
Libya
Morocco
Nigeria
South Africa
Tunisia
Asia (16)
China
Hong Kong
India
Indonesia
Japan
Korea (Republic of)
Malaysia
Pakistan
Qatar
Saudi Arabia
Singapore
Taiwan
Thailand
Turkey
United Arab Emirates
Vietnam
Europe (23)
Austria
Belarus
Belgium
Czech Republic
Denmark
Finland
France
Germany
Greece
Hungary
Ireland
Italy
Liechtenstein
Luxembourg
Netherlands
Norway
Poland
Portugal
Russian Federation
Spain
Sweden
Ukraine
United Kingdom
North America (3)
Canada
Mexico
United States of America
Oceania (2)
Australia
New Zealand
South America (10)
Argentina
Brazil
Chile
Colombia
Ecuador
Paraguay
Peru
Puerto Rico
Uruguay
Venezuela (Bolivarian Republic of)

History

5 years of historical data

Volume

500,000 images

Pricing

Free sample available
License Starts at
One-off purchase
$5,000 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Suitable Company Sizes

Small Business
Medium-sized Business
Enterprise

Quality

Self-reported by the provider
97%
Accuracy

Delivery

Methods
S3 Bucket
SFTP
Email
UI Export
REST API
SOAP API
Streaming API
Feed API
Frequency
secondly
minutely
hourly
daily
weekly
monthly
quarterly
yearly
real-time
on-demand
Format
.bin
.json
.xml
.csv
.xls
.sql
.txt

Use Cases

Artificial Intelligence (AI)
Machine Learning (ML)
Deep Learning
Handwriting Image
Optical Charater Recognition

Categories

Related Searches

Related Products

500K image per month
98% accuracy
116 countries covered
Nexdata supports multi-scene and multi-language OCR data collection services for Machine Learning (ML) Data, such as handwriting, invoice data, natural scene...
50K images
97% accuracy
160 countries covered
Pre-collected OCR datasets include images of natural scenes, handwritten texts, bills and documents, and test papers. The AI training data spans 20 languages...
5K Videos
100% Quality
249 countries covered
We offer face anti-spoofing dataset designed to combat deceptive attacks on facial recognition systems, such as deepfakes and imprinted images. Our dataset i...
420M MAU
95% Match rate
248 countries covered
We provide POI Data, which can be used to train AI & ML Models on14M physical locations globally, and unlock wide range of use cases, from marketing to publi...

Frequently asked questions

What is Nexdata OCR Data 500,000 Images Computer Vision Data Invoice Data AI Training Data?

Off-the-shelf OCR data covers natural scenes image, handwriting, bill and document, test paper and etc. The AI Training Data covers 20 languages, multiple natural scenes, and multiple photographic angles.

What is Nexdata OCR Data 500,000 Images Computer Vision Data Invoice Data AI Training Data used for?

This product has 5 key use cases. Nexdata recommends using the data for Artificial Intelligence (AI), Machine Learning (ML), Deep Learning, Handwriting Image, and Optical Charater Recognition. Global businesses and organizations buy Annotated Imagery Data from Nexdata to fuel their analytics and enrichment.

Who can use Nexdata OCR Data 500,000 Images Computer Vision Data Invoice Data AI Training Data?

This product is best suited if you’re a Medium-sized Business, Enterprise, or Small Business looking for Annotated Imagery Data. Get in touch with Nexdata to see what their data can do for your business and find out which integrations they provide.

How far back does the data in Nexdata OCR Data 500,000 Images Computer Vision Data Invoice Data AI Training Data go?

This product has 5 years of historical coverage. It can be delivered on a secondly, minutely, hourly, daily, weekly, monthly, quarterly, yearly, real-time, and on-demand basis.

Which countries does Nexdata OCR Data 500,000 Images Computer Vision Data Invoice Data AI Training Data cover?

This product includes data covering 62 countries like USA, China, Japan, Germany, and India. Nexdata is headquartered in United States of America.

How much does Nexdata OCR Data 500,000 Images Computer Vision Data Invoice Data AI Training Data cost?

Pricing for Nexdata OCR Data 500,000 Images Computer Vision Data Invoice Data AI Training Data starts at USD5,000 per purchase. Connect with Nexdata to get a quote and arrange custom pricing models based on your data requirements.

How can I get Nexdata OCR Data 500,000 Images Computer Vision Data Invoice Data AI Training Data?

Businesses can buy Annotated Imagery Data from Nexdata and get the data via S3 Bucket, SFTP, Email, UI Export, REST API, SOAP API, Streaming API, and Feed API. Depending on your data requirements and subscription budget, Nexdata can deliver this product in .bin, .json, .xml, .csv, .xls, .sql, and .txt format.

What is the data quality of Nexdata OCR Data 500,000 Images Computer Vision Data Invoice Data AI Training Data?

Nexdata has reported that this product has the following quality and accuracy assurances: 97% Accuracy. You can compare and assess the data quality of Nexdata using Datarade’s data marketplace.

What are similar products to Nexdata OCR Data 500,000 Images Computer Vision Data Invoice Data AI Training Data?

This product has 3 related products. These alternatives include Nexdata OCR Data Collection Services 100+ Languages Resources Computer Vision Data Image Collection for Machine Learning (ML) Data, FileMarket Text Recognition Data 50,000 Images Computer Vision Data AI Model Training Data Textual data Annotated Imagery Data, and TagX - 5000+ Face Anti Spoofing Data Anti Spoofing Detection Face Recognition Fraud Detection KYC authentication Global coverage. You can compare the best Annotated Imagery Data providers and products via Datarade’s data marketplace and get the right data for your use case.

Starts at
$5,000 / purchase
License Starts at
One-off purchase
$5,000 / purchase
Monthly License Not available
Yearly License Not available
Usage-based Not available

Nexdata

Sharpen Your AI with Better Data

Verified provider icon Verified Provider
4h Avg. response time
100% Response rate