ALTO (XML) files containing OCR text (most have been proofread)
dc.contributor.author | Royal Danish Library | |
dc.date.accessioned | 2021-09-09T16:00:11Z | |
dc.date.available | 2021-09-09T16:00:11Z | |
dc.date.issued | 2021-10-21 | |
dc.description.abstract | Data files in ALTO format from the Royal Danish Library's digitalization of the collection Freedom of Press Writings (Danish: Trykkefrihedens Skrifter). The ALTO files contain OCR text that for most of the files have undergone proofreading. The positions of identified lines and words on the corresponding facsimiles are indicated with pixel values. | en |
dc.identifier.uri | https://loar.kb.dk/handle/1902/7791 | |
dc.identifier.uri | http://dx.doi.org/10.21994/loar7597 | |
dc.publisher | Royal Danish Library | en |
dc.rights | CC Public Domain | * |
dc.rights.uri | https://creativecommons.org/publicdomain/mark/1.0/deed.en | * |
dc.subject | ALTO (XML) format | en |
dc.title | ALTO (XML) files containing OCR text (most have been proofread) | en |
dc.type | Dataset | en |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- ALTO-20211021.zip
- Size:
- 192.15 MB
- Format:
- Unknown data format
- Description:
- 28.645 ALTO files (UNIX End-of-Lines, UTF-8 charset)
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 4.44 KB
- Format:
- Item-specific license agreed upon to submission
- Description: