Newspapers 1850-1883

Permanent URI for this collection

Collection of OCR text in csv files from digitised newspapers.

The csv files contain

  • recordID: Unique identifier of the scanned newspaper article. This identifier can be used to find the newspaper article in Mediestream. Search for recordID: "recordID".
  • timestamp: The date the newspaper was printed.
  • editionID: The newspaper id.
  • newspaper_page: The scanned newspaper page.
  • fulltext_org: Text which was generated by doing OCR of the scanned article.