Is collection data enriched automatically?

Some of the collections in the Media Suite have been enriched automatically, and data enrichments are added as search layers in addition to the institutional metadata of the collections. For the most part, data enrichments are created using Automatic Speech Recognition (ASR) or Optical Character Recognition (OCR).

Automatic Speech Recognition (ASR)

ASR is a process applied to recorded, digitised audio materials that allows for converting speech into text. The Media Suite applies this process to the entire audiovisual collection of The Netherlands Insitute for Sound and Vision (NISV), including both television and radio resources. Beginning in July 2018, the textual outputs are gradually made available and updated in order to improve search and navigation of the Media Suite’s resources.

The most complete, up-to date automatic speech recognition reports (and other statistics of the NISV collections) are available at the NISV collection statistics website . Here you will find interactive and up-to-date visualizations of the progress of ASR as it is being processed. The numbers in these visualizations reflect the entire collection of NISV, showing the amount of digitized items with a carrier and, from those, the amount of items that have ASR.

For more information about the Media Suite’s ASR processing, you may consult the publication “Speech Recognition and Scholarly Research” by Roeland Ordelman and Arjan van Hessen.

Optical character recognition (OCR)

This type of automatic enrichment is available for the following collections:

NB :

  • If you would like to use our data enrichment visualizations for a publication, you can use the direct link to the graph , zoom in to the timeline, and use the camera icon above the graph to download the result. Please don’t forget to cite the chart properly by including:

    • Description of the image or title of the image (as it appears in each graphic’s caption, or adding more detail if necessary to interpret the graphic)

    • Publisher: The Netherlands Institute for Sound and Vision

    • Editor: Mari Wigham and Willem Melder

    • Edition or version (charts in this website are updated automatically, thus, the date of update is the same as the date of download/copy)

    • Access information (Website’s URL and/or graphic URL)