Cultural heritage institutions do not only have physical documents, such as books, but also possess a goldmine of data that is waiting to have its full potential unlocked.
Since 2002, the National Library of Luxembourg (BnL) has been digitising a large variety of historical documents such as newspapers, monographs, manuscripts, postcards and even posters. To facilitate access of its collections to the widest possible audience, as well as to provide tools for education and research, the National Library of Luxembourg published part of its digitised newspapers and metadata as open datasets. All available data, APIs and tools can be found at:
The digitised newspapers have very precise metadata (XML format) and contain the full text, article-segmentation information (tagged articles, images, tables, advertisements, …), original scanned high resolution images (300ppi TIFF). Every block of information, from individual words to complete paragraphs or articles, have page coordinates. All digitised materials are also available for viewing through on a-z.lu and eluxemburgensia.lu.