U.S. Library of Congress Processes over 16 Million Historic Newspaper Pages Using AI

jwitsoe · August 21, 2022, 11:49pm

Originally published at: U.S. Library of Congress Processes over 16 Million Historic Newspaper Pages Using AI | NVIDIA Technical Blog

Digitizing millions of historical documents and newspapers is a challenging task. To help speed up the process, the U.S. Library of Congress developed a GPU-accelerated, deep learning model to automatically extract, categorize, and caption over 16 million pages of historic American newspapers published between 1789 and 1963. The work, which is being made publicly available…