Originally published at: U.S. Library of Congress Processes over 16 Million Historic Newspaper Pages Using AI | NVIDIA Technical Blog
Digitizing millions of historical documents and newspapers is a challenging task. To help speed up the process, the U.S. Library of Congress developed a GPU-accelerated, deep learning model to automatically extract, categorize, and caption over 16 million pages of historic American newspapers published between 1789 and 1963. The work, which is being made publicly available…