I
searched high and low for a product that
would handle PDF text with embedded images … I
used the dtSearch hit-highlighting feature
to highlight words seemingly right on the
scans.
|
|
FultonHistory.com
has scanned in over one hundred and fifty years
of newspapers into searchable PDF. Visitors
to this publicly-available site can use dtSearch
to search the entire collection—with highlighted
hits appearing right on the scanned newspaper
pictures. The site covers over 1.5 million
searchable newspapers of the Upstate and Central
New York region, spanning a large number of
different newspapers.

|
Because
of the potential for OCR errors when scanning
old newspapers, dtSearch's fuzzy searching
is really important.
|
|
“I
searched high and low for a product that would
handle PDF text with embedded images,” explains
Tom Tryniski, the architect of the site. “I
converted microfilm of the newspapers to 'hidden
text' PDFs, and then used dtSearch to search
the resulting PDFs. I used the dtSearch
hit-highlighting feature to highlight words
seemingly right on the scans.” Added
Mr. Tryniski: “Because of the potential
for OCR errors when scanning old newspapers,
dtSearch's fuzzy searching is really important.”
“I
did a search for the Titanic, and instantly pulled
up actual copies of newspapers from the original
disaster in 1912,” said
David Thede, president, dtSearch. “The
results even spanned more modern coverage of
the discovery of the underwater wreck. To
instantly pull up such an historical reservoir
is amazing.”
To search 150+ years
of newspapers, as well as view other historical
photos and postcards, please visit fultonhistory.com.
Or contact Mr. Tryniski directly at tryniski@fultonhistory.com.
|