Another new tool, for finding news images

If there’s a genealogist alive who doesn’t love newspapers, The Legal Genealogist hasn’t encountered such.

And if there’s a newspaper-loving genealogist alive in the United States who doesn’t love Chronicling America, well, there’s clearly something wrong with you.

I mean, seriously, what’s not to love about an online collection of 17,013,539 pages from 3,263 newspapers starting in the 1780s — all available for free?

This treasure from the Library of Congress, with the full title of Chronicling America: Historic American Newspapers, is a go-to site for all kinds of researchers. It’s the outcome of “a long-term effort to develop an Internet-based, searchable database of U.S. newspapers with descriptive information and select digitization of historic pages” and is “produced by the National Digital Newspaper Program (NDNP). NDNP, a partnership between the National Endowment for the Humanities (NEH) and the Library of Congress (LC).”¹

The site has always had an advanced search tool to find text from the newspapers it digitizes, and though it can be a little quirky, it works pretty well. You can search by state or by newspaper, choose a year or years or a date range, select a language, and do proximity searches for words or phrases.²

What you haven’t been able to do is search for images and photographs.

Until now.

Launched as part of the 2020 Innovator in Residence Program at the Library of Congress, there’s now Newspaper Navigator, a project by Benjamin Charles Germain Lee and designed “to extract visual content from 16+ million pages of digitized historic American newspapers in Chronicling America and … re-imagine how you, members of the American public, can search the visual content using machine learning techniques. The first phase (extracting the visual content) was completed in April, 2020, and resulted in the full Newspaper Navigator dataset… The second phase of Newspaper Navigator consists of the application … All Newspaper Navigator photos in the app and dataset are in the public domain!”³

At the moment there are 1.56 million historic newspaper photos and sketches included — and it’s a lot of fun.

Of course, the first search term I entered was “court.” ⁴ And up popped the first 100 of 22,084 images between 1900 and 1963 where the term court appears somewhere in the OCR data for an image.

Now… it’s not perfect. You get things like “courtesy” and “courtyard” and “fore court” — that last one turning up in one of many OCR entries that read like hieroglyphics. It’s from the 14 October 1900 Indiana Tribune and reads in part: “In 332cTi:fiS.:Ki2-, iSS” “ST- v ”. ‘i-.. ‘1 — 5 -‘: . , ; “-N -, Lc 5 i . ‘ ; : ?”,’ 4 5″‘ . – -l . : —– – -r- :- :v:v -t-i.- r-‘v -” ‘ ‘ – r.– –‘. – i y M r” .vtit’ rr f lUJT.tVv’-5. “: -‘.fitj–“‘”.. .-As V: ‘vV’r.-. ‘i”‘”?r’.”j!” -l- ‘v-‘ -” I -A;. ;y …” Almost the only part that can be read is “NORTHAVEST STFtOM füHEi COURT : FROM PHOTOGRAPH TAKE.N “AUGUST i5-3 I&OO.”⁵

But think about it — just one word that is enough — and the Indiana Tribune was a German language newspaper, printed in the Fraktur script, chock full of umlauts and more complications. And I know that because — having clicked on that image — I was given options to download the image, view the full issue, learn about the newspaper and cite the image I found. And when I viewed the full image, this first-generation American of German heritage found it pretty easy to read the Fraktur script.

So… what kinds of goodies can you find this way? Oh my. There’s just so much. Let me give you just one example:

Julia Schreiber

This is “Miss Julia Schreiber, one of the leading models for the artists of San Francisco,” who in 1900 married a young blacksmith and, six hours later, was planning how to get a divorce. Why did she marry him at all? “Because she was dared.”⁶

Now I mean really… don’t you want that story for your files? With that image to go along with it?

And while you look at that image, look at the rest of the newspaper. “Mother” Mary Jones, a leader of the anthracite coal strikers, is on page 1 along with the Duke of Abruzzi, headed for the North Pole. A millionaire who took in a friend who then burglarized his house — and the “friend” — are on page 2. A swimmer who saved a boat adrift who was suing for salvage is on page 3. And so on.

So now not just the newspapers.

The newspapers and their images.

Free from the Library of Congress, and its Innovator in Residence program, and the new Newspaper Navigator.

Check it out.

Cite/link to this post: Judy G. Russell, “Newspapers and their images,” The Legal Genealogist (https://www.legalgenealogist.com/blog : posted 17 Sep 2020).

SOURCES

Image: “9/6/1900 The evening world,” Library of Congress, Newspaper Navigator Dataset: Extracted Visual Content from Chronicling America.

“About Chronicling America,” Library of Congress, Chronicling America (https://chroniclingamerica.loc.gov/ : accessed 17 Sep 2020). ↩
See ibid., “Advanced Search.” See also ibid., “Help: Advanced Searching in Chronicling America.” ↩
“About,” Newspaper Navigator, LC Labs, Library of Congress (https://news-navigator.labs.loc.gov : accessed 17 Sep 2020). ↩
Seriously? You wondered? ↩
Image, “10/14/1900 Indiana Tribune,” from the Library of Congress, Newspaper Navigator Dataset: Extracted Visual Content from Chronicling America (https://news-navigator.labs.loc.gov/ : accessed 17 Sep 2020.) That citation form, by the way, is mostly generated automatically by the tool itself. ↩
“On A Dare, Model Weds Blacksmith,” The (New York) Evening World, 6 September 1900, p. 4, cols. 4-5; digital images, Library of Congress, Chronicling America (https://chroniclingamerica.loc.gov/ : accessed 17 Sep 2020). ↩

7 Comments

Harold McClendon Jr on September 17, 2020 at 9:57 am

Thanks for bringing this to my attention. I look forward to trying it out. For my education, why did you include footnote number 4?
- Judy G. Russell on September 17, 2020 at 10:00 am
  
  Because I have a weird sense of humor. In case you hadn’t noticed. 🙂 🙂 🙂
  - Theresa on September 19, 2020 at 11:08 am
    
    oh my. I needed that chuckle!
Matt Cross on September 18, 2020 at 9:19 am

A genealogist who doesn’t love newspapers? Thanks for the rabbit hole, Judy.
Teresa on September 22, 2020 at 10:46 pm

So the T&C allow people to display the images on the web? This is so different from FindMyPast’s British Newspaper Archive collection…I do transcriptions to use on my blog.
- Judy G. Russell on September 22, 2020 at 11:10 pm
  
  Yes. The terms of use at the Library of Congress are terrific (see Thank you, LOC!) and the newspapers themselves are out of copyright.
  - Teresa on September 23, 2020 at 12:57 pm
    
    Judy,
    
    Thanks for getting back to me. Wonderful the LC has such generous T&C. I just wish my ancestors were in US papers…

Newspapers and their images

7 Comments

Submit a Comment Cancel reply

Categories

Archives