AAS 203rd Meeting, January 2004
Session 20 Accessing Data Bases
Poster, Monday, January 5, 2004, 9:20am-6:30pm, Hanover Hall

[20.06] Searching the full text in the ADS article service

E.A. Henneken, G. Eichhorn, M.J. Kurtz, A. Accomazzi, C.S. Grant, S.S. Murray (Harvard-Smithsonian CfA.)

The NASA Astrophysics Data System (ADS) provides world-wide free access to a searchable bibliographic database containing over 3.5 million records and a full-text archive with 2.5 million scanned pages. We have recently extracted the textual information from these scanned pages by using Optical Character Recognition (OCR) software. While this process does not yield a completely accurate reproduction of the text in the original papers, it has allowed us to create a searchable interface for this body of full-text. This allows ADS users to locate any page in the archive which contains a particular word or combination of words. While this interface is still under development, we encourage users to test it and give us feedback. An added benefit of the OCR process is that we are often able to automatically extract and process reference lists from the scanned papers, thus adding to the ADS citation database.

The ADS is funded by NASA Grant NCC5-189

