Downloading PubMed Data

PubMed comprises more than 30 million citations for biomedical literature from MEDLINE, life science journals, and online books. Citations may include links to full-text content from PubMed Central and publisher websites. PubMed XML is available from:

FTP download

Once a year, NLM releases a complete (baseline) set of PubMed citation records in XML format for download. Incremental update files are then released daily and include new, revised, and deleted citations. The PubMed DTD states any changes to the structure and allowed elements from year to year.

NCBI E-utilities API

The Entrez Programming Utilities (E-utilities) consist of eight server-side programs that provide a stable interface into the Entrez query and database system at the National Center for Biotechnology Information (NCBI).

Using this documentation

This site provides annotations and examples for all elements and attributes defined in the current PubMed DTD.

For each Element we include:

  • A description or other notes regarding the data included in the element
  • Content Model describing the expected contents -- this includes syntax and any parent or child elements
  • Valid attributes
  • Sample XML

For each Attribute we include:

  • Associated elements
  • Allowed values, where specified

The menu on the left sidebar expands to show a list of all Elements and Attributes in alphabetical order. On each page the names of Elements and Attributes are hyperlinked for easy navigation within this tool.

If you have any questions, please contact:

  • National Center for Biotechnology Information
  • info@ncbi.nlm.nih.gov