The NLM Archiving and Interchange Tag Set is now being maintained as part of the NISO JATS project. Please see http://jats.nlm.nih.gov/archiving/ for the most current information.


The Journal Archiving and Interchange Tag Set defines elements and attributes that describe the content and metadata of journal articles, including research and non-research articles, letters, editorials, and book and product reviews. The Tag Set allows for descriptions of the full article content or just the article header metadata.

The intent of the Archiving Tag Set is to preserve the intellectual content of journals independent of the form in which that content was originally delivered. This Tag Set enables an archive to capture structural and semantic components of existing material without modeling any particular sequence or textual format.

It was planned that Archiving could be used for conversion from a variety of journal source Tag Sets, with the intent of providing a single format:

In order to enable description of the content used by the wide array of publishers, repositories, aggregators, etc., the Tag Set uses many loose structures, including some elements with nearly all content structures optional. Many attribute values in the Tag Set are data character values, accommodating any source values. Because some article components are prescriptive in nature, article metadata for example, the Archiving Tag Set includes a few completely generic structures for capturing semantic tagging that is not available natively in the Tag Suite. Although publication order cannot always be preserved, particularly within the metadata, the Archiving Tag Set works harder than any of the other NLM Tag Sets in this Suite to allow almost any publication arrangement and to allow retagging as renaming without rearrangement during conversion.

The Archiving Tag Set has a distinct focus on conversion from multiple sources. That focus has made this Tag Set a large and inclusive one. Many elements have been created explicitly so that information tagged by publishers would not be discarded when they converted material from another Tag Set to this one (or one created from this Suite). Care has also been taken to provide several mechanisms (frequently, information classing attributes) to preserve the intellectual content of a document structure when that structure is converted from another Tag Set or schema to this one, even when there is no exact element equivalent of the structure.

The exact replication of the look and feel of any particular journal has not been a consideration. Therefore, many purely formatting mechanisms have not been included. At the same time, Archiving is intended to preserve observed content, without resorting to stylesheets or generation of textual elements. For that reason, labels, numbers, and symbols of tables, figures, sidebars, and the like can be recorded as elements, as can the punctuation and spaces inside bibliographic references and lists.


The complete documentation for each version of this Tag Set is available in the Tag Library. The structure and suggested usage of the Tag Library is described in the How to Use (Read Me First) section. Please see the individual version's page for access to the appropriate Tag Library.

Frequently Asked Questions

A Frequently Asked Questions page is available.

Getting the Files

All of the Tag Set files are available by anonymous FTP: ftp://ftp.ncbi.nih.gov/pub/archive_dtd/archiving/

Each schema is also available through the web at stable URIs which are listed on the individual version's page.


Please see the JATS Archiving Tag Set page for the current Tag Set version information.

The previous Tag Set versions are still available:


Please submit all questions or comments to the JATS-List, hosted by Mulberry Technologies.

Related Tag Sets

The Journal Publishing Tag Set, created from the Tag Suite, is more prescriptive than the Archiving Tag Set. It is optimized for use by publishers and archives interested in regularizing their data.

The Article Authoring Tag Set, also created from the Tag Suite, is optimized for authoring original journal articles. It is the most limited Tag Set derived from the Suite that is offered by NLM.

The NCBI Book Tag Set was designed to accommodate tagging for books as part of the NCBI Bookshelf project.

Individuals wanting to submit citations and abstracts for inclusion in PubMed/MEDLINE should use the PubMed Journal Article DTD. See the Information for Publishers re: XML Tagged Data on the PubMed web site.


All of the tools described in this section are available at http://dtd.nlm.nih.gov/tools/.

XML Information

Links to general information on XML, XSLT, Unicode™, and XLink are available on the XML Resources page.

National Center for Biotechnology Information
U.S. National Library of Medicine
8600 Rockville Pike, Bethesda, MD 20894
Copyright, Disclaimer, Privacy, Accessibility

U.S. National Institutes of HealthU.S. Department of Health and Human ServicesUSA.gov

Last updated: September 17, 2012