Vocabulary for Cancer Research
Controlled terminologies and ontologies provide the underlying foundation for data integration, sharing and re-use, and knowledge management. This common vocabulary is an essential part of making data consistent and interoperable. These efforts promote harmonization and shared standards across NCI's informatics infrastructure.
NCI manages and distributes cancer vocabulary through NCI Enterprise Vocabulary Services (EVS). The EVS team creates and maintains the vocabulary, made up of several different terminologies and ontologies that are organized and connected in different ways. This process involves working with many partners to develop, license, and publish terminology and to jointly develop software tools.
You'll find newly added terms and analysis tools which reflect the needs of the cancer community.
Submit a New Term
Your feedback on the content or structure of existing terms and contributions for new terms will help us determine what to add. NCI, EVS, and other partner terminology products may use your suggestions.
Terminologies and Ontologies
Consistent vocabulary, in the form of individual pieces of terminology or mapped concept ontologies, is the core of EVS.
NCI EVS provides services and resources, including the NCI Thesaurus and NCI Metathesaurus, that facilitate the use and standardization of terminology across the Institute and the larger biomedical community.
NCI Thesaurus
NCI’s core reference terminology and biomedical ontology are collected in the NCI Thesaurus (NCIt). NCI publishes the NCIt monthly. A growing number of NCI and other systems use NCIt. It covers more than 192,000 concepts that you can use in coding clinical care, translational and basic research, and public information and administrative activities. The NCIt provides 154,000 textual definitions; 623,000 synonyms; over 630,000 inter-concept relationships; other information on more than 40,000 cancers and related diseases; 26,000 single agents and related substances; combination therapies; a Federal Consolidated Health Informatics (CHI) standard anatomy section; and a wide range of other topics related to cancer and biomedical research. NCIt is a broadly shared coding and semantic infrastructure resource – more than two-thirds of NCIt concepts include content explicitly tagged by one or more EVS partners.
NCIt is available for download in a variety of formats. For information on third-party access, visit EVS Web Downloads.
Other terminology sets and standards also are available, with frequent additions. Four commonly used methods are below, along with information on specific licenses and terms for usage.
Download:
You can find NCIt archived material in the NCIt Archive.
NCI Metathesaurus
The NCI Metathesaurus (NCIm) is a comprehensive biomedical terminology database that provides a broad, concept-based mapping of terms from over 101 biomedical terminologies, with 7,500,000 terms mapped to 3,200,000 concepts representing their shared meanings. NCI uses many terminologies from NCIm. These terminologies pertain to clinical care, translational and basic research, and public information and administrative activities (including most public domain terminologies from the National Library of Medicine's UMLS Metathesaurus, as well as a growing number of other cancer-related and biomedical terminologies). Check NCIm for updates twice yearly.
Reference Terminologies and Ontologies
EVS currently maintains more than 25 standalone terminologies and ontologies of special interest to NCI and the research community. EVS has helped create and harmonize several of these terminologies, which originate from other agencies and standards development organizations such as the U.S. Food and Drug Administration and the Clinical Data Interchange Standards Consortium.
Terminology Value Sets
Value sets are a pre-curated, standard set of meanings you can use for biomedical coding. EVS and its partners have developed and now maintain more than 2,000 user-specific and generalized value sets.
Several of the key value sets are:
Clinical Data Interchange Standards Consortium Terminology
The Clinical Data Interchange Standards Consortium is an international non-profit organization that develops and supports global data standards for medical research.
Food and Drug Administration Terminology
EVS works with the U.S. Food and Drug Administration to develop and support controlled terminology in several areas. EVS stores within NCIt more than 27,000 FDA terms and codes and tags and publishes them as value sets.
NCIt Neoplasm Core Terminology
The NCIt Neoplasm Core value set provides a core reference set of NCIt neoplasm classification concepts that facilitate consistent coding, analysis, and data sharing across a broad range of NCI and related resources. Linked to online EVS resources, these files provide a comprehensive collection of key terms, definitions, simplified hierarchies, mappings to dozens of other terminologies, and molecular characteristics.
NCI Clinical Trials Reporting Program Terminology
EVS provides semantic support to NCI’s Clinical Trials Reporting Program to enable consistent and accurate clinical trial abstraction, coding, reporting, and portfolio management for internal NCI usage and effective clinical trial search for patients and providers. EVS develops, maintains, and publishes more than 28,000 concepts covering diseases, drugs, interventions, biomarkers, and demographics.
National Council for Prescription Drug Programs Terminology
The National Council for Prescription Drug Programs Terminology creates and promotes the transfer of data related to medications, supplies, and services through the development of standards and industry guidance. It uses NCIt in two of its standards.
National Institute of Child Health and Human Development Terminology
The National Institute of Child Health and Human Development Terminology and EVS worked with numerous contributors from national and international academic, clinical, and research institutions to provide standardized terminology for coding pediatric clinical trials and other research activities. You can view or download this terminology from NCIt as value sets.
Terminology Mappings
Terminology Mappings are curated, paired mappings between several supported terminologies to support data translation and cross-referencing. You'll find newly created mappings based on the interests and needs of the research communities.
You can search, browse, and download Terminology Mappings through the Mappings tab in the NCI Term Browser.
Accessing NCI Terminology Content
Browsers
NCI Term Browser
The NCI Term Browser publishes all terminologies that EVS hosts in an integrated environment, providing search support, cross-links, and a user-friendly interface. The NCI Term Browser provides access to:
- the International Classification of Diseases, Ninth Revision
- Clinical Modification (ICD-9-CM)
- International Classification of Diseases, Tenth Revision
- Clinical Modification (ICD-10-CM)
- the Common Terminology Criteria for Adverse Events (CTCAE)
- the Medical Dictionary for Regulatory Activities (MedDRA)
- the Systematized Nomenclature of Medicine Clinical Terms (SNOMED-CT)
- the Medication Reference Terminology (MED-RT)
- the Gene Ontology (GO)
- many other terminologies and ontologies that NCI and its partners use.
You can also find cross-terminology mappings and more than 2,000 terminology set coding standards.
Back-end Services and Open Source Software
LexEVS
LexEVS is the EVS terminology server. It comprises a collection of software and services for loading, publishing, and providing access to vocabulary and ontology resources. The Mayo Clinic developed LexEVS as an open-source tool with NCI support. Many NCI and external applications, including the Cancer Data Standards Registry and Repository, use the server's application programming interfaces (APIs).
LexEVS Downloads contains the necessary source code, web services, java client.jar files, programming interfaces, and documentation needed to use LexEVS services.
EVSRESTAPI is an API that EVS offers to a native triple-stored backend terminology server with Elasticsearch indexes. It allows searches that capture the complete logical semantics of underlying terminologies.
NCI Protégé
NCI Protégé is the primary EVS editing software application. It is based on Stanford University's open-source Protégé tool. NCI developed Protégé plug-ins to meet EVS requirements and business rules, then contributed the code back to the community to further foster Protégé adoption.
You can find a full list of tools and downloads on the EVS website.
The EVS Wiki provides a greater level of technical detail about terminology tools and resources.