News

Keep up with the latest news from the NCI Center for Biomedical Informatics and Information Technology (CBIIT) and the data science communities.

More than 70,000 CT scans from the National Lung Screening Trial (NLST) are now publicly available (no data access request needed). Read more to learn how to access this data through NCI resources.

In a recent podcast, NCI leaders from CBIIT and the Small Business Innovation Research Development Center shared how technological developments have enhanced cancer research and have helped usher in new diagnostics, treatments, and patient care.

A few of NCI’s Division of Cancer Biology grantees recently released publications on topics such as machine learning and artificial intelligence. These research results hold clues to how we research and develop various cancer treatments.

The new terminology for female reproductive neoplasms in NCI’s Thesaurus aligns with the latest World Health Organization standards. Other updates support CDISC’s standards and are intended to further facilitate the collection, management, and analysis of research data from human clinical trials and other studies.

Staff from CBIIT and NCI, alongside partners from NIH, FDA, and a consortium of scientists from across the world, joined forces to create reference samples and data call sets to help the cancer community further decipher cancer-related gene mutations. Their findings were recently published in Nature Biotechnology.

The NCI Cancer Research Data Commons’ Imaging Data Commons (IDC) has updated to include more features and 16 terabytes of medical imaging data files for cancer researchers and imaging informaticists.

Dr. Jill Barnholtz-Sloan, CBIIT’s associate director of Informatics and Data Science and DCEG senior investigator, together with research colleagues, used a direct data matching approach to compare brain tumors in U.S. Veteran and non-Veteran populations. The study indicates that direct and deterministic data matching approaches have the potential to compare the distribution of tumors, treatment trajectories, and clinical outcomes of other cancers and rare diseases among these populations.

The NCI Surveillance Research Program is hosting several webinars in October and November on resources for the analysis of SEER data and other cancer surveillance data. These webinars will include information on SEER*Stat, statistics, variables, methods, and software.

A new publication using NCI funding and resources shows that a machine learning model, called Panoptes, allowed cancer researchers to reliably predict subtypes of endometrial cancer. Such “computational pathology” offers a useful framework for supporting human pathologists, trimming the labor needed to interpret histological findings to under 4 minutes per slide, and eliminating the time and cost of genetic sequencing.

Interested in making data discoverable to the larger research community? Share your perspective with NIH, who wants to know how to improve data searchability and discovery. Cancer researchers, data submitters/generators, data users, and technology providers should respond by December 3, 2021.