“There’s No Easy Way Out: Making Data FAIR Requires Better Metadata. Information Technology Can Help.”
The research community now expects data sets to be findable, accessible, interoperable, and reusable (FAIR). Scientists seem to assert reflexively that their data are FAIR even more than they claim that their data are conclusive. But creating FAIR data is hard, and the process depends on authoring metadata that adheres to community standards for reporting the underlying data sets. Tools such as CEDAR can help investigators create FAIR data when the data are first archived in repositories.
Dr. Mark Musen will discuss his group's new work that addresses the challenge of “scrubbing” legacy metadata to infer what might have been meant by metadata authors who have not adhered to the standards needed to make their data FAIR.
This webinar is part of the monthly NIH Data Sharing and Reuse Seminar Series hosted by the NIH Office of Data Science and Strategy.
Dr. Mark Musen is professor of biomedical informatics and biomedical data science at Stanford University where he also serves as the director of the Stanford Center for Biomedical Informatics Research.
- HLA Class II Across The Cancer Genome Atlas Cancer DatasetOctober 27, 2021Cloud Computing Workshop: Cancer Genomics Cloud (CGC)October 28, 2021Social Determinants of Health (SDOH)November 03, 2021Cloud Computing Workshop: FireCloudNovember 04, 2021NCI Childhood Cancer Data Initiative Annual Symposium 2021November 09, 2021