This is a limited proof of concept to search for research data, not a production system.

Search the MIT Libraries

Title: Statistics and Evaluation Data for Publication "Using Supervised Learning to Classify Metadata of Research Data by Field of Study"

Type Dataset Tobias Weber, Michael Fromm, Nelson Tavares de Sousa (2019): Statistics and Evaluation Data for Publication "Using Supervised Learning to Classify Metadata of Research Data by Field of Study". Zenodo. Dataset. https://zenodo.org/record/3841797

Authors: Tobias Weber (Leibniz Supercomputing Centre) ; Michael Fromm (Database Systems Group, Ludwig-Maximilians-Universität München) ; Nelson Tavares de Sousa (Software Engineering Group, Kiel University) ;

Links

Summary

Automated classification of metadata of research data by their discipline(s) of research can be used in scientometric research, by repository service providers, and in the context of research data aggregation services. Openly available metadata of the DataCite index for research data were used to compile a large training and evaluation set comprised of 609,524 records. This publication contains aggregated data for the paper. It also contains the evaluation data of all model/hyper-parameter training and test runs.

More information

  • DOI: 10.5281/zenodo.3841797

Subjects

  • supervised machine learning, multi-label classification, research data, text processing, data science, disciplines of research
  • url: https://dewey.info/, https://dewey.info/

Dates

  • Publication date: 2019
  • Issued: October 15, 2019

Rights


Much of the data past this point we don't have good examples of yet. Please share in #rdi slack if you have good examples for anything that appears below. Thanks!

Format

electronic resource

Relateditems

DescriptionItem typeRelationshipUri
IsVersionOfhttps://doi.org/10.5281/zenodo.3490467
IsPartOfhttps://zenodo.org/communities/zenodo