To faciliate the processing of large files, please delete any columns without data and submit the file in CSV format.
Validates a replayable spreadsheet (.xlsx or .csv) against the modsulator template and checks for other problems with the metadata.
The validator classes issues into failures, errors, warnings, and information. Failures prevent the spreadsheet from being parsed for validation; errors mean system or schema metadata requirements are not met; warnings mean metadata recommendations are not met; information means the data suggests an error may be present, or is missing from a non-essential field.
Validation criteria
Failure
File extension is not .xlsx or .csv
The file cannot be opened
Spreadsheet includes invalid characters or encoding
Header row does not begin with "druid" and "sourceId"
Error
Duplicate headers
Missing main title column header (ti1:title)
Subject value header is present without the corresponding type header
Subject type header is present without the corresponding value header
Data row without a druid
Blank row within the data
Druid does not match valid alphanumeric pattern
Line breaks or control characters in cell values
Duplicate druids
Cell value is a spreadsheet formula error (#NA, #REF!, #VALUE?, #NAME?)
Main title column (ti1:title) is blank
More than one keyDate is declared for a record
Date format does not match stated encoding
Subject value without a declared type
MODS controlled field values do not match specified vocabulary
Title type (tiX:type)
Name usage (na1:usage)
Name type (na1:type)
Type of resource (ty1:typeOfResource)
Manuscript (ty1:manuscript)
All dates: keyDate, qualifier, point, encoding
Issuance (orX:issuance)
Subject types (snX:p1:nameType, suX:pX:type)
Warning
Type of resource column header is missing (ty1:typeOfResource)
Type of resource value is missing
For Excel spreadsheets, the cell format is not text or numeric
Cell value is 0
No keyDate is declared
Date range point values missing if two dates are given
Date keyDate, qualifier, point, and/or encoding field have value but the corresponding date field does not
Subject type has value but the corresponding subject does not
Missing value in the lo:purl field
Information
Headers appearing in spreadsheet are not in modsulator XML template
For Excel spreadsheets, the cell format is numeric