Joshua Kitlas Metadata Element Set Version 1 (JKMES Version 1)

User guide || Element set specifications || Schema || Metadata records || Reflection and summary

User guide

The schema documents herein are in development state. They are published for demonstration and for the ongoing schema development within the Joshua Kitlas Metadata Element Set.

Introduction

The goal of this user guide is to give an overview of the function and structure of the Joshua Kitlas Metadata Element Set (JKMES) and guide users and organizations that are looking to use this schema.

The function of the JKMES is to present a schema to aid users and organizations in defining resources for manipulating data and developing information visualizations. Far beyond just websites and software, the collection of information visualization tools may include open source software packages, white papers, RSS feeds, websites, Twitter feeds, and blogs. There are no distinct areas from which to gather elements for such a collection, but rather an increasingly wide range of them.

Users of the JKMES schema can range from the casual observer to a corporation looking to catalog their assortment of data manipulation and information visualization resources.

Function

The function of the JKMES is primarily to import the Dublin Core schema (http://dublincore.org/documents/dcmi-terms/) and define JKMES-specific terms and classes. The JKMES can be viewed as an extension of the Dublin Core schema. JKMES also calls upon the Metadata Object Description Schema (http://www.loc.gov/standards/mods/mods-outline.html) for terms, but to a much lesser degree.

Structure

The structure of the JKMES contains the following 13 elements:

subject, type, description, creator, version, publisher, placeOfPublication, resource, standardizedIdentifier, date, notes, language, rights

Element Set Specifications

Term Name title
Label Title
URI http://purl.org/dc/terms/title
Definition Title of resource.
Requirement Mandatory
Comment Value not controlled and should appear in title case.
EXAMPLE: Adobe Photoshop

 

Term Name subject
Label Subject
URI http://purl.org/dc/terms/description
Definition The space the element occupies in the physical or virtual world.
Requirement Mandatory
Comment Value is controlled.
Refer to LC Authority (http://authorities.loc.gov/)
Value style should correspond with LOC authority record.

 

Term Name type
Label Type
URI http://purl.org/dc/terms/type
Definition A more specific descriptor of the type of the resource.
Requirement Mandatory, Repeatable
Comment

When available, use another layer of description to further clarify the resource, such as, ‘Informational publication’

Value is controlled.
Refer to LC Authority (http://authorities.loc.gov/)
Value style should correspond with LOC authority record.

 

Term Name description
Label Description
URI http://purl.org/dc/terms/description
Definition Brief overview or summary of the resource.
Requirement Mandatory
Comment

Value is not controlled.
Free text to be determined first by element (if it is a website use the site’s ‘About Us’, as a description) and if no element description is available then text is at discretion of value author.

Can include URL pointing to resource.

 

Term Name creator
Label Creator
URI http://purl.org/dc/terms/creator
Definition The entity or entities associated with the creation of the resource.
Requirement Mandatory, Repeatable
Comment

Value is controlled.
Refer to LC Authority (http://authorities.loc.gov/)
Value style should correspond with LOC authority record.

EXAMPLE:
Corporate name: Herbert Hoover Memorial Building (Stanford, Calif.)
Personal name: Herbert, James

 

Term Name version
Label Version
URI http://purl.org/dc/terms/hasVersion
Definition The version, edition, or iteration of the resource.
Requirement Optional
Comment

Value is not controlled.

Value will vary depending on resource being cataloged. A piece of software or a computer program might appear as ‘v1.2.1’ or ‘version 1.2.1’ depending on manufacturer. Manufacturer style should be represented. Documentation may be listed as ‘123v1.pdf’ or ‘123.v.1.pdf’. Similar to the previous example, manufacturer or creator style should be adopted.

 

Term Name publisher
Label Publisher
URI http://purl.org/dc/terms/publisher
Definition The entity responsible for making the resource available.
Requirement Optional
Comment

Value is controlled.
Refer to LC Authority (http://authorities.loc.gov/)
Value style should correspond with LOC authority record.

EXAMPLE:
Corporate name heading:
Random House Audio Publishing

 

Term Name placeOfPublication
Label Place of publication
URI http://purl.org/dc/terms/coverage
Definition Location of where the resource was published.
Requirement Optional
Comment Value is controlled.
Refer to AACR2 (10.3.3. Recording the place of publication)
Value style should correspond with AACR2 record.

 

Term Name resource
Label Resource
URI http://www.loc.gov/mods/v3/resource
Definition The nature or genre of the resource.
Requirement Mandatory
Comment

A descriptive categorization of the resource, such as, ‘Social media’

Value is controlled.
Refer to LC Authority (http://authorities.loc.gov/)
Value style should correspond with LOC authority record.

 

Term Name standardizedIdentifier
Label Standardized Identifier
URI http://purl.org/dc/terms/identifier
Definition A numeric or alphanumeric identifier
Requirement Optional, Repeatable
Comment

Value is not controlled.
The control is defined by the identifier.

Use abbreviated definition (ISBN, DOI, etc.) followed by (:) and then followed by the identifier. E.g. -
ISBN-13: 978-9197270502

 

Term Name date
Label Date
URI http://purl.org/dc/terms/date
Definition A point or time associated with the resource.
Requirement Optional
Comment Value is controlled.
Refer to ISO 8601 (http://www.iso.org/iso/date_and_time_format#how-it-works).
Highest level of specificity using the below format:
Year: YYYY (eg 1997)
Year and month:  YYYY-MM (eg 1997-07)  Complete date: YYYY-MM-DD (eg 1997-07-16)

 

Term Name notes
Label Notes
URI http://www.loc.gov/mods/v3/notes
Definition Comments, other than the description, used to identify the object.
Requirement Optional, Repeatable
Comment Value is not controlled.
The control is defined by the author.
Free text following good grammar and style.

 

Term Name language
Label Language
URI http://purl.org/dc/terms/language
Definition The language of the resource.
Requirement Optional
Comment

Value is controlled. Refer to ISO 639-3 (http://www.sil.org/iso639%2D3/codes.asp).

Example: lang=”eng”

 

Term Name rights
Label Rights
URI http://purl.org/dc/terms/rights
Definition Information about rights held in and over the resource.
Requirement Optional
Comment Value is controlled. Refer to Dublin Core (http://dublincore.org/documents/dces/).
Typically, rights information includes a statement about various property rights associated with the resource, including intellectual property rights.

Schema

The schema file can be viewed by downloading it HERE.

The file path is http://kitlas.com/assets/metadata/jkmes.xsd

Metadata records

googlerefine Google Refine

The schema file can be viewed by downloading it HERE. (Google CHROME is NOT ideal for in-browser viewing.)

The file path is http://kitlas.com/assets/metadata/records/google-refine.xml

scraperwiki ScraperWiki

The schema file can be viewed by downloading it HERE. (Google CHROME is NOT ideal for in-browser viewing.)

The file path is http://kitlas.com/assets/metadata/records/scraper-wiki.xml

stanford CS229 Machine Learning Autumn 2010

The schema file can be viewed by downloading it HERE. (Google CHROME is NOT ideal for in-browser viewing.)

The file path is http://kitlas.com/assets/metadata/records/machine-learning.xml

tableaupublicTableau Public

The schema file can be viewed by downloading it HERE. (Google CHROME is NOT ideal for in-browser viewing.)

The file path is http://kitlas.com/assets/metadata/records/tableau-public.xml

Reflection and summary

I made a number of tweaks to my schema over the past months and with more time, I would probably make even more edits. It is now clear to me why schemas (as well as software and other resources) are constantly having updates – it is a lot of work and things change quickly!

The thing that frustrated me the most was the 'rights' element. As evidenced by my records, some of the segments were extremely long. Though technically correct, the thoroughness of this section came to dominate some of the records.

Two of the most unexpected and surprising insights I gained doing this project were that:

  1. The developers of Dublin Core and MODs (as well as many other schemas I am sure) were incredibly thorough and thoughtful. Much more so than I originally thought
  2. Less is more - especially when it comes to design

In building the schema, I found that the Dublin Core and MODs developers were a lot more deliberate and detailed than I thought after taking an original casual look. I have great respect for the systematic and exhaustive nature with which they developed the schemas. After 'getting under the hood', it was clear that the Dublin Core metadata schema could be used by a variety of developers cataloging a variety of resources. The schema is practically unlimited in its use and function.

In terms of design, I had quite an 'a-ha' moment. I like design and making things look nice. One thing that always bothered me was the plain, vanilla nature of the Dublin Core and LOC Authority sites. After putting together my schema and web page, I opted to follow the same style as Dublin Core – limited images, no fancy text or CSS, and everything presented in a rather unflattering way. The thing is, this way WORKS. Pages do not hang, text does not render incorrectly, and so on. This is why the blog of computer programmers and others at the top of the technological food chain are so plain – it is the best option for getting your message across on the web with the least amount of noise.

I am not sure how much I would change things if I were to start over from scratch. Building a schema is an iterative process that requires changes and assessments at each stage. I am sure that, even with the new knowledge I have, building a different schema would have different challenges, questions, and answers.