Machine-readable data

From Wikipedia, the free encyclopedia
Jump to: navigation, search

Machine-readable data is data (or metadata) which is in a format that can be understood by a computer.

There are two types; human-readable data that is marked up so that it can also be read by machines (examples; microformats, RDFa, HTML) or data file formats intended principally for processing by machines (RDF, XML, JSON).

Machine readable is not synonymous with digitally accessible. A digitally accessible document may be online, making it easier for a human to access it via a computer, but unless the relevant data is available in a machine readable format, it will be much harder to use the computer to extract, transform and process that data.[1]

For purposes of implementation of the Government Performance and Results Act (GPRA) Modernization Act, the Office of Management and Budget (OMB) defines "machine readable" as follows: "Format in a standard computer language (not English text) that can be read automatically by a web browser or computer system. (e.g.; xml). Traditional word processing documents and portable document format (PDF) files are easily read by humans but typically are difficult for machines to interpret. Other formats such as extensible markup language (XML), (JSON), or spreadsheets with header columns that can be exported as comma separated values (CSV) are machine readable formats. As HTML is a structural markup language, discreetly labeling parts of the document, computers are able to gather document components to assemble Tables of Content, outlines, literature search bibliographies, etc. It is possible to make traditional word processing documents and other formats machine readable but the documents must include enhanced structural elements."[2]


  1. ^ "A Primer on Machine Readability for Online Documents and Data". 2012-09-24. Retrieved 2015-02-27. 
  2. ^ OMB Circular A-11, Part 6, Preparation and Submission of Strategic Plans, Annual Performance Plans, and Annual Program Performance Reports

See also