<?xml version="1.0" encoding="UTF-8"?>
<?asciidoc-toc maxdepth="3"?>
<?asciidoc-numbered?>
<book xmlns="http://docbook.org/ns/docbook" xmlns:xl="http://www.w3.org/1999/xlink" version="5.0" xml:lang="en" xml:id="mzidentml:-exchange-format-for-peptides-and-proteins-identified-from-mass-spectra">
<info>
<title>mzIdentML</title>
<subtitle>exchange format for peptides and proteins identified from mass spectra</subtitle>
<date>2026-04-24</date>
</info>
<chapter xml:id="preface">
<title>Preface</title>
<simpara><emphasis>Status of This Document</emphasis></simpara>
<simpara>This document presents a final specification for the mzIdentML data format developed by the HUPO Proteomics Standards Initiative.
Distribution is unlimited.</simpara>
<simpara><emphasis>Version of This Document</emphasis></simpara>
<simpara>Date created: June 24, 2024</simpara>
<simpara>Last updated: Fri Apr 24 15:41:03 UTC 2026</simpara>
<simpara>Based on commit: <link xl:href="https://github.com/HUPO-PSI/mzIdentML/commit/e537e39e404653ca0fcbd1d7c07280123a6ce8fa">e537e39e404653ca0fcbd1d7c07280123a6ce8fa</link> - <link xl:href="https://github.com/HUPO-PSI/mzIdentML/commits/master/specification_document/specdoc1_3/asciidoc/mzidentml.adoc">Commit History</link></simpara>
<simpara>The current version of this document is: version 1.3.0 final, June 2024</simpara>
<simpara>The latest (draft) version of this document may be found at <link xl:href="https://github.com/HUPO-PSI/mzIdentML">https://github.com/HUPO-PSI/mzIdentML</link>.</simpara>
<simpara><emphasis>Type of This Document</emphasis></simpara>
<simpara>This document is a <emphasis>recommendation</emphasis> for a common, community-driven standard data exchange format in proteomics.</simpara>
<simpara><emphasis>Authors</emphasis></simpara>
<simpara>Please see <xref linkend="authors-and-contributors"/> for details on the authors and editors of this document.</simpara>
</chapter>
<chapter xml:id="abstract">
<title>Abstract</title>
<simpara>The Human Proteome Organisation (HUPO) Proteomics Standards Initiative (PSI) defines community standards for data representation in proteomics to facilitate data comparison, exchange and verification.
The Proteomics Informatics Working Group is developing standards for describing the results of identification and quantitation processes for proteins, peptides and protein modifications from mass spectrometry.
This document defines an XML schema that can be used to describe the outputs of proteomics search engines or similar software for peptide/protein identification from mass spectrometry (MS) data.</simpara>
</chapter>
<chapter xml:id="introduction">
<title>Introduction</title>
<section xml:id="background">
<title>Background</title>
<simpara>This document addresses the systematic description of (poly)peptide identification and characterisation based upon mass spectrometry (MS).
A large number of different proteomics search engines are available that produce output in a variety of different formats.
It is intended that mzIdentML will provide a common standard format for identification results to support a range of scenarios encountered in proteome informatics. mzIdentML has been developed with a view to supporting the following general tasks (more specific use cases are provided in <xref linkend="use-cases-for-mzidentml"/>.):</simpara>
<orderedlist numeration="arabic">
<listitem>
<simpara><emphasis>The discovery of relevant results,</emphasis> so that, for example, data sets in a database that use a particular technique or combination of techniques can be identified and studied by experimentalists during experiment design or data analysis.</simpara>
</listitem>
<listitem>
<simpara><emphasis>The sharing of best practice</emphasis>, so that, for example, analyses that have been particularly successful at identifying a certain group of peptides/proteins can be interpreted by consumers of the data.</simpara>
</listitem>
<listitem>
<simpara><emphasis>The evaluation of results</emphasis>, so that, for example, sufficient information is provided about how a particular analysis was performed to allow the results to be critically evaluated.</simpara>
</listitem>
<listitem>
<simpara><emphasis>The sharing of data sets,</emphasis> so that, for example, public repositories can import or export data, or multi-site projects can share results to support integrated analysis.</simpara>
</listitem>
<listitem>
<simpara><emphasis>The creation of a format for input to analysis software</emphasis>, for example, allowing software to be designed that provides a meta-score over the output from several search engines.</simpara>
</listitem>
<listitem>
<simpara><emphasis>An internal format for pipeline analysis software, for example, allowing analysis software to store intermediate results from different stages of an identification pipeline, prior to the final results being assembled in a single mzIdentML file.</emphasis></simpara>
</listitem>
</orderedlist>
<simpara>This document presents a specification, not a tutorial.
As such, the presentation of technical details is deliberately direct.
The role of the text is to describe the model and justify design decisions made.
The document does not discuss how the models should be used in practice, consider tool support for data capture or storage, or provide comprehensive examples of the models in use.
Tutorial material can be located on the PSI website (see <link xl:href="http://www.psidev.info/mzidentml">http://www.psidev.info/mzidentml</link>).</simpara>
</section>
<section xml:id="document-structure">
<title>Document Structure</title>
<simpara>The remainder of this document is structured as follows.
<xref linkend="use-cases-for-mzidentml"/> lists use cases mzIdentML is designed to support.
<xref linkend="concepts-and-terminology"/> describes the terminology used.
<xref linkend="relationship-to-other-specifications"/> describes how the specification presented here relates to other specifications, both those that it extends and those that it is intended to complement.
<xref linkend="model-in-xml-schema"/> contains the documentation for the XML schema which is generated automatically and several parts of the schema are documented in more detail in <xref linkend="specific-comments-on-schema"/>. Conclusions are presented in <xref linkend="conclusions"/>.</simpara>
</section>
</chapter>
<chapter xml:id="use-cases-for-mzidentml">
<title>Use Cases for mzIdentML</title>
<simpara>The following use cases have driven the development of the mzIdentML data model and XML schema, and are used to define the scope of the format in the current version.</simpara>
<simpara>It should be possible to create a tool that loads an mzIdentML document and enables users to examine results from an MS or MSn analysis.
As of mzIdentML version 1.3.0, there is support for aggregating evidence from multiple MS levels by using the encoding for identifications based on multiple spectra given in <xref linkend="identifications-based-on-multiple-mass-spectra"/>. There should be sufficient information for the tool to generate output reports that conform to the requirements made by journals for publication and that conform to the relevant MIAPE guidelines.
For example:<?asciidoc-br?>
· For a Peptide Mass Fingerprint (PMF) search, it should be possible to display the spectrum and show the matches of the peaks to the relevant peptides.<?asciidoc-br?>
· For an MS/MS search, it should be possible to locate which spectrum matched to which peptide in the original file.</simpara>
<orderedlist numeration="arabic">
<listitem>
<simpara>There should be sufficient information stored in the instance document to enable a user to run the same or a similar search on the same or another search engine.
This means that all search parameters should be described in sufficient detail and that sufficient information is available to determine which database (if any) the data were searched against.
The peak lists data do not need to be included in the instance document, but do need to be suitably referenced.</simpara>
</listitem>
<listitem>
<simpara>It should be possible to save the results of searching a decoy database in the same instance document as the results from the target database.
It should then be possible to write a viewer application that enables a user to investigate the effect of changing, for example, a threshold value on the false discovery rate.
This would only be possible if results that are generally considered lower quality from the search are also saved in the mzIdentML document (rather than just top matches) and if the results from the decoy search are also saved.
It would only be possible to do this at the peptide level for an MS/MS search, because changing thresholds would normally have some effect on the protein grouping algorithm.</simpara>
</listitem>
<listitem>
<simpara>It should be possible to save manual or automated annotation of proteins/peptides in an instance document.
A third-party tool could be used to save annotations and validations of identified proteins/peptides to an existing instance document.</simpara>
</listitem>
<listitem>
<simpara>It should be possible to save the results from a search of a metabolically labelled sample.
For example, with a 14N/15N experiment, two separate sets of amino acid masses are used, and it must be possible to tell which masses were used for each peptide result.</simpara>
</listitem>
<listitem>
<simpara>For a search of multiple peak lists, it should be possible to identify the spectrum that matched a particular peptide or protein reported by the search engine.
For example, in an LC-MS/MS run, it should be possible to refer back to the spectrum in the peak list file that was searched and from there, if the information is available, to be able to determine the retention time of the spectrum.</simpara>
</listitem>
<listitem>
<simpara>It should be possible to search a file to retrieve all molecules that have a specified modification.</simpara>
</listitem>
<listitem>
<simpara>It should be possible to store the results of a search of spectra against other spectra – <emphasis>i.e.</emphasis>, a spectral library search.</simpara>
</listitem>
<listitem>
<simpara>It should be possible to store the results of a top-down search, <emphasis>i.e.</emphasis>, analysis of complete proteins.</simpara>
</listitem>
<listitem>
<simpara>Support should be provided for storing fragmentation data so that for example viewers could display which ions in the input data match predicted ion fragment masses.</simpara>
</listitem>
<listitem>
<simpara>There should be support for storing the results of searches of peptides against nucleic acid databases, including the information about which translation frame the matches were found in.</simpara>
</listitem>
<listitem>
<simpara>It should be possible to combine the results from multiple search engines into one mzIdentML document.
For example, the peptide spectrum matches (PSMs) from two or more different search engines could be combined using a third tool to give one set of protein results.</simpara>
</listitem>
<listitem>
<simpara>It should be possible to store <emphasis>de novo</emphasis> peptide sequencing results, to the extent that it will be possible to enumerate and record all possible matches found by a <emphasis>de novo</emphasis> technique.
However, we anticipate that this can produce large files.</simpara>
</listitem>
<listitem>
<simpara>It should be possible to store the results of MS/MS crosslinking approaches, whereby two peptides ed using chemical reagents or biologically occurring modifications have been identified (newly added in mzIdentML 1.2).
New use cases for crosslinking data have been added in mzIdentML 1.3. From this version, this information is available in the mzIdentML crosslinking extension document, and not in this main specification document.</simpara>
</listitem>
<listitem>
<simpara>It should be possible to store at a basic level of detail the molecular interaction data that can be inferred from crosslinking approaches (newly added in mzIdentML 1.2).</simpara>
</listitem>
<listitem>
<simpara>It should be possible to represent statistical values or scores associated with the positions of modifications on a peptide chain (newly added in mzIdentML 1.2).</simpara>
</listitem>
<listitem>
<simpara>It should be possible to represent statistical values or scores associated with peptide identifications, formed from groups of redundant peptide-spectrum matches (PSMs) reporting on the same peptide (newly added in mzIdentML 1.2).</simpara>
</listitem>
<listitem>
<simpara>It should be possible to capture the output of proteogenomics analyses such as the mapping of peptides to gene models and chromosomes (newly added in mzIdentML 1.2).</simpara>
</listitem>
</orderedlist>
<simpara>There should be limited support for:</simpara>
<orderedlist numeration="arabic">
<listitem>
<simpara>Sequence tagging, in which short sequences defined by a <emphasis>de novo</emphasis> process are used to characterize spectra.
The final results from a sequence-tag-filtered search can be stored in mzIdentML, but the details of tag generation and filtering cannot.
The following use cases <phrase role="underline">will not be</phrase> supported in version 1.2 of mzIdentML:</simpara>
</listitem>
</orderedlist>
<orderedlist numeration="arabic">
<listitem>
<simpara>It should be possible to store relative and absolute quantitation information at the peptide and protein level using all the popular techniques – this is captured in mzQuantML and also in mzTab.</simpara>
</listitem>
<listitem>
<simpara>Support for complex workflows where multiple data processing algorithms are combined in a pipeline, <emphasis>i.e.</emphasis>, only “final” results are represented in mzIdentML v1.2 in one protein list.</simpara>
</listitem>
</orderedlist>
</chapter>
<chapter xml:id="concepts-and-terminology">
<title>Concepts and Terminology</title>
<simpara>This document assumes familiarity with XML Schema notation (<link xl:href="http://www.w3.org/XML/Schema">www.w3.org/XML/Schema</link>).
The key words “MUST,” “MUST NOT,” “REQUIRED,” “SHALL,” “SHALL NOT,” “SHOULD,” “SHOULD NOT,” “RECOMMENDED,” “MAY,” and “OPTIONAL” are to be interpreted as described in RFC-2119 (<link xl:href="http://www.ietf.org/rfc/rfc2119.txt">http://www.ietf.org/rfc/rfc2119.txt</link>).</simpara>
</chapter>
<chapter xml:id="relationship-to-other-specifications">
<title>Relationship to Other Specifications</title>
<simpara>The specification described in this document is not being developed in isolation; indeed, it is designed to be complementary to, and thus used in conjunction with, several existing and emerging models.
Related specifications include the following:</simpara>
<orderedlist numeration="arabic">
<listitem>
<simpara><emphasis>MIAPE MSI</emphasis> (<link xl:href="http://www.psidev.info/groups/miape">http://www.psidev.info/groups/miape</link>) The Minimum Information About a Proteomics Experiment: Mass spectrometry Informatics document defines a checklist of information that should be reported about such a study.</simpara>
</listitem>
<listitem>
<simpara><emphasis>mzML</emphasis> (<link xl:href="http://www.psidev.info/mzml">http://www.psidev.info/mzml</link>). mzML is the PSI standard for capturing mass spectra / peak lists resulting from MS in proteomics.
It is RECOMMENDED that mzIdentML should be used in conjunction with mzML, although it is possible to use mzIdentML with other formats of mass spectra.
This document does not assume familiarity with mzML <xref linkend="martens2011"/>.</simpara>
</listitem>
<listitem>
<simpara><emphasis>mzQuantML</emphasis> (<link xl:href="http://www.psidev.info/mzquantml">http://www.psidev.info/mzquantml</link>). mzQuantML is the PSI standard for capturing quantitative proteomics data from MS <xref linkend="walzer2013"/> mzQuantML files that report quantitative data MAY reference mzIdentML files containing the detailed identification data.</simpara>
</listitem>
<listitem>
<simpara><emphasis>mzTab</emphasis> (<link xl:href="http://www.psidev.info/mztab">http://www.psidev.info/mztab</link>). mzTab is the PSI standard that can represent identification and quantification results at different levels of detail, in a tab-delimited format <xref linkend="griss20014"/>. mzTab results can represent a subset of all the information included in a mzidentML file.
However, in mzTab 1.0 while there is support for some of the new features represented in mzidentML 1.2 (e.g. ambiguity in the modification position), other features are not explicitly supported (e.g. crosslinking). mzTab files MAY reference mzIdentML files.</simpara>
</listitem>
<listitem>
<simpara><emphasis>PSI-MI XML / MITAB</emphasis> (<link xl:href="http://www.psidev.info/groups/molecular-interactions">http://www.psidev.info/groups/molecular-interactions</link>).
The PSI has developed specifications for molecular interaction evidence in XML format (PSI-MI) and tab-separated (MITAB).
When MS crosslinking data is stored in mzIdentML, including evidence for protein interactions, the file can act in a complementary manner (e.g. providing source data) to a PSI-MI <xref linkend="kerrien2007"/> or MITAB file.</simpara>
</listitem>
</orderedlist>
<section xml:id="the-psi-mass-spectrometry-controlled-vocabulary-cv">
<title>The PSI Mass Spectrometry Controlled Vocabulary (CV)</title>
<simpara>The PSI-MS controlled vocabulary (CV) <xref linkend="mayer2013"/> is intended to provide terms for annotation of mzML, mzIdentML, mzQuantML and other PSI standard file formats.
The CV has been generated by collection of terms from software vendors and academic groups working in the area of MS and proteome informatics.
Some terms describe attributes that must be coupled with a numerical value attribute in the &lt;cvParam&gt; element (e.g. MS:1001191 “p-value”) and optionally a unit for that value (e.g. MS:1001117, “theoretical mass”, units = dalton).
The terms that require a value are denoted by having a “datatype” key-value pair in the CV itself: MS:1001172 "mascot:expectation value" value-type:xsd:double.
Terms that need to be qualified with units are denoted by having a “has_units” key in the CV itself (relationship: has_units: UO:0000221 ! dalton).
The details of which terms are allowed or required in a given schema section is reported in the mapping file (<xref linkend="validation-of-controlled-vocabulary-terms"/>).</simpara>
<simpara>As recommended by the PSI CV guidelines, psi-ms.obo should be dynamically maintained <emphasis>via</emphasis> the <link xl:href="mailto:psidev-ms-vocab@lists.sourceforge.net">psidev-ms-vocab@lists.sourceforge.net</link> mailing list that allows any user to request new terms, in agreement with the community involved.
Once a consensus is reached among the community the new terms are added within a few business days.</simpara>
<simpara>In general, modifications SHOULD be sourced from Unimod (<link xl:href="http://www.unimod.org/obo/unimod.obo">http://www.unimod.org/obo/unimod.obo</link>) where possible.
For encoding crosslinking results, the XLMOD-CV SHOULD be used (<link xl:href="https://raw.githubusercontent.com/HUPO-PSI/mzIdentML/master/cv/XLMOD.obo">https://raw.githubusercontent.com/HUPO-PSI/mzIdentML/master/cv/XLMOD.obo</link>), unless suitable terms can be obtained from Unimod at a later date.</simpara>
<simpara>The following ontologies or controlled vocabularies specified below may also be suitable or required in certain instances:</simpara>
<itemizedlist>
<listitem>
<simpara>Unit Ontology (<link xl:href="http://bioportal.bioontology.org/ontologies/UO?p=classes&amp;conceptid=root">http://bioportal.bioontology.org/ontologies/UO?p=classes&amp;conceptid=root</link>).</simpara>
</listitem>
<listitem>
<simpara>ChEBI (<link xl:href="http://www.ebi.ac.uk/chebi/">http://www.ebi.ac.uk/chebi/</link>).</simpara>
</listitem>
<listitem>
<simpara>For describing sample types, any suitable and stable ontologies MAY be used.</simpara>
</listitem>
</itemizedlist>
<simpara>The PSI Protein modifications CV (<link xl:href="http://psidev.cvs.sourceforge.net/viewvc/psidev/psi/mod/data/PSI-MOD.obo">http://psidev.cvs.sourceforge.net/viewvc/psidev/psi/mod/data/PSI-MOD.obo</link>) is now DEPRECATED for use in mzIdentML.
It is RECOMMENDED to use Unimod wherever possible.</simpara>
</section>
<section xml:id="validation-of-controlled-vocabulary-terms">
<title>Validation of controlled vocabulary terms</title>
<simpara>The correct usage of controlled vocabulary terms within mzIdentML is governed by the use of a mapping file that defines each XML location (XPath) where a &lt;cvParam&gt; instance can be used, and the allowed terms from the PSI-MS, or other CVs.
The mapping file is read and interpreted by validation software, checking that the data annotation is consistent.
The mapping file needs to be checked and updated when the structure of the CV is changed, and in some instances when new terms are added to the CV.
XML paths are associated with CV terms along with a requirement level (MAY, SHOULD or MUST) defining what should be reported by validation software if one of the mapped terms is not provided in an instance document.
Syntactic and semantic validation SHOULD be checked using the official Java-based validator available from <link xl:href="http://www.psidev.info/mzidentml">http://www.psidev.info/mzidentml</link>.</simpara>
</section>
<section xml:id="changes-from-version-1.1.0-to-version-1.2.0">
<title>Changes from version 1.1.0 to version 1.2.0</title>
<simpara>The primary update requiring the change from version 1.1.0 to version 1.2.0 is in the inclusion of guidelines for encoding protein group results (<xref linkend="protein-grouping-encoding"/>).
Several examples referenced throughout the document are annotated with version 1.1.0. In these cases, it can be assumed that these files are also valid 1.2.0 files, since they do not include protein inference results.
Other minor changes have been made to the specification since version 1.1.0, with regards to the encoding of specific workflows – notably searches where pre-fractionation has been performed (<xref linkend="pre-fractionation-of-samples-prior-to-ms-and-splitting-of-searches"/>), searches employing multiple search engines (<xref linkend="multiple-database-search-engines"/>), <emphasis>de novo</emphasis> sequencing (<xref linkend="support-for-de-novo-sequencing-approaches"/>) and spectral library searches (<xref linkend="spectral-library-searches"/>).</simpara>
<simpara>Several new CV terms are now mandatory within the &lt;SpectrumIdentificationProtocol&gt; element - enabling the new features to be differentiated and recognised automatically by processing software, as follows. <emphasis role="strong">1…<emphasis>n</emphasis> of the following terms MUST be present:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="3">
<colspec colname="col_1" colwidth="33.3333*"/>
<colspec colname="col_2" colwidth="33.3333*"/>
<colspec colname="col_3" colwidth="33.3334*"/>
<tbody>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">CV term name</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara><emphasis role="strong">Accession</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara><emphasis role="strong">Reference to section within this document</emphasis></simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">peptide-level scoring</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>MS:1002490</simpara></entry>
<entry align="left" valign="top"><simpara><xref linkend="peptide-level-scores-and-statistical-measures"/></simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">modification localization scoring</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>MS:1002491</simpara></entry>
<entry align="left" valign="top"><simpara><xref linkend="encoding-modification-localisation-scores"/></simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">consensus scoring</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>MS:1002492</simpara></entry>
<entry align="left" valign="top"><simpara><xref linkend="multiple-database-search-engines"/></simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">sample pre-fractionation</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>MS:1002493</simpara></entry>
<entry align="left" valign="top"><simpara><xref linkend="pre-fractionation-of-samples-prior-to-ms-and-splitting-of-searches"/></simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">crosslinking search</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>MS:1002494</simpara></entry>
<entry align="left" valign="top"><simpara><xref linkend="encoding-results-of-crosslinking-searches"/></simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">de novo search</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>MS:1001010</simpara></entry>
<entry align="left" valign="top"><simpara><xref linkend="support-for-de-novo-sequencing-approaches"/></simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">spectral library search</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>MS:1001031</simpara></entry>
<entry align="left" valign="top"><simpara><xref linkend="spectral-library-searches"/></simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">proteogenomics search</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>MS:1002635</simpara></entry>
<entry align="left" valign="top"><simpara><xref linkend="encoding-proteogenomics-annotation-data"/></simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">no special processing</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>MS:1002495</simpara></entry>
<entry align="left" valign="top"><simpara>Used to indicate that none of the above (new) features have been included in the file.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara>Table 1 New CV terms now mandatory (1&#8230;&#8203;<emphasis>n</emphasis> terms MUST be present) within the &lt;SpectrumIdentificationProtocol&gt; element in mzIdentML 1.2. Terms “de novo search” and “spectral library search” MUST appear under the &lt;SearchType&gt; element.
All other terms MUST appear under the &lt;AdditionalSearchParams&gt; element.</simpara>
</section>
<section xml:id="changes-from-version-1.2.0-to-1.3.0">
<title>Changes from version 1.2.0 to 1.3.0</title>
<simpara>The core of this specification document remains unchanged from 1.2.0, but version 1.3.0 now supports extensions for additional features or use cases, described in additional extension documents located in the same folder.
At the time of writing, an extension for crosslinking data exists and an extension for glycopeptides is in progress, but others may take place in the future.</simpara>
<simpara>It is important to highlight that extension documents cannot introduce any changes to the mzIdentML schema.
As such they are primarily about introducing and describing the use of CV terms.</simpara>
<simpara>Implementers only need to incorporate the extensions if supporting the specific extra features or use cases described there.
A document signals which extensions it requires by including &lt;cvParam&gt; elements that are children of the term MS:1003373, "mzIdentML extension version", immediately after the &lt;cvList&gt; element inside the &lt;MzIdentML&gt; element.</simpara>
<simpara>This is the only change to the XML schema definition for mzIdentML 1.3.0: &lt;cvParam&gt; elements can be included immediately after the &lt;cvList&gt; element inside the top level &lt;MzIdetnML&gt; element, to permit declaring the version of extension documents.</simpara>
<simpara>A new section has also been added to explain how to encode identifications coming from multiple spectra (<xref linkend="identifications-based-on-multiple-mass-spectra"/>).
The “combined spectra” type of input file format from version 1.2.0 has been retired and is not part of the 1.3.0 specification.</simpara>
<simpara>Additionally, two new CV terms have been introduced to provide an optional mechanism for linking the &lt;Modification&gt; elements inside &lt;Peptide&gt; elements to &lt;SearchModification&gt; elements (<xref linkend="linking-searchmodification-elements-to-modification-elements"/>).
These are:</simpara>
<itemizedlist>
<listitem>
<simpara>“search modification id” (MS:1003392),</simpara>
</listitem>
<listitem>
<simpara>“search modification id ref” (MS:1003393).</simpara>
</listitem>
</itemizedlist>
<simpara>Furthermore, different typos and small details have been refined throughout the text.
As an example, the hyphenated term “cross-linking” (used in version 1.2.0) has now been de-hyphenated throughout this 1.3.0 updated specification document.</simpara>
<simpara>Finally, in the <xref linkend="support-for-de-novo-sequencing-approaches"/> (“Support for de novo sequencing approaches”), a clarification has been added providing more details about how to represent <emphasis>de novo</emphasis> searches, including the use of the CV param “<emphasis>de novo</emphasis> search or no database used” (MS:1000394).</simpara>
</section>
</chapter>
<chapter xml:id="format-scope-and-specific-use-cases">
<title>Format scope and specific use cases</title>
<section xml:id="">
<title></title>
<section xml:id="handling-updates-to-the-controlled-vocabulary">
<title>Handling updates to the controlled vocabulary</title>
<simpara>In brief, when a new term is required, the file producers must contact the CV working group <emphasis>via</emphasis> e-mail (<link xl:href="http://psidev-ms-vocab@lists.sourceforge.net">psidev-ms-vocab@lists.sourceforge.net</link>) and request the new term.
It is anticipated that problems may arise if a consumer of the file encounters a new CV term and they are not working from the latest version of the CV file.
It has been decided that rather than aim for a workaround to this issue, it can be expected that data file consumers must ensure that the OBO file is up-to-date.</simpara>
</section>
<section xml:id="identifying-the-input-spectrum-for-each-result">
<title>Identifying the input spectrum for each result</title>
<simpara>A &lt;SpectrumIdentificationResult&gt; is linked to the source spectrum (in an external file) from which the identifications are made by way of a reference in the spectrumID attribute and <emphasis>via</emphasis> the &lt;SpectraData&gt; element which stores the URL of the file in the location attribute.
It is advantageous if there is a consistent system for identifying spectra in different file formats.
The following table is implemented in the PSI-MS CV for providing consistent identifiers for different spectrum file formats.
A CV term MUST be imported into the &lt;SpectraData&gt; element to demonstrate which system for identifying input spectra is being used in the spectrumID attribute of &lt;SpectrumIdentificationResult&gt;.</simpara>
<simpara><emphasis>It is encouraged but not mandatory that a valid mzIdentML file is accompanied by the set of spectra that were searched.</emphasis> It is acknowledged that in many cases it will be useful to have an mzIdentML file and the input spectra together, there are practical problems processing such data depending on the spectrum file format (e.g. in case of proprietary formats), and cases where an mzIdentML is useful even if the spectra data is not available.</simpara>
<simpara><emphasis role="strong">Update from version 1.2.0:</emphasis></simpara>
<simpara>Version 1.1.0 of the specification document states “the CV holds the definite specification for legal encodings of spectrumID values”.
In version 1.2, the only legal ways of referencing a spectrum identification format are provided below in Table 1. Any new spectral formats that cannot fit into this schema require an update to this document.</simpara>
<simpara><emphasis role="strong">Update included in version 1.3.0:</emphasis></simpara>
<simpara><emphasis>All CV terms containing combined spectra input types have been deprecated.</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="25*"/>
<colspec colname="col_3" colwidth="25*"/>
<colspec colname="col_4" colwidth="25*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">ID</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Term</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Comment</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>MS:1000768</simpara></entry>
<entry align="left" valign="top"><simpara>Thermo nativeID format</simpara></entry>
<entry align="left" valign="top"><simpara>controllerType=xsd:nonNegativeInteger controllerNumber=xsd:positiveInteger scan=xsd:positiveInteger</simpara></entry>
<entry align="left" valign="top"><simpara>controller=0 is usually the mass spectrometer. Space-separated values.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>MS:1000769</simpara></entry>
<entry align="left" valign="top"><simpara>Waters nativeID format</simpara></entry>
<entry align="left" valign="top"><simpara>function=xsd:positiveInteger process=xsd:nonNegativeInteger scan=xsd:nonNegativeInteger</simpara></entry>
<entry align="left" valign="top"><simpara>Space-separated values.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>MS:1000770</simpara></entry>
<entry align="left" valign="top"><simpara>WIFF nativeID format</simpara></entry>
<entry align="left" valign="top"><simpara>sample=xsd:nonNegativeInteger period=xsd:nonNegativeInteger cycle=xsd:nonNegativeInteger experiment=xsd:nonNegativeInteger</simpara></entry>
<entry align="left" valign="top"><simpara>Space-separated values.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>MS:1000771</simpara></entry>
<entry align="left" valign="top"><simpara>Bruker/Agilent YEP nativeID format</simpara></entry>
<entry align="left" valign="top"><simpara>scan=xsd:nonNegativeInteger</simpara></entry>
<entry align="left" valign="top"></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>MS:1000772</simpara></entry>
<entry align="left" valign="top"><simpara>Bruker BAF nativeID format</simpara></entry>
<entry align="left" valign="top"><simpara>scan=xsd:nonNegativeInteger</simpara></entry>
<entry align="left" valign="top"></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>MS:1000773</simpara></entry>
<entry align="left" valign="top"><simpara>Bruker FID nativeID format</simpara></entry>
<entry align="left" valign="top"><simpara>file=xsd:IDREF</simpara></entry>
<entry align="left" valign="top"><simpara>The nativeID must be the same as the source file ID.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>MS:1000774</simpara></entry>
<entry align="left" valign="top"><simpara>multiple peak list nativeID format</simpara></entry>
<entry align="left" valign="top"><simpara>index=xsd:nonNegativeInteger</simpara></entry>
<entry align="left" valign="top"><simpara>Used for referencing peak list files with multiple spectra, i.e. MGF, PKL, merged DTA files. Index is the spectrum number in the file, starting from 0.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>MS:1000775</simpara></entry>
<entry align="left" valign="top"><simpara>single peak list nativeID format</simpara></entry>
<entry align="left" valign="top"><simpara>file=xsd:IDREF</simpara></entry>
<entry align="left" valign="top"><simpara>The nativeID must be the same as the source file ID. Used for referencing peak list files with one spectrum per file, typically in a folder of PKL or DTAs, where each sourceFileRef is different.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>MS:1000776</simpara></entry>
<entry align="left" valign="top"><simpara>scan number only nativeID format</simpara></entry>
<entry align="left" valign="top"><simpara>scan=xsd:nonNegativeInteger</simpara></entry>
<entry align="left" valign="top"><simpara>Used for referencing mzXML, or a DTA folder where native scan numbers can be derived.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>MS:1000777</simpara></entry>
<entry align="left" valign="top"><simpara>spectrum identifier nativeID format</simpara></entry>
<entry align="left" valign="top"><simpara>spectrum=xsd:nonNegativeInteger</simpara></entry>
<entry align="left" valign="top"><simpara>Used for referencing mzData. The spectrum ID attribute is referenced.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>MS:1001530</simpara></entry>
<entry align="left" valign="top"><simpara>mzML unique identifier</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>Used for referencing mzML. The value of the spectrum ID attribute is referenced directly.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara>Table 2. CV terms and rules implemented in the PSI-MS CV for formulating the “nativeID” to identify spectra in different file formats.</simpara>
<simpara>In mzIdentML, the spectrum ID attribute should be constructed following the data type specification in Table 2. As an example, to reference the third spectrum (index=2) in an MGF (Mascot Generic Format) file:</simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SpectrumIdentificationResult id="Res1" spectrumID="index=2" SpectraData_ref="InputSpectra1"&gt;
    ...
    &lt;SpectraData location="local/mgf/merge.mgf" id="SD_1"&gt;
        &lt;FileFormat&gt;
            &lt;cvParam accession="MS:1001062" name="Mascot MGF file" cvRef="PSI-MS"/&gt;
        &lt;/FileFormat&gt;
        &lt;SpectrumIDFormat&gt;
            &lt;cvParam accession="MS:1000774" name="multiple peak list nativeID format" cvRef="PSI-MS"/&gt;
        &lt;/SpectrumIDFormat&gt;
    &lt;/SpectraData&gt;
&lt;/SpectrumIdentificationResult&gt;</programlisting>
</section>
</section>
<section xml:id="comments-on-specific-use-cases">
<title>Comments on Specific use cases</title>
<simpara>Many special use cases for mzIdentML were considered during its development.
Each of these use cases has a corresponding example file that exercises the relevant part of the schema and provides a reference implementation example (see supporting documentation).
Authors of software that create mzIdentML are encouraged to examine the examples that accompany this format release before implementing the writer.
Further, such authors are encouraged to use the validator before releasing any new writer code and working with the PSI PI Working Group to resolve any issues.
In the subsections below, we comment on a few of the notable use cases that were considered – in particular those newly added in mzIdentML 1.2.</simpara>
<section xml:id="protein-grouping-encoding">
<title>Protein grouping encoding</title>
<simpara>This section is newly inserted in the mzIdentML version 1.2.0 specifications.
In version 1.1.0, CV terms had been proposed for representing set relationships between different proteins within groups, but there was not a requirement that particular terms were used.
A given data structure from software could be mapped onto the hierarchy &lt;ProteinAmbiguityGroup&gt; and &lt;ProteinDetectionHypothesis&gt; in mzIdentML in different ways, leading to difficulties for data consumers.
As such, a working group has now agreed a more rigid encoding detailed as follows and in <xref linkend="seymour2014"/>.</simpara>
<orderedlist numeration="arabic">
<listitem>
<simpara>As in mzIdentML version 1.1, a single protein accession that has been cited by software is captured in mzIdentML in &lt;ProteinDetectionHypothesis&gt; (PDH).</simpara>
<orderedlist numeration="loweralpha">
<listitem>
<simpara>A PDH MAY contain scores or statistical values produced by the export software, encoded as CV terms.</simpara>
</listitem>
</orderedlist>
</listitem>
<listitem>
<simpara>A “protein group”, representing a “biological entity” for which the software claims independent evidence is present, MUST be mapped onto &lt;ProteinAmbiguityGroup&gt; (PAG).</simpara>
<orderedlist numeration="loweralpha">
<listitem>
<simpara>A PAG MAY have additional scores produced by the export software, encoded as CV terms.</simpara>
</listitem>
</orderedlist>
</listitem>
<listitem>
<simpara>The reporting of protein identification thresholds is now mapped onto PAGs.
There is no desire to change the core XML Schema Document (XSD) for mzIdentML and as such, a new CV term “protein group passes threshold” value= “xsd:boolean” MUST be present on every PAG (MS:1002415).
If no thresholding has been done by the software, all protein groups MUST be annotated as “protein group passes threshold” value= “true”.</simpara>
<orderedlist numeration="loweralpha">
<listitem>
<simpara>The attribute <emphasis>passThreshold</emphasis> = “true|false” remains present on PDH and MAY be used if software packages wish to report a two-level hierarchy of thresholds applied.
However, it is not expected that consuming software will use this attribute to determine which proteins have been reported as identified.</simpara>
</listitem>
</orderedlist>
</listitem>
<listitem>
<simpara>The &lt;ProteinDetectionList&gt; MUST contain the CV term “count of identified proteins” value= “xsd:integer” (MS:1002404).
The value MUST be derived from the count of PAGs passing the threshold reported in the file and will be checked by validation software.</simpara>
</listitem>
<listitem>
<simpara>Few software packages report “protein clusters” at present, but for those packages that wish to report clusters, a CV term “cluster identifier” value = “xsd:integer” SHOULD be used (MS:1002407).
The integer identifier MUST be shared by all PAGs belonging to the same cluster.
An optional term “count of identified clusters” value = “xsd:integer” (MS:1002406) MAY be annotated on the &lt;ProteinDetectionList&gt;.</simpara>
</listitem>
<listitem>
<simpara>Every PDH MUST be annotated as either a “leading protein” (MS:1002401) or a “non-leading protein” (MS:1002402), as defined in Table 2, within a PAG.
This recommendation thus makes it explicit for consuming software whether one or more proteins have stronger evidence than others in the group (see Table 2 for examples).</simpara>
<orderedlist numeration="loweralpha">
<listitem>
<simpara>An additional term, “group representative” (MS:1002403) MAY be used to annotate one PDH, which is also flagged as a “leading protein”, if the export software wishes to enforce that only one of potential several “leading proteins” will be interpreted by the consuming software as the representative of the group, for example acting as a tiebreaker.</simpara>
</listitem>
<listitem>
<simpara>If the export software does not explicitly flag one protein as the “group representative”, it is assumed that if consuming software requires a single accession to represent the group, an arbitrary choice will be made (among “leading proteins” only if these exist).</simpara>
</listitem>
</orderedlist>
</listitem>
<listitem>
<simpara>Any PDHs MAY be annotated with terms present in the CV for spectrum/sequence same-set, spectrum/sequence subset, spectrum/sequence subsumable, marginally distinguished and so on (Table 2).</simpara>
<orderedlist numeration="loweralpha">
<listitem>
<simpara>A PDH MAY be annotated with more than one of these terms if appropriate to describe the complex set relationships that exist within a group.</simpara>
</listitem>
<listitem>
<simpara>Developers of software packages MAY propose additional terms for describing group membership of PDHs, which will be incorporated into the CV.</simpara>
</listitem>
<listitem>
<simpara>The associated value for these CV terms MAY be used to annotate which PDH(s) are the super/same-set of the annotated PDH.</simpara>
</listitem>
<listitem>
<simpara>There is no expectation that consuming software should be aware of these terms, but they may be useful in internal pipeline or visualization software packages that are specifically designed to work with this terminology set.</simpara>
</listitem>
</orderedlist>
</listitem>
<listitem>
<simpara>Some PDHs could be mapped to more than one PAG, for example where proteins are multiply subsumed.
To capture these cases, multiple PDHs in different PAGs MAY reference the same &lt;DBSequence&gt;.</simpara>
</listitem>
</orderedlist>
<simpara>The semantic validation software has been updated to encode these rules and report errors (“MUST” rule), warnings (“SHOULD” rule) or informational messages (“MAY” rule).</simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="5">
<colspec colname="col_1" colwidth="14.2857*"/>
<colspec colname="col_2" colwidth="14.2857*"/>
<colspec colname="col_3" colwidth="14.2857*"/>
<colspec colname="col_4" colwidth="14.2857*"/>
<colspec colname="col_5" colwidth="42.8572*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">mzIdentML context</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">CV term</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Values</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Require-ment level</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Description</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">ProteinDetection-List</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>count of identified proteins</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:integer</simpara></entry>
<entry align="left" valign="top"><simpara>MUST</simpara></entry>
<entry align="left" valign="top"><simpara>The value reported MUST equal the number of PAGs with “protein group passes threshold” value = “true”</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">ProteinDetection-List</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>count of identified clusters</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:integer</simpara></entry>
<entry align="left" valign="top"><simpara>MAY</simpara></entry>
<entry align="left" valign="top"><simpara>If protein clusters have been reported in the file, the exporter may choose to annotate the ProteinDetectionList with the number identified above threshold.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">ProteinAmbiguity-Group</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>number of distinct protein sequences</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:integer</simpara></entry>
<entry align="left" valign="top"><simpara>MAY</simpara></entry>
<entry align="left" valign="top"><simpara>The number of distinct protein sequences among the PDHs in the group. For example, if there are two PDHs with different identifiers that have identical full length sequences, the value would be 1.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">ProteinAmbiguity-Group</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>cluster identifier</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:integer</simpara></entry>
<entry align="left" valign="top"><simpara>MAY</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier applied to protein groups to indicate that they are linked by shared peptides.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">ProteinDetection-Hypothesis</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>leading protein</simpara>
<simpara>OR</simpara>
<simpara>non-leading protein</simpara></entry>
<entry align="left" valign="top"><simpara>-</simpara></entry>
<entry align="left" valign="top"><simpara>MUST</simpara>
<simpara>OR</simpara>
<simpara>MUST</simpara></entry>
<entry align="left" valign="top"><simpara>Every PDH in each PAG MUST be flagged as a leading protein or a non-leading protein and each PAG MUST contain at least one leading protein, but MAY contain more than one. A “leading protein” is defined as a protein that has the strongest or near strongest (further explained in Table 3) set of evidence for being present in the sample studied, amongst the grouped protein accessions. A “non-leading protein” is defined as a protein that has (substantially) less evidence than other proteins within the same group, and is thus less likely to have been present in the sample studied.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">ProteinDetection-Hypothesis</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>group representative</simpara></entry>
<entry align="left" valign="top"><simpara>-</simpara></entry>
<entry align="left" valign="top"><simpara>MAY</simpara></entry>
<entry align="left" valign="top"><simpara>Each PAG MAY contain zero or one PDH flagged as the group representative, if the software wishes to flag a preference (often arbitrary or for example based on alphabetical ordering) amongst the leading proteins. The group representative term can thus be viewed a “tiebreaker” if the export software wishes to make this distinction.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">ProteinAmbiguity-Group</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>protein group passes threshold</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:Boolean</simpara></entry>
<entry align="left" valign="top"><simpara>MUST</simpara></entry>
<entry align="left" valign="top"><simpara>Each PAG MUST be annotated with a Boolean CV term indicating whether the PAG has passed the threshold reported in the ProteinDetectionProtocol.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">ProteinDetection-Hypothesis</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>sequence same-set protein</simpara></entry>
<entry align="left" valign="top"><simpara>xsd: “list_of_ strings” space separated list of PDH IDs that are same-set.</simpara></entry>
<entry align="left" valign="top"><simpara>MAY</simpara></entry>
<entry align="left" valign="top"><simpara>A protein that is indistinguishable or equivalent to another protein in the group, having matches to an identical set of peptide sequences.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">ProteinDetection-Hypothesis</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>spectrum same-set protein</simpara></entry>
<entry align="left" valign="top"><simpara>xsd: “list_of_ strings” space separated list of PDH IDs that are same-set.</simpara></entry>
<entry align="left" valign="top"><simpara>MAY</simpara></entry>
<entry align="left" valign="top"><simpara>A protein that is indistinguishable or equivalent to another protein in the group, having PSMs derived from the same set of spectra.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">ProteinDetection-Hypothesis</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>sequence sub-set protein</simpara></entry>
<entry align="left" valign="top"><simpara>xsd: “list_of_ strings” space separated list of PDH IDs that are super-set.</simpara></entry>
<entry align="left" valign="top"><simpara>MAY</simpara></entry>
<entry align="left" valign="top"><simpara>A protein for which the matched peptide sequences are a subset of the matched peptide sequences for another protein in the group.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">ProteinDetection-Hypothesis</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>spectrum sub-set protein</simpara></entry>
<entry align="left" valign="top"><simpara>xsd: “list_of_ strings” space separated list of PDH IDs that are super-set.</simpara></entry>
<entry align="left" valign="top"><simpara>MAY</simpara></entry>
<entry align="left" valign="top"><simpara>A protein for which the matched spectra are a subset of the matched spectra for another protein in the group.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">ProteinDetection-Hypothesis</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>sequence multiply subsumable protein</simpara></entry>
<entry align="left" valign="top"><simpara>xsd: “list_of_ strings” space separated list of PDH IDs that subsume this PDH.</simpara></entry>
<entry align="left" valign="top"><simpara>MAY</simpara></entry>
<entry align="left" valign="top"><simpara>A protein for which the matched peptide sequences are the same, or a subset of, the matched peptide sequences for two or more other proteins combined. These other proteins need not all be in the same group.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">ProteinDetection-Hypothesis</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>spectrum multiply subsumable protein</simpara></entry>
<entry align="left" valign="top"><simpara>xsd: “list_of_ strings” space separated list of PDH IDs that subsume this PDH.</simpara></entry>
<entry align="left" valign="top"><simpara>MAY</simpara></entry>
<entry align="left" valign="top"><simpara>A protein for which the matched spectra are the same, or a subset of, the matched spectra for two or more other proteins combined. These other proteins need not all be in the same group.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">ProteinDetection-Hypothesis</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>marginally distinguished protein</simpara></entry>
<entry align="left" valign="top"><simpara>-</simpara></entry>
<entry align="left" valign="top"><simpara>MAY</simpara></entry>
<entry align="left" valign="top"><simpara>Assigned to a non-leading PDH that has some independent evidence to support its presence relative to the leading protein(s) e.g. the PDH may have a unique peptide but not sufficient to be promoted as, for example, a leading protein of another a PAG.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara>Table 3. New CV terms for reporting protein set (group) relationships and global statistics about the protein identification results.
The semantic validation software for mzIdentML version 1.2.0 reports an error (MUST), a warning (SHOULD) or an informational message (MAY) if these terms are not reported within the file.</simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="3">
<colspec colname="col_1" colwidth="33.3333*"/>
<colspec colname="col_2" colwidth="33.3333*"/>
<colspec colname="col_3" colwidth="33.3334*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Scenario</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Software preference</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Encoding</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">Software scores A and B as same-set, C and D as subset.</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>Software wishes to make A the group representative (arbitrary)</simpara></entry>
<entry align="left" valign="top"><simpara>A = leading protein &amp; group representative</simpara>
<simpara>B = leading protein</simpara>
<simpara>C = non-leading protein</simpara>
<simpara>D = non-leading protein</simpara>
<simpara>(Use of formal same-set and subset notation is also allowed but optional)</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">As above</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>Software does not wish to choose which is the group representative</simpara></entry>
<entry align="left" valign="top"><simpara>A = leading protein</simpara>
<simpara>B = leading protein</simpara>
<simpara>C = non-leading protein</simpara>
<simpara>D = non-leading protein</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">Software scores A as best protein, B, C and D are all subset or subsumed</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>N/A</simpara></entry>
<entry align="left" valign="top"><simpara>A = leading protein</simpara>
<simpara>B = non-leading protein</simpara>
<simpara>C = non-leading protein</simpara>
<simpara>D = non-leading protein</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">Software scores all four proteins as same-set or more generally as having equal evidence</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>Software wishes to make A the group representative (arbitrary)</simpara></entry>
<entry align="left" valign="top"><simpara>A = leading protein &amp; group representative</simpara>
<simpara>B = leading protein</simpara>
<simpara>C = leading protein</simpara>
<simpara>D = leading protein</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">As above</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>Software does not wish to choose which is the group representative</simpara></entry>
<entry align="left" valign="top"><simpara>A = leading protein</simpara>
<simpara>B = leading protein</simpara>
<simpara>C = leading protein</simpara>
<simpara>D = leading protein</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">Software scores A as having slightly more evidence than B. B has additional weak independent evidence relative to A. C and D have less evidence than either A or B.</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>Software wishes to assign A as the leading protein and the independent evidence for B is not sufficient for it to form a new PAG.</simpara></entry>
<entry align="left" valign="top"><simpara>A = leading protein</simpara>
<simpara>B = non-leading protein &amp; marginally distinguished (optional)</simpara>
<simpara>C = non-leading protein</simpara>
<simpara>D = non-leading protein</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">As above</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>Software does not wish to choose which is the leading protein out of A and B or group representative</simpara></entry>
<entry align="left" valign="top"><simpara>A = leading protein</simpara>
<simpara>B = leading protein</simpara>
<simpara>C = non-leading protein</simpara>
<simpara>D = non-leading protein</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">As above</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>Software does not wish to choose which is the leading protein but does select a group representative</simpara></entry>
<entry align="left" valign="top"><simpara>A = leading protein &amp; group representative</simpara>
<simpara>B = leading protein</simpara>
<simpara>C = non-leading protein</simpara>
<simpara>D = non-leading protein</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Table 4.</emphasis> A summary of grouping options and recommendation for CV term annotations, assuming a group of four related proteins A-D.</simpara>
</section>
<section xml:id="support-for-de-novo-sequencing-approaches">
<title>Support for <emphasis>de novo</emphasis> sequencing approaches</title>
<simpara>In mzIdentML version 1.1, &lt;SpectrumIdentificationItem&gt; had a mandatory sub-element &lt;PeptideEvidenceRef&gt; to link each PSM to all the proteins from which it could have originated.
The inclusion of these mandatory requirements makes it difficult to represent results from <emphasis>de novo</emphasis> sequencing and spectral library search approaches where PSMs may not necessarily have originated from a protein database search.
As such, in mzIdentML 1.2.0 &lt;PeptideEvidenceRef&gt; has a cardinality of 0&#8230;&#8203;many.
In all cases of sequence database search, export software MUST include all &lt;PeptideEvidenceRef&gt; elements for every PSM.
In version mzIdentML 1.3.0, further clarifications are added: &lt;SearchDatabase&gt; will include the CV param MS:1000394 (“de novo search or no database used”) and “No database” will be included as an userParam in its subelement &lt;DatabaseName&gt;.</simpara>
<simpara><emphasis>De novo</emphasis> sequencing approaches are therefore supported, but only in a relatively straightforward manner, where complete peptide sequences are identified.
Proposals for representing partial peptide sequences or sequence tags are encouraged for future iterations of the standard.</simpara>
</section>
<section xml:id="spectral-library-searches">
<title>Spectral library searches</title>
<simpara>An alternative to sequence database searches for identifying peptides from MS data is to search a pre-compiled library (spectral library) of PSMs.
These spectral library searches are supported in mzIdentML.
The recommended encoding is similar to sequence database search results.
The main difference is that a &lt;Peptide&gt; entity SHOULD record each library entry that has been matched against.
Additional scores or metadata about the library entity SHOULD be included as &lt;cvParam&gt; elements on &lt;Peptide&gt;.
For searches against spectral archives i.e. where the identity of the library entry is unknown (there is no a peptide sequence assignment to the spectrum in the library), the encoding SHOULD include an empty string in &lt;PeptideSequence&gt;.</simpara>
<simpara>Note – there has been no formal change to the schema or CV requirements from mzIdentML 1.1 to 1.2.0 around spectral library encoding, but the intended encoding has changed.
The mzIdentML 1.1. specifications stated that spectral library entries should be encoded within &lt;DBSequence&gt;, which does not well model the data produced.</simpara>
</section>
<section xml:id="multiple-database-search-engines">
<title>Multiple database search engines</title>
<simpara>Proteomics research groups now commonly analyze MS data using multiple search engines and combine results to improve the number of peptide and protein identifications that can be made.
The output of such approaches can be represented in mzIdentML as follows (see <xref linkend="model-in-xml-schema"/> for documentation of the model elements).
Note that the RECOMMENDED encoding has changed since the version 1.1.0 specification as a result of community feedback.
It has been decided that throughout mzIdentML, the spectrum referenced from a &lt;SpectrumIdentificationResult&gt; MUST be unique within a file <emphasis>i.e.</emphasis> only one set of ranked results can be provided per spectrum.
This has implications for encoding the results of multiple search engines, as only consensus results (after they have been combined) can be represented in a valid mzIdentML 1.2.0 file.
If exporters wish to maintain the original search engine results, these MAY be encoded using &lt;cvParam&gt; elements within &lt;SpectrumIdentificationItem&gt; containing additional scores, statistics and indicating the original rankings from the source search engine.</simpara>
<simpara>The &lt;SpectrumIdentification&gt; element MUST reference a &lt;SpectrumIdentificationProtocol&gt; holding representative parameters used across all search engines (i.e. search tolerances, enzyme and modifications), since these are MANDATORY elements.
If the same search parameters were not employed in all source searches, the parameters should be set with superset or widest values i.e. all modifications that have been searched, widest tolerances and so on.
All search engines that have been employed SHOULD be represented within the &lt;AnalysisSoftwareList&gt;.
The &lt;AnalysisSoftwareList&gt; SHOULD also record the software used to combine results.
It must also be highlighted that mzIdentML cannot be used to model the order in which the software was used (it does not support workflows).</simpara>
<simpara>The same encoding MAY also be used to describe other approaches where different search protocols are applied to the same spectra (for example using different parameter sets with the same search engine), and subsequently combined.
In this case, it is RECOMMENDED that only a single (assumed best) score of any given type is represented once per &lt;SpectrumIdentificationItem&gt;.</simpara>
</section>
<section xml:id="pre-fractionation-of-samples-prior-to-ms-and-splitting-of-searches">
<title>Pre-fractionation of samples prior to MS and splitting of searches</title>
<simpara>It is common in many workflows for pre-fractionation of a sample to be performed prior to MS, for example <emphasis>via</emphasis> 1D or 2D gel electrophoresis or 2D LC.
In some scenarios results of database searches are combined prior to protein inference and in other instances there is no combination of results prior to protein inference.
We have identified the following scenarios and describe the RECOMMENDED encoding in each case in Table 5 below.</simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="2">
<colspec colname="col_1" colwidth="50*"/>
<colspec colname="col_2" colwidth="50*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Scenario</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Encoding</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">Scenario 1. i) A sample is fractionated into <emphasis>n</emphasis> sub-samples, prior to <emphasis>n</emphasis> runs on the MS; ii) the search engine performs <emphasis>n</emphasis> searches, producing <emphasis>n</emphasis> protein-lists.</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara><emphasis>n</emphasis> mzIdentML files SHOULD be produced, each containing 1 &lt;SpectrumIdentificationList&gt;, 1 &lt;SpectrumIdentificationProtocol&gt;, 1&lt;SpectrumIdentification&gt;, 1 &lt;ProteinDetection&gt;, 1 &lt;ProteinDetectionList&gt;.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">Scenario 2. i) A sample is fractionated into <emphasis>n</emphasis> sub-samples, prior to <emphasis>n</emphasis> runs on the MS; ii) the search engine imports <emphasis>n</emphasis> peak lists and performs <emphasis>n</emphasis> searches but internally integrates results to produce one protein list.</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>One single mzIdentML file SHOULD contain <emphasis>n</emphasis> &lt;SpectrumIdentificationList&gt;s, <emphasis>n</emphasis> &lt;SpectrumIdentificationProtocol&gt;s, <emphasis>n</emphasis> &lt;SpectrumIdentification&gt;s, 1 &lt;ProteinDetection&gt;, 1 &lt;ProteinDetectionList&gt;.</simpara>
<simpara>The &lt;SpectrumIdentificationProtocol&gt;s MUST indicate that pre-fractionation has taken place, using the CV term indicated in Table 1.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">Scenario 3. i) A sample is fractionated into <emphasis>n</emphasis> sub-samples, prior to <emphasis>n</emphasis> runs on the MS; ii) the search engine performs <emphasis>n</emphasis> searches, producing <emphasis>n</emphasis> lists of spectrum identifications; iii) post-processing software integrates results to produce one protein list.</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>As Scenario 2.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><emphasis role="strong">Scenario 4. i) There is no sample pre-fractionation and one run on the MS. ii) The spectra are split into <emphasis>n</emphasis> peak list files for searching (for example for parallelisation on a cluster), producing <emphasis>n</emphasis> lists of PSMs iii) post-processing software re-combines results into one mzIdentML file producing 1 protein list.</emphasis></simpara></entry>
<entry align="left" valign="top"><simpara>One single mzIdentML file SHOULD contain 1 &lt;SpectrumIdentificationList&gt;, 1 &lt;SpectrumIdentificationProtocol&gt;, 1 &lt;SpectrumIdentification&gt; referencing <emphasis>n</emphasis> &lt;InputSpectra&gt; sub-elements, 1 &lt;ProteinDetection&gt;, 1 &lt;ProteinDetectionList&gt;.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara>Table 5. A description of RECOMMENDED encodings in mzIdentML, where sample pre-fractionation has taken place.</simpara>
</section>
<section xml:id="encoding-replicate-samples">
<title>Encoding replicate samples</title>
<simpara>One mzIdentML file is intended to capture the analysis of one sample, including rules for pre-fractionation as discussed in <xref linkend="pre-fractionation-of-samples-prior-to-ms-and-splitting-of-searches"/>. For encoding replicate samples (biological or technical), separate mzIdentML files SHOULD be used.
A naming convention using suffixes MAY be adopted but the specifications of such are beyond the scope of this document.</simpara>
</section>
<section xml:id="peptide-level-scores-and-statistical-measures">
<title>Peptide-level scores and statistical measures</title>
<simpara>The format was designed with explicit support for encoding scores or statistical measures for PSMs, for individual proteins and for protein groups.
However, the original design contained no explicit (schema level) support for peptide-level scores i.e. after redundant PSMs reporting on the same peptide have been removed.
One of the challenges in this space is defining the mechanism of grouping multiple PSMs for the same <emphasis>distinct peptide –</emphasis> since in different contexts a distinct peptide could encompass one of the following concepts:</simpara>
<itemizedlist>
<listitem>
<simpara>A peptide sequence with a given set of modifications – located in specified positions, identified from a single charge state.</simpara>
</listitem>
<listitem>
<simpara>A peptide sequence with a given set of modifications – located in specified positions, identified from different charge state ions.</simpara>
</listitem>
<listitem>
<simpara>A peptide sequence with a given set of modifications – regardless of the positions of modifications.</simpara>
</listitem>
<listitem>
<simpara>A peptide sequence regardless of the presence/absence of different modifications.</simpara>
</listitem>
</itemizedlist>
<simpara>A mechanism for encoding these different types of distinct peptide grouping in the mzIdentML 1.2.0 specifications has been defined, using CV terms as described in Figure 1. Three CV terms have currently been added to the PSI-MS CV: “group PSMs by sequence” (MS:1002496), “group PSMs by sequence with modifications” (MS:1002497) and “group PSMs by sequence with modifications and charge” (MS:1002498).</simpara>
<simpara>The following additional features have also been added to mzIdentML 1.2.0 to support peptide-level scores (Figure 1).
First, an additional CV term “peptide-level scoring” (<emphasis role="strong">MS:1002490) MUST be included (when this process is being reported)</emphasis> in &lt;SpectrumIdentificationProtocol&gt;, as shown in Figure 1A and Table 1. In addition, the &lt;SpectrumIdentificationProtocol&gt; contains a &lt;Threshold&gt; element, used in previous versions, for representing the threshold applied at the PSM level.
In mzIdentML 1.2, the element can now be used to demonstrate the threshold applied at the PSM and/or peptide-level, through the use of appropriate CV terms.</simpara>
<simpara>Additionally, a mechanism is needed for capturing how different PSMs are grouped into a single entity.
This is achieved by adding a CV term to every PSM in the file “peptide group ID” (MS:1002520), whereby the associated value is a unique identifier shared between all PSMs in the same peptide group.
In the example in Figure 1D, the unique identifier used is the peptide sequence itself (since when grouping by sequence irrespective of modification status this value must be unique), although this could be an arbitrary value such as an integer code.</simpara>
<simpara>The mzIdentML file must be able to record scores or statistical values at the peptide level.
This is achieved <emphasis>via</emphasis> adding CV terms with identical values to all PSMs within the same peptide-group, with an indication that it is a peptide-level value, <emphasis>via</emphasis> the convention of the prefix “peptide:” in the CV term name (Figure 1E).
Finally, a mechanism has been added for recording peptides both above and below the threshold, to allow complete statistical re-evaluation by downstream software.
PSM-level threshold is covered <emphasis>via</emphasis> the <emphasis>passThreshold</emphasis> attribute on the &lt;SpectrumIdentificationItem&gt; element.
To enable additional thresholding at the peptide-level, a new CV term is required for all PSMs (“peptide passes threshold”, MS:1002500) as shown in Figure 1F.</simpara>
<informalfigure>
<mediaobject>
<imageobject>
<imagedata fileref="img/mzidentml/image1.png" contentwidth="100%" align="centre"/>
</imageobject>
<textobject><phrase>image1</phrase></textobject>
</mediaobject>
</informalfigure>
<simpara>Figure 1. The mechanism for encoding peptide-level statistics in mzIdentML 1.2.0.</simpara>
</section>
<section xml:id="encoding-modification-localisation-scores">
<title>Encoding modification localisation scores</title>
<simpara>A new addition to mzIdentML 1.2.0 is the ability to attach scores or statistical values to the position of a modification, with regards to the peptide sequence.
A variety of software packages now export such values, since it is common for there to be more than one possible site of modification.
Evidence from the presence or absence of fragment ions can enable a calculation of the likelihood of different possibilities.
Such evidence trail is particularly important for some downstream uses of the data, such as profiling motifs for positions of modifications or populating databases with “experimentally observed” modification sites.</simpara>
<simpara>The encoding of such scores is achieved in mzIdentML 1.2.0 by making use of a regular expression attached within a &lt;cvParam&gt; at the level of &lt;SpectrumIdentificationItem&gt;.
The following additional features to be present in mzIdentML 1.2.0 (Figure 2).</simpara>
<simpara>To ensure that downstream software is aware that a file contains modification scores, a CV term is added to the &lt;SpectrumIdenticationProtocol&gt; - “Modification localization scoring” (MS:1002491), as shown in Figure 2A and Table 1. Some approaches apply a statistical threshold for accepting or rejecting that a modification position has been confidently identified.
The (re-usable) &lt;Peptide&gt; element has an attribute <emphasis>via</emphasis> which the residue and location of a modification can be recorded.
To remain backwards compatible, we recommend that the software implementing mod scoring (and export) in mzIdentML should continue to use these attributes, populating with the most likely modification position (Figure 2C).
A new CV term (REQUIRED when MS:1002491 is present in the protocol) must be added to every &lt;Modification&gt; element – called “modification index” (MS:1002504), where the value serves as a unique identifier (local only to the containing &lt;Peptide&gt;) to be referenced from the PSM (Figure 2D).</simpara>
<simpara>The modification scores themselves are added as CV terms to the &lt;SpectrumIdentificationItem&gt; element referencing the peptide (e.g. “phosphoRS score”, Figure 2E), with a value provided as a regular expression of four values in a defined order - MOD_INDEX, SCORE, POSITION, PASS_THRESHOLD.
MOD_INDEX is a reference to the* “modification index” identifier provided in the referenced &lt;Peptide&gt; - &lt;Modification&gt; element.
SCORE represents the score or statistical value (double data type) for the given position.
POSITION is the scored modification position with respect to the peptide sequence (where position = 0 is the N-terminus, and the peptide length + 1 is used to indicate the C-terminus).
The POSITION can include the bar symbol ‘|’, as a logical OR, if the score relates to multiple positions that can be distinguished.
PASS_THRESHOLD holds a Boolean (true, false) value to indicate whether the modification position passes the threshold described above.*</simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam accession="MS:1002380" cvRef="PSI-MS" value="1:0.03:2|3:true" name="modification rescored by false localization rate"/&gt;
 +
&lt;cvParam accession="MS:1002380" cvRef="PSI-MS" value="1:0.97:8|9:false" name="modification rescored by false localization rate"/&gt;</programlisting>
<simpara>The mechanism described MAY be used in conjunction with peptide-level scoring, using specific CV terms for peptide-level modification re-scoring.</simpara>
<informalfigure>
<mediaobject>
<imageobject>
<imagedata fileref="img/mzidentml/image2.png" contentwidth="100%" align="centre"/>
</imageobject>
<textobject><phrase>image2</phrase></textobject>
</mediaobject>
</informalfigure>
<simpara>Figure 2. The specification in mzIdentML 1.2.0 for encoding modification localization scores, using CV terms.</simpara>
</section>
<section xml:id="encoding-results-of-crosslinking-searches">
<title>Encoding results of crosslinking searches</title>
<simpara>See the crosslinking extension document in the same folder as this document for all the details.</simpara>
</section>
<section xml:id="encoding-proteogenomics-annotation-data">
<title>Encoding proteogenomics annotation data</title>
<simpara>It is now common to use tandem MS data to improve current gene model annotations, in so-called proteogenomics approaches, for example based on making peptide identifications against the official gene models or against alternative databases generated by gene finders, mapping mRNA transcripts or six frame genome translations.
Where identifications do not match the official genes, they give evidence in support of updates to the gene models.
One of the key concepts required is the mapping of peptides back to chromosomes, including, for example, where they map across splice junctions.
File format specifications are under development that can be used directly for genome visualisation, such as adaptations of the BED and BAM <xref linkend="li2009"/> formats commonly used in genomics.
To ensure a consistent export is possible from mzIdentML to formats designed for genome visualisation or annotation, in mzIdentML 1.2, a consistent encoding of the chromosomal mappings has been developed, as exemplified in Figure 3.</simpara>
<informalfigure>
<mediaobject>
<imageobject>
<imagedata fileref="img/mzidentml/image3.png" contentwidth="100%" align="centre"/>
</imageobject>
<textobject><phrase>image3</phrase></textobject>
</mediaobject>
</informalfigure>
<simpara>Figure 3. The encoding for chromosomal coordinates in mzIdentML in support of proteogenomics approaches.</simpara>
</section>
</section>
<section xml:id="other-supporting-materials">
<title>Other supporting materials</title>
<simpara>Example files demonstrating the different uses cases have been developed and are available from the following location: <link xl:href="https://github.com/HUPO-PSI/mzIdentML/tree/master/examples">https://github.com/HUPO-PSI/mzIdentML/tree/master/examples</link>.
The sub-folder names indicate the features of the format being used in each example.</simpara>
</section>
</chapter>
<chapter xml:id="model-in-xml-schema">
<title>Model in XML Schema</title>
<simpara>An overview of the schema is presented in Figure 4. The following documentation is automatically generated from the XML Schema.</simpara>
<informalfigure>
<mediaobject>
<imageobject>
<imagedata fileref="img/mzidentml/image4.png" contentwidth="664" contentdepth="373"/>
</imageobject>
<textobject><phrase>image</phrase></textobject>
</mediaobject>
</informalfigure>
<simpara>Figure 4. A diagrammatic overview of the mzIdentML schema (generated by <link xl:href="https://github.com/dgis/xsddiagram">xsddiagram</link>).</simpara>
<section xml:id="element-mzidentml" xreflabel="MzIdentML">
<title>Element &lt;MzIdentML&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The upper-most hierarchy level of mzIdentML with sub-containers for example describing software, protocols and search results (spectrum identifications or protein detection results).</simpara>
<simpara><emphasis role="strong">Type:</emphasis> MzIdentMLType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>creationDate</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:dateTime</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The date on which the file was produced.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>version</simpara></entry>
<entry align="left" valign="top"><simpara>versionRegex</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The version of the schema this instance document refers to, in the format x.y.z.
Changes to z should not affect prevent instance documents from validating.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvlist"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The list of controlled vocabularies used in the file.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.
Here, it is used to declare which extension documents are used.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-analysissoftwarelist"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The software packages used to perform the analyses.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-provider"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The Provider of the mzIdentML record in terms of the contact and software.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-auditcollection"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The complete set of Contacts (people and organisations) for this file.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-analysissamplecollection"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The samples analysed can optionally be recorded using CV terms for descriptions.
If a composite sample has been analysed, the subsample association can be used to build a hierarchical description.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-sequencecollection"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The collection of sequences (DBSequence or Peptide) identified and their relationship between each other (PeptideEvidence) to be referenced elsewhere in the results.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-analysiscollection"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The analyses performed to get the results, which map the input and output data sets.
Analyses are for example: SpectrumIdentification (resulting in peptides) or ProteinDetection (assemble proteins from peptides).</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-analysisprotocolcollection"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The collection of protocols which include the parameters and settings of the performed analyses.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-datacollection"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The collection of input and output data sets of the analyses.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-bibliographicreference"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>Any bibliographic references associated with the file</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Graphical Context:</emphasis></simpara>
<informalfigure>
<mediaobject>
<imageobject>
<imagedata fileref="img/mzidentml/image5.png" contentwidth="282" contentdepth="496"/>
</imageobject>
<textobject><phrase>image</phrase></textobject>
</mediaobject>
</informalfigure>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;MzIdentML xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns="http://psidev.info/psi/pi/mzIdentML/1.2" id="" xsi:schemaLocation="http://psidev.info/psi/pi/mzIdentML/1.2 mzIdentML1.2.0.xsd" creationDate="2014-06-03T14:52:58.665376Z" version="1.2.0"&gt;
    &lt;cvList&gt;
        &lt;cv fullName="Proteomics Standards Initiative Mass Spectrometry Vocabularies" version="2.32.0" uri="https://raw.githubusercontent.com/HUPO-PSI/psi-ms-CV/master/psi-ms.obo" id="PSI-MS"/&gt;
        &lt;cv fullName="UNIMOD" uri="http://www.unimod.org/obo/unimod.obo" id="UNIMOD"/&gt;
        &lt;cv fullName="UNIT-ONTOLOGY" uri="http://obo.cvs.sourceforge.net/*checkout*/obo/obo/ontology/phenotype/unit.obo" id="UO"/&gt;
    &lt;/cvList&gt;
    &lt;AnalysisSoftwareList/&gt;
    ...
&lt;/MzIdentML&gt;</programlisting>
</section>
<section xml:id="element-additionalsearchparams" xreflabel="AdditionalSearchParams">
<title>Element &lt;AdditionalSearchParams&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The search parameters other than the modifications searched.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ParamListType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;AdditionalSearchParams&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001211" name="parent mass type mono"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001256" name="fragment mass type mono"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002492" name="consensus scoring"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002490" name="peptide-level scoring"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002497" name="group PSMs by sequence with modifications"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002491" name="modification localization scoring"/&gt;
...
&lt;/AdditionalSearchParams&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/AdditionalSearchParams<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001302">MS:1001302</link> (search engine specific input parameter) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001005">MS:1001005</link> (SEQUEST:CleavesAt)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001007">MS:1001007</link> (SEQUEST:OutputLines)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001009">MS:1001009</link> (SEQUEST:DescriptionLines)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001026">MS:1001026</link> (SEQUEST:NormalizeXCorrValues)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001028">MS:1001028</link> (SEQUEST:SequenceHeaderFilter)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001032">MS:1001032</link> (SEQUEST:SequencePartialFilter)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001037">MS:1001037</link> (SEQUEST:ShowFragmentIons)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001038">MS:1001038</link> (SEQUEST:Consensus)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001042">MS:1001042</link> (SEQUEST:LimitTo)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001046">MS:1001046</link> (SEQUEST:sort by dCn)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001302">et al.</link><?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001066">MS:1001066</link> (ions series considered in search) one or more times<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001210">MS:1001210</link> (mass type settings) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001211">MS:1001211</link> (parent mass type mono)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001212">MS:1001212</link> (parent mass type average)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001255">MS:1001255</link> (fragment mass type average)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001256">MS:1001256</link> (fragment mass type mono)<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of MS:1002489 (special processing) one or more times</simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1001211" name="parent mass type mono"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001256" name="fragment mass type mono"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002490" name="peptide-level scoring"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002496" name="group PSMs by sequence"/&gt;
&lt;cvParam accession="MS:1001118" cvRef="PSI-MS" name="param: b ion"/&gt;
&lt;cvParam accession="MS:1001149" cvRef="PSI-MS" name="param: b ion-NH3 DEPRECATED"/&gt;
&lt;cvParam accession="MS:1001150" cvRef="PSI-MS" name="param: b ion-H2O DEPRECATED"/&gt;
&lt;cvParam accession="MS:1001262" cvRef="PSI-MS" name="param: y ion"/&gt;
&lt;cvParam accession="MS:1001151" cvRef="PSI-MS" name="param: y ion-NH3 DEPRECATED"/&gt;
&lt;cvParam accession="MS:1001152" cvRef="PSI-MS" name="param: y ion-H2O DEPRECATED"/&gt;
&lt;cvParam accession="MS:1002494" cvRef="PSI-MS" name="crosslinking search"/&gt;
&lt;cvParam name="no special processing" cvRef="PSI-MS" accession="MS:1002495"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002492" name="consensus scoring"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002497" name="group PSMs by sequence with modifications"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002491" name="modification localization scoring"/&gt;</programlisting>
<simpara><emphasis role="strong">Example userParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;userParam value="true" name="TargetDecoyApproach"/&gt;
&lt;userParam value="-1" name="MinIsotopeError"/&gt;
&lt;userParam value="2" name="MaxIsotopeError"/&gt;
&lt;userParam value="HCD" name="FragmentMethod"/&gt;
&lt;userParam value="QExactive" name="Instrument"/&gt;
&lt;userParam value="iTRAQ" name="Protocol"/&gt;
&lt;userParam value="2" name="NumTolerableTermini"/&gt;
&lt;userParam value="1" name="NumMatchesPerSpec"/&gt;
&lt;userParam value="2" name="MaxNumModifications"/&gt;
&lt;userParam value="6" name="MinPepLength"/&gt;
&lt;userParam value="40" name="MaxPepLength"/&gt;
&lt;userParam value="2" name="MinCharge"/&gt;
&lt;userParam value="3" name="MaxCharge"/&gt;
&lt;userParam value="2a_uniprot" name="Mascot User Comment"/&gt;
&lt;userParam value="ESI-QUAD" name="Mascot Instrument Name"/&gt;
&lt;userParam name="input_consensusXML" unitName="xsd:string" value="leitner004.consensusXML"/&gt;
&lt;userParam name="input_decoys" unitName="xsd:string" value=""/&gt;
&lt;userParam name="decoy_prefix" unitName="xsd:integer" value="1"/&gt;
&lt;userParam name="decoy_string" unitName="xsd:string" value="decoy"/&gt;
&lt;userParam name="precursor:min_charge" unitName="xsd:integer" value="3"/&gt;
&lt;userParam name="precursor:max_charge" unitName="xsd:integer" value="7"/&gt;
&lt;userParam name="fragment:mass_tolerance_xlinks" unitName="xsd:double" value="0.3"/&gt;
&lt;userParam name="peptide:min_size" unitName="xsd:integer" value="5"/&gt;
&lt;userParam name="cross_link:residue1" unitName="xsd:string" value="[K]"/&gt;
&lt;userParam name="cross_link:residue2" unitName="xsd:string" value="[K]"/&gt;
&lt;userParam name="cross_link:mass" unitName="xsd:double" value="138.0680796"/&gt;
&lt;userParam name="cross_link:mass_isoshift" unitName="xsd:double" value="12.075321"/&gt;
&lt;userParam name="cross_link:mass_monolink" unitName="xsd:string" value="[156.07864431, 155.094628715]"/&gt;
&lt;userParam name="modifications:variable_max_per_peptide" unitName="xsd:integer" value="2"/&gt;
&lt;userParam name="algorithm:candidate_search" unitName="xsd:string" value="enumeration"/&gt;
&lt;userParam name="charges" unitName="xsd:string" value="2,3,4,5,6"/&gt;</programlisting>
<simpara><emphasis role="strong">Example for peptide-level statistics:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1001211" name="parent mass type mono"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001256" name="fragment mass type mono"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002490" name="peptide-level scoring"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002496" name="group PSMs by sequence"/&gt;</programlisting>
<simpara><emphasis role="strong">Example for sample pre-fractionation:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1001256" name="fragment mass type mono"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001211" name="parent mass type mono"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002493" name="sample pre-fractionation" value="MudPIT"/&gt;</programlisting>
<simpara><emphasis role="strong">Example for proteogenomics:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1002635" name="proteogenomics search" value=""/&gt;</programlisting>
<simpara><emphasis role="strong">Example for crosslinking:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1001211" name="parent mass type mono"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002494" name="crosslinking search"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001256" name="fragment mass type mono"/&gt;</programlisting>
<simpara><emphasis role="strong">Example for modification position scoring:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1002491" name="modification localization scoring"/&gt;</programlisting>
<simpara><emphasis role="strong">Example for de novo sequencing:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1001010" name="de novo search"/&gt;</programlisting>
<simpara><emphasis role="strong">Example for consensus scoring:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1002492" name="consensus scoring"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002392" name="PIA:PSM sets created" value="true"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002391" name="PIA:Combined FDRScore calculated" value="true"/&gt;</programlisting>
</section>
<section xml:id="element-affiliation" xreflabel="Affiliation">
<title>Element &lt;Affiliation&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The organization a person belongs to.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> AffiliationType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>organization_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the organization this contact belongs to.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Affiliation organization_ref="ORG_DOC_OWNER"/&gt;</programlisting>
</section>
<section xml:id="element-ambiguousresidue" xreflabel="AmbiguousResidue">
<title>Element &lt;AmbiguousResidue&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
Ambiguous residues e.g. X can be specified by the Code attribute and a set of parameters for example giving the different masses that will be used in the search.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> AmbiguousResidueType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>code</simpara></entry>
<entry align="left" valign="top"><simpara>chars</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The single letter code of the ambiguous residue e.g. X.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;AmbiguousResidue code="X"&gt;
    &lt;cvParam accession="MS:1001360" cvRef="PSI-MS" value="A C D E F G H I K L M N O P Q R S T U V W Y" name="alternate single letter codes"/&gt;
&lt;/AmbiguousResidue&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/MassTable/AmbiguousResidue<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001359">MS:1001359</link> (ambiguous residues) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001360">MS:1001360</link> (alternate single letter codes)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001361">MS:1001361</link> (alternate mass)</simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
</section>
<section xml:id="element-analysiscollection" xreflabel="AnalysisCollection">
<title>Element &lt;AnalysisCollection&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The analyses performed to get the results, which map the input and output data sets. Analyses are for example: SpectrumIdentification (resulting in peptides) or ProteinDetection (assemble proteins from peptides).</simpara>
<simpara><emphasis role="strong">Type:</emphasis> AnalysisCollectionType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-spectrumidentification"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>An Analysis which tries to identify peptides in input spectra, referencing the database searched, the input spectra, the output results and the protocol that is run.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-proteindetection"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>An Analysis which assembles a set of peptides (e.g. from a spectra search analysis) to proteins.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;AnalysisCollection &gt;
&lt;SpectrumIdentification spectrumIdentificationProtocol_ref="SearchProtocol_1_4299" spectrumIdentificationList_ref="SII_LIST_1_1_4299_120114_20_Orbi2_ZC_QC_220_HSAd0-d4-1to1-3_Din.raw" id="SpecIdent__4299_120114_20_Orbi2_ZC_QC_220_HSAd0-d4-1to1-3_Din.raw"&gt;
&lt;InputSpectra spectraData_ref="SD_4299_120114_20_Orbi2_ZC_QC_220_HSAd0-d4-1to1-3_Din.raw"&gt;&lt;/InputSpectra&gt;
&lt;SearchDatabaseRef searchDatabase_ref="SDB_4299_203"&gt;&lt;/SearchDatabaseRef&gt;
&lt;/SpectrumIdentification&gt;
&lt;SpectrumIdentification spectrumIdentificationProtocol_ref="SearchProtocol_1_4299" spectrumIdentificationList_ref="SII_LIST_1_1_4299_120114_09_Orbi2_ZC_QC_220_HSAd0-d4-1to4-2_Din.raw" id="SpecIdent__4299_120114_09_Orbi2_ZC_QC_220_HSAd0-d4-1to4-2_Din.raw"&gt;
&lt;InputSpectra spectraData_ref="SD_4299_120114_09_Orbi2_ZC_QC_220_HSAd0-d4-1to4-2_Din.raw"&gt;&lt;/InputSpectra&gt;
...
&lt;/AnalysisCollection&gt;</programlisting>
</section>
<section xml:id="element-analysisdata" xreflabel="AnalysisData">
<title>Element &lt;AnalysisData&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
Data sets generated by the analyses, including peptide and protein lists.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> AnalysisDataType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-spectrumidentificationlist"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>Represents the set of all search results from SpectrumIdentification.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-proteindetectionlist"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The protein list resulting from a protein detection process.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;AnalysisData&gt;
    &lt;SpectrumIdentificationList xmlns="http://psidev.info/psi/pi/mzIdentML/1.2" id="SII_LIST_1"&gt;
        &lt;SpectrumIdentificationResult spectrumID="index=6451" spectraData_ref="SD_COMBINED_SE_0" id="SIR_8947"&gt;
            &lt;SpectrumIdentificationItem chargeState="2" experimentalMassToCharge="679.817322" calculatedMassToCharge="679.818488" peptide_ref="AVMDDFAAFVEK_##Oxidation(M):3" rank="1" passThreshold="false" id="SIR_8947_SII_1"&gt;
                &lt;PeptideEvidenceRef peptideEvidence_ref="AVMDDFAAFVEK_generic_A_ENSP00000401820_p:known_378_389"/&gt;
                &lt;PeptideEvidenceRef peptideEvidence_ref="AVMDDFAAFVEK_generic_A_ENSP00000421027_p:putative_420_431"/&gt;
                &lt;PeptideEvidenceRef peptideEvidence_ref="AVMDDFAAFVEK_generic_A_ENSP00000483421_p:known_357_368"/&gt;
                ...
            &lt;/SpectrumIdentificationItem&gt;
        &lt;/SpectrumIdentificationResult&gt;
    &lt;/SpectrumIdentificationList&gt;
&lt;/AnalysisData&gt;</programlisting>
</section>
<section xml:id="element-analysisparams" xreflabel="AnalysisParams">
<title>Element &lt;AnalysisParams&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The parameters and settings for the protein detection given as CV terms.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ParamListType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;AnalysisParams&gt;
    &lt;cvParam name="mascot:SigThreshold" value="0.05" cvRef="PSI-MS" accession="MS:1001316"/&gt;
    &lt;cvParam name="mascot:MaxProteinHits" value="Auto" cvRef="PSI-MS" accession="MS:1001317"/&gt;
    &lt;cvParam name="mascot:ProteinScoringMethod" value="MudPIT" cvRef="PSI-MS" accession="MS:1001318"/&gt;
    &lt;cvParam name="mascot:MinMSMSThreshold" value="0" cvRef="PSI-MS" accession="MS:1001319"/&gt;
    &lt;cvParam name="mascot:ShowHomologousProteinsWithSamePeptides" value="1" cvRef="PSI-MS" accession="MS:1001320"/&gt;
    &lt;cvParam name="mascot:ShowHomologousProteinsWithSubsetOfPeptides" value="10" cvRef="PSI-MS" accession="MS:1001321"/&gt;
    ...
&lt;/AnalysisParams&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AnalysisProtocolCollection/ProteinDetectionProtocol/AnalysisParams<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001302">MS:1001302</link> (search engine specific input parameter) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001005">MS:1001005</link> (SEQUEST:CleavesAt)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001007">MS:1001007</link> (SEQUEST:OutputLines)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001009">MS:1001009</link> (SEQUEST:DescriptionLines)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001026">MS:1001026</link> (SEQUEST:NormalizeXCorrValues)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001028">MS:1001028</link> (SEQUEST:SequenceHeaderFilter)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001032">MS:1001032</link> (SEQUEST:SequencePartialFilter)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001037">MS:1001037</link> (SEQUEST:ShowFragmentIons)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001038">MS:1001038</link> (SEQUEST:Consensus)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001042">MS:1001042</link> (SEQUEST:LimitTo)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001046">MS:1001046</link> (SEQUEST:sort by dCn)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001302">et al.</link><?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001194">MS:1001194</link> (quality estimation with decoy database) one or more times</simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam name="mascot:SigThreshold" value="0.05" cvRef="PSI-MS" accession="MS:1001316"/&gt;
&lt;cvParam name="mascot:MaxProteinHits" value="Auto" cvRef="PSI-MS" accession="MS:1001317"/&gt;
&lt;cvParam name="mascot:ProteinScoringMethod" value="MudPIT" cvRef="PSI-MS" accession="MS:1001318"/&gt;
&lt;cvParam name="mascot:MinMSMSThreshold" value="0" cvRef="PSI-MS" accession="MS:1001319"/&gt;
&lt;cvParam name="mascot:ShowHomologousProteinsWithSamePeptides" value="1" cvRef="PSI-MS" accession="MS:1001320"/&gt;
&lt;cvParam name="mascot:ShowHomologousProteinsWithSubsetOfPeptides" value="10" cvRef="PSI-MS" accession="MS:1001321"/&gt;
&lt;cvParam name="mascot:RequireBoldRed" value="0" cvRef="PSI-MS" accession="MS:1001322"/&gt;
&lt;cvParam name="mascot:UseUnigeneClustering" value="false" cvRef="PSI-MS" accession="MS:1001323"/&gt;
&lt;cvParam name="mascot:IncludeErrorTolerantMatches" value="1" cvRef="PSI-MS" accession="MS:1001324"/&gt;
&lt;cvParam name="mascot:ShowDecoyMatches" value="0" cvRef="PSI-MS" accession="MS:1001325"/&gt;</programlisting>
</section>
<section xml:id="element-analysisprotocolcollection" xreflabel="AnalysisProtocolCollection">
<title>Element &lt;AnalysisProtocolCollection&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The collection of protocols which include the parameters and settings of the performed analyses.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> AnalysisProtocolCollectionType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-spectrumidentificationprotocol"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>The parameters and settings of a SpectrumIdentification analysis.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-proteindetectionprotocol"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The parameters and settings of a ProteinDetection process.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;AnalysisProtocolCollection xmlns="http://psidev.info/psi/pi/mzIdentML/1.2"&gt;
    &lt;SpectrumIdentificationProtocol analysisSoftware_ref="AS_mascot_server" id="SIP"&gt;
        &lt;SearchType&gt;
            &lt;cvParam accession="MS:1001083" cvRef="PSI-MS" value="" name="ms-ms search"/&gt;
        &lt;/SearchType&gt;
        &lt;AdditionalSearchParams&gt;
            &lt;cvParam accession="MS:1001211" cvRef="PSI-MS" name="parent mass type mono"/&gt;
            ...
        &lt;/AdditionalSearchParams&gt;
    &lt;/SpectrumIdentificationProtocol&gt;
&lt;/AnalysisProtocolCollection&gt;</programlisting>
</section>
<section xml:id="element-analysissamplecollection" xreflabel="AnalysisSampleCollection">
<title>Element &lt;AnalysisSampleCollection&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The samples analysed can optionally be recorded using CV terms for descriptions. If a composite sample has been analysed, the subsample association can be used to build a hierarchical description.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> AnalysisSampleCollectionType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-sample"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A description of the sample analysed by mass spectrometry using CVParams or UserParams.
If a composite sample has been analysed, a parent sample should be defined, which references subsamples.
This represents any kind of substance used in an experimental workflow, such as whole organisms, cells, DNA, solutions, compounds and experimental substances (gels, arrays etc.).</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
</section>
<section xml:id="element-analysissoftware" xreflabel="AnalysisSoftware">
<title>Element &lt;AnalysisSoftware&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The software used for performing the analyses.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> AnalysisSoftwareType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>uri</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:anyURI</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>URI of the analysis software e.g. manufacturer&#8217;s website</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>version</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The version of Software used.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-contactrole"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The Contact that provided the document instance.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-softwarename"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The name of the analysis software package, sourced from a CV if available.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-customizations"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>Any customizations to the software, such as alternative scoring mechanisms implemented, should be documented here as free text.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;AnalysisSoftware uri="http://code.google.com/p/ehu-bio/downloads/list" version="1.1-beta4" name="PAnalyzer (v1.1-beta4)" id="PAnalyzer"&gt;
    &lt;ContactRole contact_ref="PAnalyzer_Author"&gt;
        &lt;Role&gt;
            &lt;cvParam accession="MS:1001271" cvRef="PSI-MS" name="researcher"/&gt;
        &lt;/Role&gt;
    &lt;/ContactRole&gt;
    &lt;SoftwareName&gt;...&lt;/SoftwareName&gt;
&lt;/AnalysisSoftware&gt;</programlisting>
</section>
<section xml:id="element-analysissoftwarelist" xreflabel="AnalysisSoftwareList">
<title>Element &lt;AnalysisSoftwareList&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The software packages used to perform the analyses.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> AnalysisSoftwareListType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-analysissoftware"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>The software used for performing the analyses.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;AnalysisSoftwareList xmlns="http://psidev.info/psi/pi/mzIdentML/1.2"&gt;
    &lt;AnalysisSoftware version="Beta (v9979)" name="MS-GF+" id="ID_software"&gt;
        &lt;SoftwareName&gt;
            &lt;cvParam accession="MS:1002048" cvRef="PSI-MS" name="MS-GF+"/&gt;
        &lt;/SoftwareName&gt;
    &lt;/AnalysisSoftware&gt;
    &lt;AnalysisSoftware name="FalseDiscoveryRate_2014-07-02 12-04-18" id="FalseDiscoveryRate_2014-07-02 12-04-18"&gt;...&lt;/AnalysisSoftware&gt;
&lt;/AnalysisSoftwareList&gt;</programlisting>
</section>
<section xml:id="element-auditcollection" xreflabel="AuditCollection">
<title>Element &lt;AuditCollection&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The complete set of Contacts (people and organisations) for this file.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> AuditCollectionType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-person"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A person&#8217;s name and contact details.
Any additional information such as the address, contact email etc. should be supplied using CV parameters or user parameters.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-organization"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>Organizations are entities like companies, universities, government agencies.
Any additional information such as the address, email etc. should be supplied either as CV parameters or as user parameters.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;AuditCollection xmlns="http://psidev.info/psi/pi/mzIdentML/1.2"&gt;
    &lt;Person lastName="secondName" firstName="firstname" id="PERSON_DOC_OWNER"&gt;
        &lt;cvParam cvRef="PSI-MS" accession="MS:1000587" name="contact address" value="address"/&gt;
        &lt;Affiliation organization_ref="ORG_DOC_OWNER"/&gt;
    &lt;/Person&gt;
    &lt;Organization id="ORG_DOC_OWNER" name="myworkplace"&gt;
        &lt;cvParam cvRef="PSI-MS" accession="MS:1000586" name="contact name" value="address"/&gt;
        ...
    &lt;/Organization&gt;
&lt;/AuditCollection&gt;</programlisting>
</section>
<section xml:id="element-bibliographicreference" xreflabel="BibliographicReference">
<title>Element &lt;BibliographicReference&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
Any bibliographic references associated with the file</simpara>
<simpara><emphasis role="strong">Type:</emphasis> BibliographicReferenceType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>authors</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The names of the authors of the reference.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>doi</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The DOI of the referenced publication.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>editor</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The editor(s) of the reference.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>issue</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The issue name or number.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>pages</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The page numbers.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>publication</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The name of the journal, book etc.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>publisher</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The publisher of the publication.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>title</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The title of the BibliographicReference.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>volume</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The volume name or number.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>year</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:int</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The year of publication.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;BibliographicReference id="10.1002/(SICI)1522-2683(19991201)20:18&lt;3551::AID-ELPS3551&gt;3.0.CO;2-2" name="Probability-based protein identification by searching sequence databases using mass spectrometry data" authors="David N. Perkins, Darryl J. C. Pappin, David M. Creasy, John S. Cottrell" publication="Electrophoresis" publisher="Wiley VCH" editor="" year="1999" volume="20" issue="18" pages="3551-3567" title="Probability-based protein identification by searching sequence databases using mass spectrometry data" /&gt;</programlisting>
</section>
<section xml:id="element-contactrole" xreflabel="ContactRole">
<title>Element &lt;ContactRole&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
<emphasis role="strong">Depending on context:</emphasis><?asciidoc-br?></simpara>
<simpara><emphasis role="strong">Type:</emphasis> ContactRoleType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>contact_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>When a ContactRole is used, it specifies which Contact the role is associated with.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-role"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The roles (lab equipment sales, contractor, etc.) the Contact fills.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;ContactRole contact_ref="PERSON_DOC_OWNER"&gt;
    &lt;Role&gt;
        &lt;cvParam cvRef="PSI-MS" accession="MS:1001271" name="researcher"/&gt;
    &lt;/Role&gt;
&lt;/ContactRole&gt;</programlisting>
</section>
<section xml:id="element-customizations" xreflabel="Customizations">
<title>Element &lt;Customizations&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
Any customizations to the software, such as alternative scoring mechanisms implemented, should be documented here as free text.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> xsd:string</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Customizations&gt;
No customisations
&lt;/Customizations&gt;</programlisting>
</section>
<section xml:id="element-cv" xreflabel="cv">
<title>Element &lt;cv&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
A source controlled vocabulary from which cvParams will be obtained.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> cvType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>fullName</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The full name of the CV.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The unique identifier of this cv within the document to be referenced by cvParam elements.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>uri</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:anyURI</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The URI of the source CV.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>version</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The version of the CV.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cv fullName="Proteomics Standards Initiative Mass Spectrometry Vocabularies" version="2.32.0" uri="https://raw.githubusercontent.com/HUPO-PSI/psi-ms-CV/master/psi-ms.obo" id="PSI-MS"/&gt;</programlisting>
</section>
<section xml:id="element-cvlist" xreflabel="cvList">
<title>Element &lt;cvList&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The list of controlled vocabularies used in the file.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> CVListType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cv"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A source-controlled vocabulary from which cvParams will be obtained.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvList&gt;
    &lt;cv id="PSI-MS" fullName="Proteomics Standards Initiative Mass Spectrometry Vocabularies" uri="https://raw.githubusercontent.com/HUPO-PSI/psi-ms-CV/master/psi-ms.obo" version="3.15.0"/&gt;
    &lt;cv id="UNIMOD" fullName="UNIMOD" uri="http://www.unimod.org/obo/unimod.obo"/&gt;
    &lt;cv id="UO" fullName="UNIT-ONTOLOGY" uri="https://raw.githubusercontent.com/bio-ontology-research-group/unit-ontology/master/unit.obo"/&gt;
    &lt;cv id="XLMOD" fullName="PSI cross-link modifications" uri="https://raw.githubusercontent.com/HUPO-PSI/mzIdentML/master/cv/XLMOD-1.0.0.obo"/&gt;
&lt;/cvList&gt;</programlisting>
</section>
<section xml:id="element-cvparam" xreflabel="cvParam">
<title>Element &lt;cvParam&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
A single entry from an ontology or a controlled vocabulary.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> CVParamType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>accession</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The accession or ID number of this CV term in the source CV.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>cvRef</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the cv element from which this term originates.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The name of the parameter.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>unitAccession</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>An accession number identifying the unit within the OBO foundry Unit CV.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>unitCvRef</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>If a unit term is referenced, this attribute MUST refer to the CV 'id' attribute defined in the cvList in this file.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>unitName</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The name of the unit.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>value</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The user-entered value of the parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1002520" name="peptide group ID" value="CCPQCCSSGCSQNLCGPLCVTTPYYCTR_##Carbamidomethyl(C):1##Carbamidomethyl(C):2##Carbamidomethyl(C :5##Carbamidomethyl(C):6##Carbamidomethyl(C):10##Carbamidomethyl(C):15##Carbamidomethyl(C :19##Carbamidomethyl(C):26"/&gt;</programlisting>
</section>
<section xml:id="element-databasefilters" xreflabel="DatabaseFilters">
<title>Element &lt;DatabaseFilters&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The specification of filters applied to the database searched.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> DatabaseFiltersType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-filter"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>Filters applied to the search database.
The filter MUST include at least one of Include and Exclude.
If both are used, it is assumed that inclusion is performed first.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;DatabaseFilters&gt;
    &lt;Filter&gt;
        &lt;FilterType&gt;
            &lt;cvParam accession="MS:1001020" cvRef="PSI-MS" name="DB filter taxonomy"/&gt;
        &lt;/FilterType&gt;
    &lt;/Filter&gt;
&lt;/DatabaseFilters&gt;</programlisting>
</section>
<section xml:id="element-databasename" xreflabel="DatabaseName">
<title>Element &lt;DatabaseName&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The database name may be given as a cvParam if it maps exactly to one of the release databases listed in the CV, otherwise a userParam should be used.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ParamType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;DatabaseName&gt;
    &lt;userParam name="uniprot-human-reviewed-trypsin-april-2016_concatenated_target_decoy.fasta"/&gt;
&lt;/DatabaseName&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/DataCollection/Inputs/SearchDatabase/DatabaseName<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001013">MS:1001013</link> (database name) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001084">MS:1001084</link> (database nr)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001104">MS:1001104</link> (database UniProtKB/Swiss-Prot)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001142">MS:1001142</link> (database IPI_human)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001285">MS:1001285</link> (database IPI_mouse)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001286">MS:1001286</link> (database IPI_rat)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001287">MS:1001287</link> (database IPI_zebrafish)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001288">MS:1001288</link> (database IPI_chicken)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001289">MS:1001289</link> (database IPI_cow)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001290">MS:1001290</link> (database IPI_arabidopsis)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002060">MS:1002060</link> (database UniProtKB/TrEMBL)</simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam accession="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001104[MS:1001104]" cvRef="PSI-MS" name="database UniProtKB/Swiss-Prot"/&gt;</programlisting>
<simpara><emphasis role="strong">Example userParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;userParam name="fawaz_PXD000652_combined_concatenated_target_decoy.fasta"/&gt;
&lt;userParam name="no description"/&gt;
&lt;userParam name="Rosetta_uniprot_20130402_mouse_SWISS_can_iso_ECOLI.fasta"/&gt;
&lt;userParam name="Ros_Uniprot_20130402.fasta"/&gt;
&lt;userParam name="26Syeast_test.fasta"/&gt;
&lt;userParam name="Ros_Uniprot_Ecoli_20130402.fasta"/&gt;
&lt;userParam name="uniprot-human-reviewed-trypsin-april-2016_concatenated_target_decoy.fasta"/&gt;
&lt;userParam name="HSA-Active.FASTA"/&gt;</programlisting>
</section>
<section xml:id="element-databasetranslation" xreflabel="DatabaseTranslation">
<title>Element &lt;DatabaseTranslation&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
A specification of how a nucleic acid sequence database was translated for searching.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> DatabaseTranslationType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>frames</simpara></entry>
<entry align="left" valign="top"><simpara>listOfAllowedFrames</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The frames in which the nucleic acid sequence has been translated as a space separated list</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-translationtable"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>The table used to translate codons into nucleic acids e.g. by reference to the NCBI translation table.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;DatabaseTranslation frames="1 2 3 -1 -2 -3"&gt;
    &lt;TranslationTable id="TT_1" name="Standard"&gt;
        &lt;cvParam accession="MS:1001025" name="translation table" cvRef="PSI-MS" value="FFLLSSSSYY**CC*WLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGG"/&gt;
        &lt;cvParam accession="MS:1001410" name="translation start codons" cvRef="PSI-MS" value="---M---------------M---------------M----------------------------"/&gt;
        &lt;cvParam accession="MS:1001423" name="translation table description" cvRef="PSI-MS" value="http://www.ncbi.nlm.nih.gov/Taxonomy/taxonomyhome.html/index.cgi?chapter=cgencodes#SG1"/&gt;
    &lt;/TranslationTable&gt;
    &lt;TranslationTable id="TT_2" name="Vertebrate Mitochondrial"&gt;...&lt;/TranslationTable&gt;
&lt;/DatabaseTranslation&gt;</programlisting>
</section>
<section xml:id="element-datacollection" xreflabel="DataCollection">
<title>Element &lt;DataCollection&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The collection of input and output data sets of the analyses.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> DataCollectionType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-inputs"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The inputs to the analyses including the databases searched, the spectral data and the source file converted to mzIdentML.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-analysisdata"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>Data sets generated by the analyses, including peptide and protein lists.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Graphical Context:</emphasis></simpara>
<informalfigure>
<mediaobject>
<imageobject>
<imagedata fileref="img/mzidentml/image6.png" contentwidth="566" contentdepth="642"/>
</imageobject>
<textobject><phrase>http://www.peptideatlas.org/PSI/schemas/mzIdentML/1.2/figures/DataCollection.png</phrase></textobject>
</mediaobject>
</informalfigure>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;DataCollection&gt;
&lt;Inputs xmlns="http://psidev.info/psi/pi/mzIdentML/1.2"&gt;
&lt;SearchDatabase numDatabaseSequences="57566" location="E:\Work\PSI\mzIdentML\ProteinInference\Rosetta2\FASTAswithecoli\Rosetta_uniprot_20130402_
mouse_SWISS_can_iso_ECOLI.fasta" id="SearchDB_1"&gt;
&lt;FileFormat&gt;
&lt;cvParam accession="MS:1001348" cvRef="PSI-MS" name="FASTA format"/&gt;
&lt;/FileFormat&gt;
&lt;DatabaseName&gt;
...
&lt;/DatabaseName&gt;
&lt;/DataCollection&gt;</programlisting>
</section>
<section xml:id="element-dbsequence" xreflabel="DBSequence">
<title>Element &lt;DBSequence&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
A database sequence from the specified SearchDatabase (nucleic acid or amino acid). If the sequence is nucleic acid, the source nucleic acid sequence should be given in the seq attribute rather than a translated sequence.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> DBSequenceType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>accession</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The unique accession of this sequence.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>length</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:int</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The length of the sequence as a number of bases or residues.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>searchDatabase_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The source database of this sequence.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-seq"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The actual sequence of amino acids or nucleic acid.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;DBSequence accession="sp_P20029_GRP78_MOUSE 78 kDa glucose-regulated protein OS=Mus musculus GN=Hspa5 PE=1..." searchDatabase_ref="SearchDB_1" length="655" name="sp_P20029_GRP78_MOUSE 78 kDa glucose-regula ted protein OS=Mus musculus GN=Hspa5 PE=1 SV=3" id="dbseq_sp_P20029_GRP78_MOUSE 78 kDa glucose-regulated protein OS=Mus musculus GN=Hspa5 PE=1..."&gt;
    &lt;Seq&gt;MMKFTVVAAALLLLGAVRAEEEDKKEDVGTVVGIDLGTTYSCVGVFKNGRVEIIANDQGNRITPSYVAFTPEGERLIGDAAKNQLTSNPENTVFDA KRLIGRTWNDPSVQQDIKFLPFKVVEKKTKPYIQVDIGGGQTKTFAPEEISAMVLTKMKETAEAYLGKKVTHAVVTVPAYFNDAQRQATKDAGTIAGLNVM RIINEPTAAAIAYGLDKREGEKNILVFDLGGGTFDVSLLTIDNGVFEVVATNGDTHLGGEDFDQRVMEHFIKLYKKKTGKDVRKDNRAVQKLRREVEKAKR ALSSQHQARIEIESFFEGEDFSETLTRAKFEELNMDLFRSTMKPVQKVLEDSDLKKSDIDEIVLVGGSTRIPKIQQLVKEFFNGKEPSRGINPDEAVAYGA AVQAGVLSGDQDTGDLVLLDVCPLTLGIETVGGVMTKLIPRNTVVPTKKSQIFSTASDNQPTVTIKVYEGERPLTKDNHLLGTFDLTGIPPAPRGVPQIEV TFEIDVNGILRVTAEDKGTGNKNKITITNDQNRLTPEEIERMVNDAEKFAEEDKKLKERIDTRNELESYAYSLKNQIGDKEKLGGKLSSEDKETMEKAVEE KIEWLESHQDADIEDFKAKKKELEEIVQPIISKLYGSGGPPPTGEEDTSEKDEL&lt;/Seq&gt;
&lt;/DBSequence&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/SequenceCollection/DBSequence<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001342">MS:1001342</link> (database sequence details) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001088">MS:1001088</link> (protein description)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001090">MS:1001090</link> (taxonomy nomenclature)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001343">MS:1001343</link> (NA sequence)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001344">MS:1001344</link> (AA sequence)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001467">MS:1001467</link> (taxonomy: NCBI TaxID)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001468">MS:1001468</link> (taxonomy: common name)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001469">MS:1001469</link> (taxonomy: scientific name)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001470">MS:1001470</link> (taxonomy: Swiss-Prot ID)<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001089">MS:1001089</link> (molecule taxonomy) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001090">MS:1001090</link> (taxonomy nomenclature)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001467">MS:1001467</link> (taxonomy: NCBI TaxID)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001468">MS:1001468</link> (taxonomy: common name)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001469">MS:1001469</link> (taxonomy: scientific name)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001470">MS:1001470</link> (taxonomy: Swiss-Prot ID)</simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam accession="MS:1001088" cvRef="PSI-MS" value="sp_P36938_PGM_ECOLI Phosphoglucomutase OS=Escherichia coli (strain K12) GN=pgm PE=1 SV=1" name="protein description"/&gt;</programlisting>
<simpara><emphasis role="strong">Example for proteogenomics search:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;DBSequence searchDatabase_ref="SearchDB_1" accession="generic|A_ENSP00000284981|" id="dbseq_generic|A_ENSP00000284981|"&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002637" name="chromosome name" value="21"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002638" name="chromosome strand" value="-"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002644" name="genome reference version" value="Homo_sapiens.GRCh38.77.gff3"/&gt;
&lt;/DBSequence&gt;</programlisting>
</section>
<section xml:id="element-enzyme" xreflabel="Enzyme">
<title>Element &lt;Enzyme&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The details of an individual cleavage enzyme should be provided by giving a regular expression or a CV term if a "standard" enzyme cleavage has been performed.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> EnzymeType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>cTermGain</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string with restriction
[A-Za-z0-9 ]+</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>Element formula gained at CTerm.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>minDistance</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:int</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>Minimal distance for another cleavage (minimum: 1).</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>missedCleavages</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:int</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The number of missed cleavage sites allowed by the search.
The attribute MUST be provided if an enzyme has been used.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>nTermGain</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string with restriction
[A-Za-z0-9 ]+</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>Element formula gained at NTerm.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>semiSpecific</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:boolean</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>Set to true if the enzyme cleaves semi-specifically (i.e. one terminus MUST cleave according to the rules, the other can cleave at any residue), false if the enzyme cleavage is assumed to be specific to both termini (accepting for any missed cleavages).</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-siteregexp"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>Regular expression for specifying the enzyme cleavage site.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-enzymename"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The name of the enzyme from a CV.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Enzyme missedCleavages="1" semiSpecific="false" cTermGain="OH" nTermGain="H" id="ENZ_0"&gt;
    &lt;SiteRegexp&gt;(?&amp;lt;=[KR])&lt;/SiteRegexp&gt;
    &lt;EnzymeName&gt;
        &lt;cvParam accession="MS:1001313" cvRef="PSI-MS" name="Trypsin/P"/&gt;
    &lt;/EnzymeName&gt;
&lt;/Enzyme&gt;</programlisting>
</section>
<section xml:id="element-enzymename" xreflabel="EnzymeName">
<title>Element &lt;EnzymeName&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The name of the enzyme from a CV.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ParamListType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;EnzymeName&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001251" name="Trypsin"/&gt;
&lt;/EnzymeName&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/Enzymes/Enzyme/EnzymeName<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001045">MS:1001045</link> (cleavage agent name) only once<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001091">MS:1001091</link> (NoEnzyme)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001251">MS:1001251</link> (Trypsin)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001303">MS:1001303</link> (Arg-C)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001304">MS:1001304</link> (Asp-N)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001305">MS:1001305</link> (Asp-N_ambic)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001306">MS:1001306</link> (Chymotrypsin)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001307">MS:1001307</link> (CNBr)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001308">MS:1001308</link> (Formic_acid)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001309">MS:1001309</link> (Lys-C)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001310">MS:1001310</link> (Lys-C/P)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001045">et al.</link></simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1001251" name="Trypsin"/&gt;
&lt;cvParam accession="MS:1001313" cvRef="PSI-MS" name="Trypsin/P"/&gt;</programlisting>
</section>
<section xml:id="element-enzymes" xreflabel="Enzymes">
<title>Element &lt;Enzymes&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The list of enzymes used in experiment</simpara>
<simpara><emphasis role="strong">Type:</emphasis> EnzymesType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>independent</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:boolean</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>If there are multiple enzymes specified, this attribute is set to true if cleavage with different enzymes is performed independently.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-enzyme"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>The details of an individual cleavage enzyme should be provided by giving a regular expression or a CV term if a "standard" enzyme cleavage has been performed.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Enzymes&gt;
    &lt;Enzyme missedCleavages="1" semiSpecific="false" cTermGain="OH" nTermGain="H" id="ENZ_0"&gt;
        &lt;SiteRegexp&gt;(?&amp;lt;=[KR])&lt;/SiteRegexp&gt;
        &lt;EnzymeName&gt;
            &lt;cvParam accession="MS:1001313" cvRef="PSI-MS" name="Trypsin/P"/&gt;
        &lt;/EnzymeName&gt;
    &lt;/Enzyme&gt;
    ...
&lt;/Enzymes&gt;</programlisting>
</section>
<section xml:id="element-exclude" xreflabel="Exclude">
<title>Element &lt;Exclude&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
All sequences fulfilling the specifed criteria are excluded.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ParamListType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/DatabaseFilters/Filter/Exclude<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001512">MS:1001512</link> (Sequence database filters) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001090">MS:1001090</link> (taxonomy nomenclature)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001201">MS:1001201</link> (DB MW filter maximum)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001202">MS:1001202</link> (DB MW filter minimum)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001203">MS:1001203</link> (DB PI filter maximum)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001204">MS:1001204</link> (DB PI filter minimum)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001467">MS:1001467</link> (taxonomy: NCBI TaxID)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001468">MS:1001468</link> (taxonomy: common name)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001469">MS:1001469</link> (taxonomy: scientific name)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001470">MS:1001470</link> (taxonomy: Swiss-Prot ID)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001513">MS:1001513</link> (DB sequence filter pattern)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001512">et al.</link></simpara>
</section>
<section xml:id="element-externalformatdocumentation" xreflabel="ExternalFormatDocumentation">
<title>Element &lt;ExternalFormatDocumentation&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
A URI to access documentation and tools to interpret the external format of the ExternalData instance. For example, XML Schema or static libraries (APIs) to access binary formats.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> xsd:anyURI</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
</section>
<section xml:id="element-fileformat" xreflabel="FileFormat">
<title>Element &lt;FileFormat&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The format of the ExternalData file, for example "tiff" for image files.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> FileFormatType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;FileFormat&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001401" name="X!Tandem xml format"/&gt;
&lt;/FileFormat&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/DataCollection/Inputs/SearchDatabase/FileFormat<?asciidoc-br?>
MUST supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001347">MS:1001347</link> (database file formats) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001348">MS:1001348</link> (FASTA format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001349">MS:1001349</link> (ASN.1)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001350">MS:1001350</link> (NCBI <emphasis role="strong">.p</emphasis>)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001351">MS:1001351</link> (clustal aln)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001352">MS:1001352</link> (embl em)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001353">MS:1001353</link> (NBRF PIR)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001462">MS:1001462</link> (PEFF format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002659">MS:1002659</link> (UniProtKB text sequence format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002660">MS:1002660</link> (UniProtKB XML sequence format)</simpara>
<simpara>Path /MzIdentML/DataCollection/Inputs/SourceFile/FileFormat<?asciidoc-br?>
MUST supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001040">MS:1001040</link> (intermediate analysis format) only once<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000742">MS:1000742</link> (Bioworks SRF format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001107">MS:1001107</link> (data stored in database)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001199">MS:1001199</link> (Mascot DAT format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001200">MS:1001200</link> (SEQUEST out file format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001242">MS:1001242</link> (SEQUEST out folder)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001243">MS:1001243</link> (SEQUEST summary)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001275">MS:1001275</link> (ProteinScape SearchEvent)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001276">MS:1001276</link> (ProteinScape Gel)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001399">MS:1001399</link> (OMSSA csv format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001400">MS:1001400</link> (OMSSA xml format)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001040">et al.</link><?asciidoc-br?>
Path /MzIdentML/DataCollection/Inputs/SpectraData/FileFormat<?asciidoc-br?>
MUST supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000560">MS:1000560</link> (mass spectrometer file format) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000526">MS:1000526</link> (Waters raw format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000562">MS:1000562</link> (ABI WIFF format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000563">MS:1000563</link> (Thermo RAW format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000564">MS:1000564</link> (PSI mzData format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000565">MS:1000565</link> (Micromass PKL format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000566">MS:1000566</link> (ISB mzXML format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000567">MS:1000567</link> (Bruker/Agilent YEP format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000584">MS:1000584</link> (mzML format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000613">MS:1000613</link> (DTA format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000614">MS:1000614</link> (ProteinLynx Global Server mass spectrum XML format)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000560">et al.</link></simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1001400" name="OMSSA xml file"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001348" name="FASTA format"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001062" name="Mascot MGF file"/&gt;
&lt;cvParam accession="MS:1001401" cvRef="PSI-MS" name="X\!Tandem xml file"/&gt;
&lt;cvParam accession="MS:1001199" cvRef="PSI-MS" name="Mascot DAT format"/&gt;
&lt;cvParam accession="MS:1000584" cvRef="PSI-MS" name="mzML format"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1000563" name="Thermo Raw file"/&gt;</programlisting>
</section>
<section xml:id="element-filter" xreflabel="Filter">
<title>Element &lt;Filter&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
Filters applied to the search database. The filter MUST include at least one of Include and Exclude. If both are used, it is assumed that inclusion is performed first.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> FilterType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-filtertype"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The type of filter e.g. database taxonomy filter, pi filter, mw filter</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><link xl:href="#element-include">Include</link></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>All sequences fulfilling the specifed criteria are included.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-exclude"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>All sequences fulfilling the specifed criteria are excluded.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Filter&gt;
    &lt;FilterType&gt;
        &lt;cvParam accession="MS:1001020" cvRef="PSI-MS" name="DB filter taxonomy"/&gt;
    &lt;/FilterType&gt;
&lt;/Filter&gt;</programlisting>
</section>
<section xml:id="element-filtertype" xreflabel="FilterType">
<title>Element &lt;FilterType&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The type of filter e.g. database taxonomy filter, pi filter, mw filter</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ParamType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;FilterType&gt;
    &lt;cvParam accession="MS:1001020" cvRef="PSI-MS" name="DB filter taxonomy"/&gt;
&lt;/FilterType&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/DatabaseFilters/Filter/FilterType<?asciidoc-br?>
MUST supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001511">MS:1001511</link> (Sequence database filter types) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001020">MS:1001020</link> (DB filter taxonomy)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001021">MS:1001021</link> (DB filter on accession numbers)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001022">MS:1001022</link> (DB MW filter)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001023">MS:1001023</link> (DB PI filter)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001027">MS:1001027</link> (DB filter on sequence pattern)</simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
</section>
<section xml:id="element-fragmentarray" xreflabel="FragmentArray">
<title>Element &lt;FragmentArray&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
An array of values for a given type of measure and for a particular ion type, in parallel to the index of ions identified.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> FragmentArrayType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>measure_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the Measure defined in the FragmentationTable</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>values</simpara></entry>
<entry align="left" valign="top"><simpara>listOfFloats</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The values of this particular measure, corresponding to the index defined in ion type</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;FragmentArray measure_ref="Measure_Error" values="4.173258879802688E-4 -1.9794682032170385E-5 1.618474794895519E-5 0.001690052197886871 -0.0037214683721344954 0.0020060110579152024 -2.3719321211501665E-4 2.7168621795681247E-4 -0.0019049343519554895 0.0019553613780090018 2.6704080801209784E-4 0.007734020238103767 0.0013568713879976713 1.571508180404635E-4 -0.0017703817320580129 0.013774177127970688 0.0056154565579618065 0.004415735988004599 0.006145015418042021 -0.005059517131940083 0.01419863401793009 0.007626913448120831 0.007892192877989146"/&gt;</programlisting>
</section>
<section xml:id="element-fragmentation" xreflabel="Fragmentation">
<title>Element &lt;Fragmentation&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The product ions identified in this result.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> FragmentationType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-iontype"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>IonType defines the index of fragmentation ions being reported, importing a CV term for the type of ion e.g. b ion.
Example: if b3 b7 b8 and b10 have been identified, the index attribute will contain 3 7 8 10, and the corresponding values will be reported in parallel arrays below</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Fragmentation&gt;
    &lt;IonType charge="1" index="1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23"&gt;
        &lt;FragmentArray measure_ref="Measure_MZ" values="175.1193695 232.1403961 289.1618958 452.2268982 509.2429504 566.2701416 653.2999268 710.3218994 767.3411865 854.3770752 911.3968506 968.4257813 1065.472168 1122.492432 1285.553833 1399.612305 1456.62561 1513.645874 1570.669067 1733.721191 1830.793213 1887.808105 1944.829834"/&gt;
        &lt;FragmentArray measure_ref="Measure_Int" values="5939.5844726563 4933.5014648438 13310.7265625 5077.6694335938 5685.9287109375 13253.552734375 7620.0947265625 7724.3696289063 16868.541015625 10552.126953125 11589.0576171875 7839.9741210938 47821.64453125 60335.71484375 21759.3984375 8742.5595703125 11512.0908203125 18130.890625 30577.375 3801.3923339844 8051.07421875 1954.5501708984 4844.9125976563"/&gt;
        &lt;FragmentArray measure_ref="Measure_Error" values="4.173258879802688E-4 -1.9794682032170385E-5 1.618474794895519E-5 0.001690052197886871 -0.0037214683721344954 0.0020060110579152024 -2.3719321211501665E-4 2.7168621795681247E-4 -0.0019049343519554895 0.0019553613780090018 2.6704080801209784E-4 0.007734020238103767 0.0013568713879976713 1.571508180404635E-4 -0.0017703817320580129 0.013774177127970688 0.0056154565579618065 0.004415735988004599 0.006145015418042021 -0.005059517131940083 0.01419863401793009 0.007626913448120831 0.007892192877989146"/&gt;
        &lt;cvParam cvRef="PSI-MS" accession="MS:1001220" name="frag: y ion"/&gt;
    &lt;/IonType&gt;
    ...
&lt;/Fragmentation&gt;</programlisting>
</section>
<section xml:id="element-fragmentationtable" xreflabel="FragmentationTable">
<title>Element &lt;FragmentationTable&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
Contains the types of measures that will be reported in generic arrays for each SpectrumIdentificationItem e.g. product ion m/z, product ion intensity, product ion m/z error</simpara>
<simpara><emphasis role="strong">Type:</emphasis> FragmentationTableType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-measure"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>References to CV terms defining the measures about product ions to be reported in SpectrumIdentificationItem</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;FragmentationTable&gt;
    &lt;Measure id="Measure_MZ"&gt;
        &lt;cvParam cvRef="PSI-MS" accession="MS:1001225" name="product ion m/z" unitCvRef="PSI-MS" unitAccession="MS:1000040" unitName="m/z"/&gt;
    &lt;/Measure&gt;
    &lt;Measure id="Measure_Int"&gt;
        &lt;cvParam cvRef="PSI-MS" accession="MS:1001226" name="product ion intensity" unitCvRef="PSI-MS" unitAccession="MS:1000131" unitName="number of detector counts"/&gt;
    &lt;/Measure&gt;
    ...
&lt;/FragmentationTable&gt;</programlisting>
</section>
<section xml:id="element-fragmenttolerance" xreflabel="FragmentTolerance">
<title>Element &lt;FragmentTolerance&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The tolerance of the search given as a plus and minus value with units.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ToleranceType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;FragmentTolerance&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001412" name="search tolerance plus value" value="20.0 ppm" unitAccession="UO:0000169" unitName="parts per million" unitCvRef="UO"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001413" name="search tolerance minus value" value="20.0 ppm" unitAccession="UO:0000169" unitName="parts per million" unitCvRef="UO"/&gt;
&lt;/FragmentTolerance&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/FragmentTolerance<?asciidoc-br?>
MUST supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001412">MS:1001412</link> (search tolerance plus value) only once<?asciidoc-br?>
MUST supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001413">MS:1001413</link> (search tolerance minus value) only once</simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1001412" name="search tolerance plus value" value="0.7" unitAccession="UO:0000221" unitName="dalton" unitCvRef="UO"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001413" name="search tolerance minus value" value="0.7" unitAccession="UO:0000221" unitName="dalton" unitCvRef="UO"/&gt;</programlisting>
</section>
<section xml:id="element-include" xreflabel="Include">
<title>Element &lt;Include&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
All sequences fulfilling the specifed criteria are included.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ParamListType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/DatabaseFilters/Filter/Include<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001512">MS:1001512</link> (Sequence database filters) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001090">MS:1001090</link> (taxonomy nomenclature)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001201">MS:1001201</link> (DB MW filter maximum)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001202">MS:1001202</link> (DB MW filter minimum)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001203">MS:1001203</link> (DB PI filter maximum)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001204">MS:1001204</link> (DB PI filter minimum)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001467">MS:1001467</link> (taxonomy: NCBI TaxID)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001468">MS:1001468</link> (taxonomy: common name)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001469">MS:1001469</link> (taxonomy: scientific name)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001470">MS:1001470</link> (taxonomy: Swiss-Prot ID)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001513">MS:1001513</link> (DB sequence filter pattern)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001512">et al.</link></simpara>
</section>
<section xml:id="element-inputs" xreflabel="Inputs">
<title>Element &lt;Inputs&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The inputs to the analyses including the databases searched, the spectral data and the source
file converted to mzIdentML.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> InputsType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-sourcefile"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A file from which this mzIdentML instance was created.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-searchdatabase"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A database for searching mass spectra.
Examples include a set of amino acid sequence entries, nucleotide databases (e.g. 6 frame translated) or annotated spectra libraries.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-spectradata"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A data set containing spectra data (consisting of one or more spectra).</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Inputs xmlns="http://psidev.info/psi/pi/mzIdentML/1.2"&gt;
    &lt;SourceFile location="C:\Work\PSI\mzIdentML\ProteinInference\Rosetta2\tandem\peaklist2a_plus_ecoli_versus_unimod_ full.xml" id="SourceFile_1"&gt;
        &lt;FileFormat&gt;
            &lt;cvParam accession="MS:1001401" cvRef="PSI-MS" name="X\!Tandem xml file"/&gt;
        &lt;/FileFormat&gt;
    &lt;/SourceFile&gt;
    &lt;SearchDatabase numDatabaseSequences="163648" location="C:/Work/PSI/mzIdentML/ProteinInference/Rosetta2/FASTAs, neat/Rosetta_uniprot_20130402_mouse_FULL_UNIPROT_can+iso.fasta" id="SearchDB_1"&gt;...&lt;/SearchDatabase&gt;
&lt;/Inputs&gt;</programlisting>
</section>
<section xml:id="element-inputspectra" xreflabel="InputSpectra">
<title>Element &lt;InputSpectra&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
One of the spectra data sets used.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> InputSpectraType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>spectraData_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the SpectraData element which locates the input spectra to an external file.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
</section>
<section xml:id="element-inputspectrumidentifications" xreflabel="InputSpectrumIdentifications">
<title>Element &lt;InputSpectrumIdentifications&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The lists of spectrum identifications that are input to the protein detection process.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> InputSpectrumIdentificationsType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>spectrumIdentificationList_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the list of spectrum identifications that were input to the process.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
</section>
<section xml:id="element-iontype" xreflabel="IonType">
<title>Element &lt;IonType&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
IonType defines the index of fragmentation ions being reported, importing a CV term for the
Type of ion e.g. b ion. Example: if b3 b7 b8 and b10 have been identified, the index attribute
will contain 3 7 8 10, and the corresponding values will be reported in parallel arrays below</simpara>
<simpara><emphasis role="strong">Type:</emphasis> IonTypeType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>charge</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:int</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The charge of the identified fragmentation ions.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>index</simpara></entry>
<entry align="left" valign="top"><simpara>listOfIntegers</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The index of ions identified as integers, following standard notation for a-c, x-z e.g. if b3 b5 and b6 have been identified, the index would store "3 5 6".
For internal ions, the index contains pairs defining the start and end point - see specification document for examples.
For immonium ions, the index is the position of the identified ion within the peptide sequence - if the peptide contains the same amino acid in multiple positions that cannot be distinguished, all positions should be given.
For precursor ions, including neutral losses, the index value MUST be 0. For any other ions not related to the position within the peptide sequence e.g. quantification reporter ions, the index value MUST be 0.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-fragmentarray"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>An array of values for a given type of measure and for a particular ion type, in parallel to the index of ions identified.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;IonType charge="1" index="1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23"&gt;
    &lt;FragmentArray measure_ref="Measure_MZ" values="175.1193695 232.1403961 289.1618958 452.2268982 509.2429504 566.2701416 653.2999268 710.3218994 767.3411865 854.3770752 911.3968506 968 4257813 1065.472168 1122.492432 1285.553833 1399.612305 1456.62561 1513.645874 1570.669067 1733.721191 1830.793213 1887.808105 1944.829834"/&gt;
    &lt;FragmentArray measure_ref="Measure_Int" values="5939.5844726563 4933.5014648438 13310.7265625 5077.6694335938 5685.9287109375 13253.552734375 7620.0947265625 7724.3696289063 16868.541015625 10552.126953125 11589.0576171875 7839.9741210938 47821.64453125 60335.71484375 21759.3984375 8742.5595703125 11512.0908203125 18130.890625 30577.375 3801.3923339844 8051.07421875 1954.5501708984 4844.9125976563"/&gt;
    &lt;FragmentArray measure_ref="Measure_Error" values="4.173258879802688E-4 -1.9794682032170385E-5 1.618474794895519E-5 0.001690052197886871 -0.0037214683721344954 0.0020060110579152024 -2.3719321211501665E-4 2.7168621795681247E-4 -0.0019049343519554895 0.0019553613780090018 2.6704080801209784E-4 0.007734020238103767 0.0013568713879976713 1.571508180404635E-4 -0.0017703817320580129 0.013774177127970688 0.0056154565579618065 0.004415735988004599 0.006145015418042021 -0.005059517131940083 0.01419863401793009 0.007626913448120831 0.007892192877989146"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001220" name="frag: y ion"/&gt;
&lt;/IonType&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/DataCollection/AnalysisData/SpectrumIdentificationList/SpectrumIdentificationResult/SpectrumIdentificationItem/Fragmentation/IonType<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001221">MS:1001221</link> (fragmentation information) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000903">MS:1000903</link> (product ion series ordinal)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000904">MS:1000904</link> (product ion m/z delta)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000926">MS:1000926</link> (product interpretation rank)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001220">MS:1001220</link> (frag: y ion)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001222">MS:1001222</link> (frag: b ion - H2O)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001223">MS:1001223</link> (frag: y ion - H2O)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001224">MS:1001224</link> (frag: b ion)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001225">MS:1001225</link> (product ion m/z)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001227">MS:1001227</link> (product ion m/z error)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001228">MS:1001228</link> (frag: x ion)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001221">et al.</link></simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam accession="MS:1001224" cvRef="PSI-MS" name="frag: b ion"/&gt;
&lt;cvParam accession="MS:1001220" cvRef="PSI-MS" name="frag: y ion"/&gt;
&lt;cvParam accession="MS:1002681" cvRef="PSI-MS" name="OpenXQuest:combined score" value="21.9678562261903"/&gt;
&lt;cvParam accession="MS:1002511" cvRef="PSI-MS" name="crosslink spectrum identification item" value="3050674907789158263"/&gt;
&lt;cvParam accession="MS:1000894" cvRef="PSI-MS" name="retention time" value="5468.0193" unitAccession="second" unitName="" unitCvRef="se"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001523" name="frag: precursor ion"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002466" name="PeptideShaker PSM score" value="0.0"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002467" name="PeptideShaker PSM confidence" value="0.0"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002469" name="PeptideShaker peptide confidence" value="4.0000000000000036"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002468" name="PeptideShaker peptide score" value="-0.0"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002500" name="peptide passes threshold" value="false"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002520" name="peptide group ID" value="QKAQAAATVVK"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001328" name="OMSSA:evalue" value="68.145917448381"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001117" name="theoretical mass" value="1113.6506071554904" unitCvRef="UO" unitAccession="UO:0000221" unitName="dalton"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002540" name="PeptideShaker PSM confidence type" value="Not Validated"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1000796" name="spectrum title" value="qExactive01819.13825.13825. File:" controllerNumber="1" scan="13825&amp;quot;&amp;quot;"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001222" name="frag: b ion - H2O"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001239" name="frag: immonium ion"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001223" name="frag: y ion - H2O"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001233" name="frag: y ion - NH3"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001521" name="frag: precursor ion - H2O"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002536" name="D-Score" value="2:99.6124031007752:1:true"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001330" name="X!Tandem:expect" value="0.0067"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001232" name="frag: b ion - NH3"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001522" name="frag: precursor ion - NH3"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002674" name="frag: b ion - CH4OS"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001969" name="phosphoRS score" value="1:50.0:4:false"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002550" name="peptide:phosphoRS score" value="1:50.0:4:false"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002553" name="peptide:D-Score" value="1:1.4263074484944571:4:false"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002694" name="frag: precursor ion - CH4OS"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002686" name="frag: y ion - CH4OS"/&gt;</programlisting>
<simpara><emphasis role="strong">Example userParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;userParam name="crosslink_chain" unitName="xsd:string" values="alpha"/&gt;
&lt;userParam name="crosslink_ioncategory" unitName="xsd:string" values="ci"/&gt;</programlisting>
</section>
<section xml:id="element-masstable" xreflabel="MassTable">
<title>Element &lt;MassTable&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The masses of residues used in the search.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> MassTableType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>msLevel</simpara></entry>
<entry align="left" valign="top"><simpara>listOfIntegers</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The MS spectrum that the MassTable refers to e.g. "1" for MS1 "2" for MS2 or "1 2" for MS1 or MS2.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-residue"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>The specification of a single residue within the mass table.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-ambiguousresidue"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>Ambiguous residues e.g. X can be specified by the Code attribute and a set of parameters for example giving the different masses that will be used in the search.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;MassTable msLevel="1 2" id="MT"&gt;
    &lt;Residue mass="71.03712" code="A"/&gt;
    &lt;Residue mass="103.009186" code="C"/&gt;
    &lt;Residue mass="115.02694" code="D"/&gt;
    &lt;Residue mass="129.04259" code="E"/&gt;
    &lt;Residue mass="147.06842" code="F"/&gt;
    &lt;Residue mass="57.021465" code="G"/&gt;
    ...
&lt;/MassTable&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/MassTable<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001354">MS:1001354</link> (mass table options) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001346">MS:1001346</link> (AAIndex mass table)</simpara>
</section>
<section xml:id="element-measure" xreflabel="Measure">
<title>Element &lt;Measure&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
References to CV terms defining the measures about product ions to be reported in SpectrumIdentificationItem</simpara>
<simpara><emphasis role="strong">Type:</emphasis> MeasureType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Measure id="Measure_Int"&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001226" name="product ion intensity" unitCvRef="PSI-MS" unitAccession="MS:1000131" unitName="number of detector counts"/&gt;
&lt;/Measure&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/DataCollection/AnalysisData/SpectrumIdentificationList/FragmentationTable/Measure<?asciidoc-br?>
MUST supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001226">MS:1001226</link> (product ion intensity) only once<?asciidoc-br?>
MUST supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001225">MS:1001225</link> (product ion m/z) only once<?asciidoc-br?>
MUST supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001227">MS:1001227</link> (product ion m/z error) only once</simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam accession="MS:1001225" cvRef="PSI-MS" unitCvRef="PSI-MS" unitName="m/z" unitAccession="MS:1000040" name="product ion m/z"/&gt;
&lt;cvParam accession="MS:1001226" cvRef="PSI-MS" unitCvRef="PSI-MS" unitName="number of detector counts" unitAccession="MS:1000131" name="product ion intensity"/&gt;
&lt;cvParam accession="MS:1001227" cvRef="PSI-MS" unitCvRef="PSI-MS" unitName="m/z" unitAccession="MS:1000040" name="product ion m/z error"/&gt;</programlisting>
</section>
<section xml:id="element-modification" xreflabel="Modification">
<title>Element &lt;Modification&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
A molecule modification specification. If <emphasis>n</emphasis> modifications have been found on a peptide, there should be <emphasis>n</emphasis> instances of Modification. If multiple modifications are provided as cvParams, it is assumed that the modification is ambiguous i.e. one modification or another. A cvParam MUST be provided with the identification of the modification sourced from a suitable CV e.g. UNIMOD. If the modification is not present in the CV (and this will be checked by the semantic validator within a given tolerance window), there is a “unknown modification” CV term that MUST be used instead. A neutral loss should be defined as an additional CVParam within Modification. If more complex information should be given about neutral losses (such as presence/absence on particular product ions), this can additionally be encoded within the FragmentationArray.
MAY also contain the CV term "search modification id ref" (MS:1003393) once to link the Modification to a SearchModification defined in the ModificationParams of the related SpectrumIdentificationProtocol (<xref linkend="linking-searchmodification-elements-to-modification-elements"/>&gt;&gt;). The value of this term is the id of the SearchModification as defined by its “search modification id” (MS:1003392) CV term.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ModificationType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>avgMassDelta</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:double</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>Atomic mass delta considering the natural distribution of isotopes in Daltons.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>location</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:int</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>Location of the modification within the peptide - position in peptide sequence, counted from the N-terminus residue, starting at position 1. Specific modifications to the N-terminus should be given the location 0. Modification to the C-terminus should be given as peptide length + 1. If the modification location is unknown e.g. for PMF data, this attribute should be omitted.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>monoisotopicMassDelta</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:double</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>Atomic mass delta when assuming only the most common isotope of elements in Daltons.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>residues</simpara></entry>
<entry align="left" valign="top"><simpara>listOfChars</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>Specification of the residue (amino acid) on which the modification occurs.
If multiple values are given, it is assumed that the exact residue modified is unknown i.e. the modification is to ONE of the residues listed.
Multiple residues would usually only be specified for PMF data.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Modification location="10" residues="K" monoisotopicMassDelta="138.0680796"&gt;
    &lt;cvParam accession="XLMOD:02001" cvRef="XLMOD" name="DSS"/&gt;
    &lt;cvParam accession="MS:1002509" cvRef="PSI-MS" name="crosslink donor" value="11309529182388590588"/&gt;
&lt;/Modification&gt;</programlisting>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1003393" name="search modification id ref" value="DSSO_donor"&gt;
    &lt;cvParam cvRef="UNIMOD" accession="UNIMOD:35" name="Oxidation"/&gt;
    &lt;cvParam cvRef="UNIMOD" accession="UNIMOD:37" name="Trimethyl"/&gt;
    &lt;cvParam cvRef="UNIMOD" accession="UNIMOD:4" name="Carbamidomethyl"/&gt;
    &lt;cvParam cvRef="UNIMOD" accession="UNIMOD:27" name="Glu-&amp;gt;pyro-Glu"/&gt;
    &lt;cvParam cvRef="UNIMOD" accession="UNIMOD:1" name="Acetyl"/&gt;
    &lt;cvParam cvRef="UNIMOD" accession="UNIMOD:385" name="Ammonia-loss"/&gt;
    &lt;cvParam cvRef="UNIMOD" accession="UNIMOD:28" name="Gln-&amp;gt;pyro-Glu"/&gt;
    &lt;cvParam cvRef="UNIMOD" accession="UNIMOD:575" name="Gly-&amp;gt;Val"/&gt;
    &lt;cvParam accession="UNIMOD:214" cvRef="UNIMOD" name="iTRAQ4plex"/&gt;
    &lt;cvParam accession="UNIMOD:7" cvRef="UNIMOD" name="Deamidated"/&gt;
    &lt;cvParam accession="UNIMOD:39" cvRef="UNIMOD" name="Methylthio"/&gt;
    &lt;cvParam accession="XLMOD:02001" cvRef="XLMOD" name="DSS"/&gt;
    &lt;cvParam accession="MS:1002509" cvRef="PSI-MS" name="crosslink donor" value="11309529182388590588"/&gt;
    &lt;cvParam accession="MS:1002510" cvRef="PSI-MS" name="crosslink acceptor" value="2399294065069360606"/&gt;
    &lt;cvParam accession="UNIMOD:1020" name="xlink:DSS" cvRef="UNIMOD"/&gt;
    &lt;cvParam cvRef="XLMOD" accession="XLMOD:01000" name="hydrolyzed BS3"/&gt;
    &lt;cvParam cvRef="XLMOD" accession="XLMOD:01001" name="amidated BS3"/&gt;
    &lt;cvParam cvRef="XLMOD" accession="XLMOD:02000" name="BS3"/&gt;
    &lt;cvParam cvRef="XLMOD" accession="XLMOD:01008" name="hydrolyzed BS3-d4"/&gt;
    &lt;cvParam cvRef="XLMOD" accession="XLMOD:01009" name="amidated BS3-d4"/&gt;
&lt;/cvParam&gt;</programlisting>
<simpara><emphasis role="strong">Example for crosslinking:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Modification location="2" residues="Y" monoisotopicMassDelta="156.07864430999996"&gt;
    &lt;cvParam cvRef="XLMOD" accession="XLMOD:01000" name="hydrolysed BS3"/&gt;
&lt;/Modification&gt;</programlisting>
</section>
<section xml:id="element-modificationparams" xreflabel="ModificationParams">
<title>Element &lt;ModificationParams&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The specification of static/variable modifications (e.g. Oxidation of Methionine) that are to be considered in the spectra search.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ModificationParamsType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-searchmodification"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>Specification of a search modification as parameter for a spectra search.
Contains the name of the modification, the mass, the specificity and whether it is a static modification.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;ModificationParams&gt;
&lt;SearchModification residues="M" massDelta="15.9949" fixedMod="false"&gt;
&lt;cvParam accession="UNIMOD:35" cvRef="UNIMOD" name="Oxidation"/&gt;
&lt;/SearchModification&gt;
&lt;SearchModification residues="." massDelta="144.102" fixedMod="false"&gt;
&lt;cvParam accession="UNIMOD:214" cvRef="UNIMOD" name="iTRAQ4plex"/&gt;
&lt;cvParam accession="MS:1001189" cvRef="PSI-MS" name="modification specificity peptide N-term"/&gt;
...
&lt;/ModificationParams&gt;</programlisting>
</section>
<section xml:id="element-organization" xreflabel="Organization">
<title>Element &lt;Organization&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
Organizations are entities like companies, universities, government agencies. Any additional information such as the address, email etc. should be supplied either as CV parameters or as user parameters.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> OrganizationType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-parent"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The containing organization (the university or business which a lab belongs to, etc.)</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Organization name="PeptideShaker developers" id="PS_DEV"&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1000586" name="contact name" value="PeptideShaker developers"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1000587" name="contact address" value="Proteomics Unit, Building for Basic Biology, University of Bergen, Jonas Liesvei 91, N-5009 Bergen, Norway"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1000588" name="contact URL" value="http://compomics.github.io/projects/peptide-shaker.html"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1000589" name="contact email" value="peptide-shaker@googlegroups.com"/&gt;
&lt;/Organization&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AuditCollection/Organization</simpara>
<simpara>SHOULD supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000588">MS:1000588</link> (contact URL) one or more times</simpara>
<simpara>SHOULD supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000587">MS:1000587</link> (contact address) one or more times</simpara>
<simpara>SHOULD supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000589">MS:1000589</link> (contact email) one or more times</simpara>
<simpara>SHOULD supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000586">MS:1000586</link> (contact name) one or more times</simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1000586" name="contact name" value="address"/&gt;
&lt;cvParam accession="MS:1000588" cvRef="PSI-MS" value="http://www.matrixscience.com" name="contact URL"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1000587" name="contact address" value="test"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1000589" name="contact email" value="test"/&gt;</programlisting>
</section>
<section xml:id="element-parent" xreflabel="Parent">
<title>Element &lt;Parent&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The containing organization (the university or business which a lab belongs to, etc.)</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ParentOrganizationType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>organization_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the organization this contact belongs to.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
</section>
<section xml:id="element-parenttolerance" xreflabel="ParentTolerance">
<title>Element &lt;ParentTolerance&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The tolerance of the search given as a plus and minus value with units.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ToleranceType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;ParentTolerance&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001412" name="search tolerance plus value" value="6.0 ppm" unitAccession="UO:0000169" unitName="parts per million" unitCvRef="UO"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001413" name="search tolerance minus value" value="6.0 ppm" unitAccession="UO:0000169" unitName="parts per million" unitCvRef="UO"/&gt;
&lt;/ParentTolerance&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/ParentTolerance<?asciidoc-br?>
MUST supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001412">MS:1001412</link> (search tolerance plus value) only once<?asciidoc-br?>
MUST supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001413">MS:1001413</link> (search tolerance minus value) only once</simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1001412" name="search tolerance plus value" value="10.0" unitAccession="UO:0000221" unitName="dalton" unitCvRef="UO"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001413" name="search tolerance minus value" value="10.0" unitAccession="UO:0000221" unitName="dalton" unitCvRef="UO"/&gt;</programlisting>
</section>
<section xml:id="element-peptide" xreflabel="Peptide">
<title>Element &lt;Peptide&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
One (poly)peptide (a sequence with modifications). The combination of Peptide sequence and
modifications MUST be unique in the file.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> PeptideType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-peptidesequence"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The amino acid sequence of the (poly)peptide.
If a substitution modification has been found, the original sequence should be reported.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-modification"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A molecule modification specification.
If n modifications have been found on a peptide, there should be n instances of Modification.
If multiple modifications are provided as cvParams, it is assumed that the modification is ambiguous i.e. one modification or another.
A cvParam MUST be provided with the identification of the modification sourced from a suitable CV e.g. UNIMOD.
If the modification is not present in the CV (and this will be checked by the semantic validator within a given tolerance window), there is a â€œunknown modificationâ€ CV term that MUST be used instead.
A neutral loss should be defined as an additional CVParam within Modification.
If more complex information should be given about neutral losses (such as presence/absence on particular product ions), this can additionally be encoded within the FragmentationArray.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-substitutionmodification"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A modification where one residue is substituted by another (amino acid change).</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Peptide id="CCPQCCSSGCSQNLCGPLCVTTPYYCTR_##Carbamidomethyl(C):1##Carbamidomethyl(C):2##Carbamidomethyl(C):
5##Carbamidomethyl(C):6##Carbamidomethyl(C):10##Carbamidomethyl(C):15##Carbamidomethyl(C):
19##Carbamidomethyl(C):26"&gt;
&lt;PeptideSequence&gt;CCPQCCSSGCSQNLCGPLCVTTPYYCTR&lt;/PeptideSequence&gt;
&lt;Modification location="1" residues="C" monoisotopicMassDelta="57.02147"&gt;
&lt;cvParam cvRef="UNIMOD" accession="UNIMOD:4" name="Carbamidomethyl"&gt;&lt;/cvParam&gt;
&lt;/Modification&gt;
&lt;Modification location="2" residues="C" monoisotopicMassDelta="57.02147"&gt;
&lt;cvParam cvRef="UNIMOD" accession="UNIMOD:4" name="Carbamidomethyl"&gt;&lt;/cvParam&gt;
...
&lt;/Peptide&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/SequenceCollection/Peptide<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001355">MS:1001355</link> (peptide descriptions) one or more times</simpara>
<simpara><emphasis role="strong">Example for crosslinking:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Peptide id="54603257_54604608_2_1_p1"&gt;
    &lt;PeptideSequence&gt;KYLYEIAR&lt;/PeptideSequence&gt;
    &lt;Modification location="2" residues="Y" monoisotopicMassDelta="156.07864430999996"&gt;
        &lt;cvParam cvRef="XLMOD" accession="XLMOD:01000" name="hydrolysed BS3"/&gt;
    &lt;/Modification&gt;
    &lt;Modification location="4" residues="Y" monoisotopicMassDelta="155.094628715"&gt;
        &lt;cvParam cvRef="XLMOD" accession="XLMOD:01001" name="amidated BS3"/&gt;
    &lt;/Modification&gt;
    &lt;Modification location="2" monoisotopicMassDelta="0.0"&gt;
        &lt;cvParam cvRef="PSI-MS" accession="MS:1002510" name="crosslink acceptor" value="1"/&gt;
    &lt;/Modification&gt;
&lt;/Peptide&gt;
&lt;Peptide id="54603257_54604608_2_1_p0"&gt;
    &lt;PeptideSequence&gt;LSVEAFEK&lt;/PeptideSequence&gt;
    &lt;Modification location="3" monoisotopicMassDelta="138.06807961"&gt;
        &lt;cvParam cvRef="XLMOD" accession="XLMOD:02000" name="BS3"/&gt;
        &lt;cvParam cvRef="PSI-MS" accession="MS:1002509" name="crosslink donor" value="1"/&gt;
    &lt;/Modification&gt;
&lt;/Peptide&gt;</programlisting>
</section>
<section xml:id="element-peptideevidence" xreflabel="PeptideEvidence">
<title>Element &lt;PeptideEvidence&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
PeptideEvidence links a specific Peptide element to a specific position in a DBSequence.
There MUST only be one PeptideEvidence item per Peptide-to-DBSequence-position.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> PeptideEvidenceType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>dBSequence_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the protein sequence in which the specified peptide has been linked.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>end</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:int</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The index position of the last amino acid of the peptide inside the protein sequence, where the first amino acid of the protein sequence is position 1. Must be provided unless this is a de novo search.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>frame</simpara></entry>
<entry align="left" valign="top"><simpara>allowed_frames</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The translation frame of this sequence if this is PeptideEvidence derived from nucleic acid sequence</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>isDecoy</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:boolean</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>Set to true if the peptide is matched to a decoy sequence.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>peptide_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the identified (poly)peptide sequence in the Peptide element.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>post</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string with restriction
[ABCDEFGHIJKLMNOPQRSTUVWXYZ?\-]{1}</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>Post flanking residue.
If the peptide is C-terminal, post="-" and not post="".
If for any reason it is unknown (e.g. denovo), post="?" should be used.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>pre</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string with restriction
[ABCDEFGHIJKLMNOPQRSTUVWXYZ?\-]{1}</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>Previous flanking residue.
If the peptide is N-terminal, pre="-" and not pre="".
If for any reason it is unknown (e.g. denovo), pre="?" should be used.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>start</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:int</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>Start position of the peptide inside the protein sequence, where the first amino acid of the protein sequence is position 1. Must be provided unless this is a de novo search.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>translationTable_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the translation table used if this is PeptideEvidence derived from nucleic acid sequence</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;PeptideEvidence dBSequence_ref="dbseq_generic_B_GENSCAN00000036974_REVERSED_p:genscan" peptide_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_##Carbamidomethyl(C):1##Carbamidomethyl(C): 3##Carbamidomethyl(C):12##Carbamidomethyl(C):18##Carbamidomethyl(C):25##Carbamidomethyl(C): 31##Ammonia-loss(C):1" start="494" end="531" pre="R" post="L" isDecoy="true" id="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_generic_B_GENSCAN00000036974_REVERSED_p:genscan_494_531"/&gt;</programlisting>
<simpara><emphasis role="strong">Example for proteogenomics search:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;PeptideEvidence dBSequence_ref="dbseq_generic|A_ENSP00000287611|" peptide_ref="YNSQNQSNNQFVLYR_" start="44" end="58" pre="K" post="I" isDecoy="false" id="YNSQNQSNNQFVLYR_generic|A_ENSP00000287611|_44_58"&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002640" name="peptide end on chromosome" value="186717716"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002641" name="peptide exon count" value="1"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002642" name="peptide exon nucleotide sizes" value="45"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002643" name="peptide start positions on chromosome" value="186717673"/&gt;
&lt;/PeptideEvidence&gt;</programlisting>
</section>
<section xml:id="element-peptideevidenceref" xreflabel="PeptideEvidenceRef">
<title>Element &lt;PeptideEvidenceRef&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
Reference to the PeptideEvidence element identified. If a specific sequence can be assigned to
multiple proteins and or positions in a protein all possible PeptideEvidence elements should be
referenced here.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> PeptideEvidenceRefType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>peptideEvidence_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the PeptideEvidenceItem element(s).</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<simpara>&lt;PeptideEvidenceRef peptideEvidence_ref="GEGGAQDGSGTEGVGATGAAGGRGAQGAPGGTGGAGSGSGLHHQQDSGYQGASGSGGAQSGGR_generic\|A_ENSP
00000352272_REVERSED\|p:known_125_187"&gt;&lt;/PeptideEvidenceRef&gt;</simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1002356" name="PSM-level combined FDRScore" value="3.9523759266648643E-7"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002359" name="distinct peptide-level local FDR" value="0.0"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001868" name="distinct peptide-level q-value" value="0.0"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002360" name="distinct peptide-level FDRScore" value="3.0117913560694526E-7"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002500" name="peptide passes threshold" value="true"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002520" name="peptide group ID" value="AVMDDFAAFVEK_##Oxidation(M):3"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002439" name="final PSM list UNDER DISCUSSION"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002511" name="Crosslinked spectrum identification item." value="21"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002545" name="The xi result 'Score'." value="2.769918944845425"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1000797" name="peak list scans" value="6655"/&gt;</programlisting>
<simpara><emphasis role="strong">Example userParams:</emphasis></simpara>
</section>
<section xml:id="element-peptidehypothesis" xreflabel="PeptideHypothesis">
<title>Element &lt;PeptideHypothesis&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
Peptide evidence on which this ProteinHypothesis is based by reference to a PeptideEvidence element.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> PeptideHypothesisType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>peptideEvidence_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the PeptideEvidence element on which this hypothesis is based.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-spectrumidentificationitemref"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>Reference(s) to the SpectrumIdentificationItem element(s) that support the given PeptideEvidence element.
Using these references it is possible to indicate which spectra were actually accepted as evidence for this peptide identification in the given protein.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;PeptideHypothesis peptideEvidence_ref="PepEv_1864"&gt;
    &lt;SpectrumIdentificationItemRef spectrumIdentificationItem_ref="SII_1780_1"/&gt;
    &lt;SpectrumIdentificationItemRef spectrumIdentificationItem_ref="SII_2217_1"/&gt;
    &lt;SpectrumIdentificationItemRef spectrumIdentificationItem_ref="SII_3245_1"/&gt;
    &lt;SpectrumIdentificationItemRef spectrumIdentificationItem_ref="SII_4362_1"/&gt;
    &lt;SpectrumIdentificationItemRef spectrumIdentificationItem_ref="SII_5349_1"/&gt;
    &lt;SpectrumIdentificationItemRef spectrumIdentificationItem_ref="SII_5621_1"/&gt;
    ...
&lt;/PeptideHypothesis&gt;</programlisting>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam accession="MS:1001097" cvRef="PSI-MS" value="1" name="distinct peptide sequences"/&gt;
&lt;cvParam accession="MS:1002235" cvRef="PSI-MS" value="81.01860914459425" name="ProteoGrouper:PDH score"/&gt;
&lt;cvParam accession="MS:1002401" cvRef="PSI-MS" name="leading protein"/&gt;
&lt;cvParam accession="MS:1002403" cvRef="PSI-MS" name="group representative"/&gt;
&lt;cvParam accession="MS:1001594" cvRef="PSI-MS" value="PDH_15" name="sequence same-set protein"/&gt;
&lt;cvParam accession="MS:1002415" cvRef="PSI-MS" value="true" name="protein group passes threshold"/&gt;
&lt;cvParam accession="MS:1002236" cvRef="PSI-MS" value="43.73236628236426" name="ProteoGrouper:PAG score"/&gt;
&lt;cvParam accession="MS:1002407" cvRef="PSI-MS" value="2" name="cluster identifier"/&gt;
&lt;cvParam accession="MS:1002404" cvRef="PSI-MS" value="4" name="count of identified proteins"/&gt;
&lt;cvParam accession="MS:1001596" cvRef="PSI-MS" value="PDH_239" name="sequence sub-set protein"/&gt;
&lt;cvParam accession="MS:1002402" cvRef="PSI-MS" name="non-leading protein"/&gt;
&lt;cvParam accession="MS:1001598" cvRef="PSI-MS" value="PDH_167" name="sequence subsumable protein"/&gt;
&lt;cvParam name="mascot:score" value="1416.6296969697" cvRef="PSI-MS" accession="MS:1001171"/&gt;
&lt;cvParam name="PAnalyzer:conclusive protein" cvRef="PSI-MS" accession="MS:1002213"/&gt;
&lt;cvParam name="PAnalyzer:non-conclusive protein" cvRef="PSI-MS" accession="MS:1002215"/&gt;
&lt;cvParam name="PAnalyzer:indistinguishable protein" cvRef="PSI-MS" accession="MS:1002214"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001093" name="sequence coverage" value="0.19"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002470" name="PeptideShaker protein group score" value="100.0"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002471" name="PeptideShaker protein group confidence" value="100.0"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002542" name="PeptideShaker protein confidence type" value="Confident"/&gt;</programlisting>
<simpara><emphasis role="strong">Example userParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;userParam value="IINEPTAAAIAYGLDK" name="razor peptides"/&gt;
&lt;userParam value="SLSDTLEEVLSSSGEK" name="unique peptides"/&gt;</programlisting>
</section>
<section xml:id="element-peptidesequence" xreflabel="PeptideSequence">
<title>Element &lt;PeptideSequence&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The amino acid sequence of the (poly)peptide. If a substitution modification has been found, the original sequence should be reported.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> sequence</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;PeptideSequence&gt;GEGGAQDGSGTEGVGATGAAGGRGAQGAPGGTGGAGSGSGLHHQQDSGYQGASGSGGAQSGGR&lt;/PeptideSequence&gt;</programlisting>
</section>
<section xml:id="element-person" xreflabel="Person">
<title>Element &lt;Person&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
A person&#8217;s name and contact details. Any additional information such as the address, contact email etc. should be supplied using CV parameters or user parameters.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> PersonType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>firstName</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The Person&#8217;s first name.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>lastName</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The Person&#8217;s last/family name.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>midInitials</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The Person&#8217;s middle initial.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-affiliation"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>The organization a person belongs to.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Person firstName="test" lastName="test" id="PROVIDER"&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1000587" name="contact address" value="test"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1000588" name="contact URL" value="test"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1000589" name="contact email" value="testtest"/&gt;
    &lt;Affiliation organization_ref="ORG_DOC_OWNER"/&gt;
&lt;/Person&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AuditCollection/Person
SHOULD supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000588">MS:1000588</link> (contact URL) one or more times
SHOULD supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000587">MS:1000587</link> (contact address) one or more times
SHOULD supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000589">MS:1000589</link> (contact email) one or more times</simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1000587" name="contact address" value="address"/&gt;
&lt;cvParam accession="MS:1000589" cvRef="PSI-MS" value="smartinez@proteored.org" name="contact email"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1000588" name="contact URL" value="test"/&gt;</programlisting>
</section>
<section xml:id="element-proteinambiguitygroup" xreflabel="ProteinAmbiguityGroup">
<title>Element &lt;ProteinAmbiguityGroup&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
A set of logically related results from a protein detection, for example to represent conflicting assignments of peptides to proteins.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ProteinAmbiguityGroupType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-proteindetectionhypothesis"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single result of the ProteinDetection analysis (i.e. a protein).</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;ProteinAmbiguityGroup id="PAG_1"&gt;
    &lt;ProteinDetectionHypothesis passThreshold="true" dBSequence_ref="dbseq_sp_Q64467_G3PT_MOUSE Glyceraldehyde-3-phosphate dehydrogenase, testis-specific OS=Mus..." id="PDH_11"&gt;
        &lt;PeptideHypothesis peptideEvidence_ref="PE13_2_63"&gt;
            &lt;SpectrumIdentificationItemRef spectrumIdentificationItem_ref="SII_13_1"/&gt;
        &lt;/PeptideHypothesis&gt;
        &lt;cvParam accession="MS:1001097" cvRef="PSI-MS" value="1" name="distinct peptide sequences"/&gt;
        &lt;cvParam accession="MS:1002235" cvRef="PSI-MS" value="34.57557513936462" name="ProteoGrouper:PDH score"/&gt;
        ...
    &lt;/ProteinDetectionHypothesis&gt;
&lt;/ProteinAmbiguityGroup&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/DataCollection/AnalysisData/ProteinDetectionList/ProteinAmbiguityGroup<?asciidoc-br?>
MUST supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002415">MS:1002415</link> (protein group passes threshold) only once<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001147">MS:1001147</link> (protein ambiguity group result details) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001164">MS:1001164</link> (Paragon:unused protscore)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001165">MS:1001165</link> (Paragon:total protscore)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001301">MS:1001301</link> (protein rank)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002236">MS:1002236</link> (ProteoGrouper:PAG score)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002407">MS:1002407</link> (cluster identifier)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002415">MS:1002415</link> (protein group passes threshold)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002474">MS:1002474</link> (ProteoAnnotator:non-canonical gene model score)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002475">MS:1002475</link> (ProteoAnnotator:count alternative peptides)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002663">MS:1002663</link> (Morpheus:summed Morpheus score)</simpara>
<simpara><emphasis role="strong">Example for protein grouping:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;ProteinAmbiguityGroup id="PAG_4266"&gt;
    &lt;ProteinDetectionHypothesis dBSequence_ref="DBSeq_RRRRRQ7TMJ9|Q7TMJ9_MOUSE" passThreshold="true" id="PDH_RRRRRQ7TMJ9|Q7TMJ9_MOUSE_PAG_4266"&gt;
        &lt;PeptideHypothesis peptideEvidence_ref="PE_APVPPSQAR(0;144.1021)-332-340-RRRRRQ7TMJ9|Q7TMJ9_MOUSE"&gt;
            &lt;SpectrumIdentificationItemRef spectrumIdentificationItem_ref="2:[0,144.1021]:APVPPSQAR:index=26699"/&gt;
        &lt;/PeptideHypothesis&gt;
        &lt;cvParam cvRef="PSI-MS" accession="MS:1002394" name="PIA:protein score" value="2.0881360887005513"/&gt;
        &lt;cvParam cvRef="PSI-MS" accession="MS:1002401" name="leading protein"/&gt;
    &lt;/ProteinDetectionHypothesis&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002415" name="protein group passes threshold" value="true"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002407" name="cluster identifier" value="2814"/&gt;
&lt;/ProteinAmbiguityGroup&gt;</programlisting>
</section>
<section xml:id="element-proteindetection" xreflabel="ProteinDetection">
<title>Element &lt;ProteinDetection&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
An Analysis which assembles a set of peptides (e.g. from a spectra search analysis) to proteins.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ProteinDetectionType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>activityDate</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:dateTime</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>When the protocol was applied.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>proteinDetectionList_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the ProteinDetectionList in the DataCollection section.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>proteinDetectionProtocol_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the detection protocol used for this ProteinDetection.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-inputspectrumidentifications"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>The lists of spectrum identifications that are input to the protein detection process.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;ProteinDetection id="PD_1" activityDate="2014-01-11T19:42:49" proteinDetectionList_ref="PDL_PAnalyzer" proteinDetectionProtocol_ref="PDP_PAnalyzer"&gt;
    &lt;InputSpectrumIdentifications spectrumIdentificationList_ref="SIL_1"/&gt;
&lt;/ProteinDetection&gt;</programlisting>
</section>
<section xml:id="element-proteindetectionhypothesis" xreflabel="ProteinDetectionHypothesis">
<title>Element &lt;ProteinDetectionHypothesis&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
A single result of the ProteinDetection analysis (i.e. a protein).</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ProteinDetectionHypothesisType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>dBSequence_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the corresponding DBSequence entry.
Note - this attribute was optional in mzIdentML 1.1 but is now mandatory in mzIdentML 1.2. Consuming software should assume that the DBSequence entry referenced here is the definitive identifier for the protein.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>passThreshold</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:boolean</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>Set to true if the producers of the file has deemed that the ProteinDetectionHypothesis has passed a given threshold or been validated as correct.
If no such threshold has been set, value of true should be given for all results.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-peptidehypothesis"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>Peptide evidence on which this ProteinHypothesis is based by reference to a PeptideEvidence element.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;ProteinDetectionHypothesis passThreshold="true" dBSequence_ref="dbseq_tr_Q3V2I5_Q3V2I5_MOUSE Glyceraldehyde-3-phosphate dehydrogenase (Fragment) OS=Mus..." id="PDH_10"&gt;
    &lt;PeptideHypothesis peptideEvidence_ref="PE13_2_62"&gt;
        &lt;SpectrumIdentificationItemRef spectrumIdentificationItem_ref="SII_13_1"/&gt;
    &lt;/PeptideHypothesis&gt;
    &lt;cvParam accession="MS:1001097" cvRef="PSI-MS" value="1" name="distinct peptide sequences"/&gt;
    &lt;cvParam accession="MS:1002235" cvRef="PSI-MS" value="34.57557513936462" name="ProteoGrouper:PDH score"/&gt;
    &lt;cvParam accession="MS:1001594" cvRef="PSI-MS" value="PDH_11" name="sequence same-set protein"/&gt;
    ...
&lt;/ProteinDetectionHypothesis&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/DataCollection/AnalysisData/ProteinDetectionList/ProteinAmbiguityGroup/ProteinDetection
Hypothesis<?asciidoc-br?>
MAY supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002403">MS:1002403</link> (group representative) only once<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001116">MS:1001116</link> (single protein result details) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001088">MS:1001088</link> (protein description)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001093">MS:1001093</link> (sequence coverage)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001097">MS:1001097</link> (distinct peptide sequences)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001098">MS:1001098</link> (confident distinct peptide sequences)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001099">MS:1001099</link> (confident peptide qualification)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001100">MS:1001100</link> (confident peptide sequence number)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001125">MS:1001125</link> (manual validation)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001157">MS:1001157</link> (SEQUEST:sp)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001158">MS:1001158</link> (SEQUEST:Uniq) WARNING: Term has no definition!<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001169">MS:1001169</link> (Paragon:expression change p-value)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001116">et al.</link><?asciidoc-br?>
MAY supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002402">MS:1002402</link> (non-leading protein) only once<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001153">MS:1001153</link> (search engine specific score) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001154">MS:1001154</link> (SEQUEST:probability)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001155">MS:1001155</link> (SEQUEST:xcorr)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001156">MS:1001156</link> (SEQUEST:deltacn)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001157">MS:1001157</link> (SEQUEST:sp)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001158">MS:1001158</link> (SEQUEST:Uniq) WARNING: Term has no definition!<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001159">MS:1001159</link> (SEQUEST:expectation value)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001160">MS:1001160</link> (SEQUEST:sf)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001161">MS:1001161</link> (SEQUEST:matched ions)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001162">MS:1001162</link> (SEQUEST:total ions)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001163">MS:1001163</link> (SEQUEST:consensus score)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001153">et al.</link><?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001060">MS:1001060</link> (quality estimation method details) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001058">MS:1001058</link> (quality estimation by manual validation)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001194">MS:1001194</link> (quality estimation with decoy database)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001447">MS:1001447</link> (prot:FDR threshold)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001448">MS:1001448</link> (pep:FDR threshold)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001454">MS:1001454</link> (quality estimation with implicite decoy sequences)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001494">MS:1001494</link> (no threshold)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001574">MS:1001574</link> (report only spectra assigned to identified proteins)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001060">et al.</link><?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001101">MS:1001101</link> (protein group or subset relationship) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001591">MS:1001591</link> (anchor protein)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001592">MS:1001592</link> (family member protein)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001593">MS:1001593</link> (group member with undefined relationship OR ortholog protein)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001594">MS:1001594</link> (sequence same-set protein)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001595">MS:1001595</link> (spectrum same-set protein)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001596">MS:1001596</link> (sequence sub-set protein)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001597">MS:1001597</link> (spectrum sub-set protein)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001598">MS:1001598</link> (sequence subsumable protein)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001599">MS:1001599</link> (spectrum subsumable protein)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002213">MS:1002213</link> (PAnalyzer:conclusive protein)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001101">et al.</link><?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of MS:1002664 (interaction score derived from cross-linking) one or more times<?asciidoc-br?>
e.g.: MS:1002677 (residue-pair-level global FDR)<?asciidoc-br?>
e.g.: MS:1002676 (protein-pair-level global FDR)<?asciidoc-br?>
MAY supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002401">MS:1002401</link> (leading protein) only once</simpara>
<simpara><emphasis role="strong">Example for protein grouping:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;ProteinDetectionHupothesis&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002394" name="PIA:protein score" value="107.73038501509386"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002401" name="leading protein"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001594" name="sequence same-set protein" value="PDH_Q3TWF2_PAG_2106 PDH_Q3U7T8_PAG_2106 PDH_Q3U9G2_PAG_2106"/&gt;
&lt;/ProteinDetectionHupothesis&gt;</programlisting>
</section>
<section xml:id="element-proteindetectionlist" xreflabel="ProteinDetectionList">
<title>Element &lt;ProteinDetectionList&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The protein list resulting from a protein detection process.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ProteinDetectionListType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-proteinambiguitygroup"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A set of logically related results from a protein detection, for example to represent conflicting assignments of peptides to proteins.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;ProteinDetectionList id="PDL_1" xmlns="http://psidev.info/psi/pi/mzIdentML/1.2"&gt;
&lt;ProteinAmbiguityGroup id="PAG_0"&gt;
&lt;ProteinDetectionHypothesis passThreshold="true" dBSequence_ref="dbseq_sp_P16627_HS71L_MOUSE Heat shock 70 kDa protein 1-like OS=Mus musculus GN=Hspa1l..." id="PDH_15"&gt;
&lt;PeptideHypothesis peptideEvidence_ref="PE16_2_69"&gt;
&lt;SpectrumIdentificationItemRef spectrumIdentificationItem_ref="SII_16_1"/&gt;
&lt;/PeptideHypothesis&gt;
&lt;PeptideHypothesis peptideEvidence_ref="PE17_2_107"&gt;
...
&lt;/PeptideHypothesis&gt;
&lt;/ProteinDetectionHypothesis&gt;
&lt;/ProteinDetectionList&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/DataCollection/AnalysisData/ProteinDetectionList<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001184">MS:1001184</link> (search statistics) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001035">MS:1001035</link> (date / time search performed)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001036">MS:1001036</link> (search time taken)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001177">MS:1001177</link> (number of molecular hypothesis considered)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002404">MS:1002404</link> (count of identified proteins)<?asciidoc-br?>
MUST supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002404">MS:1002404</link> (count of identified proteins) only once</simpara>
</section>
<section xml:id="element-proteindetectionprotocol" xreflabel="ProteinDetectionProtocol">
<title>Element &lt;ProteinDetectionProtocol&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The parameters and settings of a ProteinDetection process.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ProteinDetectionProtocolType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>analysisSoftware_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The protein detection software used, given as a reference to the SoftwareCollection section.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-analysisparams"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The parameters and settings for the protein detection given as CV terms.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-threshold"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The threshold(s) applied to determine that a result is significant.
If multiple terms are used it is assumed that all conditions are satisfied by the passing results.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Graphical Context:</emphasis></simpara>
<informalfigure>
<mediaobject>
<imageobject>
<imagedata fileref="img/mzidentml/image7.png" contentwidth="566" contentdepth="336"/>
</imageobject>
<textobject><phrase>http://www.peptideatlas.org/PSI/schemas/mzIdentML/1.2/figures/ProteinDetectionProtocol.png</phrase></textobject>
</mediaobject>
</informalfigure>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;ProteinDetectionProtocol id="PDP_PAnalyzer" analysisSoftware_ref="PAnalyzer"&gt;
    &lt;AnalysisParams&gt;
        &lt;cvParam name="mascot:SigThreshold" value="0.05" cvRef="PSI-MS" accession="MS:1001316"/&gt;
        &lt;cvParam name="mascot:MaxProteinHits" value="Auto" cvRef="PSI-MS" accession="MS:1001317"/&gt;
        &lt;cvParam name="mascot:ProteinScoringMethod" value="MudPIT" cvRef="PSI-MS" accession="MS:1001318"/&gt;
        &lt;cvParam name="mascot:MinMSMSThreshold" value="0" cvRef="PSI-MS" accession="MS:1001319"/&gt;
        &lt;cvParam name="mascot:ShowHomologousProteinsWithSamePeptides" value="1" cvRef="PSI-MS" accession="MS:1001320"/&gt;
        ...
    &lt;/AnalysisParams&gt;
&lt;/ProteinDetectionProtocol&gt;</programlisting>
</section>
<section xml:id="element-provider" xreflabel="Provider">
<title>Element &lt;Provider&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The Provider of the mzIdentML record in terms of the contact and software.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ProviderType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>analysisSoftware_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The Software that produced the document instance.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-contactrole"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The Contact that provided the document instance.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Provider xmlns="http://psidev.info/psi/pi/mzIdentML/1.2" id="PROVIDER"&gt;
    &lt;ContactRole contact_ref="PERSON_DOC_OWNER"&gt;
        &lt;Role&gt;
            &lt;cvParam accession="MS:1001271" cvRef="PSI-MS" name="researcher"/&gt;
        &lt;/Role&gt;
    &lt;/ContactRole&gt;
&lt;/Provider&gt;</programlisting>
</section>
<section xml:id="element-residue" xreflabel="Residue">
<title>Element &lt;Residue&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The specification of a single residue within the mass table.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ResidueType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>code</simpara></entry>
<entry align="left" valign="top"><simpara>chars</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The single letter code for the residue.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>mass</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:float</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The residue mass in Daltons (not including any fixed modifications).</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
</section>
<section xml:id="element-role" xreflabel="Role">
<title>Element &lt;Role&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The roles (lab equipment sales, contractor, etc.) the Contact fills.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> RoleType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Role&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001267" name="software vendor"/&gt;
&lt;/Role&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/Provider/ContactRole/Role<?asciidoc-br?>
MUST supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001266">MS:1001266</link> (role type) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001267">MS:1001267</link> (software vendor)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001268">MS:1001268</link> (programmer)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001269">MS:1001269</link> (instrument vendor)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001270">MS:1001270</link> (lab personnel)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001271">MS:1001271</link> (researcher)</simpara>
<simpara>Path /MzIdentML/AnalysisSoftwareList/AnalysisSoftware/ContactRole/Role<?asciidoc-br?>
MUST supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001266">MS:1001266</link> (role type) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001267">MS:1001267</link> (software vendor)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001268">MS:1001268</link> (programmer)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001269">MS:1001269</link> (instrument vendor)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001270">MS:1001270</link> (lab personnel)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001271">MS:1001271</link> (researcher)</simpara>
<simpara>Path /MzIdentML/AnalysisSampleCollection/Sample/ContactRole/Role<?asciidoc-br?>
MUST supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001266">MS:1001266</link> (role type) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001267">MS:1001267</link> (software vendor)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001268">MS:1001268</link> (programmer)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001269">MS:1001269</link> (instrument vendor)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001270">MS:1001270</link> (lab personnel)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001271">MS:1001271</link> (researcher)</simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1001271" name="researcher"/&gt;
&lt;cvParam accession="MS:1001267" cvRef="PSI-MS" name="software vendor"/&gt;</programlisting>
</section>
<section xml:id="element-sample" xreflabel="Sample">
<title>Element &lt;Sample&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
A description of the sample analysed by mass spectrometry using CVParams or UserParams. If a composite sample has been analysed, a parent sample should be defined, which references subsamples. This represents any kind of substance used in an experimental workflow, such as whole organisms, cells, DNA, solutions, compounds and experimental substances (gels, arrays etc.).</simpara>
<simpara><emphasis role="strong">Type:</emphasis> SampleType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-contactrole"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>The Contact that provided the document instance.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-subsample"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>References to the individual component samples within a mixed parent sample.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
</section>
<section xml:id="element-searchdatabase" xreflabel="SearchDatabase">
<title>Element &lt;SearchDatabase&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
A database for searching mass spectra. Examples include a set of amino acid sequence entries, nucleotide databases (e.g. 6 frame translated) or annotated spectra libraries.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> SearchDatabaseType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>location</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:anyURI</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The location of the data file.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>numDatabaseSequences</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:long</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The total number of sequences in the database.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>numResidues</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:long</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The number of residues in the database.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>releaseDate</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:dateTime</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The date and time the database was released to the public; omit this attribute when the date and time are unknown or not applicable (e.g. custom databases).</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>version</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The version of the database.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-externalformatdocumentation"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>A URI to access documentation and tools to interpret the external format of the ExternalData instance.
For example, XML Schema or static libraries (APIs) to access binary formats.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-fileformat"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The format of the ExternalData file, for example "tiff" for image files.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-databasename"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The database name may be given as a cvParam if it maps exactly to one of the release databases listed in the CV, otherwise a userParam should be used.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SearchDatabase numDatabaseSequences="40400" location="C:\Users\hba041\My_Git_Applications\peptide-shaker.wiki\data\2016_04_05\uniprot-human-reviewed-trypsin-april-2016_concatenated_target_decoy.fasta" id="SearchDB_1"&gt;
    &lt;FileFormat&gt;
        &lt;cvParam cvRef="PSI-MS" accession="MS:1001348" name="FASTA format"/&gt;
    &lt;/FileFormat&gt;
    &lt;DatabaseName&gt;
        &lt;userParam name="uniprot-human-reviewed-trypsin-april-2016_concatenated_target_decoy.fasta"/&gt;
    &lt;/DatabaseName&gt;
    ...
&lt;/SearchDatabase&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/DataCollection/Inputs/SearchDatabase<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000561">MS:1000561</link> (data file checksum type) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000568">MS:1000568</link> (MD5)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000569">MS:1000569</link> (SHA-1)<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001011">MS:1001011</link> (search database details) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001014">MS:1001014</link> (database local file path)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001015">MS:1001015</link> (database original uri)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001016">MS:1001016</link> (database version)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001017">MS:1001017</link> (database release date)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001020">MS:1001020</link> (DB filter taxonomy)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001021">MS:1001021</link> (DB filter on accession numbers)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001022">MS:1001022</link> (DB MW filter)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001023">MS:1001023</link> (DB PI filter)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001024">MS:1001024</link> (translation frame)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001025">MS:1001025</link> (translation table)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001011">et al.</link></simpara>
</section>
<section xml:id="element-searchdatabaseref" xreflabel="SearchDatabaseRef">
<title>Element &lt;SearchDatabaseRef&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
One of the search databases used.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> SearchDatabaseRefType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>searchDatabase_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the database searched.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SearchDatabaseRef searchDatabase_ref="SDB_4299_203"/&gt;</programlisting>
</section>
<section xml:id="element-searchmodification" xreflabel="SearchModification">
<title>Element &lt;SearchModification&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
Specification of a search modification as parameter for a spectra search. Contains the name of the modification, the mass, the specificity and whether it is a static modification.
It MAY provide the CV term “search modification id” (MS:1003392) once, to provide an identifier for this SearchModification (<xref linkend="linking-searchmodification-elements-to-modification-elements"/>). This identifier MUST be unique within the &lt;SpectrumIdentificationProtocol&gt; element. If there are multiple &lt;SpectrumIdentificationProtocol&gt; elements with in the file, then the values of “search modification id” SHOULD be identical for identical modifications.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> SearchModificationType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>fixedMod</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:boolean</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>True, if the modification is static (i.e. occurs always).</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>massDelta</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:float</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The mass delta of the searched modification in Daltons.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>residues</simpara></entry>
<entry align="left" valign="top"><simpara>listOfCharsOrAny</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The residue(s) searched with the specified modification.
For N or C terminal modifications that can occur on any residue, the . character should be used to specify any, otherwise the list of amino acids should be provided.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-specificityrules"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>The specificity rules of the searched modification including for example the probability of a modification&#8217;s presence or peptide or protein termini.
Standard fixed or variable status should be provided by the attribute fixedMod.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SearchModification residues="E" massDelta="-18.010565" fixedMod="false"&gt;
    &lt;SpecificityRules&gt;
        &lt;cvParam cvRef="PSI-MS" accession="MS:1001189" name="modification specificity peptide N-term"/&gt;
    &lt;/SpecificityRules&gt;
    &lt;cvParam cvRef="UNIMOD" accession="UNIMOD:27" name="Glu-&amp;gt;pyro-Glu"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002504" name="modification index" value="3"/&gt;
&lt;/SearchModification&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/ModificationParams/SearchModification<?asciidoc-br?>
MAY supply term <phrase role="underline">MS:1003392</phrase> (search modification id) only once<?asciidoc-br?>
MAY supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002509"><phrase role="underline">MS:1002509</phrase></link> (crosslink donor) only once<?asciidoc-br?>
MAY supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002510"><phrase role="underline">MS:1002510</phrase></link> (crosslink acceptor) only once<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/UNIMOD:0"><phrase role="underline">UNIMOD:0</phrase></link> (unimod root node) only once<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001471"><phrase role="underline">MS:1001471</phrase></link> (peptide modification details) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001460"><phrase role="underline">MS:1001460</phrase></link> (unknown modification)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001524"><phrase role="underline">MS:1001524</phrase></link> (fragment neutral loss)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001525"><phrase role="underline">MS:1001525</phrase></link> (precursor neutral loss)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001972"><phrase role="underline">MS:1001972</phrase></link> (PTM scoring algorithm version)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002028"><phrase role="underline">MS:1002028</phrase></link> (nucleic acid base modification)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002029"><phrase role="underline">MS:1002029</phrase></link> (original nucleic acid sequence)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002030"><phrase role="underline">MS:1002030</phrase></link> (modified nucleic acid sequence)<?asciidoc-br?>
MUST supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001460"><phrase role="underline">MS:1001460</phrase></link> (unknown modification) only once<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/XLMOD:00002"><phrase role="underline">XLMOD:00002</phrase></link> (cross-linker related PTM) only once<?asciidoc-br?>
MAY supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002504"><phrase role="underline">MS:1002504</phrase></link> (modification index) only once<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/XLMOD:00004"><phrase role="underline">XLMOD:00004</phrase></link> (cross-linker) only once<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MOD:00000"><phrase role="underline">MOD:00000</phrase></link> (protein modification) only once</simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="UNIMOD" accession="UNIMOD:35" name="Oxidation"/&gt;
&lt;cvParam cvRef="UNIMOD" accession="UNIMOD:4" name="Carbamidomethyl"/&gt;
&lt;cvParam accession="UNIMOD:214" cvRef="UNIMOD" name="iTRAQ4plex"/&gt;
&lt;cvParam accession="MS:1001189" cvRef="PSI-MS" name="modification specificity peptide N-term"/&gt;
&lt;cvParam accession="UNIMOD:39" cvRef="UNIMOD" name="Methylthio"/&gt;
&lt;cvParam accession="UNIMOD:7" cvRef="UNIMOD" name="Deamidated"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002504" name="modification index" value="0"/&gt;
&lt;cvParam cvRef="XLMOD" accession="XLMOD:02000" name="BS3"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002509" name="crosslink donor" value="0"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002510" name="crosslink acceptor" value="0"/&gt;
&lt;cvParam cvRef="XLMOD" accession="XLMOD:02004" name="BS3-d4"/&gt;
&lt;cvParam cvRef="XLMOD" accession="XLMOD:01001" name="amidated BS3"/&gt;
&lt;cvParam cvRef="XLMOD" accession="XLMOD:01000" name="hydrolyzed BS3"/&gt;
&lt;cvParam cvRef="UNIMOD" accession="UNIMOD:1020" name="Xlink:DSS"/&gt;
&lt;cvParam cvRef="XLMOD" accession="XLMOD:01009" name="amidated BS3-d4"/&gt;
&lt;cvParam cvRef="XLMOD" accession="XLMOD:01008" name="hydrolyzed BS3-d4"/&gt;</programlisting>
</section>
<section xml:id="element-searchtype" xreflabel="SearchType">
<title>Element &lt;SearchType&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The type of search performed e.g. PMF, Tag searches, MS-MS</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ParamType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SearchType&gt;
    &lt;cvParam accession="MS:1001083" cvRef="PSI-MS" value="" name="ms-ms search"/&gt;
&lt;/SearchType&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/SearchType<?asciidoc-br?>
MUST supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001080">MS:1001080</link> (search type) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001010">MS:1001010</link> (de novo search)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001031">MS:1001031</link> (spectral library search)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001081">MS:1001081</link> (pmf search)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001082">MS:1001082</link> (tag search)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001083">MS:1001083</link> (ms-ms search)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001584">MS:1001584</link> (combined pmf + ms-ms search)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002490">MS:1002490</link> (peptide-level scoring)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002491">MS:1002491</link> (modification localization scoring)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002492">MS:1002492</link> (consensus scoring)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002493">MS:1002493</link> (sample pre-fractionation)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001080">et al.</link></simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1001083" name="ms-ms search"/&gt;</programlisting>
</section>
<section xml:id="element-seq" xreflabel="Seq">
<title>Element &lt;Seq&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The actual sequence of amino acids or nucleic acid.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> sequence</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Seq&gt;MMKFTVVAAALLLLGAVRAEEEDKKEDVGTVVGIDLGTTYSCVGVFKNGRVEIIANDQGNRITPSYVAFTPEGERLIGDAAKNQLTSNPENTVFDA KRLIGRTWNDPSVQQDIKFLPFKVVEKKTKPYIQVDIGGGQTKTFAPEEISAMVLTKMKETAEAYLGKKVTHAVVTVPAYFNDAQRQATKDAGTIAGLNVM RIINEPTAAAIAYGLDKREGEKNILVFDLGGGTFDVSLLTIDNGVFEVVATNGDTHLGGEDFDQRVMEHFIKLYKKKTGKDVRKDNRAVQKLRREVEKAKR ALSSQHQARIEIESFFEREDFSETLTRAKFEELNMDLFRSTMKPVQKVLEDSDLKKSDIDEIVLVGGSTRIPKIQQLVKEFFNGKEPSRGINPDEAVAYGA AVQAGVLSGGQDTGDLVLLDVCPLTLGIETVGGVMTKLIPRNTVVPTKKSQIFSTASDNQPTVTIKVYEGERPLTKDNHLLGTFDLTGIPPAPRGVPQIEV TFEIDVNGILRVTAEDKGTGNKNKITITNDQNRLTPEEIERMVNDAEKFAEEDKKLKERIDTRNELESYAYSLKNQIGDKEKLGGKLSSEDKETMEKAVEE KIEWLESHQDADIEDFKAKKKELEEIVQPIISKLYGSGGPPPTGEEDTSEKDEL&lt;/Seq&gt;</programlisting>
</section>
<section xml:id="element-sequencecollection" xreflabel="SequenceCollection">
<title>Element &lt;SequenceCollection&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The collection of sequences (DBSequence or Peptide) identified and their relationship between each other (PeptideEvidence) to be referenced elsewhere in the results.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> SequenceCollectionType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-dbsequence"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A database sequence from the specified SearchDatabase (nucleic acid or amino acid).
If the sequence is nucleic acid, the source nucleic acid sequence should be given in the seq attribute rather than a translated sequence.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-peptide"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>One (poly)peptide (a sequence with modifications).
The combination of Peptide sequence and modifications MUST be unique in the file.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-peptideevidence"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>PeptideEvidence links a specific Peptide element to a specific position in a DBSequence.
There MUST only be one PeptideEvidence item per Peptide-to-DBSequence-position.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Graphical Context:</emphasis></simpara>
<informalfigure>
<mediaobject>
<imageobject>
<imagedata fileref="img/mzidentml/image8.png" contentwidth="576" contentdepth="652"/>
</imageobject>
<textobject><phrase>http://www.peptideatlas.org/PSI/schemas/mzIdentML/1.2/figures/SequenceCollection.png</phrase></textobject>
</mediaobject>
</informalfigure>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SequenceCollection xmlns="http://psidev.info/psi/pi/mzIdentML/1.2"&gt;
    &lt;DBSequence accession="sp_Q64467_G3PT_MOUSE Glyceraldehyde-3-phosphate dehydrogenase, testis-specific OS=Mus..." searchDatabase_ref="SearchDB_1" length="440" name="sp_Q64467_G3PT_MOUSE Glyceraldehyde-3-phosphate dehydrogenase, testis-specific OS=Mus musculus GN=Gapdhs PE=1 SV=1" id="dbseq_sp_Q64467_G3PT_MOUSE Glyceraldehyde-3-phosphate dehydrogenase, testis-specific OS=Mus..."&gt;
        &lt;Seq&gt;MSRRDVVLTNVTVVQLRRDRCPCPCPCPCPCPCPVIRPPPPKLEDPPPTVEEQPPPPPPPPPPPPPPPPPPPPQIEPDKFEEAPPPPP PPPPPPPPPP PPLQKPARELTVGINGFGRIGRLVLRVCMEKGIRVVAVNDPFIDPEYMVYMFKYDSTHGRYKGNVEHKNGQLVVDNLEINTYQCKDPKEIPWSSIGNPYVVEC TGVYLSIEAASAHISSGARRVVVTAPSPDAPMFVMGVNEKDYNPGSMTIVSNASCTTNCLAPLAKVIHENFGIVEGLMTTVHSYTATQKTVDGPSKKDWRGGR GAHQNIIPSSTGAAKAVGKVIPELKGKLTGMAFRVPTPNVSVVDLTCRLAKPASYSAITEAVKAAAKGPLAGILAYTEDQVVSTDFNGNPHSSIFDAKAGIAL NDNFVKLVAWYDNEYGYSNRVVDLLRYMFSREK&lt;/Seq&gt;
    &lt;/DBSequence&gt;
    &lt;DBSequence accession="tr_Q3UEM8_Q3UEM8_MOUSE Putative uncharacterized protein (Fragment) OS=Musmusculus..." searchDatabase_ref="SearchDB_1" length="520" name="tr_Q3UEM8_Q3UEM8_MOUSE Putative uncharacterized protein (Fragment) OS=Mus musculus GN=Hspa5 PE=2 SV=1" id="dbseq_tr_Q3UEM8_Q3UEM8_MOUSE Putative uncharacterized protein (Fragment) OS=Mus musculus..."&gt;
        &lt;Seq&gt;MMKFTVVAAALLLLGAVRAEEEDKKEDVGTVVGIDLGTTYSCVGVFKNGRVEIIANDQGNRITPSYVAFTPEGERLIGDAAKNQLTSNP ENTVFDAK RLIGRTWNDPSVQQDIKFLPFKVVEKKTKPYIQVDIGGGQTKTFAPEEISAMVLTKMKETAEAYLGKKVTHAVVTVPAYFNDAQRQATKDAGTIAGLNVMRI INEPTAAAIAYGLDKREGEKNILVFDLGGGTFDVSLLTIDNGVFEVVATNGDTHLGGEDFDQRVMEHFIKLYKKKTGKDVRKDNRAVQKLRREVEKAKRALS SQHQARIEIESFFEGEDFSETLTRAKFEELNMDLFRSTIKPVQKVLEDSDLKKSDIDEIVLVGGSTRIPKIQQLVKEFFNGKEPSRGINPDEAVAYGAAVQA GVLSGDQDTGDLVLLDVCPLTLGIETVGGVMTKLIPRNTVVPTKKSQIFSTASDNQPTVTIKVYEGERPLTKDNHLLGTFDLTGIPPAPRGVPQIEVTFEID VNGILRVTAEDKGTG&lt;/Seq&gt;
    &lt;/DBSequence&gt;
    ...
&lt;/SequenceCollection&gt;</programlisting>
</section>
<section xml:id="element-siteregexp" xreflabel="SiteRegexp">
<title>Element &lt;SiteRegexp&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
Regular expression for specifying the enzyme cleavage site.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> xsd:string</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
</section>
<section xml:id="element-softwarename" xreflabel="SoftwareName">
<title>Element &lt;SoftwareName&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The name of the analysis software package, sourced from a CV if available.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> ParamType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SoftwareName&gt;
    &lt;cvParam accession="MS:1002244" cvRef="PSI-MS" name="mzidLib:FalseDiscoveryRate"/&gt;
&lt;/SoftwareName&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AnalysisSoftwareList/AnalysisSoftware/SoftwareName<?asciidoc-br?>
MUST supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001456">MS:1001456</link> (analysis software) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000532">MS:1000532</link> (Xcalibur)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000533">MS:1000533</link> (Bioworks)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000534">MS:1000534</link> (MassLynx)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000535">MS:1000535</link> (FlexAnalysis)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000536">MS:1000536</link> (Data Explorer)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000537">MS:1000537</link> (4700 Explorer)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000539">MS:1000539</link> (Voyager Biospectrometry Workstation System)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000551">MS:1000551</link> (Analyst)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000600">MS:1000600</link> (Proteios)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000601">MS:1000601</link> (ProteinLynx Global Server)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001456">et al.</link></simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1001475" name="OMSSA"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002237" name="mzidLib"/&gt;
&lt;cvParam accession="MS:1001476" cvRef="PSI-MS" name="X\!Tandem"/&gt;
&lt;cvParam accession="MS:1002239" cvRef="PSI-MS" name="mzidLib:Tandem2Mzid"/&gt;
&lt;cvParam accession="MS:1002244" cvRef="PSI-MS" name="mzidLib:FalseDiscoveryRate"/&gt;
&lt;cvParam accession="MS:1002242" cvRef="PSI-MS" name="mzidLib:Thresholder"/&gt;
&lt;cvParam accession="MS:1002241" cvRef="PSI-MS" name="mzidLib:ProteoGrouper"/&gt;
&lt;cvParam accession="MS:1002048" cvRef="PSI-MS" name="MS-GF+"/&gt;
&lt;cvParam accession="MS:1001207" cvRef="PSI-MS" name="Mascot"/&gt;
&lt;cvParam accession="MS:1001478" cvRef="PSI-MS" name="Mascot Parser"/&gt;
&lt;cvParam accession="MS:1002076" cvRef="PSI-MS" name="PAnalyzer"/&gt;
&lt;cvParam accession="MS:1001456" cvRef="PSI-MS" name="analysis software"/&gt;
&lt;cvParam accession="MS:1000752" cvRef="PSI-MS" name="TOPP software"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002458" name="PeptideShaker"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002544" name="xiFDR"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002543" name="xiFDR"/&gt;</programlisting>
</section>
<section xml:id="element-sourcefile" xreflabel="SourceFile">
<title>Element &lt;SourceFile&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
A file from which this mzIdentML instance was created.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> SourceFileType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>location</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:anyURI</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The location of the data file.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-externalformatdocumentation"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>A URI to access documentation and tools to interpret the external format of the ExternalData instance.
For example, XML Schema or static libraries (APIs) to access binary formats.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-fileformat"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The format of the ExternalData file, for example "tiff" for image files.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SourceFile location="C:\Users\hba041\My_Git_Applications\peptide-shaker.wiki\data\2016_04_05\.PeptideShaker_unzip_temp\searchgui_out_PeptideShaker_temp\qExactive01819 .t.xml" id="SourceFile_2"&gt;
    &lt;FileFormat&gt;
        &lt;cvParam cvRef="PSI-MS" accession="MS:1001401" name="X!Tandem xml format"/&gt;
    &lt;/FileFormat&gt;
&lt;/SourceFile&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/DataCollection/Inputs/SourceFile<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000561">MS:1000561</link> (data file checksum type) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000568">MS:1000568</link> (MD5)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000569">MS:1000569</link> (SHA-1)</simpara>
</section>
<section xml:id="element-specificityrules" xreflabel="SpecificityRules">
<title>Element &lt;SpecificityRules&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The specificity rules of the searched modification including for example the probability of a
modification&#8217;s presence or peptide or protein termini. Standard fixed or variable status should
be provided by the attribute fixedMod.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> SpecificityRulesType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SpecificityRules&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002057" name="modification specificity protein N- term"/&gt;
&lt;/SpecificityRules&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/ModificationParams/SearchModification/SpecificityRules<?asciidoc-br?>
MUST supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001056">MS:1001056</link> (modification specificity rule) only once<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001189">MS:1001189</link> (modification specificity peptide N-term)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001190">MS:1001190</link> (modification specificity peptide C-term)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001875">MS:1001875</link> (modification motif)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001876">MS:1001876</link> (modification probability)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002057">MS:1002057</link> (modification specificity protein N-term)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1002058">MS:1002058</link> (modification specificity protein C-term)</simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam accession="MS:1001189" cvRef="PSI-MS" name="modification specificity peptide N-term"/&gt;
&lt;cvParam accession="MS:1002057" cvRef="PSI-MS" name="modification specificity protein N-term"/&gt;</programlisting>
</section>
<section xml:id="element-spectradata" xreflabel="SpectraData">
<title>Element &lt;SpectraData&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
A data set containing spectra data (consisting of one or more spectra).</simpara>
<simpara><emphasis role="strong">Type:</emphasis> SpectraDataType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>location</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:anyURI</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The location of the data file.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-externalformatdocumentation"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>A URI to access documentation and tools to interpret the external format of the ExternalData instance.
For example, XML Schema or static libraries (APIs) to access binary formats.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-fileformat"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The format of the ExternalData file, for example "tiff" for image files.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-spectrumidformat"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The format of the spectrum identifier within the source file</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SpectraData location="E:\Work\PSI\mzIdentML\ProteinInference\Rosetta2\Peaklistswithecoli\Rosetta2a_Ecoli_spectra .mgf" name="Rosetta2a_Ecoli_spectra.mgf" id="SID_1"&gt;
    &lt;FileFormat&gt;
        &lt;cvParam accession="MS:1001062" cvRef="PSI-MS" name="Mascot MGF format"/&gt;
    &lt;/FileFormat&gt;
    &lt;SpectrumIDFormat&gt;
        &lt;cvParam accession="MS:1000774" cvRef="PSI-MS" name="multiple peak list nativeID format"/&gt;
    &lt;/SpectrumIDFormat&gt;
    ...
&lt;/SpectraData&gt;</programlisting>
</section>
<section xml:id="element-spectrumidentification" xreflabel="SpectrumIdentification">
<title>Element &lt;SpectrumIdentification&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
An Analysis which tries to identify peptides in input spectra, referencing the database searched, the input spectra, the output results and the protocol that is run.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> SpectrumIdentificationType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>activityDate</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:dateTime</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>When the protocol was applied.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>spectrumIdentificationList_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the &lt;SpectrumIdentificationList&gt; produced by this analysis in the &lt;DataCollection&gt; section.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>spectrumIdentificationProtocol_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the search protocol used for this SpectrumIdentification.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-inputspectra"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>One of the spectra data sets used.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-searchdatabaseref"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>One of the search databases used.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SpectrumIdentification spectrumIdentificationProtocol_ref="SearchProtocol_1_4299" spectrumIdentificationList_ref="SII_LIST_1_1_4299_120114_20_Orbi2_ZC_QC_220_HSAd0-d4-1to1-3_Din.raw" id="SpecIdent__4299_120114_20_Orbi2_ZC_QC_220_HSAd0-d4-1to1-3_Din.raw"&gt;
    &lt;InputSpectra spectraData_ref="SD_4299_120114_20_Orbi2_ZC_QC_220_HSAd0-d4-1to1-3_Din.raw"/&gt;
    &lt;SearchDatabaseRef searchDatabase_ref="SDB_4299_203"/&gt;
&lt;/SpectrumIdentification&gt;</programlisting>
</section>
<section xml:id="element-spectrumidentificationitem" xreflabel="SpectrumIdentificationItem">
<title>Element &lt;SpectrumIdentificationItem&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
An identification of a single (poly)peptide, resulting from querying an input spectra, along with
the set of confidence values for that identification. PeptideEvidence elements should be given
for all mappings of the corresponding Peptide sequence within protein sequences.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> SpectrumIdentificationItemType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>calculatedMassToCharge</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:double</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The theoretical mass-to-charge value calculated for the peptide in Daltons / charge.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>calculatedPI</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:float</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The calculated isoelectric point of the (poly)peptide, with relevant modifications included.
Do not supply this value if the PI cannot be calcuated properly.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>chargeState</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:int</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The charge state of the identified peptide.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>experimentalMassToCharge</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:double</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The mass-to-charge value measured in the experiment in Daltons / charge.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>massTable_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>A reference SHOULD be given to the MassTable used to calculate the sequenceMass only if more than one MassTable has been given.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>passThreshold</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:boolean</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>Set to true if the producers of the file has deemed that the identification has passed a given threshold or been validated as correct.
If no such threshold has been set, value of true should be given for all results.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>peptide_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the identified (poly)peptide sequence in the Peptide element.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>rank</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:int</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>For an MS/MS result set, this is the rank of the identification quality as scored by the search engine. 1 is the top rank.
If multiple identifications have the same top score, they should all be assigned rank =1. For PMF data, the rank attribute may be meaningless and values of rank = 0 should be given.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>sample_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>A reference should be provided to link the SpectrumIdentificationItem to a Sample if more than one sample has been described in the AnalysisSampleCollection.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-peptideevidenceref"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>Reference to the PeptideEvidence element identified.
If a specific sequence can be assigned to multiple proteins and or positions in a protein all possible PeptideEvidence elements should be referenced here.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-fragmentation"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The product ions identified in this result.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Graphical Context:</emphasis></simpara>
<simpara><inlinemediaobject>
<imageobject>
<imagedata fileref="img/mzidentml/image9.png" contentwidth="576" contentdepth="326"/>
</imageobject>
<textobject><phrase><link xl:href="http://www.peptideatlas.org/PSI/schemas/mzIdentML/1.2/figures/SpectrumIdentificationItem.png">http://www.peptideatlas.org/PSI/schemas/mzIdentML/1.2/figures/SpectrumIdentificationItem.png</link></phrase></textobject>
</inlinemediaobject></simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SpectrumIdentificationItem passThreshold="true" rank="1" peptide_ref="NMGGPYGGGNYGPGGSGGSGGYGGR_15.99491461956-ATAA-2" calculatedMassToCharge="1103.45377262
38118" experimentalMassToCharge="1103.454029541017" chargeState="2" id="SII_1920_1"&gt;
&lt;PeptideEvidenceRef peptideEvidence_ref="PepEv_5164"/&gt;
&lt;Fragmentation&gt;
&lt;IonType charge="1" index="1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23"&gt;
&lt;FragmentArray measure_ref="Measure_MZ" values="175.1193695 232.1403961 289.1618958
452.2268982 509.2429504 566.2701416 653.2999268 710.3218994 767.3411865 854.3770752 911.3968506
968.4257813 1065.472168 1122.492432 1285.553833 1399.612305 1456.62561 1513.645874 1570.669067
1733.721191 1830.793213 1887.808105 1944.829834"/&gt;
&lt;FragmentArray measure_ref="Measure_Int" values="5939.5844726563 4933.5014648438
13310.7265625 5077.6694335938 5685.9287109375 13253.552734375 7620.0947265625 7724.3696289063
16868.541015625 10552.126953125 11589.0576171875 7839.9741210938 47821.64453125 60335.71484375
21759.3984375 8742.5595703125 11512.0908203125 18130.890625 30577.375 3801.3923339844 8051.07421875
1954.5501708984 4844.9125976563"/&gt;
&lt;FragmentArray measure_ref="Measure_Error" values="4.173258879802688E-4
-1.9794682032170385E-5 1.618474794895519E-5 0.001690052197886871 -0.0037214683721344954
0.0020060110579152024 -2.3719321211501665E-4 2.7168621795681247E-4 -0.0019049343519554895
0.0019553613780090018 2.6704080801209784E-4 0.007734020238103767 0.0013568713879976713
1.571508180404635E-4 -0.0017703817320580129 0.013774177127970688 0.0056154565579618065
0.004415735988004599 0.006145015418042021 -0.005059517131940083 0.01419863401793009
0.007626913448120831 0.007892192877989146"/&gt;
...
&lt;/SpectrumIdentificationItem&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/DataCollection/AnalysisData/SpectrumIdentificationList/SpectrumIdentificationResult/SpectrumIdentificationItem<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001405">MS:1001405</link> (spectrum identification result details) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000016">MS:1000016</link> (scan start time)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000796">MS:1000796</link> (spectrum title)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000797">MS:1000797</link> (peak list scans)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000798">MS:1000798</link> (peak list raw scans)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000903">MS:1000903</link> (product ion series ordinal)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000904">MS:1000904</link> (product ion m/z delta)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000926">MS:1000926</link> (product interpretation rank)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001030">MS:1001030</link> (number of peptide seqs compared to each spectrum)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001035">MS:1001035</link> (date / time search performed)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001036">MS:1001036</link> (search time taken)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001405">et al.</link></simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam accession="MS:1001330" cvRef="PSI-MS" value="2.5E-6" name="X\!Tandem:expect"/&gt;
&lt;cvParam accession="MS:1001331" cvRef="PSI-MS" value="43.2" name="X\!Tandem:hyperscore"/&gt;
&lt;cvParam accession="MS:1001250" cvRef="PSI-MS" value="0.0" name="local FDR"/&gt;
&lt;cvParam accession="MS:1001868" cvRef="PSI-MS" value="0.0" name="distinct peptide-level q-value"/&gt;
&lt;cvParam accession="MS:1001874" cvRef="PSI-MS" value="3.113325031133251E-4" name="FDRScore"/&gt;
&lt;cvParam accession="MS:1000796" cvRef="PSI-MS" value="Locus:11.1.1.4652.4 File:" R1="" p450="" iTRAQ="" QS="" name="spectrum title"/&gt;
&lt;cvParam accession="MS:1002049" cvRef="PSI-MS" value="129" name="MS-GF:RawScore"/&gt;
&lt;cvParam accession="MS:1002050" cvRef="PSI-MS" value="143" name="MS-GF:DeNovoScore"/&gt;
&lt;cvParam accession="MS:1002052" cvRef="PSI-MS" value="6.3746987E-25" name="MS-GF:SpecEValue"/&gt;
&lt;cvParam accession="MS:1002053" cvRef="PSI-MS" value="1.3781529E-17" name="MS-GF:EValue"/&gt;
&lt;cvParam accession="MS:1002054" cvRef="PSI-MS" value="0.0" name="MS-GF:QValue"/&gt;
&lt;cvParam accession="MS:1002055" cvRef="PSI-MS" value="0.0" name="MS-GF:PepQValue"/&gt;
&lt;cvParam accession="MS:1002351" cvRef="PSI-MS" value="0.0" name="PSM-level local FDR"/&gt;
&lt;cvParam accession="MS:1002354" cvRef="PSI-MS" value="0.0" name="PSM-level q-value"/&gt;
&lt;cvParam accession="MS:1002355" cvRef="PSI-MS" value="1.5603866050496166E-18" name="PSM-level FDRScore"/&gt;
&lt;cvParam accession="MS:1001171" cvRef="PSI-MS" value="25.37" name="Mascot:score"/&gt;
&lt;cvParam accession="MS:1001172" cvRef="PSI-MS" value="0.0813522191664226" name="Mascot:expectation value"/&gt;
&lt;cvParam accession="MS:1001175" cvRef="PSI-MS" name="peptide shared in multiple proteins"/&gt;
&lt;cvParam accession="MS:1001363" cvRef="PSI-MS" name="peptide unique to one protein"/&gt;
&lt;cvParam accession="MS:1001371" cvRef="PSI-MS" value="40" name="Mascot:identity threshold"/&gt;
&lt;cvParam accession="MS:1001370" cvRef="PSI-MS" value="27" name="Mascot:homology threshold"/&gt;
&lt;cvParam accession="MS:1001030" cvRef="PSI-MS" value="10148" name="number of peptide seqs compared to each spectrum"/&gt;
&lt;cvParam accession="MS:1001114" cvRef="PSI-MS" unitCvRef="UO" unitName="second" unitAccession="UO:0000010" value="1741" name="retention time(s)"/&gt;
&lt;cvParam accession="MS:1002681" cvRef="PSI-MS" name="OpenXQuest:combined score" value="0.552164719139592"/&gt;
&lt;cvParam accession="MS:1002511" cvRef="PSI-MS" name="crosslink spectrum identification item" value="11309529182388590588"/&gt;
&lt;cvParam accession="MS:1000894" cvRef="PSI-MS" name="retention time" value="5468.0193" unitAccession="second" unitName="" unitCvRef="se"/&gt;</programlisting>
<simpara><emphasis role="strong">Example userParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;userParam value="0" name="IsotopeError"/&gt;
&lt;userParam value="HCD" name="AssumedDissociationMethod"/&gt;</programlisting>
<simpara><emphasis role="strong">Example for peptide-level statistics:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SpectrumIdentificationItem chargeState="3" experimentalMassToCharge="710.352539" calculatedMassToCharge="710.352984" peptide_ref="KMDLSDEGGGGVRYPGLHPK_##Oxidation(M):2" rank="1" passThreshold="false" id="SIR_3397_SII_1"&gt;
    &lt;PeptideEvidenceRef peptideEvidence_ref="KMDLSDEGGGGVRYPGLHPK_generic|B_GENSCAN00000016205_REVERSED|p:genscan_42_61"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002356" name="PSM-level combined FDRScore" value="0.38641138028680944"/&gt;
    &lt;userParam name="search engines identifying PSM" value="2"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002359" name="distinct peptide-level local FDR" value="0.419570671870644"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001868" name="distinct peptide-level q-value" value="0.4192650334075724"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002360" name="distinct peptide-level FDRScore" value="0.41934590570107133"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002500" name="peptide passes threshold" value="true"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002520" name="peptide group ID" value="KMDLSDEGGGGVRYPGLHPK_##Oxidation(M):2"/&gt;
&lt;/SpectrumIdentificationItem&gt;</programlisting>
<simpara><emphasis role="strong">Example for crosslinking:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SpectrumIdentificationItem chargeState="4" experimentalMassToCharge="0.0" peptide_ref="54600873_54605193_9_1_p1" rank="1" passThreshold="false" id="SII_21_1"&gt;
    &lt;PeptideEvidenceRef peptideEvidence_ref="pepevid_psm252621611_pep54605193_protP02768-A_target_137"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002511" name="crosslink spectrum identification item" value="21"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002545" name="The xi result 'Score'." value="2.769918944845425"/&gt;
&lt;/SpectrumIdentificationItem&gt;</programlisting>
</section>
<section xml:id="element-spectrumidentificationitemref" xreflabel="SpectrumIdentificationItemRef">
<title>Element &lt;SpectrumIdentificationItemRef&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
Reference(s) to the SpectrumIdentificationItem element(s) that support the given PeptideEvidence element. Using these references, it is possible to indicate which spectra were actually accepted as evidence for this peptide identification in the given protein.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> SpectrumIdentificationItemRefType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>spectrumIdentificationItem_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the SpectrumIdentificationItem element(s).</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1001591" name="anchor protein"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001593" name="group member with undefined relationship OR ortholog protein"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002676" name="protein-pair-level global FDR" value="0.a:null:1.0:true"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002677" name="residue-pair-level global FDR" value="0.a:58:0.04716981132075472:true"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002415" name="protein group passes threshold" value="true"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002404" name="count of identified protein" value="2"/&gt;</programlisting>
</section>
<section xml:id="element-spectrumidentificationlist" xreflabel="SpectrumIdentificationList">
<title>Element &lt;SpectrumIdentificationList&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
Represents the set of all search results from SpectrumIdentification.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> SpectrumIdentificationListType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>numSequencesSearched</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:long</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The number of database sequences searched against.
This value should be provided unless a de novo search has been performed.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-fragmentationtable"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>Contains the types of measures that will be reported in generic arrays for each SpectrumIdentificationItem e.g. product ion m/z, product ion intensity, product ion m/z error</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><link xl:href="#element-spectrumidentificationresult">SpectrumIdentificationResult</link></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>All identifications made from searching one spectrum.
For PMF data, all peptide identifications will be listed underneath as SpectrumIdentificationItems.
For MS/MS data, there will be ranked SpectrumIdentificationItems corresponding to possible different peptide IDs.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Graphical Context:</emphasis></simpara>
<simpara><inlinemediaobject>
<imageobject>
<imagedata fileref="img/mzidentml/image10.png" contentwidth="575" contentdepth="432"/>
</imageobject>
<textobject><phrase><link xl:href="http://www.peptideatlas.org/PSI/schemas/mzIdentML/1.2/figures/SpectrumIdentificationList.png">http://www.peptideatlas.org/PSI/schemas/mzIdentML/1.2/figures/SpectrumIdentificationList.png</link></phrase></textobject>
</inlinemediaobject></simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SpectrumIdentificationList xmlns="http://psidev.info/psi/pi/mzIdentML/1.2" id="SII_LIST_1"&gt;
    &lt;SpectrumIdentificationResult spectrumID="index=6451" spectraData_ref="SD_COMBINED_SE_0" id="SIR_8947"&gt;
        &lt;SpectrumIdentificationItem chargeState="2" experimentalMassToCharge="679.817322" calculatedMassToCharge="679.818488" peptide_ref="AVMDDFAAFVEK_##Oxidation(M):3" rank="1" passThreshold="false" id="SIR_8947_SII_1"&gt;
            &lt;PeptideEvidenceRef peptideEvidence_ref="AVMDDFAAFVEK_generic_A_ENSP00000401820_p:known_378_389"/&gt;
            &lt;PeptideEvidenceRef peptideEvidence_ref="AVMDDFAAFVEK_generic_A_ENSP00000421027_p:putative_420_431"/&gt;
            &lt;PeptideEvidenceRef peptideEvidence_ref="AVMDDFAAFVEK_generic_A_ENSP00000483421_p:known_357_368"/&gt;
            &lt;PeptideEvidenceRef peptideEvidence_ref="AVMDDFAAFVEK_generic_A_ENSP00000480485_p:known_357_368"/&gt;
            ...
        &lt;/SpectrumIdentificationItem&gt;
    &lt;/SpectrumIdentificationResult&gt;
&lt;/SpectrumIdentificationList&gt;</programlisting>
</section>
<section xml:id="element-spectrumidentificationprotocol" xreflabel="SpectrumIdentificationProtocol">
<title>Element &lt;SpectrumIdentificationProtocol&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The parameters and settings of a SpectrumIdentification analysis.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> SpectrumIdentificationProtocolType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>analysisSoftware_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The search algorithm used, given as a reference to the &lt;SoftwareCollection&gt; section.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-searchtype"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The type of search performed e.g. PMF, Tag searches, MS-MS</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><link xl:href="#element-additionalsearchparams">AdditionalSearchParams</link></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The search parameters other than the modifications searched.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-modificationparams"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The specification of static/variable modifications (e.g. Oxidation of Methionine) that are to be considered in the spectra search.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-enzymes"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The list of enzymes used in experiment</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><link xl:href="#element-masstable">MassTable</link></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>The masses of residues used in the search.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-fragmenttolerance"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The tolerance of the search given as a plus and minus value with units.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-parenttolerance"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The tolerance of the search given as a plus and minus value with units.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-threshold"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The threshold(s) applied to determine that a result is significant.
If multiple terms are used it is assumed that all conditions are satisfied by the passing results.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-databasefilters"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>The specification of filters applied to the database searched.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-databasetranslation"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>A specification of how a nucleic acid sequence database was translated for searching.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Graphical Context:</emphasis></simpara>
<informalfigure>
<mediaobject>
<imageobject>
<imagedata fileref="img/mzidentml/image11.png" contentwidth="557" contentdepth="813"/>
</imageobject>
<textobject><phrase>http://www.peptideatlas.org/PSI/schemas/mzIdentML/1.2/figures/SpectrumIdentificationProtocol.png</phrase></textobject>
</mediaobject>
</informalfigure>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SpectrumIdentificationProtocol id="SIP_10589554385233790425" analysisSoftware_ref="SOF_10581839310406754333"&gt;
    &lt;SearchType&gt;
        &lt;cvParam accession="MS:1001083" cvRef="PSI-MS" name="ms-ms search"/&gt;
    &lt;/SearchType&gt;
    &lt;AdditionalSearchParams&gt;
        &lt;cvParam accession="MS:1002494" cvRef="PSI-MS" name="crosslinking search"/&gt;
        &lt;userParam name="input_consensusXML" unitName="xsd:string" value="leitner004.consensusXML"/&gt;
        ...
    &lt;/AdditionalSearchParams&gt;
&lt;/SpectrumIdentificationProtocol&gt;</programlisting>
</section>
<section xml:id="element-spectrumidentificationresult" xreflabel="SpectrumIdentificationResult">
<title>Element &lt;SpectrumIdentificationResult&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
All identifications made from searching one spectrum. For PMF data, all peptide identifications
will be listed underneath as SpectrumIdentificationItems. For MS/MS data, there will be ranked
SpectrumIdentificationItems corresponding to possible different peptide IDs.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> SpectrumIdentificationResultType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>spectraData_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to a spectra data set (e.g. a spectra file).</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>spectrumID</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The locally unique id for the spectrum in the spectra data set specified by SpectraData_ref.
External guidelines are provided on the use of consistent identifiers for spectra in different external formats.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-spectrumidentificationitem"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>An identification of a single (poly)peptide, resulting from querying an input spectra, along with the set of confidence values for that identification.
PeptideEvidence elements should be given for all mappings of the corresponding Peptide sequence within protein sequences.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SpectrumIdentificationResult spectrumID="index=7665" spectraData_ref="SD_COMBINED_SE_0" id="SIR_7191"&gt;
    &lt;SpectrumIdentificationItem chargeState="4" experimentalMassToCharge="1123.974121" calculatedMassToCharge="1123.968707" peptide_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_##Carbamidomethyl(C):1##Carbamidomethyl(C):3##Carbamidomethyl(C):12##Carbamidomethyl(C):18##Carbamidomethyl(C):25##Carbamidomethyl(C):31##Ammonia-loss(C):1" rank="1" passThreshold="false" id="SIR_7191_SII_1"&gt;
        &lt;PeptideEvidenceRef peptideEvidence_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_generic_A_ENSP00000376692_REVERSED_p:novel_575_612"/&gt;
        &lt;PeptideEvidenceRef peptideEvidence_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_generic_B_GENSCAN00000036974_REVERSED_p:genscan_494_531"/&gt;
        &lt;PeptideEvidenceRef peptideEvidence_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_generic_A_ENSP00000471297_REVERSED_p:putative_641_678"/&gt;
        &lt;PeptideEvidenceRef peptideEvidence_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_generic_A_ENSP00000319883_REVERSED_p:known_633_670"/&gt;
        &lt;PeptideEvidenceRef peptideEvidence_ref="CRCQYSGVNNLCHTSSHCPNQGSTCENVDTCLKPDEPK_generic_A_ENSP00000472280_REVERSED_p:putative_622_659"/&gt;
        ...
    &lt;/SpectrumIdentificationItem&gt;
&lt;/SpectrumIdentificationResult&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/DataCollection/AnalysisData/SpectrumIdentificationList/SpectrumIdentificationResult<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001405">MS:1001405</link> (spectrum identification result details) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000016">MS:1000016</link> (scan start time)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000796">MS:1000796</link> (spectrum title)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000797">MS:1000797</link> (peak list scans)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000798">MS:1000798</link> (peak list raw scans)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000903">MS:1000903</link> (product ion series ordinal)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000904">MS:1000904</link> (product ion m/z delta)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000926">MS:1000926</link> (product interpretation rank)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001030">MS:1001030</link> (number of peptide seqs compared to each spectrum)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001035">MS:1001035</link> (date / time search performed)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001036">MS:1001036</link> (search time taken)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001405">et al.</link></simpara>
</section>
<section xml:id="element-spectrumidformat" xreflabel="SpectrumIDFormat">
<title>Element &lt;SpectrumIDFormat&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The format of the spectrum identifier within the source file</simpara>
<simpara><emphasis role="strong">Type:</emphasis> SpectrumIDFormatType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SpectrumIDFormat&gt;
    &lt;cvParam accession="MS:1000774" cvRef="PSI-MS" name="multiple peak list nativeID format"/&gt;
&lt;/SpectrumIDFormat&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/DataCollection/Inputs/SpectraData/SpectrumIDFormat<?asciidoc-br?>
MUST supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000767">MS:1000767</link> (native spectrum identifier format) only once<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000768">MS:1000768</link> (Thermo nativeID format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000769">MS:1000769</link> (Waters nativeID format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000770">MS:1000770</link> (WIFF nativeID format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000771">MS:1000771</link> (Bruker/Agilent YEP nativeID format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000772">MS:1000772</link> (Bruker BAF nativeID format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000773">MS:1000773</link> (Bruker FID nativeID format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000774">MS:1000774</link> (multiple peak list nativeID format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000775">MS:1000775</link> (single peak list nativeID format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000776">MS:1000776</link> (scan number only nativeID format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000777">MS:1000777</link> (spectrum identifier nativeID format)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1000767">et al.</link><?asciidoc-br?>
MUST supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001529">MS:1001529</link> (spectra data details) only once<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001530">MS:1001530</link> (mzML unique identifier)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001531">MS:1001531</link> (spectrum from ProteinScape database nativeID format)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001532">MS:1001532</link> (spectrum from database string nativeID format)</simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1000774" name="multiple peak list nativeID format"/&gt;</programlisting>
</section>
<section xml:id="element-subsample" xreflabel="SubSample">
<title>Element &lt;SubSample&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
References to the individual component samples within a mixed parent sample.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> SubSampleType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>sample_ref</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>A reference to the child sample.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
</section>
<section xml:id="element-substitutionmodification" xreflabel="SubstitutionModification">
<title>Element &lt;SubstitutionModification&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
A modification where one residue is substituted by another (amino acid change).</simpara>
<simpara><emphasis role="strong">Type:</emphasis> SubstitutionModificationType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>avgMassDelta</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:double</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>Atomic mass delta considering the natural distribution of isotopes in Daltons.
This should only be reported if the original amino acid is known i.e. it is not "X" !location !xsd:int !optional !Location of the modification within the peptide - position in peptide sequence, counted from the N-terminus residue, starting at position 1. Specific modifications to the N-terminus should be given the location 0. Modification to the C-terminus should be given as peptide length + 1.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>monoisotopicMassDelta</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:double</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>Atomic mass delta when assuming only the most common isotope of elements in Daltons.
This should only be reported if the original amino acid is known i.e. it is not "X" !originalResidue a! xsd:string with restriction
[ABCDEFGHIJKLMNOPQRSTUVWXYZ?\-]{1}</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The original residue before replacement.</simpara></entry>
<entry align="left" valign="top"><simpara>replacementResidue</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string with restriction
[ABCDEFGHIJKLMNOPQRSTUVWXYZ?\-]{1}</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
</section>
<section xml:id="element-threshold" xreflabel="Threshold">
<title>Element &lt;Threshold&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
<emphasis role="strong">Depending on context (SpectrumIdentificationProtocol or ProteinDetectionProtocol):</emphasis><?asciidoc-br?></simpara>
<simpara><emphasis role="strong">Type:</emphasis> ParamListType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis> none</simpara>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-userparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>1</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single user-defined parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;Threshold&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001364" name="distinct peptide-level global FDR" value="1.0"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002350" name="PSM-level global FDR" value="1.0"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002567" name="phosphoRS score threshold" value="95.0"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1002557" name="D-Score threshold" value="95.0"/&gt;
&lt;/Threshold&gt;</programlisting>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/Threshold<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001302">MS:1001302</link> (search engine specific input parameter) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001005">MS:1001005</link> (SEQUEST:CleavesAt)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001007">MS:1001007</link> (SEQUEST:OutputLines)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001009">MS:1001009</link> (SEQUEST:DescriptionLines<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001026">MS:1001026</link> (SEQUEST:NormalizeXCorrValues)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001028">MS:1001028</link> (SEQUEST:SequenceHeaderFilter)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001032">MS:1001032</link> (SEQUEST:SequencePartialFilter)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001037">MS:1001037</link> (SEQUEST:ShowFragmentIons)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001038">MS:1001038</link> (SEQUEST:Consensus)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001042">MS:1001042</link> (SEQUEST:LimitTo)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001046">MS:1001046</link> (SEQUEST:sort by dCn)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001302">et al.</link><?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001153">MS:1001153</link> (search engine specific score) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001154">MS:1001154</link> (SEQUEST:probability)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001155">MS:1001155</link> (SEQUEST:xcorr)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001156">MS:1001156</link> (SEQUEST:deltacn)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001157">MS:1001157</link> (SEQUEST:sp)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001158">MS:1001158</link> (SEQUEST:Uniq)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001159">MS:1001159</link> (SEQUEST:expectation value)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001160">MS:1001160</link> (SEQUEST:sf)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001161">MS:1001161</link> (SEQUEST:matched ions)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001162">MS:1001162</link> (SEQUEST:total ions)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001163">MS:1001163</link> (SEQUEST:consensus score)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001153">et al.</link>
MAY supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001494">MS:1001494</link> (no threshold) only once<?asciidoc-br?>
MAY supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001448">MS:1001448</link> (pep:FDR threshold) only once</simpara>
<simpara>Path /MzIdentML/AnalysisProtocolCollection/ProteinDetectionProtocol/Threshold<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001302">MS:1001302</link> (search engine specific input parameter) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001005">MS:1001005</link> (SEQUEST:CleavesAt)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001007">MS:1001007</link> (SEQUEST:OutputLines)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001009">MS:1001009</link> (SEQUEST:DescriptionLines)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001026">MS:1001026</link> (SEQUEST:NormalizeXCorrValues)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001028">MS:1001028</link> (SEQUEST:SequenceHeaderFilter)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001032">MS:1001032</link> (SEQUEST:SequencePartialFilter)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001037">MS:1001037</link> (SEQUEST:ShowFragmentIons)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001038">MS:1001038</link> (SEQUEST:Consensus)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001042">MS:1001042</link> (SEQUEST:LimitTo)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001046">MS:1001046</link> (SEQUEST:sort by dCn)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001302">et al.</link><?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001153">MS:1001153</link> (search engine specific score) one or more times<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001154">MS:1001154</link> (SEQUEST:probability)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001155">MS:1001155</link> (SEQUEST:xcorr)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001156">MS:1001156</link> (SEQUEST:deltacn)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001157">MS:1001157</link> (SEQUEST:sp)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001158">MS:1001158</link> (SEQUEST:Uniq)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001159">MS:1001159</link> (SEQUEST:expectation value)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001160">MS:1001160</link> (SEQUEST:sf)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001161">MS:1001161</link> (SEQUEST:matched ions)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001162">MS:1001162</link> (SEQUEST:total ions)<?asciidoc-br?>
e.g.: <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001163">MS:1001163</link> (SEQUEST:consensus score)<?asciidoc-br?>
<link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001153">et al.</link><?asciidoc-br?>
MAY supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001447">MS:1001447</link> (prot:FDR threshold) only once<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="https://www.ebi.ac.uk/ols/ontologies/ms/terms?iri=http%3A%2F%2Fpurl.obolibrary.org%2Fobo%2FMS_1002664&amp;lang=en&amp;viewMode=All&amp;siblings=false">MS:1002664</link> (interaction score derived from crosslinking) one or more times<?asciidoc-br?>
MAY supply a <emphasis role="strong">child</emphasis> term of <link xl:href="https://www.ebi.ac.uk/ols/ontologies/ms/terms?iri=http%3A%2F%2Fpurl.obolibrary.org%2Fobo%2FMS_1002482&amp;lang=en&amp;viewMode=All&amp;siblings=false">MS:1002482</link> (statistical threshold) one or more times</simpara>
<simpara><emphasis role="strong">Example cvParams:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam cvRef="PSI-MS" accession="MS:1001494" name="no threshold"/&gt;
&lt;cvParam accession="MS:1002351" cvRef="PSI-MS" value="0.01" name="PSM-level local FDR"/&gt;
&lt;cvParam accession="MS:1001316" cvRef="PSI-MS" value="0.05" name="Mascot:SigThreshold"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1001364" name="distinct peptide-level global FDR" value="1.0"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002350" name="PSM-level global FDR" value="1.0"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002567" name="phosphoRS score threshold" value="95.0"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002557" name="D-Score threshold" value="95.0"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1002369" name="protein group-level global FDR" value="0.01"/&gt;</programlisting>
</section>
<section xml:id="element-translationtable" xreflabel="TranslationTable">
<title>Element &lt;TranslationTable&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
The table used to translate codons into nucleic acids e.g. by reference to the NCBI translation
table.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> TranslationTableType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>id</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>An identifier is an unambiguous string that is unique within the scope (i.e. a document, a set of related documents, or a repository) of its use.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The potentially ambiguous common identifier, such as a human-readable name for the instance.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Subelement Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">minOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">maxOccurs</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara><xref linkend="element-cvparam"/></simpara></entry>
<entry align="left" valign="top"><simpara>0</simpara></entry>
<entry align="left" valign="top"><simpara>unbounded</simpara></entry>
<entry align="left" valign="top"><simpara>A single entry from an ontology or a controlled vocabulary.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<simpara><emphasis role="strong">cvParam Mapping Rules:</emphasis></simpara>
<simpara>Path /MzIdentML/AnalysisProtocolCollection/SpectrumIdentificationProtocol/DatabaseTranslation/TranslationTable<?asciidoc-br?>
MUST supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001410">MS:1001410</link> (translation start codons) only once<?asciidoc-br?>
MUST supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001025">MS:1001025</link> (translation table) only once<?asciidoc-br?>
MUST supply term <link xl:href="http://www.ebi.ac.uk/ols/beta/ontologies/ms/terms?iri=http://purl.obolibrary.org/obo/MS_1001423">MS:1001423</link> (translation table description) only once</simpara>
</section>
<section xml:id="element-userparam" xreflabel="userParam">
<title>Element &lt;userParam&gt;</title>
<simpara><emphasis role="strong">Definition:</emphasis>
In case more information about the ions annotation has to be conveyed, that has no fit in FragmentArray. Note: It is suggested that the value attribute takes the form of a list of the same size as FragmentArray values. However, there is no formal encoding and it cannot be expeceted that other software will process or impart that information properly.</simpara>
<simpara><emphasis role="strong">Type:</emphasis> UserParamType</simpara>
<simpara><emphasis role="strong">Attributes:</emphasis></simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="4">
<colspec colname="col_1" colwidth="25*"/>
<colspec colname="col_2" colwidth="16.6666*"/>
<colspec colname="col_3" colwidth="16.6666*"/>
<colspec colname="col_4" colwidth="41.6668*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Attribute Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Data Type</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Use</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Definition</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>name</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>required</simpara></entry>
<entry align="left" valign="top"><simpara>The name of the parameter.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>type</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The datatype of the parameter, where appropriate (e.g.: xsd:float).</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>unitAccession</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>An accession number identifying the unit within the OBO foundry Unit CV.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>unitCvRef</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>If a unit term is referenced, this attribute MUST refer to the CV 'id' attribute defined in the cvList in this file.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>unitName</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The name of the unit.</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>value</simpara></entry>
<entry align="left" valign="top"><simpara>xsd:string</simpara></entry>
<entry align="left" valign="top"><simpara>optional</simpara></entry>
<entry align="left" valign="top"><simpara>The user-entered value of the parameter.</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara><emphasis role="strong">Subelements:</emphasis> none</simpara>
<simpara><emphasis role="strong">Example Context:</emphasis></simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;userParam value="VLENAEGDR; ASSGLNEDEIQK; MQELAQVSQK; KTAEDYLGEPVTEAVITVPAYFNDAQR; SLGQFNLDGINPAPR; MPMVQK; IIAADNGDAWVEVK; DVSIMPFK; KDVNPDEAVAIGAAVQGGVLTGDVK; KFEELVQTR; NDPLAMQR; VAEFFGK; QVEEAGDKLPADDK; MAPPQISAEVLKK; KQVEEAGDKLPADDK; LINYLVEEFK; MAPPQISAEVLK; QAVTNPQNTLFAIK; TFEVLATNGDTHLGGEDFDSR; VALQDAGLSVSDIDDVILVGGQTR; FQDEEVQR" name="unique peptides"/&gt;</programlisting>
</section>
</chapter>
<chapter xml:id="specific-comments-on-schema">
<title>Specific Comments on schema</title>
<simpara>In this section, several points of documentation are elaborated beyond the core specification in <xref linkend="model-in-xml-schema"/>.</simpara>
<section xml:id="file-extension-and-compression">
<title>File extension and compression</title>
<simpara>It is noted that standard file compression algorithms greatly reduce the mzIdentML file sizes, speeding up file transfers and uploads / downloads.
It is also noted that software implementing mzIdentML import or export will be expected to benefit in performance from working with compressed mzIdentML, since the compression and decompression algorithms are expected to give significant performance gains over disk access times for non-compressed files.
As such, it is RECOMMENDED that mzIdentML files are compressed using gzip from all software that exports mzIdentML and software that imports SHOULD be expected to read gzipped files, as well as native (non-compressed) mzIdentML files.
The file extension for native mzIdentML files SHOULD be “.mzid” and for compressed files SHOULD be “mzid.gz”.</simpara>
</section>
<section xml:id="referencing-elements-within-the-document">
<title>Referencing elements within the document</title>
<simpara>A number of elements within the schema have an attribute which is used to reference an element elsewhere in the file using the unique identifier of the referenced element.
These attributes are named following the convention: “[elementName]_ref”.
The uniqueness of the value in the “id” attribute of elements is validated using xsd:key, and the integrity of the reference is validated using xsd:keyref, defined within the schema.</simpara>
</section>
<section xml:id="searches-against-nucleotide-sequences">
<title>Searches against nucleotide sequences</title>
<simpara>The “seq” attribute on &lt;DBSequence&gt; SHOULD contain the nucleic acid sequence if a nucleic acid database was searched (rather than up to six translated sequences).
&lt;Peptide&gt; represents the identified amino acid sequence (including modifications) and, as such, the &lt;peptideSequence&gt; elements SHOULD store the translated amino acid sequences.
&lt;PeptideEvidence&gt; contains the <emphasis>DBSequence_Ref</emphasis> together with the translation frame and a <emphasis>TranslationTable_Ref</emphasis> attribute (see below).
The <emphasis>Peptide_Ref</emphasis> is done in &lt;SpectrumIdentificationItem&gt; as in the case for an amino acid database.
If protein detection is performed, there are &lt;PeptideHypothesis&gt; elements referencing &lt;PeptideEvidence&gt; elements from &lt;SpectrumIdentificationItem&gt; sections.
For clarification, see the example instance document for a nucleic acid search (<xref linkend="other-supporting-materials"/>).</simpara>
<simpara>In the &lt;SpectrumIdentificationProtocol&gt;, &lt;TranslationTable&gt; is used to specify how nucleic acid sequences are translated into amino acid sequences as follows:</simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;DatabaseTranslation frames="1 2 3 -1 -2 -3"&gt;
    &lt;TranslationTable id="TT_1" name="Standard"&gt;
        &lt;cvParam accession="MS:1001025" name="translation table" cvRef="PSI-MS" value="FFLLSSSSYY**CC*WLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGG"/&gt;
        &lt;cvParam accession="MS:1001410" name="translation start codons" cvRef="PSI-MS" value="---M---------------M---------------M----------------------------"/&gt;
        &lt;cvParam accession="MS:1001423" name="translation table description" cvRef="PSI-MS" value="http://www.ncbi.nlm.nih.gov/Taxonomy/taxonomyhome.html/index.cgi?chapter=cgencodes#SG1"/&gt;
    &lt;/TranslationTable&gt;
    &lt;TranslationTable id="TT_2" name="Vertebrate Mitochondrial"&gt;
        &lt;cvParam accession="MS:1001025" name="translation table" cvRef="PSI-MS" value="FFLLSSSSYY**CCWWLLLLPPPPHHQQRRRRIIMMTTTTNNKKSS**VVVVAAAADDEEGGGG"/&gt;
        &lt;cvParam accession="MS:1001410" name="translation start codons" cvRef="PSI-MS" value="--------------------------------MMMM---------------M------------"/&gt;
        &lt;cvParam accession="MS:1001423" name="translation table description" cvRef="PSI-MS" value="http://www.ncbi.nlm.nih.gov/Taxonomy/taxonomyhome.html/index.cgi?chapter=cgencodes#SG2"/&gt;
    &lt;/TranslationTable&gt;
&lt;/DatabaseTranslation&gt;</programlisting>
<simpara>The attribute “frames” specifies which frames are considered and one or more translation tables can be specified using CV parameters.
The translation table is defined here: <link xl:href="http://www.ncbi.nlm.nih.gov/IEB/ToolBox/SDKDOCS/SEQFEAT.HTML#_Genetic_Codes">http://www.ncbi.nlm.nih.gov/IEB/ToolBox/SDKDOCS/SEQFEAT.HTML#_Genetic_Codes</link>:</simpara>
<simpara>"The genetic codes themselves are arrays of 64 amino acid codes.
The index to the position in the array of the amino acid is derived from the codon by the following method:</simpara>
<simpara>index = (base1 16) + (base2 4) + (base3 1)</simpara>
<simpara>where T=0, C=1, A=2, G=3”</simpara>
<simpara>The same encoding technique is used to specify start codons.
Alphabet names are prefixed with "s" (e.g. sncbieaa) to indicate start codon arrays.
Each cell of a start codon array contains either the gap code ("-" for ncbieaa) or an amino acid code if it is valid to use the codon as a start codon.
Currently all starts are set to code for methionine, since it has never been convincingly demonstrated that a protein can start with any other amino acid.
However, if other amino acids are shown to be used as starts, this structure can easily accommodate that information.</simpara>
<simpara>For each peptide, the frame and translation table should be specified in the &lt;PeptideEvidence&gt; element:</simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;PeptideEvidence id="”1”" TranslationTable_ref="TT_1" frame="1"/&gt;</programlisting>
</section>
<section xml:id="reporting-peptide-and-protein-identifications-passing-a-significance-threshold">
<title>Reporting peptide and protein identifications passing a significance threshold</title>
<simpara>The elements &lt;SpectrumIdentificationItem&gt; and &lt;ProteinDetectionHypothesis&gt; have a mandatory Boolean attribute <emphasis>passThreshold</emphasis> that allows a file producer to indicate that an identification has passed a given threshold or that it has been manually validated.
Depending on the intended purpose of the file, the file producer MAY wish to report a number of identifications that fall below the given significance threshold, for example to allow global statistical analyses to be performed which are not possible if only identifications passing the threshold are reported.
Thresholds for peptide-spectrum matches or for protein identification should be encoded as instances of &lt;cvParam&gt; within &lt;SpectrumDetectionProtocol&gt; or &lt;ProteinDetectionProtocol&gt;, respectively, as follows.
If the file producer does not want to indicate that a threshold has been set, all identifications MUST have passThreshold = “true” and the “no threshold” CV term should be given within the protocols.</simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SpectrumIdentificationProtocol id="SIP" AnalysisSoftware_ref="AS_mascot_server"&gt;
    …
    &lt;Threshold&gt;
        +
        &lt;cvParam accession="MS:1001316" name="mascot:SigThreshold" cvRef="PSI-MS" value="0.05"/&gt;
        +
    &lt;/Threshold&gt;
    &lt;ProteinDetectionProtocol id="PDP_MascotParser_1" AnalysisSoftware_ref="AS_mascot_parser"&gt;
        …
        &lt;Threshold&gt;
            &lt;cvParam accession="MS:1001316" name="mascot:SigThreshold" cvRef="PSI-MS" value="0.05"/&gt;
        &lt;/Threshold&gt;
    &lt;/ProteinDetectionProtocol&gt;
&lt;/SpectrumIdentificationProtocol&gt;</programlisting>
<simpara>The reporting of significance thresholds at the PSM and peptide level (mzIdentML 1.2) is explained in <xref linkend="peptide-level-scores-and-statistical-measures"/>. Reporting of threshold for modification position (also mzIdentML 1.2) is explained in <xref linkend="encoding-modification-localisation-scores"/>.</simpara>
</section>
<section xml:id="using-decoy-databases-to-set-different-thresholds-of-false-discovery-rate">
<title>Using decoy databases to set different thresholds of false discovery rate</title>
<simpara>mzIdentML supports the reporting of searches against decoy databases, constructed and searched using many of the currently known methods.
A &lt;SpectrumIdentificationItem&gt; can be marked as matching a decoy peptide using the <emphasis>isDecoy</emphasis> attribute of the referenced &lt;PeptideEvidence&gt; element, thus allowing the false discovery rate to be calculated across an entire file.
The <emphasis>DBSequence_Ref</emphasis> references the decoy protein record.</simpara>
<simpara>Implementers of the format SHOULD report the peptide identifications that pass the threshold they wish to communicate to a consumer of the data.
For example, a threshold could be set by p-value, false discovery rate, by a native search engine score (or a more complex system documented with CV terms in &lt;Threshold&gt;), and those peptides reported (passing the threshold) are used to determine which proteins have been detected.
It is not guaranteed that a consumer of an mzIdentML file will be able to calculate other results, or global false discovery rates, using different thresholds from the reported information, although in some circumstances they may be able to, for example, if a user reports the complete output of a search against a target and decoy search.</simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;SequenceCollection&gt;
    &lt;PeptideEvidence isDecoy="true" post="D" pre="K" end="404" start="392" peptide_ref="HAVGGRYSSLLCK__57.0215@C$403;_" dBSequence_ref="dbseq_REV_psu|NC_LIV_113200" id="PE6_2_4"/&gt;
    &lt;SequenceCollection&gt;
        &lt;DataCollection&gt;
            &lt;Inputs&gt;
                &lt;SearchDatabase location="/localdirectory/18.E_coli_K12_edit.fasta" id="K12_nosignal" name="K12" numDatabaseSequences="9376" releaseDate="01-2008-08-2008" version="1.0"&gt;
                    &lt;FileFormat&gt;
                        &lt;cvParam accession="MS:1001348" name="FASTA format" cvRef="PSI-MS"/&gt;
                    &lt;/FileFormat&gt;
                    &lt;DatabaseName&gt;
                        &lt;userParam name="18.E_coli_K12_edit.fasta"/&gt;
                    &lt;/DatabaseName&gt;
                    &lt;cvParam accession="MS:1001197" name="*DB composition target+decoy*" cvRef="PSI-MS"/&gt;
                    &lt;cvParam accession="MS:1001283" name="*decoy DB accession regexp*" value="*Rnd*" cvRef="PSI-MS"/&gt;
                    &lt;cvParam accession="MS:1001195" name="*decoy DB type reverse*" cvRef="PSI-MS"/&gt;
                &lt;/SearchDatabase&gt;
            &lt;/Inputs&gt;
            &lt;AnalysisData&gt;
                &lt;SpectrumIdentificationList&gt;
                    &lt;SpectrumIdentificationResult&gt;
                        &lt;SpectrumIdentificationItem passThreshold="false" rank="1" peptide_ref="HAVGGRYSSLLCK__57.0215@C$403;_" experimentalMassToCharge="1448.756" chargeState="2" id="SII_6_1"&gt;
                            &lt;PeptideEvidenceRef peptideEvidence_ref="PE6_2_4"/&gt;
                            &lt;cvParam accession="MS:1001329" name="OMSSA:pvalue" cvRef="PSI-MS" value="0.00073351"/&gt;
                        &lt;/SpectrumIdentificationItem&gt;
                    &lt;/SpectrumIdentificationResult&gt;
                &lt;/SpectrumIdentificationList&gt;
            &lt;/AnalysisData&gt;
        &lt;/DataCollection&gt;
    &lt;/SequenceCollection&gt;
&lt;/SequenceCollection&gt;</programlisting>
</section>
<section xml:id="database-filter">
<title>Database Filter</title>
<simpara>The format can specify that a sequence database has been filtered, for example based on pI, protein mass, taxonomy or even a set of accession numbers for a second pass search.
For example, all animals except mice would be encoded as (NCBI:33208 is metazoa, NCBI:10090 is <emphasis>Mus musculus</emphasis>):</simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;DatabaseFilters&gt;
    &lt;Filter&gt;
        &lt;FilterType&gt;
            &lt;cvParam accession="MS:1001020" name="DB filter taxonomy" cvRef="PSI-MS"/&gt;
        &lt;/FilterType&gt;
        &lt;Include&gt;
            &lt;cvParam accession="MS:1001467" name="taxonomy: NCBI TaxID" cvRef="PSI-MS" value="33208"/&gt;
        &lt;/Include&gt;
        &lt;Exclude&gt;
            &lt;cvParam accession="MS:1001467" name="taxonomy: NCBI TaxID" cvRef="PSI-MS" value="10090"/&gt;
        &lt;/Exclude&gt;
    &lt;/Filter&gt;
&lt;/DatabaseFilters&gt;</programlisting>
</section>
<section xml:id="types-of-parameters-and-values">
<title>Types of parameters and values</title>
<simpara>There are several types for parameters that are used in the schema:</simpara>
<simpara>&lt;ParamListType&gt;: A list (i.e. unbounded number) of &lt;ParamGroup&gt; elements.</simpara>
<simpara>&lt;ParamGroup&gt;: A choice between &lt;cvParam&gt; or &lt;userParam&gt; elements.</simpara>
<simpara>&lt;ParamType&gt;: A single reference to &lt;ParamGroup&gt;, which allows a choice between either &lt;cvParam&gt; or &lt;userParam&gt; elements at the specified point in the schema.</simpara>
<simpara>&lt;cvParamType&gt;: A single entry from an ontology or a CV.
Attributes: <emphasis>accession</emphasis>, <emphasis>cvRef</emphasis>, <emphasis>name</emphasis>, <emphasis>value</emphasis>, <emphasis>unitAccession</emphasis>, <emphasis>unitName</emphasis>, <emphasis>unitCvRef</emphasis>.</simpara>
<simpara>&lt;userParamType&gt;: A single user-defined parameter.
Attributes: <emphasis>name</emphasis>, <emphasis>value</emphasis>, <emphasis>unitAccession</emphasis>, <emphasis>unitName</emphasis>, <emphasis>unitCvRef</emphasis>.</simpara>
</section>
<section xml:id="reporting-fragmentation-ions">
<title>Reporting fragmentation ions</title>
<simpara>mzIdentML employs an array type structure to support the reporting of ion types identified in an MS/MS analysis, coupled with CV parameters to retain flexibility in the types of ions that can be reported.</simpara>
<simpara>A brief example is given here to explain how these structures should be used where <emphasis>y11</emphasis>, <emphasis>y8</emphasis> and <emphasis>y7</emphasis> have been identified with charge = 2+.
First, the types of measures to be reported are given in the &lt;FragmentationTable&gt; using &lt;cvParam&gt; instances.
Second, each &lt;SpectrumIdentificationItem&gt; contains an index of values (11, 8 and 7 for each y ion) and parallel arrays that reference back to each &lt;Measure&gt; defined in the &lt;FragmentationTable&gt;.
In the example, the <emphasis>y8</emphasis> ion has a product ion <emphasis>m/z</emphasis> = 436.4, product ion intensity = 11 and product ion <emphasis>m/z</emphasis> error = 0.1284 (the second position in the index of each array).</simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;FragmentationTable&gt;
    &lt;Measure id="m_mz"&gt;
        &lt;cvParam cvRef="PSI-MS" accession="MS:1001225" name="product ion m/z"/&gt;
    &lt;/Measure&gt;
    &lt;Measure id="m_intensity"&gt;
        &lt;cvParam cvRef="PSI-MS" accession="MS:1001226" name="product ion intensity"/&gt;
    &lt;/Measure&gt;
    &lt;Measure id="m_error"&gt;
        &lt;cvParam cvRef="PSI-MS" accession="MS:1001227" name="product ion m/z error" unitAccession="MS:1000040" unitName="m/z" unitCvRef="PSI-MS"/&gt;
    &lt;/Measure&gt;
&lt;/FragmentationTable&gt;
&lt;IonType index="11 8 7" charge="2"&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001220" name="frag: y ion"/&gt;
    &lt;FragmentArray values="551.3 436.4 380.1" measure_ref="m_mz"/&gt;
    &lt;FragmentArray values="800 11 46" measure_ref="m_intensity"/&gt;
    &lt;FragmentArray values="0.4752 0.1284 0.3704" measure_ref="m_error"/&gt;
&lt;/IonType&gt;</programlisting>
<section xml:id="internal-fragments-and-immonium-ions">
<title>Internal fragments and immonium ions</title>
<simpara>mzIdentML supports the reporting of internal fragment ions, of which an immonium ion is a special case comprising a single side chain (<link xl:href="http://www.matrixscience.com/help/fragmentation_help.html">http://www.matrixscience.com/help/fragmentation_help.html</link>).
For internal and immonium ions, the index is used in two different ways.
Internal fragments are reported using the index structure to identify the start and end of the ion within the sequence.
The example shows how the index performs this different role, as it identifies pairs of internal ions: <emphasis>ya2-5</emphasis>, <emphasis>ya3-7</emphasis>, <emphasis>ya3-8</emphasis>, <emphasis>ya4-8</emphasis>, <emphasis>ya5-8</emphasis>, <emphasis>ya5-11</emphasis>, <emphasis>ya8-11</emphasis>.</simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;IonType index="2 5 3 7 3 8 4 8 5 8 5 11 8 11" charge="1"&gt;
    &lt;FragmentArray values="315.2 388.1 501.4 444.1 342.8 669.901495 412.4" measure_ref="m_mz"/&gt;
    &lt;FragmentArray values="44 63 10430 75 48 6420 31" measure_ref="m_intensity"/&gt;
    &lt;FragmentArray values="-0.0027 -0.1191 0.0969 -0.1817 -0.4340 0.4721 0.1082" measure_ref="m_error"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001366" name="frag: internal ya ion"/&gt;
&lt;/IonType&gt;</programlisting>
<simpara>For immonium ions, the index is the position of the identified ion within the peptide sequence.
If the peptide contains the same amino acid in multiple positions that cannot be distinguished, all positions should be given.
Example, where immonium ions have been found matching T and G in the following peptide sequence FGGEENTY (positions 2 or 3, and position 7):</simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;IonType charge="1" index="2 3 7"&gt;
    &lt;FragmentArray values="288.2 286.1 387.2 371.127841" measure_ref="m_mz"/&gt;
    &lt;FragmentArray values="2137 83 656 1663" measure_ref="m_intensity"/&gt;
    &lt;FragmentArray values="0.0260 -0.1125 -0.0602 -0.1011" measure_ref="m_error"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001239" name="frag: immonium ion"/&gt;
&lt;/IonType&gt;</programlisting>
</section>
<section xml:id="encoding-neutral-loss-fragment-ions">
<title>Encoding Neutral loss fragment ions</title>
<simpara>The encoding of the identification of neutral loss fragment ions has changed from version 1.1 to version 1.2.0. The CV previously contained an attempt to enumerate all possible neutral losses from all types of fragment ions, leading to a long and incomplete list of possible terms.
As such, pairs of CV terms are now allowed in version 1.2.0 to describe both the ion type and the type of neutral loss, as follows:</simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;IonType charge="2" index="3"&gt;
    &lt;FragmentArray measure_ref="Measure_MZ" values="165.0899353"/&gt;
    &lt;FragmentArray measure_ref="Measure_Int" values="2058.2841796875"/&gt;
    &lt;FragmentArray measure_ref="Measure_Error" values="0.007899949187986977"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1001220" name="frag: y ion"/&gt;
    &lt;cvParam cvRef="PSI-MS" accession="MS:1000336" name="neutral loss" value="H3N"/&gt;
&lt;/IonType&gt;</programlisting>
</section>
</section>
<section xml:id="enzyme-definition">
<title>Enzyme definition</title>
<simpara>The &lt;SpectrumIdentificationProtocol&gt; SHOULD contain a specification of which enzyme (if any) was applied in the search.
The element &lt;Enzyme&gt; has optional sub-elements for specifying the &lt;EnzymeName&gt; using a CV term and the cleavage site, using a regular expression.
Regular expressions should be encoded following the notation of Perl Compatible Regular Expressions (PCRE regex, <link xl:href="https://www.pcre.org/">https://www.pcre.org</link>, matching the syntax and semantics of Perl version 5).
The PSI-MS CV contains terms for the most common enzymes with pre-defined regular expressions (Table 6).
If the enzyme used is present in the PSI-MS CV, the term MUST be encoded under &lt;EnzymeName&gt; unless the rule given in the CV does not match that used by the software or if the enzyme used is not present in the CV, in which case the regular expression used MUST be given in the element &lt;SiteRegexp&gt;.
If the &lt;EnzymeName&gt; element is used, the regular expression MAY also be provided additionally.
For a no enzyme search, (i.e. one where there may be a cleavage at any residue), the CV term MS:1001091 ‘NoEnzyme’ MUST be specified, and the missedCleavages and semiSpecific attributes SHOULD NOT be specified.
If two or more enzymes are used, multiple &lt;Enzyme&gt; elements SHOULD be provided rather than trying to build a regular expression covering all cleavage sites.
If the software uses a name for an enzyme other than the one specified in the CV, a user param term MAY also be given.</simpara>
<simpara>The following guidelines SHOULD be followed when generating regular expressions in an instance document for enzymes not present in the CV: 1) use the PCRE supplied negation syntax for look-ahead and look-behind assertions and 2) use the most compact representation possible for a regex.
The start of a match specifies the cleavage point.
For example the enzyme trypsin, which cleaves following a K or R residue unless the next residue is P, has the regular expression:</simpara>
<simpara>(?&#8656;[KR])(?!P)</simpara>
<simpara>The ?&#8656; is a "zero-width positive look-behind assertion", and [] means one of this character set.
So, this rule is to look behind for a K or R. ?! is a zero-width positive look-ahead assertion, and ?!P means any character that is not P. An example of an “N-term” enzyme is Asp-N which cleaves before D or B. This can be described using the PCRE:</simpara>
<simpara>(?=[BD])</simpara>
<simpara>The ?= is a “zero-width positive look-ahead assertion.”</simpara>
<simpara>A simple 3 line perl program can be written to test a regular expression:</simpara>
<simpara>$protein = "ABCDKPEFGHIJKLMNOPQRSTUVWXYZ";</simpara>
<simpara>@peptides = split(/(?&#8656;[KR])( ?!P)/, $protein);</simpara>
<simpara>print join "\n", @peptides;</simpara>
<simpara>The program returns:</simpara>
<simpara>ABCDKPEFGHIJK</simpara>
<simpara>LMNOPQR</simpara>
<simpara>STUVWXYZ</simpara>
<informaltable frame="all" rowsep="1" colsep="1">
<tgroup cols="2">
<colspec colname="col_1" colwidth="50*"/>
<colspec colname="col_2" colwidth="50*"/>
<thead>
<row>
<entry align="left" valign="top"><emphasis role="strong">Enzyme Name</emphasis></entry>
<entry align="left" valign="top"><emphasis role="strong">Regular expression</emphasis></entry>
</row>
</thead>
<tbody>
<row>
<entry align="left" valign="top"><simpara>Trypsin</simpara></entry>
<entry align="left" valign="top"><simpara>(?&#8656;[KR])(?!P)</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>Arg-C</simpara></entry>
<entry align="left" valign="top"><simpara>(?&#8656;R)(?!P)</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>Asp-N</simpara></entry>
<entry align="left" valign="top"><simpara>(?=[BD])</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>Asp-N_ambic</simpara></entry>
<entry align="left" valign="top"><simpara>(?=[DE])</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>Chymotrypsin</simpara></entry>
<entry align="left" valign="top"><simpara>(?&#8656;[FYWL])(?!P)</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>CNBr</simpara></entry>
<entry align="left" valign="top"><simpara>(?&#8656;M)</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>Formic_acid</simpara></entry>
<entry align="left" valign="top"><simpara><indexterm>
<primary>?&#8656;D</primary>
</indexterm>?&#8656;D</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara><indexterm>
<primary>?=D</primary>
</indexterm>?=D</simpara></entry>
<entry align="left" valign="top"><simpara>Lys-C</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>(?&#8656;K)(?!P)</simpara></entry>
<entry align="left" valign="top"><simpara>Lys-C/P</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>(?&#8656;K)</simpara></entry>
<entry align="left" valign="top"><simpara>PepsinA</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>(?&#8656;[FL])</simpara></entry>
<entry align="left" valign="top"><simpara>TrypChymo</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>(?&#8656;[FYWLKR])(?!P)</simpara></entry>
<entry align="left" valign="top"><simpara>Trypsin/P</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>(?&#8656;[KR])</simpara></entry>
<entry align="left" valign="top"><simpara>V8-DE</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>(?&#8656;[BDEZ])(?!P)</simpara></entry>
<entry align="left" valign="top"><simpara>V8-E</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>(?&#8656;[EZ])(?!P)</simpara></entry>
<entry align="left" valign="top"><simpara>Leukocyte elastase</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>(?&#8656;[ALIV])(?!P)</simpara></entry>
<entry align="left" valign="top"><simpara>Proline endopeptidase</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>(?&#8656;[HKR]P)(?!P)</simpara></entry>
<entry align="left" valign="top"><simpara>Glutamyl endopeptidase</simpara></entry>
</row>
<row>
<entry align="left" valign="top"><simpara>(?&#8656;[^E]E)</simpara></entry>
<entry align="left" valign="top"><simpara>2-iodobenzoate</simpara></entry>
</row>
</tbody>
</tgroup>
</informaltable>
<simpara>Table 6. Common enzymes and the cleavage site specified as regular expressions as represented in the PSI-MS CV.</simpara>
</section>
<section xml:id="unknown-modifications">
<title>Unknown modifications</title>
<simpara>In version 1.1.0 onwards of mzIdentML there has been a change with respect to how “unknown modifications” (i.e. those not present in an allowed CV) are reported on peptides.
In version 1.0, &lt;userParam&gt; elements were allowed on &lt;Peptide&gt; to capture these modifications.
In version 1.1.0 onwards, only &lt;cvParam&gt; elements can be given on &lt;Peptide&gt; and a term “unknown modification” has been added to the PSI-MS CV.
This term MUST only be used if the identified modification is not present in Unimod (or other allowed CV such as PSI-MOD, although PSI-MOD is now deprecated), according to the identity of the residue modified and the delta mass, within the parent tolerance specified in the search.
The semantic validator will check any uses of the “unknown modification” term (MS:1001460) and reject files if the modification is present in Unimod.</simpara>
</section>
<section xml:id="identifications-based-on-multiple-mass-spectra">
<title>Identifications based on multiple mass spectra</title>
<simpara>This Section has been added to version 1.3.0 of the specification.
Some analysis workflows utilise multiple spectra to arrive at a given identification, for instance, the following crosslinking search strategies:</simpara>
<orderedlist numeration="lowerroman">
<listitem>
<simpara>“light” and “heavy” isotopes of the crosslinker are used as a pair, combined together and searched once;</simpara>
</listitem>
<listitem>
<simpara>multiple spectra of the same precursor are acquired, <emphasis>e.g.</emphasis> using different fragmentation techniques like EThcD;</simpara>
</listitem>
<listitem>
<simpara>when using a cleavable crosslinker and both MS3 spectra of the cleaved peptides and the MS2 spectrum of the crosslinked peptide pair are considered in the identification process.</simpara>
</listitem>
</orderedlist>
<simpara>mzIdentML 1.2.0 included a method for encoding such cases using the “combined spectra” type of input file format.
This essentially associates a single &lt;SpectrumIdentificationResult&gt; element with a comma separated list of spectrum identifiers.
This would work for crosslinking search strategy (i) if all the spectra contributing to an identification share the same acquisition settings, but it presents a problem for cases (ii) and (iii).
The &lt;SpectrumIdentificationResult&gt; element with the comma separated list of identifiers is inside a single &lt;SpectrumIdentificationList&gt; element and this can only be associated with a single &lt;SpectrumAnalysisProtocol&gt; element.
But in cases (ii) and (iii) the spectra referenced have different acquisition settings, which would require the use of distinct &lt;SpectrumAnalysisProtocol&gt; elements to be encoded correctly.
Many of the workflows using cleavable crosslinkers fall into categories (ii) and (iii) above.</simpara>
<simpara>The consequence of the use of the “combined spectra” type of input file format is that the different spectra cannot be associated with different &lt;SpectrumAnalysisProtocol&gt; elements.
But doing so is necessary to correctly encode cases (ii) and (iii) above (to capture, for example, different acquisition settings for different fragmentation modes).</simpara>
<simpara>mzIdentML 1.3.0 now advises the use of the new CV term “identification based on multiple spectra” (MS:1003332) to encode identifications based on multiple spectra.
This CV term goes inside &lt;SpectrumIdentificationItem&gt; elements.
The “combined spectra” type of input file format from version 1.2.0 has been retired and is not part of the 1.3.0 specification.</simpara>
<simpara>The values of the “identification based on multiple spectra” CV term (MS:1003332) take the form: [identifier string]:[P or C].
For example:</simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam accession="MS:1003332" cvRef="PSI-MS" value="1234:P" name="identification based on multiple spectra"/&gt;</programlisting>
<simpara>The letters ‘P’ and ‘C’ refer to ‘parent’ and ‘child’.
&lt;SpectrumIdentificationItem&gt; elements marked ‘parent’ cover the entire identification (in the case of crosslinking, the crosslinked peptide pair) and those marked ‘child’ only identify a constituent part of the whole identification (typically a single peptide in an MS3 scan).
Identifications based on multiple spectra MUST have both ‘P’ and ‘C’ marked constituent spectra identifications for cases that include at least one child element, or neither ‘P’ nor ‘C’ in the case that there is no parent/child relationship.
There is no limit on the number of constituent spectra that are marked either ‘P’ or ‘C’.</simpara>
<simpara>The unique identifier string (“1234” in the above example) can associate &lt;SpectrumIdentificationItem&gt; elements across different &lt;SpectrumIdentificationList&gt; elements.
These &lt;SpectrumIdentificationList&gt; elements can then be associated with different &lt;SpectrumIdentificationProtocol&gt; elements.</simpara>
<simpara>There may not be a parent-child relationship between the scan events supporting an identification since one scan may not represent a constituent part of the other.
To describe such a sibling relationship between scans, the ‘P’ and ‘C’ can be omitted, as is the case in Figure 6.</simpara>
<simpara>The associated example file <link xl:href="https://github.com/HUPO-PSI/mzIdentML/blob/master/examples/1_3examples/crosslinking/multiple_spectra_per_id_1_3_0_draft.mzid">multiple_spectra_per_id_1_3_0_draft.mzid</link> shows a common workflow for cleavable crosslinkers which uses two levels of MS2 (one using HCD as the fragmentation mode, one using ETD as the fragmentation mode) followed by MS3 scans.
To encode this, three distinct &lt;SpectrumIdentificationProtocol&gt; elements are needed: one for HCD MS2 scans, one for ETD MS2 scans, and one for MS3 scans.
Three distinct &lt;SpectrumIdentification&gt; elements associate each of the &lt;SpectrumIdentificationProtocol&gt; elements with different &lt;SpectrumIdentificationList&gt; elements.
Figure 5 shows an excerpt from this example file to illustrate the use of the new CV terms.</simpara>
<simpara>Figure 6 shows this approach used in the context of encoding identifications of glycopeptides.</simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;AnalysisData&gt;
    &lt;!-- for the ETD to be correctly encoded it needs to go into a separate list --&gt;
    &lt;SpectrumIdentificationList id="sil_ETD"&gt;
        &lt;SpectrumIdentificationResult spectrumID="index=2" spectraData_ref="pk_id" id="SIR_2"&gt;
            &lt;SpectrumIdentificationItem chargeState="3" experimentalMassToCharge="100" peptide_ref="p1" rank="1" passThreshold="false" id="ETD_SII_0"&gt;
                &lt;PeptideEvidenceRef peptideEvidence_ref="pepevid_p1"/&gt;
                &lt;cvParam cvRef="PSI-MS" accession="MS:1002511" name="crosslink spectrum identification item" value="1"/&gt;
                &lt;!-- this flags it as part of the crosslinked identification '1234' --&gt;
                &lt;cvParam accession="MS:1003332" cvRef="PSI-MS" value="*1234:P*" name="identification based on multiple spectra"/&gt;
                &lt;cvParam cvRef="PSI-MS" accession="MS:1003336" name="Posterior Error Probability from multiple spectra identification" value="*1234:1E-08*"/&gt;
            &lt;/SpectrumIdentificationItem&gt;
            &lt;SpectrumIdentificationItem chargeState="3" experimentalMassToCharge="100" peptide_ref="p2" rank="1" passThreshold="false" id="ETD_SII_1"&gt;
                &lt;PeptideEvidenceRef peptideEvidence_ref="pepevid_p2"/&gt;
                &lt;cvParam cvRef="PSI-MS" accession="MS:1002511" name="crosslink spectrum identification item" value="1"/&gt;
                &lt;!-- this flags it as part of the crosslinked identification '1234' --&gt;
                &lt;cvParam accession="MS:1003332" cvRef="PSI-MS" value="*1234:P*" name="identification based on multiple spectra"/&gt;
                &lt;cvParam cvRef="PSI-MS" accession="MS:1003336" name="Posterior Error Probability from multiple spectra identification" value="*1234:1E-08*"/&gt;
            &lt;/SpectrumIdentificationItem&gt;
        &lt;/SpectrumIdentificationResult&gt;
    &lt;/SpectrumIdentificationList&gt;
    &lt;!-- as the MS3 may have different search params, e.g. mass tolerance, they need to go into a separate list --&gt;
    &lt;SpectrumIdentificationList id="sil_MS3"&gt;
        &lt;!-- HCD MS3 match peptide 1 A--&gt;
        &lt;SpectrumIdentificationResult spectrumID="index=3" spectraData_ref="pk_id" id="SIR_3"&gt;
            &lt;SpectrumIdentificationItem chargeState="3" experimentalMassToCharge="50" peptide_ref="p1_a" rank="1" passThreshold="false" id="MS3_SII_0"&gt;
                &lt;PeptideEvidenceRef peptideEvidence_ref="pepevid_p1_a"/&gt;
                &lt;!-- this flags it as part of the crosslinked identification '1234' --&gt;
                &lt;cvParam accession="MS:1003332" cvRef="PSI-MS" value="*1234:C*" name="identification based on multiple spectra"/&gt;
                &lt;cvParam cvRef="PSI-MS" accession="MS:1003336" name="Posterior Error Probability from multiple spectra identification" value="**1234:1E-0**8"/&gt;
            &lt;/SpectrumIdentificationItem&gt;
        &lt;/SpectrumIdentificationResult&gt;
        &lt;!-- HCD MS3 match peptide 2 T--&gt;
        &lt;SpectrumIdentificationResult spectrumID="index=4" spectraData_ref="pk_id" id="SIR_4"&gt;
            &lt;SpectrumIdentificationItem chargeState="3" experimentalMassToCharge="50" peptide_ref="p2_t" rank="1" passThreshold="false" id="MS3_SII_1"&gt;
                &lt;PeptideEvidenceRef peptideEvidence_ref="pepevid_p2_t"/&gt;
                &lt;!-- this flags it as part of the crosslinked identification '1234' --&gt;
                &lt;cvParam accession="MS:1003332" cvRef="PSI-MS" value="*1234:C*" name="identification based on multiple spectra"/&gt;
                &lt;cvParam cvRef="PSI-MS" accession="MS:1003336" name="Posterior Error Probability from multiple spectra identification" value="*1234:1E-08*"/&gt;
            &lt;/SpectrumIdentificationItem&gt;
        &lt;/SpectrumIdentificationResult&gt;
        &lt;!-- HCD MS3 match peptide 1 T--&gt;
        &lt;SpectrumIdentificationResult spectrumID="index=3" spectraData_ref="pk_id" id="SIR_5"&gt;
            &lt;SpectrumIdentificationItem chargeState="3" experimentalMassToCharge="51" peptide_ref="p1_t" rank="1" passThreshold="false" id="MS3_SII_2"&gt;
                &lt;PeptideEvidenceRef peptideEvidence_ref="pepevid_p1_t"/&gt;
                &lt;!-- this flags it as part of the crosslinked identification '1234' --&gt;
                &lt;cvParam accession="MS:1003332" cvRef="PSI-MS" value="*1234:C*" name="identification based on multiple spectra"/&gt;
                &lt;cvParam cvRef="PSI-MS" accession="MS:1003336" name="Posterior Error Probability from multiple spectra identification" value="*1234:1E-08*"/&gt;
            &lt;/SpectrumIdentificationItem&gt;
        &lt;/SpectrumIdentificationResult&gt;
        &lt;!-- HCD MS3 match peptide 2 A--&gt;
        &lt;SpectrumIdentificationResult spectrumID="index=4" spectraData_ref="pk_id" id="SIR_6"&gt;
            &lt;SpectrumIdentificationItem chargeState="3" experimentalMassToCharge="49" peptide_ref="p2_a" rank="1" passThreshold="false" id="MS3_SII_3"&gt;
                &lt;PeptideEvidenceRef peptideEvidence_ref="pepevid_p2_a"/&gt;
                &lt;!-- this flags it as part of the crosslinked identification '1234' --&gt;
                &lt;cvParam accession="MS:1003332" cvRef="PSI-MS" value="*1234:C*" name="identification based on multiple spectra"/&gt;
                &lt;cvParam cvRef="PSI-MS" accession="MS:1003336" name="Posterior Error Probability from multiple spectra identification" value="*1234:1E-08*"/&gt;
            &lt;/SpectrumIdentificationItem&gt;
        &lt;/SpectrumIdentificationResult&gt;
    &lt;/SpectrumIdentificationList&gt;
&lt;/AnalysisData&gt;</programlisting>
<simpara><emphasis role="strong">Figure 5.</emphasis> XML snippet showing the new CV terms "identification based on multiple spectra" (MS:1003332) and "Posterior Error Probability from multiple spectra identification" (MS:1003336) used in the context of crosslinking.</simpara>
<simpara>To encode peptide level scores for multiple spectra identifications different CV terms MUST be used.
The values of these new terms take the form: [multiple spectra identification identifier]: [score].
The identifier before the colon MUST be an identifier used for an identification based on multiple spectra.
For example:</simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;cvParam accession="MS:1003332" cvRef="PSI-MS" value="*1234:P*" name="identification based on multiple spectra"/&gt;
&lt;cvParam cvRef="PSI-MS" accession="MS:1003336" name="Posterior Error Probability for an identification based on multiple spectra" value="*1234:1E-08*"/&gt;</programlisting>
<simpara>A new CV term for "Posterior Error Probability from multiple spectra identification" (MS:1003336) has also been introduced in this context.
For search specific, match level scores for identifications based on multiple spectra, new CV terms must be created, these should be children of MS:1003334 (“Parent term for PSM-level scores for identifications based on multiple spectra”).</simpara>
<programlisting language="xml" linenumbering="unnumbered">&lt;AnalysisData&gt;
    &lt;!-- HCD Spectrum List --&gt;
    &lt;SpectrumIdentificationList id="SII_LIST_4_1"&gt;
        &lt;SpectrumIdentificationResult spectrumID="controllerType=0 controllerNumber=1 scan=3832" spectraData_ref="SD_17022_recal_B210619_02_Lumos_ZC_CO_190_D2I_SDA- WT12019_09_19_OPRmix_35trig_EThcD35.raw" id="SIR_4"&gt;
            &lt;SpectrumIdentificationItem chargeState="3" experimentalMassToCharge="821.6863" calculatedMassToCharge="821.68451" peptide_ref="T_O-Glycosylation_EAQT_O-Glycosylation_T_O- Glycosylation_PLAA_Hex_4_HexNAc_4" rank="1" passThreshold="true" id="SII_4_1"&gt;
                &lt;PeptideEvidenceRef peptideEvidence_ref="T_O-Glycosylation_EAQT_O-Glycosylation_T_O- Glycosylation_PLAA_Hex_4_HexNAc_4_pe1"/&gt;
                &lt;cvParam cvRef="PSI-MS" accession="MS:1002520" value="T_O-Glycosylation_EAQT_O- Glycosylation_T_O-Glycosylation_PLAA_Hex_4_HexNAc_4:256" name="peptide group ID"/&gt;
                &lt;cvParam cvRef="PSI-MS" accession="MS:1002354" value="0.005" name="PSM-level q-value"/&gt;
                &lt;cvParam cvRef="”PSI-MS”" accession="MS:1003332" value="*256*" name="identification based on multiple spectra"/&gt;
                &lt;cvParam cvRef="PSI-MS" accession="MS:1003336" name="Posterior Error Probability from multiple spectra identification" value="256:1E-08"/&gt;
            &lt;/SpectrumIdentificationItem&gt;
        &lt;/SpectrumIdentificationResult&gt;
    &lt;/SpectrumIdentificationList&gt;
    &lt;!-- EThcD Spectrum List --&gt;
    &lt;SpectrumIdentificationList id="SII_LIST_5_1"&gt;
        &lt;SpectrumIdentificationResult spectrumID="controllerType=0 controllerNumber=1 scan=3836" spectraData_ref="SD_17022_recal_B210619_02_Lumos_ZC_CO_190_D2I_SDA- WT12019_09_19_OPRmix_35trig_EThcD35.raw" id="SIR_5"&gt;
            &lt;SpectrumIdentificationItem chargeState="3" experimentalMassToCharge="821.6863" calculatedMassToCharge="821.68451" peptide_ref="T_O-Glycosylation_EAQT_O-Glycosylation_T_O- Glycosylation_PLAA_Hex_4_HexNAc_4" rank="1" passThreshold="true" id="SII_5_1"&gt;
                &lt;PeptideEvidenceRef peptideEvidence_ref="T_Hex_1_HexNAc_1_EAQT_Hex_2_HexNAc_2_T_Hex_1_HexNAc_1_PLAA_pe1"/&gt;
                &lt;cvParam cvRef="PSI-MS" accession="MS:1002520" value="T_O-Glycosylation_EAQT_O-Glycosylation_T_O- Glycosylation_PLAA_Hex_4_HexNAc_4:256" name="peptide group ID"/&gt;
                &lt;cvParam cvRef="PSI-MS" accession="MS:1003147" value="1:0.97:1:true" name="PTMProphet probability"/&gt;
                &lt;cvParam cvRef="PSI-MS" accession="MS:1003147" value="2:0.83:5:true" name="PTMProphet probability"/&gt;
                &lt;cvParam cvRef="PSI-MS" accession="MS:1003147" value="3:0.89:6:true" name="PTMProphet probability"/&gt;
                &lt;cvParam cvRef="PSI-MS" accession="MS:1002354" value="0.001" name="PSM-level q-value"/&gt;
                &lt;cvParam cvRef="”PSI-MS”" accession="MS:1003332" name="identification based on multiple spectra" value="*256*"/&gt;
                &lt;cvParam cvRef="PSI-MS" accession="MS:1003336" name="Posterior Error Probability from multiple spectra identification" value="256:1E-08"/&gt;
            &lt;/SpectrumIdentificationItem&gt;
        &lt;/SpectrumIdentificationResult&gt;
    &lt;/SpectrumIdentificationList&gt;
&lt;/AnalysisData&gt;</programlisting>
<simpara><emphasis role="strong">Figure 6.</emphasis> XML snippet showing the new CV terms "identification based on multiple spectra" (MS:1003332) and "Posterior Error Probability from multiple spectra identification" (MS:1003336) used in the context of encoding glycopeptides, using HCD-EThCD spectrum pairs for glycan localization (taken from dataset PXD020077).</simpara>
</section>
<section xml:id="linking-searchmodification-elements-to-modification-elements">
<title>Linking SearchModification elements to Modification elements</title>
<simpara>mzIdentML version 1.3.0 introduces two new CV terms to link &lt;SearchModification&gt; elements and &lt;Modification&gt; elements - “search modification id” (MS:1003392) which goes inside &lt;SearchModification&gt; elements, and “search modification id ref” (MS:1003393) which goes inside &lt;Modification&gt; elements.
This allows for more detailed information on modifications to be provided without redundant repetition of this information throughout the file.
Making this link is optional but recommended where possible.
In the case of open modification searches, such a link cannot be made.</simpara>
<simpara>The values of “search modification id” (MS:1003392) MUST be unique within the &lt;SpectrumIdentificationProtocol&gt; element.</simpara>
<simpara>However, if there are multiple &lt;SpectrumIdentificationProtocol&gt; elements within the file, then the values of “search modification id” SHOULD be identical for identical modifications across different &lt;SpectrumIdentificationProtocol&gt; elements.
This is to avoid the duplication of &lt;Peptide&gt; elements with the same &lt;Modification&gt; but identified with a different &lt;SpectrumIdentificationProtocol&gt;).</simpara>
</section>
</chapter>
<chapter xml:id="conclusions">
<title>Conclusions</title>
<simpara>This document contains the specifications for using the mzIdentML format to represent results from peptide and protein identification pipelines, in the context of a proteomics investigation.
This specification, in conjunction with the XML Schema, mapping file and CV constitute a proposal for a standard from the Proteomics Standards Initiative.</simpara>
</chapter>
<chapter xml:id="authors-and-contributors">
<title>Authors and Contributors</title>
<simpara>Colin W. Combe, University of Edinburgh</simpara>
<simpara>Lars Kolbowski, Technical University Berlin</simpara>
<simpara>Lutz Fischer, University of Edinburgh</simpara>
<simpara>Ville Koskinen, Matrix Science, London</simpara>
<simpara>Joshua Klein, Boston University</simpara>
<simpara>Alexander Leitner, ETH Zurich</simpara>
<simpara>Juan Antonio Vizcaíno, European Informatics Institute</simpara>
<simpara>Andy Jones, University of Liverpool</simpara>
<simpara>Juri Rappsilber, University of Edinburg &amp; Technical University Berlin</simpara>
<simpara>The mzIdentML version 1.2.0 authors were as follows:</simpara>
<simpara>Juan Antonio Vizcaíno, European Bioinformatics Institute</simpara>
<simpara>Gerhard Mayer, Medizinisches Proteom-Center, Ruhr-Universität Bochum</simpara>
<simpara>Martin Eisenacher, Medizinisches Proteom-Center, Ruhr-Universität Bochum</simpara>
<simpara>Sean Seymour, SCIEX / Applied Biosystems</simpara>
<simpara>Terry Farrah, Institute for Systems Biology</simpara>
<simpara>Eric Deutsch, Institute for Systems Biology</simpara>
<simpara>Andrew R Jones, University of Liverpool</simpara>
<simpara>The mzIdentML version 1.1.0 authors were as follows:</simpara>
<simpara>David Creasy, Matrix Science</simpara>
<simpara>Florian Reisinger, European Bioinformatics Institute</simpara>
<simpara>Johannes Griss, European Bioinformatics Institute</simpara>
<simpara>Juan Antonio Vizcaíno, European Bioinformatics Institute</simpara>
<simpara>Matthew Chambers, Vanderbilt University Medical Center</simpara>
<simpara>Gerhard Mayer, Medizinisches Proteom-Center, Ruhr-Universität Bochum</simpara>
<simpara>Martin Eisenacher, Medizinisches Proteom-Center, Ruhr-Universität Bochum</simpara>
<simpara>Andrew Jones, University of Liverpool</simpara>
<simpara>The mzIdentML version 1.0 authors were as follows:</simpara>
<simpara>Angel Pizarro, Center for Bioinformatics, University of Pennsylvania</simpara>
<simpara>David Creasy, MatrixScience</simpara>
<simpara>Phil Jones, European Bioinformatics Institute</simpara>
<simpara>Andreas Bertsch, Eberhard Karls University Tübingen</simpara>
<simpara>Jenny Siepen, University of Manchester</simpara>
<simpara>Martin Eisenacher, Medizinisches Proteom-Center, Ruhr-Universität Bochum</simpara>
<simpara>Andrew Jones, University of Liverpool</simpara>
<simpara>In addition to the authors indicated above, the following people contributed to the model development, gave feedback or tested mzIdentML:</simpara>
<itemizedlist>
<listitem>
<simpara>Simon Hubbard, University of Manchester</simpara>
</listitem>
<listitem>
<simpara>Julian Selley, University of Manchester</simpara>
</listitem>
<listitem>
<simpara>Zsuzsanna Bencsath-Makkai, Biomedical Engineering, McGill University</simpara>
</listitem>
<listitem>
<simpara>Randy Julian, IndigoBio</simpara>
</listitem>
<listitem>
<simpara>Pierre-Alain Binz, GeneBio Geneva</simpara>
</listitem>
<listitem>
<simpara>Alex Masselot, GeneBio Geneva</simpara>
</listitem>
<listitem>
<simpara>Lennart Martens, European Bioinformatics Institute</simpara>
</listitem>
<listitem>
<simpara>Henning Hermjakob, European Bioinformatics Institute</simpara>
</listitem>
<listitem>
<simpara>Luisa Montecchi, European Bioinformatics Institute</simpara>
</listitem>
<listitem>
<simpara>Richard Côté, European Bioinformatics Institute</simpara>
</listitem>
<listitem>
<simpara>Marc Sturm, Eberhard Karls University, Tübingen</simpara>
</listitem>
<listitem>
<simpara>Jim Shofstahl, Thermo Fisher</simpara>
</listitem>
<listitem>
<simpara>David Horn, Agilent</simpara>
</listitem>
<listitem>
<simpara>Jimmy Eng, Fred Hutchinson Cancer Research</simpara>
</listitem>
<listitem>
<simpara>Brian Searle, Proteome Software</simpara>
</listitem>
<listitem>
<simpara>Phillip Young, Waters</simpara>
</listitem>
<listitem>
<simpara>Michael Kohl, Medizinisches Proteom-Center, Ruhr-Universität Bochum, Germany</simpara>
</listitem>
<listitem>
<simpara>Christian Stephan, Medizinisches Proteom-Center, Ruhr-Universität Bochum, Germany</simpara>
</listitem>
<listitem>
<simpara>Eugene Kapp, Ludwig Institute for Cancer Research</simpara>
</listitem>
<listitem>
<simpara>Michael Coleman, Stowers Institute</simpara>
</listitem>
<listitem>
<simpara>Julian Uszkoreit, Medizinisches Proteom-Center, Ruhr-Universität Bochum, Germany</simpara>
</listitem>
<listitem>
<simpara>Oliver Kohlbacher, Eberhard Karls University, Tübingen</simpara>
</listitem>
<listitem>
<simpara>Mathias Walzer, Eberhard Karls University, Tübingen</simpara>
</listitem>
<listitem>
<simpara>David Ovelleiro, European Bioinformatics Institute</simpara>
</listitem>
<listitem>
<simpara>Alberto Medina, ProteoRed Consortium, Spain</simpara>
</listitem>
<listitem>
<simpara>Salvador Martínez-Bartolomé, ProteoRed Consortium, Spain</simpara>
</listitem>
<listitem>
<simpara>Laurent Gatto, University of Cambridge</simpara>
</listitem>
<listitem>
<simpara>Simon Perkins, University of Liverpool</simpara>
</listitem>
<listitem>
<simpara>Harald Barsnes, University of Bergen</simpara>
</listitem>
<listitem>
<simpara>Marc Vaudel, University of Bergen</simpara>
</listitem>
<listitem>
<simpara>Yasset Perez-Riverol, European Bioinformatics Institute</simpara>
</listitem>
<listitem>
<simpara>Tobias Ternent, European Bioinformatics Institute</simpara>
</listitem>
<listitem>
<simpara>Eugen Netz, Max Planck Institute for Developmental Biology, Tübingen</simpara>
</listitem>
<listitem>
<simpara>Robert J. Chalkley, University of California San Franscisco</simpara>
</listitem>
<listitem>
<simpara>Fawaz Ghali, University of Liverpool</simpara>
</listitem>
<listitem>
<simpara>Timo Sachsenberg, Eberhard Karls University, Tübingen</simpara>
</listitem>
<listitem>
<simpara>Andrea Sinz, Martin Luther University Halle-Wittenberg</simpara>
</listitem>
</itemizedlist>
</chapter>
<bibliography xml:id="references">
<title>References</title>
<bibliodiv>
<bibliomixed>
<bibliomisc><anchor xml:id="martens2011" xreflabel="[1]"/>[1] Martens, L., Chambers, M., Sturm, M., Kessner, D., Levander, F., Shofstahl, J., Tang, W. H., Römpp, A., Neumann, S., Pizarro, A. D., Montecchi-Palazzi, L., Tasman, N., Coleman, M., Reisinger, F., Souda, P., Hermjakob, H., Binz, P.-A., and Deutsch, E. W. (2011) mzML—a Community Standard for Mass Spectrometry Data. <emphasis>Molecular &amp; Cellular Proteomics</emphasis> 10, R110.0001332.</bibliomisc>
</bibliomixed>
<bibliomixed>
<bibliomisc><anchor xml:id="walzer2013" xreflabel="[2]"/>[2] Walzer, M., Qi, D., Mayer, G., Uszkoreit, J., Eisenacher, M., Sachsenberg, T., Gonzalez-Galarza, F. F., Fan, J., Bessant, C., Deutsch, E. W., Reisinger, F., Vizcaino, J. A., Medina-Aunon, J. A., Albar, J. P., Kohlbacher, O., and Jones, A. R. (2013) The mzQuantML data standard for mass spectrometry-based quantitative studies in proteomics. <emphasis>Molecular &amp; cellular proteomics : MCP</emphasis> 12, 2332-23403.</bibliomisc>
</bibliomixed>
<bibliomixed>
<bibliomisc><anchor xml:id="griss20014" xreflabel="[3]"/>[3] Griss, J., Jones, A. R., Sachsenberg, T., Walzer, M., Gatto, L., Hartler, J., Thallinger, G. G., Salek, R. M., Steinbeck, C., Neuhauser, N., Cox, J., Neumann, S., Fan, J., Reisinger, F., Xu, Q. W., Del Toro, N., Perez-Riverol, Y., Ghali, F., Bandeira, N., Xenarios, I., Kohlbacher, O., Vizcaino, J. A., and Hermjakob, H. (2014) The mzTab data exchange format: communicating mass-spectrometry-based proteomics and metabolomics experimental results to a wider audience. <emphasis>Molecular &amp; cellular proteomics : MCP</emphasis> 13, 2765-27754.</bibliomisc>
</bibliomixed>
<bibliomixed>
<bibliomisc><anchor xml:id="kerrien2007" xreflabel="[4]"/>[4] Kerrien, S., Orchard, S., Montecchi-Palazzi, L., Aranda, B., Quinn, A., Vinod, N., Bader, G., Xenarios, I., Wojcik, J., Sherman, D., Tyers, M., Salama, J., Moore, S., Ceol, A., Chatr-aryamontri, A., Oesterheld, M., Stumpflen, V., Salwinski, L., Nerothin, J., Cerami, E., Cusick, M., Vidal, M., Gilson, M., Armstrong, J., Woollard, P., Hogue, C., Eisenberg, D., Cesareni, G., Apweiler, R., and Hermjakob, H. (2007) Broadening the horizon - level 2.5 of the HUPO-PSI format for molecular interactions. <emphasis>BMC Biology</emphasis> 5, 445.</bibliomisc>
</bibliomixed>
<bibliomixed>
<bibliomisc><anchor xml:id="mayer2013" xreflabel="[5]"/>[5] Mayer, G., Montecchi-Palazzi, L., Ovelleiro, D., Jones, A. R., Binz, P.-A., Deutsch, E. W., Chambers, M., Kallhardt, M., Levander, F., Shofstahl, J., Orchard, S., Antonio Vizcaíno, J., Hermjakob, H., Stephan, C., Meyer, H. E., and Eisenacher, M. (2013) The HUPO proteomics standards initiative- mass spectrometry controlled vocabulary. <emphasis>Database</emphasis> 20136.</bibliomisc>
</bibliomixed>
<bibliomixed>
<bibliomisc><anchor xml:id="seymour2014" xreflabel="[6]"/>[6] Seymour, S. L., Farrah, T., Binz, P. A., Chalkley, R. J., Cottrell, J. S., Searle, B. C., Tabb, D. L., Vizcaino, J. A., Prieto, G., Uszkoreit, J., Eisenacher, M., Martinez-Bartolome, S., Ghali, F., and Jones, A. R. (2014) A standardized framing for reporting protein identifications in mzIdentML 1.2. <emphasis>Proteomics</emphasis> 14, 2389-23997.</bibliomisc>
</bibliomixed>
<bibliomixed>
<bibliomisc><anchor xml:id="li2009" xreflabel="[7]"/>[7] Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., Marth, G., Abecasis, G., and Durbin, R. (2009) The Sequence Alignment/Map format and SAMtools. <emphasis>Bioinformatics (Oxford, England)</emphasis> 25, 2078-2079</bibliomisc>
</bibliomixed>
</bibliodiv>
</bibliography>
<chapter xml:id="intellectual-property-statement">
<title>Intellectual Property Statement</title>
<simpara>The PSI takes no position regarding the validity or scope of any intellectual property or other rights that might be claimed to pertain to the implementation or use of the technology described in this document or the extent to which any license under such rights might or might not be available; neither does it represent that it has made any effort to identify any such rights.
Copies of claims of rights made available for publication and any assurances of licenses to be made available, or the result of an attempt made to obtain a general license or permission for the use of such proprietary rights by implementers or users of this specification can be obtained from the PSI Chair.</simpara>
<simpara>The PSI invites any interested party to bring to its attention any copyrights, patents or patent applications, or other proprietary rights that may cover technology that may be required to practice this recommendation.
Please address the information to the PSI Chair (see contacts information at PSI website).</simpara>
</chapter>
<chapter xml:id="copyright-notice">
<title>Copyright Notice</title>
<simpara>Copyright &#169; Proteomics Standards Initiative (2024).
All Rights Reserved.</simpara>
<simpara>This document and translations of it may be copied and furnished to others, and derivative works that comment on or otherwise explain it or assist in its implementation may be prepared, copied, published and distributed, in whole or in part, without restriction of any kind, provided that the above copyright notice and this paragraph are included on all such copies and derivative works.
However, this document itself may not be modified in any way, such as by removing the copyright notice or references to the PSI or other organizations, except as needed for the purpose of developing Proteomics Recommendations in which case the procedures for copyrights defined in the PSI Document process must be followed, or as required to translate it into languages other than English.</simpara>
<simpara>The limited permissions granted above are perpetual and will not be revoked by the PSI or its successors or assigns.</simpara>
<simpara>This document and the information contained herein is provided on an "AS IS" basis and THE PROTEOMICS STANDARDS INITIATIVE DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE."</simpara>
</chapter>
</book>