DataStandardisation » History » Version 6

« Previous - Version 6/14 (diff) - Next » - Current version
Paweł Widera, 10/05/2007 08:35 PM
XML output definition added.


= Standardising Results with XML =

ProCKSI utilises a variety of similarity comparison methods (e.g. USM, MaxCMO, TMaling, ...) producing different similarity measures (e.g. Zscore, TMscore, RMSD, ...) each. Each of the comparison methods produces output with different formats and additional content such as alignments, rotation matrix, etc. Some of them produce just one output file, others a set of linked HTML files.

Input

Optional tags: '''exclude''' (measure, result), '''log''' (no log is generated if not specified) BR
Optional attributes: '''description'''

{{{
<job id="ID" description="TEXT">
<log filename="FILENAME" />

&lt;input type="structure|tree|contact map|similarity matrix"&gt;
&lt;item id="ID" label="TEXT" filename="FILENAME" /&gt;
:::
&lt;item id="ID" label="TEXT" filename="FILENAME" /&gt;
&lt;/input&gt;
&lt;method id="ID" name="TEXT"&gt;
&lt;param name="TEXT"&gt;VALUE&lt;/param&gt;
:::
&lt;param name="TEXT"&gt;VALUE&lt;/param&gt;
&lt;exclude&gt;
&lt;measure&gt;NAME&lt;/measure&gt;
:::
&lt;measure&gt;NAME&lt;/measure&gt;
&lt;result&gt;NAME&lt;/result&gt;
:::
&lt;result&gt;NAME&lt;/result&gt;
&lt;/exclude&gt;
&lt;/method&gt;
:::
&lt;method ...&gt;
...
&lt;/method&gt;
&lt;/job&gt;
}}}

The data used as an input could be protein structures, similarity trees, contact maps or similarity matrices. All specified methods should be able to operate on given data files. This dependency could be verified automatically using XML Schema.

Output

Optional tags: '''log''', '''message''', '''similarity''' (used only if output is a ''comparison'') BR
Optional attributes: '''description''', '''node''', '''start''', '''end''', '''ref_id''' (only if output type is ''composition''), '''ref_id2''' (only if output type is not ''comparison'')

{{{
<job id="ID" description="TEXT" node="TEXT" start="TIME" end="TIME">
<log filename="FILENAME" />

&lt;message type="error|warning|info"&gt;TEXT&lt;/message&gt;
:::
&lt;message type="error|warning|info"&gt;TEXT&lt;/message&gt;
&lt;input type="structure|tree|contact map|similarity matrix"&gt;
&lt;item id="ID" label="TEXT" filename="FILENAME" /&gt;
:::
&lt;item id="ID" label="TEXT" filename="FILENAME" /&gt;
&lt;/input&gt;
&lt;parameters&gt;
&lt;method id="ID" name="NAME"&gt;
&lt;parameter name="TEXT"&gt;VALUE&lt;/parameter&gt;
:::
&lt;parameter name="TEXT"&gt;VALUE&lt;/parameter&gt;
&lt;/method&gt;
:::
&lt;method ...&gt;
...
&lt;/method&gt;
&lt;/parameters&gt;
&lt;output type="transformation|comparison|composition" ref_id="" ref_id2=" "&gt;
&lt;method id="ID"&gt;
&lt;message type="error|warning|info"&gt;TEXT&lt;/message&gt;
:::
&lt;message type="error|warning|info"&gt;TEXT&lt;/message&gt;
&lt;similarity measure="NAME"&gt;VALUE&lt;/similarity&gt;
:::
&lt;similarity measure="NAME"&gt;VALUE&lt;/similarity&gt;
&lt;file type="TEXT" label="TEXT" name="FILENAME" /&gt;
:::
&lt;file type="TEXT" label="TEXT" name="FILENAME" /&gt;
&lt;method&gt;
&lt;/output&gt;
:::
&lt;output ...&gt;
...
&lt;/output&gt;
&lt;/job&gt;
}}}

Message being an error, warning or additional information could be passed on a global or a method level. Input data and parameters defined in the input file could be repeated in the output if needed (self-contained output). Output could be a 1->1 transformation (e.g. structure > contact map), a 2>1 comparison (e.g. 2*structure > similarity measure) or N>1 composition (e.g. N*tree -> total tree or N*similarity matrix -> consensus similarity matrix). The results other than similarity measures for a pair of proteins are stored in external files and are just referenced from the XML file.

The alignment data could be described in the XML file, as there is no single format used by all programs. This yet to be decided.

ProCKSI-core-API.png (13.3 KB) Paweł Widera, 08/27/2013 03:16 AM

ProCKSI-core-API.xmi (12.5 KB) Paweł Widera, 08/27/2013 03:16 AM