Datasets » History » Version 21

Paweł Widera, 04/28/2014 05:43 PM
Formatting corrected.

1 1 Anonymous
2 19 Anonymous
h1. Datasets
3 19 Anonymous
4 19 Anonymous
5 19 Anonymous
6 19 Anonymous
h2. How to label datasets
7 19 Anonymous
8 1 Anonymous
Datasets are labeled in the following way:
9 19 Anonymous
<pre>
10 19 Anonymous
[[DatasetName]].tgz
11 19 Anonymous
[[DatasetName]]_Literature.tgz
12 19 Anonymous
[[DatasetName]]_NumberOfChains_Extraction.tgz
13 19 Anonymous
</pre>
14 1 Anonymous
15 1 Anonymous
whereas:
16 20 Paweł Widera
|| *DatasetName*     || Authors or web that site proposed the dataset ||
17 21 Paweł Widera
|| *Literature*      || Special name given in the literatur ||
18 20 Paweł Widera
|| *NumberOfChains*  || Number of (extracted) chains contained in the dataset ||
19 21 Paweł Widera
|| *Extraction*      || Extraction of the models/chains: _none_, _first-first_, _first-all_, _all-all_ ||
20 18 Anonymous
21 19 Anonymous
22 19 Anonymous
h2. Available Datasets
23 19 Anonymous
24 21 Paweł Widera
The following datasets are available to download:
25 21 Paweł Widera
|| *DatasetName*          || *Extraction* ||*!NumberOfChains*||*Size in MB*|| *Link* ||
26 21 Paweł Widera
|| LelukKoniecznyRoterman || -            || -               || 3.5        || "Download":http://www.ico2s.org/data/instances/procksi/LelukKoniecznyRoterman.tgz ||
27 21 Paweł Widera
||                        || first-first  || 6               || 0.4        || "Download":http://www.ico2s.org/data/instances/procksi/LelukKoniecznyRoterman_6_first-first.tgz ||
28 21 Paweł Widera
||                        || first-all    || 15              || 0.9        || "Download":http://www.ico2s.org/data/instances/procksi/LelukKoniecznyRoterman_15_first-all.tgz ||
29 18 Anonymous
30 21 Paweł Widera
|| *DatasetName*          || *Extraction* ||*!NumberOfChains*||*Size in MB*|| *Link* ||
31 21 Paweł Widera
|| ChewKedem              || -            || -               || 3.8        || "Download":http://www.ico2s.org/data/instances/procksi/ChewKedem.tgz ||
32 21 Paweł Widera
||                        || first-first  || 34              || 1.3        || "Download":http://www.ico2s.org/data/instances/procksi/ChewKedem_34_first-first.tgz ||
33 21 Paweł Widera
||                        || first-all    || 54              || 2.0        || "Download":http://www.ico2s.org/data/instances/procksi/ChewKedem_54_first-all.tgz ||
34 21 Paweł Widera
||                        || all-all      || 132             || 4.1        || "Download":http://www.ico2s.org/data/instances/procksi/ChewKedem_132_all-all.tgz ||
35 1 Anonymous
36 21 Paweł Widera
|| *DatasetName*          || *Extraction* ||*!NumberOfChains*||*Size in MB*|| *Link* ||
37 21 Paweł Widera
|| ProteinKinaseResource  || -            || -               || 3.6        || "Download":http://www.ico2s.org/data/instances/procksi/ProteinKinaseResource.tgz ||
38 21 Paweł Widera
||                        || first-first  || 45              || 2.4        || "Download":http://www.ico2s.org/data/instances/procksi/ProteinKinaseResource_45_first-first.tgz ||
39 21 Paweł Widera
||                        || first-all    || 49              || 2.5        || "Download":http://www.ico2s.org/data/instances/procksi/ProteinKinaseResource_49_first-all.tgz ||
40 21 Paweł Widera
||                        || all-all      || 106             || 4.0        || "Download":http://www.ico2s.org/data/instances/procksi/ProteinKinaseResource_106_all-all.tgz ||
41 1 Anonymous
42 21 Paweł Widera
|| *DatasetName*          || *Extraction* ||*!NumberOfChains*||*Size in MB*|| *Link* ||
43 21 Paweł Widera
|| Skolnick               || -            || -               || 5.1        || "Download":http://www.ico2s.org/data/instances/procksi/Skolnick.tgz ||
44 21 Paweł Widera
||                        || first-first  || 33              || 1.1        || "Download":http://www.ico2s.org/data/instances/procksi/Skolnick_33_first-first.tgz ||
45 21 Paweł Widera
||                        || first-all    || 65              || 2.1        || "Download":http://www.ico2s.org/data/instances/procksi/Skolnick_65_first-all.tgz ||
46 21 Paweł Widera
||                        || all-all      || 179             || 5.9        || "Download":http://www.ico2s.org/data/instances/procksi/Skolnick_179_all-all.tgz ||
47 18 Anonymous
48 21 Paweł Widera
|| *DatasetName*          || *Extraction* ||*!NumberOfChains*||*Size in MB*|| *Link* ||
49 21 Paweł Widera
|| RostSander             || -            || -               || 7.4        || "Download":http://www.ico2s.org/data/instances/procksi/RostSander.tgz ||
50 21 Paweł Widera
||                        || RS126        || 126             || 4.3        || "Download":http://www.ico2s.org/data/instances/procksi/RostSander_RS126.tgz ||
51 21 Paweł Widera
||                        || first-first  || 119             || 4.4        || "Download":http://www.ico2s.org/data/instances/procksi/RostSander_119_first-first.tgz ||
52 21 Paweł Widera
||                        || first-all    || 212             || 7.6        || "Download":http://www.ico2s.org/data/instances/procksi/RostSander_212_first-all.tgz ||
53 18 Anonymous
54 21 Paweł Widera
|| *DatasetName*          || *Extraction* ||*!NumberOfChains*||*Size in MB*|| *Link* ||
55 21 Paweł Widera
|| KinjoHorimotoNishikawa || -            || -               || 98         || "Download":http://www.ico2s.org/data/instances/procksi/KinjoHorimotoNishikawa.tgz ||
56 21 Paweł Widera
||                        || first-first  || 1012            || 46         || "Download":http://www.ico2s.org/data/instances/procksi/KinjoHorimotoNishikawa_1012_first-first.tgz ||
57 21 Paweł Widera
||                        || first-all    || 2013            || 88         || "Download":http://www.ico2s.org/data/instances/procksi/KinjoHorimotoNishikawa_2013_first-all.tgz ||
58 17 Anonymous
59 21 Paweł Widera
|| *DatasetName*        || *Description*                               || *Extraction* ||*!NumberOfChains*||*Size in MB*|| *Link* ||
60 21 Paweł Widera
|| Shah                 || Randomly selected 1000 proteins from PDB    || -            || -               || 114        || "Download":http://www.ico2s.org/data/instances/procksi/Shah.tgz ||
61 21 Paweł Widera
||                      ||                                             || first-first  || 1000            || 41         || "Download":http://www.ico2s.org/data/instances/procksi/Shah_1000_first-first.tgz ||
62 21 Paweł Widera
||                      ||                                             || first-all    || 1943            || 80         || "Download":http://www.ico2s.org/data/instances/procksi/Shah_1943_first-all.tgz ||
63 21 Paweł Widera
||                      ||                                             || all-all      || 4007            || 124        || "Download":http://www.ico2s.org/data/instances/procksi/Shah_4007_all-all.tgz ||
64 17 Anonymous
65 17 Anonymous
66 21 Paweł Widera
|| *DatasetName*        || *Description*                                            || *Extraction* ||*!NumberOfChains*|| *Size in GB*          || *Link* ||
67 21 Paweł Widera
|| PDB_SELECT30_04-2008 || Downloaded from PDB web site on 10/04/2008               || -            || -               || *1.1*,  ucmp*: *4.8*  || "Download":http://www.ico2s.org/data/instances/procksi/PDB_SELECT30_04-2008_7307.tar.gz ||
68 21 Paweł Widera
||                      || with criteria "Remove similar sequences at 30% identity" || first-first  || 7183            || *0.285*, ucmp*: *1.2* || "Download":http://www.ico2s.org/data/instances/procksi/PDB_SELECT30_04-2008_7183_first-first.tar.gz ||
69 21 Paweł Widera
||                      ||                                                          || first-all    || 14651           || *0.60*, ucmp:*2.7*    || "Download":http://www.ico2s.org/data/instances/procksi/PDB_SELECT30_04-2008_14651_first-all.tar.gz ||
70 21 Paweł Widera
||                      ||                                                          || all-all      || 43025           || *1.2*, ucmp*: *5.5*   || "Download":http://www.ico2s.org/data/instances/procksi/PDB_SELECT30_04-2008_43025_all-all.tar.gz ||
71 19 Anonymous
72 21 Paweł Widera
|| *DatasetName*        || *Description*                           || *Extraction* ||*!NumberOfChains*|| *Size in GB*           || *Link* ||
73 21 Paweł Widera
|| PDB_SELECT25_10-2007 || PDB_SELECT25 as of October2007          || -            || -               || *0.746*, ucmp*: *3.4*  || "Download":http://www.ico2s.org/data/instances/procksi/PDB_SELECT25_10-2007_3560.tar.gz ||
74 21 Paweł Widera
||                      || it's a six monthly updated list of      || first-first  || 3464            || *0.12*,  ucmp*: *0.54* || "Download":http://www.ico2s.org/data/instances/procksi/PDB_SELECT25_10-2007_3464_first-first.tar.gz ||
75 21 Paweł Widera
||                      || non-redundent protein structures mostly || first-all    || 8581            || *0.30*, ucmp*: *1.4*   || "Download":http://www.ico2s.org/data/instances/procksi/PDB_SELECT25_10-2007_8581_first-all.tar.gz  ||
76 21 Paweł Widera
||                      || used in Protein Structure Prediction    || all-all      ||31288            || *0.854*, ucmp*: *4.1*  || "Download":http://www.ico2s.org/data/instances/procksi/PDB_SELECT25_10-2007_31288_all-all.tar.gz ||
77 20 Paweł Widera
78 1 Anonymous
79 1 Anonymous
*ucmp: uncompressed