Datasets » History » Version 20

Paweł Widera, 12/09/2013 02:45 PM
Table formatting corrected. Links to datasets fixed.

1 1 Anonymous
2 19 Anonymous
h1. Datasets
3 19 Anonymous
4 19 Anonymous
5 19 Anonymous
6 19 Anonymous
h2. How to label datasets
7 19 Anonymous
8 1 Anonymous
Datasets are labeled in the following way:
9 19 Anonymous
<pre>
10 19 Anonymous
[[DatasetName]].tgz
11 19 Anonymous
[[DatasetName]]_Literature.tgz
12 19 Anonymous
[[DatasetName]]_NumberOfChains_Extraction.tgz
13 19 Anonymous
</pre>
14 1 Anonymous
15 1 Anonymous
whereas:
16 20 Paweł Widera
|| *DatasetName*     || Authors or web that site proposed the dataset ||
17 20 Paweł Widera
|| *Literature*       || Special name given in the literatur ||
18 20 Paweł Widera
|| *NumberOfChains*  || Number of (extracted) chains contained in the dataset ||
19 20 Paweł Widera
|| *Extraction*       || Extraction of the models/chains: _none_, _first-first_, _first-all_, _all-all_ ||
20 18 Anonymous
21 19 Anonymous
22 19 Anonymous
h2. Available Datasets
23 19 Anonymous
24 18 Anonymous
The following datasets are available from the repository (special privileges required!):
25 20 Paweł Widera
|| *DatasetName*     || *Extraction*  || *!NumberOfChains*|| *Size in MB*|| *Link* ||
26 20 Paweł Widera
|| LelukKoniecznyRoterman|| -           || -              || 3.5       || "Download":http://www.ico2s.org/data/instances/procksi/LelukKoniecznyRoterman.tgz ||
27 20 Paweł Widera
||                        || first-first || 6              || 0.4       || "Download":http://www.ico2s.org/data/instances/procksi/LelukKoniecznyRoterman_6_first-first.tgz ||
28 20 Paweł Widera
||                        || first-all   || 15             || 0.9       || "Download":http://www.ico2s.org/data/instances/procksi/LelukKoniecznyRoterman_15_first-all.tgz ||
29 18 Anonymous
30 20 Paweł Widera
|| *DatasetName*     || *Extraction*  || *!NumberOfChains*|| *Size in MB*|| *Link* ||
31 20 Paweł Widera
|| ChewKedem             || -           || -              || 3.8       || "Download":http://www.ico2s.org/data/instances/procksi/ChewKedem.tgz ||
32 20 Paweł Widera
||                        || first-first || 34             || 1.3       || "Download":http://www.ico2s.org/data/instances/procksi/ChewKedem_34_first-first.tgz ||
33 20 Paweł Widera
||                        || first-all   || 54             || 2.0       || "Download":http://www.ico2s.org/data/instances/procksi/ChewKedem_54_first-all.tgz ||
34 20 Paweł Widera
||                        || all-all     || 132            || 4.1       || "Download":http://www.ico2s.org/data/instances/procksi/ChewKedem_132_all-all.tgz ||
35 1 Anonymous
36 20 Paweł Widera
|| *DatasetName*     || *Extraction*  || *!NumberOfChains*|| *Size in MB*|| *Link* ||
37 20 Paweł Widera
|| ProteinKinaseResource || -           || -              || 3.6       || "Download":http://www.ico2s.org/data/instances/procksi/ProteinKinaseResource.tgz ||
38 20 Paweł Widera
||                        || first-first || 45             || 2.4       || "Download":http://www.ico2s.org/data/instances/procksi/ProteinKinaseResource_45_first-first.tgz ||
39 20 Paweł Widera
||                        || first-all   || 49             || 2.5       || "Download":http://www.ico2s.org/data/instances/procksi/ProteinKinaseResource_49_first-all.tgz ||
40 20 Paweł Widera
||                        || all-all     || 106            || 4.0       || "Download":http://www.ico2s.org/data/instances/procksi/ProteinKinaseResource_106_all-all.tgz ||
41 1 Anonymous
42 20 Paweł Widera
|| *DatasetName*     || *Extraction*  || *!NumberOfChains*|| *Size in MB*|| *Link* ||
43 20 Paweł Widera
|| Skolnick               || -           || -              || 5.1       || "Download":http://www.ico2s.org/data/instances/procksi/Skolnick.tgz ||
44 20 Paweł Widera
||                        || first-first || 33             || 1.1       || "Download":http://www.ico2s.org/data/instances/procksi/Skolnick_33_first-first.tgz ||
45 20 Paweł Widera
||                        || first-all   || 65             || 2.1       || "Download":http://www.ico2s.org/data/instances/procksi/Skolnick_65_first-all.tgz ||
46 20 Paweł Widera
||                        || all-all     || 179            || 5.9       || "Download":http://www.ico2s.org/data/instances/procksi/Skolnick_179_all-all.tgz ||
47 18 Anonymous
48 20 Paweł Widera
|| *DatasetName*     || *Extraction*  || *!NumberOfChains*|| *Size in MB*|| *Link* ||
49 20 Paweł Widera
|| RostSander            || -           || -              || 7.4       || "Download":http://www.ico2s.org/data/instances/procksi/RostSander.tgz ||
50 20 Paweł Widera
||                        || RS126       || 126            || 4.3       || "Download":http://www.ico2s.org/data/instances/procksi/RostSander_RS126.tgz ||
51 20 Paweł Widera
||                        || first-first || 119            || 4.4       || "Download":http://www.ico2s.org/data/instances/procksi/RostSander_119_first-first.tgz ||
52 20 Paweł Widera
||                        || first-all   || 212            || 7.6       || "Download":http://www.ico2s.org/data/instances/procksi/RostSander_212_first-all.tgz ||
53 18 Anonymous
54 20 Paweł Widera
|| *DatasetName*           || *Extraction*  || *!NumberOfChains*|| *Size in MB*|| *Link* ||
55 20 Paweł Widera
|| KinjoHorimotoNishikawa|| -           || -              || 98        || "Download":http://www.ico2s.org/data/instances/procksi/KinjoHorimotoNishikawa.tgz ||
56 20 Paweł Widera
||                        || first-first || 1012           || 46        || "Download":http://www.ico2s.org/data/instances/procksi/KinjoHorimotoNishikawa_1012_first-first.tgz ||
57 20 Paweł Widera
||                        || first-all   || 2013           || 88        || "Download":http://www.ico2s.org/data/instances/procksi/KinjoHorimotoNishikawa_2013_first-all.tgz ||
58 17 Anonymous
59 20 Paweł Widera
|| *DatasetName* || *Description*         || *Extraction*  || *!NumberOfChains*|| *Size in MB*        || *Link* ||
60 20 Paweł Widera
|| Shah               || Randomly selected 1000 proteins from PDB    || -           || -              || 114        || "Download":http://www.ico2s.org/data/instances/procksi/Shah.tgz                   
61 20 Paweł Widera
||                    ||                           || first-first || 1000           || 41         || "Download":http://www.ico2s.org/data/instances/procksi/Shah_1000_first-first.tgz ||
62 20 Paweł Widera
||                    ||                           || first-all   || 1943           || 80         || "Download":http://www.ico2s.org/data/instances/procksi/Shah_1943_first-all.tgz ||
63 20 Paweł Widera
||                    ||                           || all-all     || 4007           || 124        || "Download":http://www.ico2s.org/data/instances/procksi/Shah_4007_all-all.tgz ||
64 17 Anonymous
65 17 Anonymous
66 20 Paweł Widera
|| *DatasetName* || *Description*         || *Extraction*  || *!NumberOfChains*|| *Size in GB*        || *Link* ||
67 20 Paweł Widera
||  PDB_SELECT30_04-2008             || Downloaded from PDB web site on 10/04/2008              || -           || -              || *1.1*,  ucmp*: *4.8*||   "Download":http://www.ico2s.org/data/instances/procksi/PDB_SELECT30_04-2008_7307.tar.gz     ||                    
68 20 Paweł Widera
||                    || with criteria "Remove similar sequences at 30% identity"          || first-first || 7183           || *0.285*, ucmp*: *1.2*    || "Download":http://www.ico2s.org/data/instances/procksi/PDB_SELECT30_04-2008_7183_first-first.tar.gz ||
69 20 Paweł Widera
||                    ||                           || first-all   || 14651                || *0.60* , ucmp:*2.7*         || "Download":http://www.ico2s.org/data/instances/procksi/PDB_SELECT30_04-2008_14651_first-all.tar.gz ||
70 20 Paweł Widera
||                    ||                           || all-all     || 43025               || *1.2*   , ucmp*: *5.5*        ||  "Download":http://www.ico2s.org/data/instances/procksi/PDB_SELECT30_04-2008_43025_all-all.tar.gz ||
71 19 Anonymous
72 20 Paweł Widera
|| *DatasetName* || *Description*         || *Extraction*  || *!NumberOfChains*|| *Size in GB*        || *Link* ||
73 20 Paweł Widera
|| PDB_SELECT25_10-2007              || PDB_SELECT25 as of October2007           || -           || -              || *0.746*, ucmp*: *3.4*        ||                     "Download":http://www.ico2s.org/data/instances/procksi/PDB_SELECT25_10-2007_3560.tar.gz ||
74 20 Paweł Widera
||                    || it's a six monthly updated list of-      || first-first || 3464           ||*0.12*,  ucmp*: *0.54*,           || "Download":http://www.ico2s.org/data/instances/procksi/PDB_SELECT25_10-2007_3464_first-first.tar.gz ||
75 20 Paweł Widera
||                    || non-redundent protein structures                          || first-all   || 8581               || *0.30*, ucmp*: *1.4*         || "Download":http://www.ico2s.org/data/instances/procksi/PDB_SELECT25_10-2007_8581_first-all.tar.gz  ||
76 20 Paweł Widera
||                    || Mostly used in Protein Structure Prediction                          || all-all     ||31288                || *0.854* , ucmp*: *4.1*        || "Download":http://www.ico2s.org/data/instances/procksi/PDB_SELECT25_10-2007_31288_all-all.tar.gz ||
77 20 Paweł Widera
78 1 Anonymous
79 1 Anonymous
*ucmp: uncompressed