Datasets » History » Version 19

Anonymous, 04/23/2008 09:39 AM
Change links to datasets

1 1 Anonymous
2 19 Anonymous
h1. Datasets
3 19 Anonymous
4 19 Anonymous
5 19 Anonymous
6 19 Anonymous
h2. How to label datasets
7 19 Anonymous
8 1 Anonymous
Datasets are labeled in the following way:
9 19 Anonymous
<pre>
10 19 Anonymous
[[DatasetName]].tgz
11 19 Anonymous
[[DatasetName]]_Literature.tgz
12 19 Anonymous
[[DatasetName]]_NumberOfChains_Extraction.tgz
13 19 Anonymous
</pre>
14 1 Anonymous
15 1 Anonymous
whereas:
16 19 Anonymous
 || *!DatasetName*     || Authors or web that site proposed the dataset
17 19 Anonymous
 || *Literature*       || Special name given in the literatur
18 19 Anonymous
 || *!NumberOfChains*  || Number of (extracted) chains contained in the dataset
19 19 Anonymous
 || *Extraction*       || Extraction of the models/chains: _none_, _first-first_, _first-all_, _all-all_
20 18 Anonymous
21 19 Anonymous
22 19 Anonymous
h2. Available Datasets
23 19 Anonymous
24 18 Anonymous
The following datasets are available from the repository (special privileges required!):
25 19 Anonymous
 || *!DatasetName*      || *Extraction*   || *!NumberOfChains* || *Size in MB* || *Link* 
26 19 Anonymous
 || LelukKoniecznyRoterman || -            || -               || 3.5        || [/procksi/datasets/LelukKoniecznyRoterman.tgz Download]
27 1 Anonymous
 ||                         || first-first  || 6               || 0.4        || [/procksi/datasets/LelukKoniecznyRoterman_6_first-first.tgz Download]
28 1 Anonymous
 ||                         || first-all    || 15              || 0.9        || [/procksi/datasets/LelukKoniecznyRoterman_15_first-all.tgz Download]
29 18 Anonymous
30 19 Anonymous
 || *!DatasetName*      || *Extraction*   || *!NumberOfChains* || *Size in MB* || *Link* 
31 19 Anonymous
 || ChewKedem              || -            || -               || 3.8        || [/procksi/datasets/ChewKedem.tgz Download]
32 1 Anonymous
 ||                         || first-first  || 34              || 1.3        || [/procksi/datasets/ChewKedem_34_first-first.tgz Download]
33 18 Anonymous
 ||                         || first-all    || 54              || 2.0        || [/procksi/datasets/ChewKedem_54_first-all.tgz Download]
34 1 Anonymous
 ||                         || all-all      || 132             || 4.1        || [/procksi/datasets/ChewKedem_132_all-all.tgz Download]
35 1 Anonymous
36 19 Anonymous
 || *!DatasetName*      || *Extraction*   || *!NumberOfChains* || *Size in MB* || *Link* 
37 19 Anonymous
 || ProteinKinaseResource  || -            || -               || 3.6        || [/procksi/datasets/ProteinKinaseResource.tgz Download]
38 1 Anonymous
 ||                         || first-first  || 45              || 2.4        || [/procksi/datasets/ProteinKinaseResource_45_first-first.tgz Download]
39 1 Anonymous
 ||                         || first-all    || 49              || 2.5        || [/procksi/datasets/ProteinKinaseResource_49_first-all.tgz Download]
40 1 Anonymous
 ||                         || all-all      || 106             || 4.0        || [/procksi/datasets/ProteinKinaseResource_106_all-all.tgz Download]
41 1 Anonymous
42 19 Anonymous
 || *!DatasetName*      || *Extraction*   || *!NumberOfChains* || *Size in MB* || *Link* 
43 1 Anonymous
 || Skolnick                || -            || -               || 5.1        || [/procksi/datasets/Skolnick.tgz Download]
44 1 Anonymous
 ||                         || first-first  || 33              || 1.1        || [/procksi/datasets/Skolnick_33_first-first.tgz Download]
45 18 Anonymous
 ||                         || first-all    || 65              || 2.1        || [/procksi/datasets/Skolnick_65_first-all.tgz Download]
46 18 Anonymous
 ||                         || all-all      || 179             || 5.9        || [/procksi/datasets/Skolnick_179_all-all.tgz Download]
47 18 Anonymous
48 19 Anonymous
 || *!DatasetName*      || *Extraction*   || *!NumberOfChains* || *Size in MB* || *Link* 
49 19 Anonymous
 || RostSander             || -            || -               || 7.4        || [/procksi/datasets/RostSander.tgz Download]
50 18 Anonymous
 ||                         || RS126        || 126             || 4.3        || [/procksi/datasets/RostSander_RS126.tgz Download]
51 18 Anonymous
 ||                         || first-first  || 119             || 4.4        || [/procksi/datasets/RostSander_119_first-first.tgz Download]
52 18 Anonymous
 ||                         || first-all    || 212             || 7.6        || [/procksi/datasets/RostSander_212_first-all.tgz Download]
53 18 Anonymous
54 19 Anonymous
 || *!DatasetName*            || *Extraction*   || *!NumberOfChains* || *Size in MB* || *Link* 
55 19 Anonymous
 || KinjoHorimotoNishikawa || -            || -               || 98         || [/procksi/datasets/KinjoHorimotoNishikawa.tgz Download]
56 12 Anonymous
 ||                         || first-first  || 1012            || 46         || [/procksi/datasets/KinjoHorimotoNishikawa_1012_first-first.tgz Download]
57 17 Anonymous
 ||                         || first-all    || 2013            || 88         || [/procksi/datasets/KinjoHorimotoNishikawa_2013_first-all.tgz Download]
58 17 Anonymous
59 19 Anonymous
 || *!DatasetName*  || *Description*          || *Extraction*   || *!NumberOfChains* || *Size in MB*         || *Link* 
60 17 Anonymous
 || Shah                || Randomly selected 1000 proteins from PDB     || -            || -               || 114         || [/procksi/datasets/Shah.tgz Download]                   
61 6 Anonymous
 ||                     ||                            || first-first  || 1000            || 41          || [/procksi/datasets/Shah_1000_first-first.tgz Download]
62 12 Anonymous
 ||                     ||                            || first-all    || 1943            || 80          || [/procksi/datasets/Shah_1943_first-all.tgz Download]
63 17 Anonymous
 ||                     ||                            || all-all      || 4007            || 124         || [/procksi/datasets/Shah_4007_all-all.tgz Download]
64 17 Anonymous
65 17 Anonymous
66 19 Anonymous
 || *!DatasetName*  || *Description*          || *Extraction*   || *!NumberOfChains* || *Size in GB*         || *Link* 
67 19 Anonymous
 ||  PDB_SELECT30_04-2008              || Downloaded from PDB web site on 10/04/2008               || -            || -               || *1.1*,  ucmp*: *4.8* ||   [/procksi/datasets/PDB_SELECT30_04-2008_7307.tar.gz Download]      ||                    
68 19 Anonymous
 ||                     || with criteria "Remove similar sequences at 30% identity"           || first-first  || 7183            || *0.285*, ucmp*: *1.2*     || [/procksi/datasets/PDB_SELECT30_04-2008_7183_first-first.tar.gz Download]
69 19 Anonymous
 ||                     ||                            || first-all    || 14651                 || *0.60* , ucmp:*2.7*          || [/procksi/datasets/PDB_SELECT30_04-2008_14651_first-all.tar.gz Download]
70 19 Anonymous
 ||                     ||                            || all-all      || 43025                || *1.2*   , ucmp*: *5.5*         ||  [/procksi/datasets/PDB_SELECT30_04-2008_43025_all-all.tar.gz Download]
71 1 Anonymous
72 19 Anonymous
 || *!DatasetName*  || *Description*          || *Extraction *   || *!NumberOfChains* || *Size in GB*         || *Link* 
73 19 Anonymous
 || PDB_SELECT25_10-2007               || PDB_SELECT25 as of October2007            || -            || -               || *0.746*, ucmp*: *3.4*         ||                     [/procksi/datasets/PDB_SELECT25_10-2007_3560.tar.gz Download]
74 19 Anonymous
 ||                     || it's a six monthly updated list of-       || first-first  || 3464            ||*0.12*,  ucmp*: *0.54*,            || [/procksi/datasets/PDB_SELECT25_10-2007_3464_first-first.tar.gz Download]
75 19 Anonymous
 ||                     || non-redundent protein structures                           || first-all    || 8581                || *0.30*, ucmp*: *1.4*          || [/procksi/datasets/PDB_SELECT25_10-2007_8581_first-all.tar.gz  Download]
76 19 Anonymous
 ||                     || Mostly used in Protein Structure Prediction                           || all-all      ||31288                 || *0.854* , ucmp*: *4.1*         || [/procksi/datasets/PDB_SELECT25_10-2007_31288_all-all.tar.gz Download]
77 1 Anonymous
78 1 Anonymous
*ucmp: uncompressed