Datasets » History » Version 18

Anonymous, 04/23/2008 09:39 AM
Change links to datasets

1 1 Anonymous
= Datasets =
2 1 Anonymous
3 1 Anonymous
== How to label datasets ==
4 1 Anonymous
Datasets are labeled in the following way:
5 1 Anonymous
{{{
6 1 Anonymous
DatasetName.tgz
7 1 Anonymous
DatasetName_Literature.tgz
8 1 Anonymous
DatasetName_NumberOfChains_Extraction.tgz
9 1 Anonymous
}}}
10 1 Anonymous
11 1 Anonymous
whereas:
12 1 Anonymous
 || '''!DatasetName'''     || Authors or web that site proposed the dataset
13 1 Anonymous
 || '''Literature'''       || Special name given in the literatur
14 1 Anonymous
 || '''!NumberOfChains'''  || Number of (extracted) chains contained in the dataset
15 1 Anonymous
 || '''Extraction'''       || Extraction of the models/chains: ''none'', ''first-first'', ''first-all'', ''all-all''
16 1 Anonymous
17 1 Anonymous
== Available Datasets ==
18 1 Anonymous
The following datasets are available from the repository (special privileges required!):
19 1 Anonymous
 || '''!DatasetName'''      || '''Extraction'''   || '''!NumberOfChains''' || '''Size in MB''' || '''Link''' 
20 18 Anonymous
 || !LelukKoniecznyRoterman || -            || -               || 3.5        || [/procksi/datasets/LelukKoniecznyRoterman.tgz Download]
21 18 Anonymous
 ||                         || first-first  || 6               || 0.4        || [/procksi/datasets/LelukKoniecznyRoterman_6_first-first.tgz Download]
22 18 Anonymous
 ||                         || first-all    || 15              || 0.9        || [/procksi/datasets/LelukKoniecznyRoterman_15_first-all.tgz Download]
23 1 Anonymous
24 1 Anonymous
 || '''!DatasetName'''      || '''Extraction'''   || '''!NumberOfChains''' || '''Size in MB''' || '''Link''' 
25 18 Anonymous
 || !ChewKedem              || -            || -               || 3.8        || [/procksi/datasets/ChewKedem.tgz Download]
26 18 Anonymous
 ||                         || first-first  || 34              || 1.3        || [/procksi/datasets/ChewKedem_34_first-first.tgz Download]
27 18 Anonymous
 ||                         || first-all    || 54              || 2.0        || [/procksi/datasets/ChewKedem_54_first-all.tgz Download]
28 18 Anonymous
 ||                         || all-all      || 132             || 4.1        || [/procksi/datasets/ChewKedem_132_all-all.tgz Download]
29 1 Anonymous
30 1 Anonymous
 || '''!DatasetName'''      || '''Extraction'''   || '''!NumberOfChains''' || '''Size in MB''' || '''Link''' 
31 18 Anonymous
 || !ProteinKinaseResource  || -            || -               || 3.6        || [/procksi/datasets/ProteinKinaseResource.tgz Download]
32 18 Anonymous
 ||                         || first-first  || 45              || 2.4        || [/procksi/datasets/ProteinKinaseResource_45_first-first.tgz Download]
33 18 Anonymous
 ||                         || first-all    || 49              || 2.5        || [/procksi/datasets/ProteinKinaseResource_49_first-all.tgz Download]
34 18 Anonymous
 ||                         || all-all      || 106             || 4.0        || [/procksi/datasets/ProteinKinaseResource_106_all-all.tgz Download]
35 1 Anonymous
36 1 Anonymous
 || '''!DatasetName'''      || '''Extraction'''   || '''!NumberOfChains''' || '''Size in MB''' || '''Link''' 
37 18 Anonymous
 || Skolnick                || -            || -               || 5.1        || [/procksi/datasets/Skolnick.tgz Download]
38 18 Anonymous
 ||                         || first-first  || 33              || 1.1        || [/procksi/datasets/Skolnick_33_first-first.tgz Download]
39 18 Anonymous
 ||                         || first-all    || 65              || 2.1        || [/procksi/datasets/Skolnick_65_first-all.tgz Download]
40 18 Anonymous
 ||                         || all-all      || 179             || 5.9        || [/procksi/datasets/Skolnick_179_all-all.tgz Download]
41 1 Anonymous
42 1 Anonymous
 || '''!DatasetName'''      || '''Extraction'''   || '''!NumberOfChains''' || '''Size in MB''' || '''Link''' 
43 18 Anonymous
 || !RostSander             || -            || -               || 7.4        || [/procksi/datasets/RostSander.tgz Download]
44 18 Anonymous
 ||                         || RS126        || 126             || 4.3        || [/procksi/datasets/RostSander_RS126.tgz Download]
45 18 Anonymous
 ||                         || first-first  || 119             || 4.4        || [/procksi/datasets/RostSander_119_first-first.tgz Download]
46 18 Anonymous
 ||                         || first-all    || 212             || 7.6        || [/procksi/datasets/RostSander_212_first-all.tgz Download]
47 1 Anonymous
48 1 Anonymous
 || '''!DatasetName'''            || '''Extraction'''   || '''!NumberOfChains''' || '''Size in MB''' || '''Link''' 
49 18 Anonymous
 || !KinjoHorimotoNishikawa || -            || -               || 98         || [/procksi/datasets/KinjoHorimotoNishikawa.tgz Download]
50 18 Anonymous
 ||                         || first-first  || 1012            || 46         || [/procksi/datasets/KinjoHorimotoNishikawa_1012_first-first.tgz Download]
51 18 Anonymous
 ||                         || first-all    || 2013            || 88         || [/procksi/datasets/KinjoHorimotoNishikawa_2013_first-all.tgz Download]
52 2 Anonymous
53 12 Anonymous
 || '''!DatasetName'''  || '''Description'''          || '''Extraction'''   || '''!NumberOfChains''' || '''Size in MB'''         || '''Link''' 
54 18 Anonymous
 || Shah                || Randomly selected 1000 proteins from PDB     || -            || -               || 114         || [/procksi/datasets/Shah.tgz Download]                   
55 18 Anonymous
 ||                     ||                            || first-first  || 1000            || 41          || [/procksi/datasets/Shah_1000_first-first.tgz Download]
56 18 Anonymous
 ||                     ||                            || first-all    || 1943            || 80          || [/procksi/datasets/Shah_1943_first-all.tgz Download]
57 18 Anonymous
 ||                     ||                            || all-all      || 4007            || 124         || [/procksi/datasets/Shah_4007_all-all.tgz Download]
58 5 Anonymous
59 5 Anonymous
60 12 Anonymous
 || '''!DatasetName'''  || '''Description'''          || '''Extraction'''   || '''!NumberOfChains''' || '''Size in GB'''         || '''Link''' 
61 17 Anonymous
 ||  PDB_SELECT30_04-2008              || Downloaded from PDB web site on 10/04/2008               || -            || -               || '''1.1''',  ucmp*: '''4.8''' ||   [/procksi/datasets/PDB_SELECT30_04-2008_7307.tar.gz Download]      ||                    
62 17 Anonymous
 ||                     || with criteria "Remove similar sequences at 30% identity"           || first-first  || 7183            || '''0.285''', ucmp*: '''1.2'''     || [/procksi/datasets/PDB_SELECT30_04-2008_7183_first-first.tar.gz Download]
63 17 Anonymous
 ||                     ||                            || first-all    || 14651                 || '''0.60''' , ucmp:'''2.7'''          || [/procksi/datasets/PDB_SELECT30_04-2008_14651_first-all.tar.gz Download]
64 17 Anonymous
 ||                     ||                            || all-all      || 43025                || '''1.2'''   , ucmp*: '''5.5'''         ||  [/procksi/datasets/PDB_SELECT30_04-2008_43025_all-all.tar.gz Download]
65 6 Anonymous
66 12 Anonymous
 || '''!DatasetName'''  || '''Description'''          || '''Extraction '''   || '''!NumberOfChains''' || '''Size in GB'''         || '''Link''' 
67 17 Anonymous
 || PDB_SELECT25_10-2007               || PDB_SELECT25 as of October2007            || -            || -               || '''0.746''', ucmp*: '''3.4'''         ||                     [/procksi/datasets/PDB_SELECT25_10-2007_3560.tar.gz Download]
68 17 Anonymous
 ||                     || it's a six monthly updated list of-       || first-first  || 3464            ||'''0.12''',  ucmp*: '''0.54''',            || [/procksi/datasets/PDB_SELECT25_10-2007_3464_first-first.tar.gz Download]
69 17 Anonymous
 ||                     || non-redundent protein structures                           || first-all    || 8581                || '''0.30''', ucmp*: '''1.4'''          || [/procksi/datasets/PDB_SELECT25_10-2007_8581_first-all.tar.gz  Download]
70 17 Anonymous
 ||                     || Mostly used in Protein Structure Prediction                           || all-all      ||31288                 || '''0.854''' , ucmp*: '''4.1'''         || [/procksi/datasets/PDB_SELECT25_10-2007_31288_all-all.tar.gz Download]
71 9 Anonymous
72 9 Anonymous
*ucmp: uncompressed