Datasets » History » Version 19

« Previous - Version 19/21 (diff) - Next » - Current version
Anonymous, 04/23/2008 09:39 AM
Change links to datasets


Datasets

How to label datasets

Datasets are labeled in the following way:

[[DatasetName]].tgz
[[DatasetName]]_Literature.tgz
[[DatasetName]]_NumberOfChains_Extraction.tgz

whereas: || !DatasetName || Authors or web that site proposed the dataset || Literature || Special name given in the literatur || !NumberOfChains || Number of (extracted) chains contained in the dataset || Extraction || Extraction of the models/chains: none, first-first, first-all, all-all

Available Datasets

The following datasets are available from the repository (special privileges required!): || !DatasetName || Extraction || !NumberOfChains || Size in MB || Link || LelukKoniecznyRoterman || - || - || 3.5 || [/procksi/datasets/LelukKoniecznyRoterman.tgz Download] || || first-first || 6 || 0.4 || [/procksi/datasets/LelukKoniecznyRoterman_6_first-first.tgz Download] || || first-all || 15 || 0.9 || [/procksi/datasets/LelukKoniecznyRoterman_15_first-all.tgz Download]

|| !DatasetName      || Extraction   || !NumberOfChains || Size in MB || Link 
|| ChewKedem              || -            || -               || 3.8        || [/procksi/datasets/ChewKedem.tgz Download]
||                         || first-first  || 34              || 1.3        || [/procksi/datasets/ChewKedem_34_first-first.tgz Download]
||                         || first-all    || 54              || 2.0        || [/procksi/datasets/ChewKedem_54_first-all.tgz Download]
||                         || all-all      || 132             || 4.1        || [/procksi/datasets/ChewKedem_132_all-all.tgz Download]
|| !DatasetName      || Extraction   || !NumberOfChains || Size in MB || Link 
|| ProteinKinaseResource  || -            || -               || 3.6        || [/procksi/datasets/ProteinKinaseResource.tgz Download]
||                         || first-first  || 45              || 2.4        || [/procksi/datasets/ProteinKinaseResource_45_first-first.tgz Download]
||                         || first-all    || 49              || 2.5        || [/procksi/datasets/ProteinKinaseResource_49_first-all.tgz Download]
||                         || all-all      || 106             || 4.0        || [/procksi/datasets/ProteinKinaseResource_106_all-all.tgz Download]
|| !DatasetName      || Extraction   || !NumberOfChains || Size in MB || Link 
|| Skolnick                || -            || -               || 5.1        || [/procksi/datasets/Skolnick.tgz Download]
||                         || first-first  || 33              || 1.1        || [/procksi/datasets/Skolnick_33_first-first.tgz Download]
||                         || first-all    || 65              || 2.1        || [/procksi/datasets/Skolnick_65_first-all.tgz Download]
||                         || all-all      || 179             || 5.9        || [/procksi/datasets/Skolnick_179_all-all.tgz Download]
|| !DatasetName      || Extraction   || !NumberOfChains || Size in MB || Link 
|| RostSander             || -            || -               || 7.4        || [/procksi/datasets/RostSander.tgz Download]
||                         || RS126        || 126             || 4.3        || [/procksi/datasets/RostSander_RS126.tgz Download]
||                         || first-first  || 119             || 4.4        || [/procksi/datasets/RostSander_119_first-first.tgz Download]
||                         || first-all    || 212             || 7.6        || [/procksi/datasets/RostSander_212_first-all.tgz Download]
|| !DatasetName            || Extraction   || !NumberOfChains || Size in MB || Link 
|| KinjoHorimotoNishikawa || -            || -               || 98         || [/procksi/datasets/KinjoHorimotoNishikawa.tgz Download]
||                         || first-first  || 1012            || 46         || [/procksi/datasets/KinjoHorimotoNishikawa_1012_first-first.tgz Download]
||                         || first-all    || 2013            || 88         || [/procksi/datasets/KinjoHorimotoNishikawa_2013_first-all.tgz Download]
|| !DatasetName  || Description          || Extraction   || !NumberOfChains || Size in MB         || Link 
|| Shah                || Randomly selected 1000 proteins from PDB     || -            || -               || 114         || [/procksi/datasets/Shah.tgz Download]                   
||                     ||                            || first-first  || 1000            || 41          || [/procksi/datasets/Shah_1000_first-first.tgz Download]
||                     ||                            || first-all    || 1943            || 80          || [/procksi/datasets/Shah_1943_first-all.tgz Download]
||                     ||                            || all-all      || 4007            || 124         || [/procksi/datasets/Shah_4007_all-all.tgz Download]
|| !DatasetName  || Description          || Extraction   || !NumberOfChains || Size in GB         || Link 
||  PDB_SELECT30_04-2008              || Downloaded from PDB web site on 10/04/2008               || -            || -               || 1.1,  ucmp*: 4.8 ||   [/procksi/datasets/PDB_SELECT30_04-2008_7307.tar.gz Download]      ||                    
||                     || with criteria "Remove similar sequences at 30% identity"           || first-first  || 7183            || 0.285, ucmp*: 1.2     || [/procksi/datasets/PDB_SELECT30_04-2008_7183_first-first.tar.gz Download]
||                     ||                            || first-all    || 14651                 || 0.60 , ucmp:*2.7*          || [/procksi/datasets/PDB_SELECT30_04-2008_14651_first-all.tar.gz Download]
||                     ||                            || all-all      || 43025                || 1.2   , ucmp*: 5.5         ||  [/procksi/datasets/PDB_SELECT30_04-2008_43025_all-all.tar.gz Download]
|| !DatasetName  || Description          || Extraction *   || *!NumberOfChains || Size in GB         || Link 
|| PDB_SELECT25_10-2007               || PDB_SELECT25 as of October2007            || -            || -               || 0.746, ucmp*: 3.4         ||                     [/procksi/datasets/PDB_SELECT25_10-2007_3560.tar.gz Download]
||                     || it's a six monthly updated list of-       || first-first  || 3464            ||*0.12*,  ucmp*: 0.54,            || [/procksi/datasets/PDB_SELECT25_10-2007_3464_first-first.tar.gz Download]
||                     || non-redundent protein structures                           || first-all    || 8581                || 0.30, ucmp*: 1.4          || [/procksi/datasets/PDB_SELECT25_10-2007_8581_first-all.tar.gz  Download]
||                     || Mostly used in Protein Structure Prediction                           || all-all      ||31288                 || 0.854 , ucmp*: 4.1         || [/procksi/datasets/PDB_SELECT25_10-2007_31288_all-all.tar.gz Download]

*ucmp: uncompressed