Supervised identification of allergen-representative peptides for in silico detection of potentially allergenic proteins

Datasets of the allergen and non-allergen amino acid sequences used in the manuscript “Supervised identification of allergen-representative peptides for in silico detection of potentially allergenic proteins” by Björklund et al. (2005) Bioinformatics 21:39-50.

 

The zip-compressed file named “Sequence Data Files.zip” contains three separate sequence data files:

 

  • The file named “Allergens.txt”, which consists of allergen amino acid sequences.
    The two files named “NonTrain.txt” and ”NonTest.txt”, which consist of non-allergen sequences for training and validation respectively.
  • The sequence names are given with SwissProt, Trembl or Entrez entries and the entries are downloaded from either the SWALL or Entrez database.


A link to the Sequence Data Files.zip (WinZip) is found on the right of this page.

 

A substantially improved method including new datasets are available here: Computational Detection of Allergenic ...

Updated: 27/06/2011

More about Sequence Data Files.zip

 » Sequence Data Files.zip

National Food Agency, Box 622, SE-751 26 Uppsala, +46 18 175500  More information

 

No text at the moment - there will be information about the web site later on