Array ( [0] => {{short description|Database of protein sequences and functional information}} [1] => {{Distinguish|UniPro}} [2] => {{infobox biodatabase [3] => |title = UniProt [4] => |logo = [[Image:UPlogo1.png|220px]] [5] => |description = '''UniProt''' is the '''Uni'''versal '''Prot'''ein resource, a central repository of [[protein]] data created by combining the Swiss-Prot, TrEMBL and PIR-PSD [[database]]s. [6] => |scope = Protein annotation [7] => |organism = All [8] => |center = [[European Bioinformatics Institute|EMBL-EBI]], UK; [[Swiss Institute of Bioinformatics|SIB]], Switzerland; [[Protein Information Resource|PIR]], US. [9] => |laboratory = [10] => |author = [11] => |citation = UniProt Consortium{{cite journal|last1=UniProt|first1=Consortium.|title=UniProt: a hub for protein information.|journal=Nucleic Acids Research|date=January 2015|volume=43|issue=Database issue|pages=D204–12|pmid=25348405|doi=10.1093/nar/gku989|pmc=4384041}} [12] => |released = [13] => |standard = [14] => |format = Custom flat file, [[FASTA]], [[General feature format|GFF]], [[Resource Description Framework|RDF]], [[XML]]. [15] => |url = {{URL|www.uniprot.org}}
{{URL|www.uniprot.org/news/}} [16] => |download = {{URL|www.uniprot.org/downloads}} & for downloading complete data sets {{URL|ftp.uniprot.org}} [17] => |webservice = Yes – [[Java (programming language)|JAVA]] [[API]] see info {{URL|www.ebi.ac.uk/uniprot/remotingAPI/|here}} & [[REST]] see info {{URL|www.uniprot.org/faq/28|here}} [18] => |sql = [19] => |sparql = [20] => |webapp = Advanced search, [[BLAST (biotechnology)|BLAST]], [[Clustal]]O, bulk retrieval/download, ID mapping [21] => |standalone = [22] => |license = [[Creative Commons]] Attribution-NoDerivs [23] => |versioning = Yes [24] => |frequency = 8 weeks [25] => |curation = Yes – manual and automatic. Rules for automatic annotation generated by database curators and computational algorithms. [26] => |bookmark = Yes – both individual protein entries and searches [27] => |version = [28] => }} [29] => [30] => '''UniProt''' is a freely accessible database of [[protein sequence]] and functional information, many entries being derived from [[genome sequencing project]]s. It contains a large amount of information about the biological function of proteins derived from the research literature. It is maintained by the UniProt consortium, which consists of several European [[bioinformatics]] organisations and a foundation from [[Washington, DC]], USA. [31] => [32] => ==The UniProt consortium== [33] => [34] => The UniProt consortium comprises the [[European Bioinformatics Institute]] (EBI), the [[Swiss Institute of Bioinformatics]] (SIB), and the [[Protein Information Resource]] (PIR). EBI, located at the [[Wellcome Trust Genome Campus]] in Hinxton, UK, hosts a large resource of bioinformatics databases and services. SIB, located in Geneva, Switzerland, maintains the [[ExPASy]] (Expert Protein Analysis System) servers that are a central resource for proteomics tools and databases. PIR, hosted by the National Biomedical Research Foundation (NBRF) at the Georgetown University Medical Center in Washington, DC, US, is heir to the oldest protein sequence database, [[Margaret Oakley Dayhoff|Margaret Dayhoff]]'s Atlas of Protein Sequence and Structure, first published in 1965.{{cite book |author=Dayhoff, Margaret O. |title=Atlas of protein sequence and structure |publisher=National Biomedical Research Foundation |location=Silver Spring, Md |year=1965 }} In 2002, EBI, SIB, and PIR joined forces as the UniProt consortium.{{cite web|url=http://www.genome.gov/page.cfm?pageID=10005283|title=2002 Release: NHGRI Funds Global Protein Database|website=National Human Genome Research Institute (NHGRI)|access-date=14 April 2018|archive-date=24 September 2015|archive-url=https://web.archive.org/web/20150924040602/http://www.genome.gov/page.cfm?pageID=10005283|url-status=dead}} [35] => [36] => ==The roots of the UniProt databases== [37] => [38] => Each consortium member is heavily involved in protein database maintenance and annotation. Until recently, EBI and SIB together produced the Swiss-Prot and TrEMBL databases, while PIR produced the Protein Sequence Database (PIR-PSD).{{Cite journal [39] => | doi = 10.1093/bib/3.3.275 [40] => | last1 = O'Donovan | first1 = C. [41] => | last2 = Martin | first2 = M. J. [42] => | last3 = Gattiker | first3 = A. [43] => | last4 = Gasteiger | first4 = E. [44] => | last5 = Bairoch | first5 = A. [45] => | last6 = Apweiler | first6 = R. [46] => | title = High-quality protein knowledge resource: SWISS-PROT and TrEMBL [47] => | journal = Briefings in Bioinformatics [48] => | volume = 3 [49] => | issue = 3 [50] => | pages = 275–284 [51] => | year = 2002 [52] => | pmid = 12230036 [53] => | doi-access = free [54] => | url = https://access.archive-ouverte.unige.ch/access/metadata/ae37e636-0172-4a75-9ab1-6830a29e0529/download [55] => }}{{Cite journal [56] => | last1 = Wu | first1 = C. H. [57] => | last2 = Yeh | first2 = L. S. [58] => | last3 = Huang | first3 = H. [59] => | last4 = Arminski | first4 = L. [60] => | last5 = Castro-Alvear | first5 = J. [61] => | last6 = Chen | first6 = Y. [62] => | last7 = Hu | first7 = Z. [63] => | last8 = Kourtesis | first8 = P. [64] => | last9 = Ledley | first9 = R. S. [65] => | last10 = Suzek | first10 = B. E. [66] => | last11 = Vinayaka | first11 = C. R. [67] => | last12 = Zhang | first12 = J. [68] => | last13 = Barker | first13 = W. C. [69] => | title = The Protein Information Resource [70] => | journal = Nucleic Acids Research [71] => | volume = 31 [72] => | issue = 1 [73] => | pages = 345–347 [74] => | year = 2003 [75] => | pmid = 12520019 [76] => | pmc = 165487 | doi=10.1093/nar/gkg040 [77] => }}{{Cite journal [78] => | last1 = Boeckmann | first1 = B. [79] => | last2 = Bairoch | first2 = A. [80] => | last3 = Apweiler | first3 = R. [81] => | last4 = Blatter | first4 = M. C. [82] => | last5 = Estreicher | first5 = A. [83] => | last6 = Gasteiger | first6 = E. [84] => | last7 = Martin | first7 = M. J. [85] => | last8 = Michoud | first8 = K. [86] => | last9 = O'Donovan | first9 = C. [87] => | last10 = Phan | first10 = I. [88] => | last11 = Pilbout | first11 = S. [89] => | last12 = Schneider | first12 = M. [90] => | title = The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [91] => | doi = 10.1093/nar/gkg095 [92] => | journal = Nucleic Acids Research [93] => | volume = 31 [94] => | issue = 1 [95] => | pages = 365–370 [96] => | year = 2003 [97] => | pmid = 12520024 [98] => | pmc =165542 [99] => }} These databases coexisted with differing [[Peptide sequence|protein sequence]] coverage and annotation priorities. [100] => [101] => Swiss-Prot was created in 1986 by [[Amos Bairoch]] during his PhD and developed by the [[Swiss Institute of Bioinformatics]] and subsequently developed by [[Rolf Apweiler]] at the [[European Bioinformatics Institute]].{{Cite journal [102] => | last1 = Bairoch | first1 = A. [103] => | last2 = Apweiler | first2 = R. [104] => | title = The SWISS-PROT protein sequence data bank and its new supplement TREMBL [105] => | journal = Nucleic Acids Research [106] => | volume = 24 [107] => | issue = 1 [108] => | pages = 21–25 [109] => | year = 1996 [110] => | pmid = 8594581 [111] => | pmc = 145613 [112] => | doi=10.1093/nar/24.1.21 [113] => }}{{Cite journal | last1 = Bairoch | first1 = A. | title = Serendipity in bioinformatics, the tribulations of a Swiss bioinformatician through exciting times! | doi = 10.1093/bioinformatics/16.1.48 | journal = Bioinformatics | volume = 16 | issue = 1 | pages = 48–64 | year = 2000 | pmid = 10812477| doi-access = free | url = https://access.archive-ouverte.unige.ch/access/metadata/a71280fd-f9ba-4a35-b227-c5f4735edacf/download }}Séverine Altairac, "[http://expasy.org/prolune/pdf/prolune018_fr.pdf Naissance d’une banque de données: Interview du prof. Amos Bairoch]". ''[http://expasy.org/prolune/ Protéines à la Une]'', August 2006. {{ISSN|1660-9824}}. Swiss-Prot aimed to provide reliable protein sequences associated with a high level of annotation (such as the description of the function of a protein, its [[Protein domain|domain]] structure, [[Post translational modification|post-translational modifications]], variants, etc.), a minimal level of [[Data redundancy|redundancy]] and high level of integration with other databases. Recognizing that sequence data were being generated at a pace exceeding Swiss-Prot's ability to keep up, TrEMBL (Translated EMBL Nucleotide Sequence Data Library) was created to provide automated annotations for those proteins not in Swiss-Prot. Meanwhile, PIR maintained the PIR-PSD and related databases, including [[iProClass]], a database of protein sequences and curated families. [114] => [115] => The consortium members pooled their overlapping resources and expertise, and launched UniProt in December 2003. [116] => [117] => ==Organization of the UniProt databases== [118] => [119] => UniProt provides four core databases: UniProtKB (with sub-parts Swiss-Prot and TrEMBL), UniParc, UniRef and Proteome. [120] => [121] => ===UniProtKB=== [122] => [123] => UniProt Knowledgebase (UniProtKB) is a protein database partially curated by experts, consisting of two sections: UniProtKB/Swiss-Prot (containing reviewed, manually annotated entries) and UniProtKB/TrEMBL (containing unreviewed, automatically annotated entries).{{Cite journal [124] => | last1 = Uniprot | first1 = C. [125] => | title = The Universal Protein Resource (UniProt) in 2010 [126] => | doi = 10.1093/nar/gkp846 [127] => | journal = Nucleic Acids Research [128] => | volume = 38 [129] => | issue = Database issue [130] => | pages = D142–D148 [131] => | year = 2009 [132] => | pmid = 19843607 [133] => | pmc =2808944 [134] => }} {{As of|2023|02|22}}, release "2023_01" of UniProtKB/Swiss-Prot contains 569,213 sequence entries (comprising 205,728,242 amino acids abstracted from 291,046 references) and release "2023_01" of UniProtKB/TrEMBL contains 245,871,724 sequence entries (comprising 85,739,380,194 amino acids).{{cite web|url=https://web.expasy.org/docs/relnotes/relstat.html|title=UniProtKB/Swiss-Prot Release 2023_01 statistics|website=web.expasy.org|access-date=31 March 2023}} [135] => [136] => ====UniProtKB/Swiss-Prot==== [137] => [138] => UniProtKB/Swiss-Prot is a manually annotated, non-redundant protein sequence database. It combines information extracted from scientific literature and [[biocurator]]-evaluated computational analysis. The aim of UniProtKB/Swiss-Prot is to provide all known relevant information about a particular protein. Annotation is regularly reviewed to keep up with current scientific findings. The manual annotation of an entry involves detailed analysis of the protein sequence and of the scientific literature.{{cite web|url=https://www.uniprot.org/faq/45 |date=September 21, 2011 |title=How do we manually annotate a UniProtKB entry?|website=UniProt|access-date=14 April 2018 |url-status=dead |archive-url= https://web.archive.org/web/20131213073315/https://www.uniprot.org/faq/45 |archive-date= Dec 13, 2013 }} [139] => [140] => Sequences from the same [[gene]] and the same [[species]] are merged into the same database entry. Differences between sequences are identified, and their cause documented (for example [[alternative splicing]], [[Genetic diversity|natural variation]], incorrect [[Eukaryotic translation#Initiation|initiation]] sites, incorrect [[exon]] boundaries, [[Frameshift mutation|frameshifts]], unidentified conflicts). A range of sequence analysis tools is used in the annotation of UniProtKB/Swiss-Prot entries. Computer-predictions are manually evaluated, and relevant results selected for inclusion in the entry. These predictions include post-translational modifications, [[transmembrane domain]]s and [[Membrane topology|topology]], [[signal peptide]]s, domain identification, and [[protein family]] classification.{{Cite journal [141] => | last1 = Apweiler | first1 = R. [142] => | last2 = Bairoch | first2 = A. [143] => | last3 = Wu | first3 = C. H. [144] => | last4 = Barker | first4 = W. C. [145] => | last5 = Boeckmann | first5 = B. [146] => | last6 = Ferro | first6 = S. [147] => | last7 = Gasteiger | first7 = E. [148] => | last8 = Huang | first8 = H. [149] => | last9 = Lopez | first9 = R. [150] => | last10 = Magrane | first10 = M. [151] => | last11 = Martin | first11 = M. J. [152] => | last12 = Natale | first12 = D. A. [153] => | last13 = o’Donovan | first13 = C. [154] => | last14 = Redaschi | first14 = N. [155] => | last15 = Yeh | first15 = L. S. [156] => | title = UniProt: The Universal Protein knowledgebase [157] => | doi = 10.1093/nar/gkh131 [158] => |doi-access=free [159] => | journal = Nucleic Acids Research [160] => | volume = 32 [161] => | issue = 90001 [162] => | pages = 115D–1119 [163] => | year = 2004 [164] => | pmid = 14681372 [165] => | pmc =308865 [166] => }} [167] => [168] => Relevant publications are identified by searching databases such as [[PubMed]]. The full text of each paper is read, and information is extracted and added to the entry. Annotation arising from the scientific literature includes, but is not limited to:{{Cite journal | last1 = Apweiler | first1 = R. | last2 = Bairoch | first2 = A. | last3 = Wu | first3 = C. H. | doi = 10.1016/j.cbpa.2003.12.004 | title = Protein sequence databases | journal = Current Opinion in Chemical Biology | volume = 8 | issue = 1 | pages = 76–80 | year = 2004 | pmid = 15036160}} [169] => *Protein and gene names [170] => *Function [171] => *[[Enzyme]]-specific information such as [[Catalysis|catalytic activity]], [[Cofactor (biochemistry)|cofactors]] and [[Active site|catalytic residues]] [172] => *[[Subcellular localization|Subcellular location]] [173] => *[[Protein-protein interaction]]s [174] => *Pattern of expression [175] => *Locations and roles of significant domains and sites [176] => *[[Ion]]-, [[Substrate (biochemistry)|substrate]]- and cofactor-binding sites [177] => *Protein variant forms produced by natural genetic variation, [[RNA editing]], alternative splicing, [[proteolytic]] processing, and post-translational modification [178] => [179] => Annotated entries undergo quality assurance before inclusion into UniProtKB/Swiss-Prot. When new data becomes available, entries are updated. [180] => [181] => ====UniProtKB/TrEMBL==== [182] => [183] => UniProtKB/TrEMBL contains high-quality computationally analyzed records, which are enriched with automatic annotation. It was introduced in response to increased dataflow resulting from genome projects, as the time- and labour-consuming manual annotation process of UniProtKB/Swiss-Prot could not be broadened to include all available protein sequences. The translations of annotated coding sequences in the [[INSDC|EMBL-Bank/GenBank/DDBJ nucleotide sequence database]] are automatically processed and entered in UniProtKB/TrEMBL. [184] => UniProtKB/TrEMBL also contains sequences from [[Protein Data Bank|PDB]], and from gene prediction, including [[Ensembl]], [[RefSeq]] and [[Consensus CDS Project|CCDS]].{{cite web|url=https://www.uniprot.org/faq/37|title=Where do the UniProtKB protein sequences come from?|website=UniProt |date=September 21, 2011 |access-date=14 April 2018 |url-status=dead |archive-url=https://web.archive.org/web/20131215223948/https://www.uniprot.org/faq/37 |archive-date= Dec 15, 2013 }} Since 22 July 2021 it also includes structures predicted with [[AlphaFold|AlphaFold2]].{{cite web |title=Putting the power of AlphaFold into the world's hands |url=https://deepmind.com/blog/article/putting-the-power-of-alphafold-into-the-worlds-hands |date= 22 July 2022 |first1= Demis |last1=Hassabis |website=Deepmind |access-date=24 July 2021 |url-status=live |archive-url=https://web.archive.org/web/20210724034333/https://deepmind.com/blog/article/putting-the-power-of-alphafold-into-the-worlds-hands |archive-date= 24 July 2021 }} [185] => [186] => ===UniParc=== [187] => [188] => UniProt Archive (UniParc) is a comprehensive and non-redundant database, which contains all the protein sequences from the main, publicly available protein sequence databases.{{Cite journal [189] => | last1 = Leinonen | first1 = R. [190] => | last2 = Diez | first2 = F. G. [191] => | last3 = Binns | first3 = D. [192] => | last4 = Fleischmann | first4 = W. [193] => | last5 = Lopez | first5 = R. [194] => | last6 = Apweiler | first6 = R. [195] => | doi = 10.1093/bioinformatics/bth191 [196] => |url=https://academic.oup.com/bioinformatics/article/20/17/3236/185740 [197] => | title = UniProt archive [198] => | journal = Bioinformatics [199] => | volume = 20 [200] => | issue = 17 [201] => | pages = 3236–3237 [202] => | year = 2004 [203] => | pmid = 15044231 [204] => | doi-access = free [205] => |url-status=live |archive-url= https://web.archive.org/web/20240330230423/https://watermark.silverchair.com/bioinformatics_20_17_3236.pdf?token=AQECAHi208BE49Ooan9kkhW_Ercy7Dm3ZL_9Cf3qfKAc485ysgAAA34wggN6BgkqhkiG9w0BBwagggNrMIIDZwIBADCCA2AGCSqGSIb3DQEHATAeBglghkgBZQMEAS4wEQQMP9mu-T8EUm-Q6Lf9AgEQgIIDMcbOoJLLoOPgo-wQyUWIt5mj5BPdBSlnXllqsCHaRfV0xiOcThLWvaWthRcPV7v4ucTNtx1q1tXDoVjKzMG4TWbWC_3ddH2WTh6auCHEBHAYoJJ_29joWila6lfpHWCv-XkLMBBAYQkbNaTZIw-ywI9qlFkgYz5DIK83PmTio_xQ6OBES9aqriHeIzRRPDkXU3QzdkfPw-USBipDF1zAOEtM6J2p7xmjKmW6DQq-_3hb7Ju82VNJoLgwtr0oKZG6845mV5MKPvrS7jL0UKYgNtkNwEcEuZ6ojBKkB_oXPTrRyqyD6kZBeVaxJcgEVmaVaNGMLxXhpC3V8q63SaoJ5DSVlnqIJQTHauFQ6hrQcoBQSOVwT8-3Yj-dUuDfopzsdOIDmwYiV71sjabJOuVC60NYR7ieEhQGp0SemDlPAAgtDOml6ivO5PSCPGy7oAlScLAwSgFKvJSci6OpW-AnhOaNFAA3EBiXj9Wb5eDjKc2-WZ4lmTDaEnmke07lMzdSaAA6i7o7h0roXIhLkVv5YKIz6kvR1dSNM2sT7LseDdJm0bQpP-WVGAtRlh6PCmnez8owewKsFxtx35mNfzv_wZejzS4A-a1oqtwpMQyYqqsQbHmsJQC-52scqM8hqOxnpXVlhSFBP_2sOE4ggiGNWBUFTUnzPc45CJ8f4cddREt5CefFUvccgHq1vbUpbMkmY6LDcsKVJEc2sSv02JnarmMrw1DTKizqGCFN3Vxebv8iI7ApcgLrH2VYd9bSI_0KbYejDkuIPRTjFd7e8BYdOYAdDLtOmLpDdxw53KJzVkudKBWyAG6vYJYQ16g2h897FN58JYAdCem3uq9FVMWtma840gBmir-_uYrbwKudMO7gKEcbWPE9RAaMqLWwhmBRsFnjQUztoR2loO-iMerIl44k2bjzSM-_8Yeo40-iTuGERBD6A7tPdzOvSp6PFpPGQcNljrqfi3Xf-tNmL7aiB2HM4yCdIepndC_v05MNPLUdbS-SxmsOnuJwRKljCsj2qbyLLpKSSXvh_GVCJ0BPRJP2c1FCCswrg5Yi-ywo7K1rhsgr9XdFU9yR0dBTSg6eBb4 |archive-date= Mar 30, 2024 [206] => }} Proteins may exist in several different source databases, and in multiple copies in the same database. In order to avoid redundancy, UniParc stores each unique sequence only once. Identical sequences are merged, regardless of whether they are from the same or different species. Each sequence is given a stable and unique identifier (UPI), making it possible to identify the same protein from different source databases. UniParc contains only protein sequences, with no annotation. Database cross-references in UniParc entries allow further information about the protein to be retrieved from the source databases. When sequences in the source databases change, these changes are tracked by UniParc and history of all changes is archived. [207] => [208] => ====Source databases==== [209] => [210] => Currently UniParc contains protein sequences from the following publicly available databases: [211] => * [[INSDC]] [[EMBL]]-Bank/[[DDBJ]]/[[GenBank]] nucleotide sequence databases [212] => * [[Ensembl]] [213] => * [[European Patent Office]] (EPO) [214] => * [[FlyBase|FlyBase: the primary repository of genetic and molecular data for the insect family Drosophilidae]] (FlyBase) [215] => * [[H-Invitational|H-Invitational Database]] (H-Inv) [216] => * [[International Protein Index]] (IPI) [217] => * [[Japan Patent Office]] (JPO) [218] => * [[Protein Information Resource]] (PIR-PSD) [219] => * [[Protein Data Bank]] (PDB) [220] => * [[Protein Research Foundation]] (PRF){{Cite web|url=http://www.prf.or.jp/index-e.html|title=Protein Research Foundation}} [221] => * [[RefSeq]] [222] => * [[Saccharomyces Genome Database]] (SGD) [223] => * [[The Arabidopsis Information Resource]] (TAIR) [224] => * [[TROME]]ftp://ftp.isrec.isb-sib.ch/pub/databases/trome{{Dead link|date=February 2022 |bot=InternetArchiveBot |fix-attempted=yes }} [225] => * [[US Patent Office]] (USPTO) [226] => * UniProtKB/Swiss-Prot, UniProtKB/Swiss-Prot protein isoforms, UniProtKB/TrEMBL [227] => * [[Vertebrate and Genome Annotation Database]] (VEGA) [228] => * [[WormBase]] [229] => [230] => ===UniRef=== [231] => [232] => The UniProt Reference Clusters (UniRef) consist of three databases of clustered sets of protein sequences from UniProtKB and selected UniParc records.{{Cite journal [233] => | last1 = Suzek | first1 = B. E. [234] => | last2 = Huang | first2 = H. [235] => | last3 = McGarvey | first3 = P. [236] => | last4 = Mazumder | first4 = R. [237] => | last5 = Wu | first5 = C. H. [238] => | title = UniRef: Comprehensive and non-redundant UniProt reference clusters [239] => | doi = 10.1093/bioinformatics/btm098 [240] => | journal = Bioinformatics [241] => | volume = 23 [242] => | issue = 10 [243] => | pages = 1282–1288 [244] => | year = 2007 [245] => | pmid = 17379688 [246] => | doi-access = [247] => }} The UniRef100 database combines identical sequences and sequence fragments (from any [[organism]]) into a single UniRef entry. The sequence of a representative protein, the [[Accession number (bioinformatics)|accession numbers]] of all the merged entries and links to the corresponding UniProtKB and UniParc records are displayed. UniRef100 sequences are clustered using the CD-HIT [[algorithm]] to build UniRef90 and UniRef50.{{Cite journal [248] => | doi = 10.1093/bioinformatics/17.3.282 [249] => | last1 = Li | first1 = W. [250] => | last2 = Jaroszewski | first2 = L. [251] => | last3 = Godzik | first3 = A. [252] => | title = Clustering of highly homologous sequences to reduce the size of large protein databases [253] => | journal = Bioinformatics [254] => | volume = 17 [255] => | issue = 3 [256] => | pages = 282–283 [257] => | year = 2001 [258] => | pmid = 11294794 [259] => | doi-access = [260] => }} Each cluster is composed of sequences that have at least 90% or 50% sequence identity, respectively, to the longest sequence. Clustering sequences significantly reduces database size, enabling faster sequence searches. [261] => [262] => UniRef is available from the [http://ftp.uniprot.org/pub/databases/uniprot/current_release/uniref/ UniProt FTP site]. [263] => [264] => ==Funding== [265] => [266] => UniProt is funded by grants from the [[National Human Genome Research Institute]], the [[National Institutes of Health]] (NIH), the [[European Commission]], the Swiss Federal Government through the Federal Office of Education and Science, [[CaBIG|NCI-caBIG]], and the US Department of Defense. [267] => [268] => ==References== [269] => {{reflist|2}} [270] => [271] => ==External links== [272] => {{Wikidata property|P352}} [273] => *[https://www.uniprot.org UniProt] [274] => [275] => {{Bioinformatics}} [276] => [277] => [278] => [[Category:Biological databases]] [279] => [[Category:Online databases]] [280] => [[Category:Proteomics]] [281] => [[Category:Science and technology in Cambridgeshire]] [282] => [[Category:South Cambridgeshire District]] [283] => [[Category:Bioinformatics]] [284] => [[Category:Computational biology]] [] => )
good wiki

UniProt

UniProt is a freely accessible database of protein sequence and functional information, many entries being derived from genome sequencing projects. It contains a large amount of information about the biological function of proteins derived from the research literature.

More about us

About

Expert Team

Vivamus eget neque lacus. Pellentesque egauris ex.

Award winning agency

Lorem ipsum, dolor sit amet consectetur elitorceat .

10 Year Exp.

Pellen tesque eget, mauris lorem iupsum neque lacus.

You might be interested in