NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F074439

Metagenome / Metatranscriptome Family F074439

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F074439
Family Type Metagenome / Metatranscriptome
Number of Sequences 119
Average Sequence Length 79 residues
Representative Sequence KAKVAAGVKSSYLELDRSRQLYQLARRMVSAAGVVNASYKSDDPEVESAQAKMEADMFRAELEYRQAYARLKSLMGEK
Number of Associated Samples 73
Number of Associated Scaffolds 119

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.84 %
% of genes from short scaffolds (< 2000 bps) 0.84 %
Associated GOLD sequencing projects 67
AlphaFold2 3D model prediction Yes
3D model pTM-score0.72

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.160 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Wetlands → Unclassified → Soil
(35.294 % of family members)
Environment Ontology (ENVO) Unclassified
(44.538 % of family members)
Earth Microbiome Project Ontology (EMPO) Unclassified
(42.017 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 65.09%    β-sheet: 0.00%    Coil/Unstructured: 34.91%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.72
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
f.5.1.0: automated matchesd1yc9a_1yc90.8759
f.5.1.1: Outer membrane efflux proteins (OEP)d1ek9a_1ek90.86339
f.5.1.0: automated matchesd3pika_3pik0.86108
f.5.1.0: automated matchesd5azsa_5azs0.84311
a.238.1.1: BAR domaind1urua_1uru0.84054


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 119 Family Scaffolds
PF10677DUF2490 4.20
PF13590DUF4136 4.20
PF01758SBF 3.36
PF04203Sortase 2.52
PF02656DUF202 2.52
PF01060TTR-52 1.68
PF04055Radical_SAM 1.68
PF13186SPASM 1.68
PF04264YceI 1.68
PF00484Pro_CA 1.68
PF07885Ion_trans_2 1.68
PF00072Response_reg 0.84
PF03724META 0.84
PF00873ACR_tran 0.84
PF01734Patatin 0.84
PF01569PAP2 0.84
PF13620CarboxypepD_reg 0.84
PF14279HNH_5 0.84
PF01594AI-2E_transport 0.84
PF00076RRM_1 0.84
PF135632_5_RNA_ligase2 0.84
PF14023DUF4239 0.84
PF00691OmpA 0.84
PF01610DDE_Tnp_ISL3 0.84
PF00069Pkinase 0.84
PF04366Ysc84 0.84
PF00884Sulfatase 0.84
PF14707Sulfatase_C 0.84
PF01263Aldose_epim 0.84

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 119 Family Scaffolds
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 3.36
COG2149Uncharacterized membrane protein YidH, DUF202 familyFunction unknown [S] 2.52
COG3764Sortase (surface protein transpeptidase)Cell wall/membrane/envelope biogenesis [M] 2.52
COG0288Carbonic anhydraseInorganic ion transport and metabolism [P] 1.68
COG2353Polyisoprenoid-binding periplasmic protein YceIGeneral function prediction only [R] 1.68
COG0628Predicted PurR-regulated permease PerMGeneral function prediction only [R] 0.84
COG0676D-hexose-6-phosphate mutarotaseCarbohydrate transport and metabolism [G] 0.84
COG1752Predicted acylesterase/phospholipase RssA, containd patatin domainGeneral function prediction only [R] 0.84
COG2017Galactose mutarotase or related enzymeCarbohydrate transport and metabolism [G] 0.84
COG2930Lipid-binding SYLF domain, Ysc84/FYVE familyLipid transport and metabolism [I] 0.84
COG3187Heat shock protein HslJPosttranslational modification, protein turnover, chaperones [O] 0.84
COG3464TransposaseMobilome: prophages, transposons [X] 0.84
COG3621Patatin-like phospholipase/acyl hydrolase, includes sporulation protein CotRGeneral function prediction only [R] 0.84
COG4667Predicted phospholipase, patatin/cPLA2 familyLipid transport and metabolism [I] 0.84


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.16 %
All OrganismsrootAll Organisms0.84 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300022527|Ga0242664_1032654All Organisms → cellular organisms → Bacteria883Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil35.29%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil14.29%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment11.76%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.76%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil8.40%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.20%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil2.52%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.52%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.68%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.68%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.84%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil0.84%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.84%
BogEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Bog0.84%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.84%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.84%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.84%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2170459003Grass soil microbial communities from Rothamsted Park, UK - March 2009 indirect MP BIO 1O1 lysis 0-21cmEnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300004082Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004137Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF202 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004635Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300005591Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012985Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_246_MGEnvironmentalOpen in IMG/M
3300014838Permafrost microbial communities from Stordalen Mire, Sweden - 812S3M metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300017928Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_1EnvironmentalOpen in IMG/M
3300017933Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_1EnvironmentalOpen in IMG/M
3300017943Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_4EnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300017995Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_1EnvironmentalOpen in IMG/M
3300018006Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_4EnvironmentalOpen in IMG/M
3300018012Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_5EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300022527Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-4-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022533Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-7-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022712Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-32-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022717Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-11-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024225Spruce rhizosphere microbial communities from Bohemian Forest, Czech Republic ? CZU5Host-AssociatedOpen in IMG/M
3300027729Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027767Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP03_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027768Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP03_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027783Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027829Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300030917Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FB5 Emin (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030979Forest soil microbial communities from France, for metatranscriptomics studies - Site 11 - Champenoux / Amance forest (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032515FICUS49499 Metatranscriptome Czech Republic combined assembly (additional data)EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300032782Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.1EnvironmentalOpen in IMG/M
3300032783Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.3EnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M
3300032828Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.4EnvironmentalOpen in IMG/M
3300032829Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.3EnvironmentalOpen in IMG/M
3300032892Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.5EnvironmentalOpen in IMG/M
3300032893Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.1EnvironmentalOpen in IMG/M
3300032895Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.3EnvironmentalOpen in IMG/M
3300032897Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.5EnvironmentalOpen in IMG/M
3300032954Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.2EnvironmentalOpen in IMG/M
3300032955Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.5EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033134Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.2EnvironmentalOpen in IMG/M
3300033158Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.1EnvironmentalOpen in IMG/M
3300034163Peat soil microbial communities from wetlands in Alaska, United States - Goldstream_04D_14EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
E4A_082639102170459003Grass SoilSLAVQLTKAKVASGVKSSYLELERARQLVQLARRMASASRMVNASYRSDDQEVESAQARMEADMFRAELEYRQAYAKLKGLMGNQ
JGIcombinedJ26739_10091666023300002245Forest SoilDLGVQLVKAKVAAAVKSTYFELERSRQFTQLARRMVSSTRVVEASYRPDNPEVESASARMEADMFRAELEYRQAFARLRTLMGEK*
Ga0062385_1039572923300004080Bog Forest SoilAGVKSTYFELDRSRQLYQLTRRMVSAGQVVDASYHSDNPEMESAQAKMEADMFRADFEYRQAYNKLKALMGEK*
Ga0062384_10048647413300004082Bog Forest SoilEHGVKEVNAQAEMADLAVQLTKAKVAAGVKNSYFELDRSRQLYQLTRRMVSAVQVVDASYKSDKPEVESAQAKMEADMFRADFEYRQAYAKLKVLMGVK*
Ga0062389_10347291313300004092Bog Forest SoilVKSSYFELDRSRQLYQLARRMVSAVQVVNASYKSDDPEVESAQAKMEADMFRADFEYRQAYGKLKVLIGKE*
Ga0062389_10480387113300004092Bog Forest SoilKAQSEMADLGVVLTKAKVAAGVKSSYLELDRSRQLYQLARRMVSAAGVVSANYKSDDPEVESAQAKMEADMFRAELEYREAYAKLKTLTGTR*
Ga0058883_154130813300004137Forest SoilRKLSQLARRMVSATQVIEASYKSDDPEVESARAKMEADMFRAELEYRQAYARLKNLMGNKWAERK*
Ga0062388_10178495813300004635Bog Forest SoilKAQSEMADLGVVLTKAKVAAGVKSSYLELDRSRELYQLARRMVSAAGVVSANYKSDDPEVESAQARMEADMFRAELEYREAYSKLKTLTGTR*
Ga0070761_1041264613300005591SoilESRANAEAADLGVQLTKAKVAASIKNSYFELQRSRQLTQLARRMAAATRLVEASYQPDNPEVESARAKVEADMFRAELEYRQAFARLKSLMGER*
Ga0099793_1012913733300007258Vadose Zone SoilVKESNAQAEMAEMAVPLTKAKVTAEVKSSYFELERSRKLSQLARRMVSAGQVVEASFQSDNPEVEPEQAKMEADMFRSELEYRQAYARLKSLMDRK*
Ga0099793_1030499713300007258Vadose Zone SoilVKESNAQAEMAEMAVPLTKAKVTAEVKSSYFELERSRKLSQLARRMVSASQVVEASYQSDNPDVESAEAKIEADMFRAELEYRQAYARLKSLMDGK*
Ga0099794_1065855423300007265Vadose Zone SoilFELERSRKLSELARRMVSETRVVDASIPSGNSEVESARAKMEADMFRAELEYRQAYSRLKSLMGGK*
Ga0134121_1274925013300010401Terrestrial SoilAATVKTSYLELDRSRQLYQLARRMVSATQVVEASYRPEDLEGESARAKIKADMFRAELEYRQAYARLKALMGPQ*
Ga0150983_1554138123300011120Forest SoilYLELERSRQLVQLARRMVSATRIVEASYRSDDPEVESAQAKMEADMYRAELEYRQAYARLRSLMGYK*
Ga0137382_1000130113300012200Vadose Zone SoilKAKVTAEVKSSYFELERSRKLSQLARRMVSAGQVVEASFQSDNPEVEPEQAKMEADMFRSELEYRQAYARLKSLMDRK*
Ga0137382_1026555633300012200Vadose Zone SoilTAEVKSSYFELERSRKLSQLARRMVSAGQVVEASFQSDNPEVEPERAKMEADMFRAELEYRQAYARLKNLMDRK*
Ga0137363_1054911123300012202Vadose Zone SoilQAEMAEMAVPLTKAKVAADVKSSYFELERSRKLSQLARRMVASTQVVEASYKSEDSEVDSARAKMEAEMFRAELEYRQAYSRLKSLTGGK*
Ga0137381_1162743613300012207Vadose Zone SoilVKSSYFELERSRKLSQLARRMVSASQVVEASYQSDNPDVESAQAKMEADMFRAELEYRQAYARLKSLMDGK*
Ga0137372_1010555213300012350Vadose Zone SoilTKAKVAETVRTSYLELDRSRQLYELARRMVSATQVVEASYKLNDPEVESARAKIKADMFRAELEYRQAYARLKALMGAQ*
Ga0137360_1038429633300012361Vadose Zone SoilSNAQAEMAEMAVPLTKAKVTAEVKSSYFELERSRKLSQLARRMVSAGQVVEASFQSDNPEVEPEQAKMEADMFRSELEYRQAYARLKNLMDRK*
Ga0137360_1129396413300012361Vadose Zone SoilSNAQAEMAEMAVPLTKAKVTAEVKSSYFELERSRKLSQLARRMVSAGQVAEASFQSDNPEVEPERAKMEADMFRAELEYRQAYARLKNLMDRK*
Ga0137394_1052959933300012922Vadose Zone SoilERSRKLSQLARRMVSAGQVVEASFQSDNPEVEPEQAKMEAAMFRSELEYRQAYARLKSLMDRK*
Ga0137416_1009111613300012927Vadose Zone SoilRKLSQMARRMVSAVQVVDASYQSDNSEVESARARIEADMFRTELEYRQAYSGLKSLMGGK
Ga0164308_1139429113300012985SoilKREHGVKEAKAQAEMADIGVQLTKAKVAGAVKSSYLELDRSRQLYQLARRMVSATQVIAASYKPDDPEVKSARAKMEADMFRAELEYRQAYAKLKALMGAQ*
Ga0182030_1118057313300014838BogVAGAAKSSYFELERSRKLSQLARRMVSASKVVEASYQPDNPEVDSARAKMEADMFRAELEYRQAYARVKSLMGDK*
Ga0137409_1005282713300015245Vadose Zone SoilLTKAKVTAEVKSSYFELERSRKLSQLARRMVSASQVVEASYQSDNPDVESAQAKIEADMFRAELEYRQAYARLKSLMDGK*
Ga0137409_1009327813300015245Vadose Zone SoilLTKAKVTAEVKSSYFELERSRKLSQLARRMVSAGQVVEASFQSDNPEVEPEQAKMEADMFRSELEYRQAYARLKNLMDRK*
Ga0132258_1369878523300015371Arabidopsis RhizosphereKAQAEMADLGVQLTKAKVAAKVKSSYLELDRSRQLYQLACRMASATQVVQASYRPDEPEVESARAKMKAEMFRAELEYRQAYAKLKAQMGAQ*
Ga0187806_108987823300017928Freshwater SedimentRQLYQLTRRMVSTTQVVEASYKPDDPEVESARAKMEADMFRAELEYRQAYSKLKALMGAQ
Ga0187806_137511423300017928Freshwater SedimentLDRSRQLYQLARRMASAARVVDASYHSDNPEVASAQAKMEADMFRAELEYRQAYAHLKSLMGH
Ga0187801_1048240023300017933Freshwater SedimentGVKSSYFELDRSRQLYQLTRRMVSAGHVVDASYKSGDPEVESAQAKMEADMFRAELEYRQAYARLKTLMGDK
Ga0187819_1013538033300017943Freshwater SedimentSRKLSQLARQMVSATRVVEASYQSDNPDVESTQAKMEAEMFRAELEYRQAYARLKSLMGD
Ga0187819_1043594713300017943Freshwater SedimentLTKAKVAAGVKSSYFELERSRKLSQLARRMVSATRVVEASYQSDNPEVESAQAKMEAEMFRAELEYRQAFARLKSLMGDK
Ga0187819_1056933523300017943Freshwater SedimentADLGVQLTKAKVAAGVKSSYFELERSRKLSQLARQMVSATRVVEASYQSDNPEVESAQAKMEAEMFRAELEYRQAFAQLKSLMGDK
Ga0187819_1062937923300017943Freshwater SedimentGVKSSYFELERSRKLSQLARQMVSATRVVEASYQSDNPEVECAQAKMEAEMFRAELEYRQAFARLKSLMGDK
Ga0187817_1013623333300017955Freshwater SedimentADLAVQLTKAKVAAGVKSSYFELERSRKLSQLARQMVSATRVVEASYHSDNPDLESAQAKMEAEMFRAELEYRQAFARLKSLMGDK
Ga0187817_1067200513300017955Freshwater SedimentAQAEMADLAVQLTKAKVAAGVKSSYFELERSRKLSQLARQMVSATRVVEAGYNSDNPEVESAQAKMEAEMFRAELEYRQAFARLKSLMGDK
Ga0187817_1093846813300017955Freshwater SedimentGFKREHGVKERSAQAEMADLAVQLTKAKVAAGVKSSHFELDRSRQLYQLACRMVSGAQVVDASYRSDDPDAKAARAQMEADMFRAELEYRQAYAKVKSLMGVE
Ga0187816_1012263923300017995Freshwater SedimentLYQLARRMASAARVVDASYHSDNPEVASAQAKMEADMFRAELQYREAYAKLKGLTGPR
Ga0187816_1015663223300017995Freshwater SedimentYFELERSRKLSQLVRQMVSATRVVEASYQSDNPDVESTQAKMEAEMFRAELEYRQAYARLKSLMGDK
Ga0187804_1041812813300018006Freshwater SedimentAEMADLAVELTKAKVAAGVKSSYFELDRSRQLYQLARRVVSSAGVVNASYKSEDPEVESARAKMEADMFRAELEYRQAYAKLKALTGAQ
Ga0187810_1046057313300018012Freshwater SedimentKVAAGVKSSYFELDRSRQLYQLARRVVSSAGVVNASYKSEDPEVESARAKMEADMFRAELEYRQAYAKLKALTGAQ
Ga0210403_1024609523300020580SoilAEMAEMAVGLTKAKVAVGVKSSYLELDRSRKLYQLTRRMVSAGRVVDASYHPDDPEIQSAQAKMEADMFRAELEYREAYARVKSVMGGK
Ga0210403_1124350923300020580SoilKAQAEAADLGVQLTKAKVGAGVKSSYLELERSRQLVQLARRMVSATRIVEASYRSDDPEVESAQAKMEVDMYRAELEYRQAYARLRSLMGYK
Ga0210395_1092553723300020582SoilRPTPARRLEQCAIHPASYFDLDRSRQLYQLARRMASAAQVVDASYKSDNPEVESAQAKMEADMFRADFEYRQAYAKLKCLMGVK
Ga0210404_1081747523300021088SoilADVKSSYFELERSRKLSQLARRMVSATQVMEASVQPDNSDVESAQAKMEADMFRAELEYRQAYSKLKSLMGGK
Ga0210404_1092280313300021088SoilLERSRKLSQLARRMVSATQVIEASYKSDDPEVESARAKMEADMFRAELEYRQAYARLKNLMGDK
Ga0210405_1002579563300021171SoilVTQGVKSSYLELERSRKLSQLARRMVSATQVVEASYKSDDPDVESAVARMEADMFRAELEYRQAYAKLKNLMGNK
Ga0210396_1145069813300021180SoilEAADLAVQLTKAKVAQGVKSSYLELERSRKLSQLARRMVSATQVIEARYKSDDPEVASARAKMEADMFRAELEYRQAYARLKSLMGSR
Ga0210388_1013213113300021181SoilAKVAASIKNSYFELQRSRQLTQLARRMAAATRLVEASYQPDNPEVESARAKVEADMFRAELEYRQAFARLKSLMGER
Ga0210389_1074604723300021404SoilDRSRKLYQLTRRMVSAGRVVDASYHPDDPEIQSAQAKMEADMFRAELEYREAYARVKSVMGGK
Ga0210387_1032916713300021405SoilKAKVAAGVKSSYLELNRSRKLYQLTRRMVSAGRVVDASYRSDNPEIKSAQARMEADMFRAELEYRQAYARLKSLTGGK
Ga0242664_103265423300022527SoilQGVKSSYLELERSRKLSQLARRMVSATQVIEARYKSDDPEVASARAKMEADMFRAELEYRQAYARLKSLMGSR
Ga0242662_1012743923300022533SoilAELAVPLTKAKVSAEVKSSYFELERSRKLSEMARRMVSASQVVYASARSDNSDVESAQAKIEADMFRAELEYRQAYARLKSLMGVN
Ga0242653_107098413300022712SoilADLGVPLTKAKVAAGVKSSYYELERSRKLSQLARRMVSATQVIEASYKSDDPEVESARAKMEADMFRAELEYRQAYARLKNLMGDK
Ga0242661_111825513300022717SoilRSRKLSQLARRMVSVTQVVEASYKSDDPDVESARAKMEADMFRAELEYRQAYAKLKALMGAR
Ga0242665_1033260723300022724SoilVPLTKAKVSAEVKSSYFELERARQLSQLARRMVSATQVVEASVHSENADVETAQAKMEADMFRAELEYRQAYSKLKSLMGAK
Ga0224572_109620913300024225RhizosphereKSSYFDLNRSRQLYQLARRMASAAQVVDASYKSDNPEVESAQAKMEADMFRADFEYRQAYAKLKALMGVK
Ga0209248_1021964813300027729Bog Forest SoilAGVKSSYFELDRSRQLYQLARRMVSATRVVDASYKSDDPEVDSAQAKMEADMYRAELEYRQAYAHLKGLMGNK
Ga0209655_1016926423300027767Bog Forest SoilAGVKSSYLELDRSRELYQLARRMVSAAGVVSANYKSDDPEVESAQARMEADMFRAELEYREAYSKLKTLTGTR
Ga0209772_1025579623300027768Bog Forest SoilGVKSSYLELDRSRELYQLARRMVSAAGVVSANYKSDDPEVESAQARMEADMFRAELEYREAYSKLKTLTGTR
Ga0209448_1008894513300027783Bog Forest SoilKAKVAAGVKSSYLELDRSRQLYQLARRMVSAAGVVNASYKSDDPEVESAQAKMEADMFRAELEYRQAYARLKSLMGEK
Ga0209773_1001938743300027829Bog Forest SoilDRSRQLYQLARRMVSAAGVVNASYKSDDPEVESAQAKMEADMFRAELEYRQAYAKLKDLTGTR
Ga0209275_1069632923300027884SoilFELDRSRQLYQLARRMVSSAGVLNASYKSDNPEVESEKAKMEADMFRAELEYRQAYAKLKSLTGAR
Ga0075382_1176874013300030917SoilGVQLTKAKAAEGIKSSKLELQRSRQQAQLARRMVSAWSIVDVSYQPDNPDVESARAKMEADMFRAELEYRQAYARLKSLMGGK
Ga0068589_1165783513300030979SoilLELQRSRQLAQLARRMVVSAGSIVDVSYQQDNPDVESAKAKMEADMFRAELEYRQAYARLKSLMGGK
Ga0170824_11665491533300031231Forest SoilDGFKREHGVKEAKINAQAADLGVQLTKAKAAEGIKSSKLELQQSRQRAQLARRMVSAWSIVDVSYQPHNQDVESARAKMEADMFRAELEYRQAYARLKGLMGGK
Ga0170820_1568742913300031446Forest SoilAKINAEAADLGVQLTKAKAAEGIKNSYLELQRSRQLAQLARRMVVSAGSIVDVSYQPDNPDVESARAKMEADMFRAELEHRQAFARLKGLMGGK
Ga0170820_1595026133300031446Forest SoilSRQQAQLARRMVSAWSIVDVSYQPDNPDVESARARMEADMFRVELEYRQAYARLKGLMGG
Ga0310686_10746711223300031708SoilVKEVNAQAEMADLGVQLTKAKVAAGVKSSYFELDRSRQLYQLARRMVSAAQVVDASYKSDNPEVESAQAKMEADMFRADLEYRQAYAKLKALMGGEQRAF
Ga0310686_11233836213300031708SoilAGAVKSSYLDLDRSRQLAQLARRMVTATQVVEASYHADNSDNAEVESARAKMEADMFRAELEYREAYAKLKSLMGHN
Ga0310686_11469024613300031708SoilVKSSYFELDRSRQLYQLARRMVSAAQVVDASYKSDNPAVESAQAKMEADMFRAELEYRQAYARLKSLMGNR
Ga0310686_11995423213300031708SoilAADLGVELTKAKVAASVKNTYFELERSRQFTQLARRMMSATRVVEASYQADNPEVESARAKVEADMFRAELEYRQAYGRLKTLMGDK
Ga0307479_1052468323300031962Hardwood Forest SoilAEAADLGVQLTKAKVGAGVKSSYLELERSRQLVQLARRMMSATRIVEASYRSDDPEVESAQAKMEADMYRAELEYRQAYARLRSLMGHK
Ga0307479_1183079813300031962Hardwood Forest SoilQLTKAKVGAGVKSSYLELERSRQLVQLARRMVSATRIVEASYRSDDPEVESAQAKMEVDMYRAELEYRQAYARLRSLMGYK
Ga0348332_1431496423300032515Plant LitterMKEASAQAEAADLAVQLTKAKVAQGVKSSYLELERSRKLSQLARRMVSATQVIEAGYKSDDPEVESARAKMEADMFRAELEYSQA
Ga0335085_1026854313300032770SoilEHGVKERNAQAEAADLAVQLTKAKAAAAVKSSYLELDRLRNLYLLARRMVSAAQFMDTSYKSGDLDAQAAKAQMEADMYRAELEYRQAYAKLQSLTGN
Ga0335085_1027992733300032770SoilEHGVKERNAQAEAADLAVQLTKAKAAAAVKSSYLELDRLRNLYLLARRMVSAAQFMDTGYKSGDLDAQAAKAQMEADMYRAELEYRQAYAKVKSLTGN
Ga0335082_1000148013300032782SoilQAEMADMGVQLTKAKVAAAVKSAYFDLERSRQLYQLASRMVTATQVVPASYSPDDPEVESARATMEAEMLRSELEYRQAYARLKALTGTH
Ga0335082_1005567413300032782SoilQLTKAKVAAAVKSAYFELERSRQLSQLARRMVSSTQLVEASYNPNDPEVESARATIEADMFRTELEYRQAYAKLKALMGTR
Ga0335082_1053927313300032782SoilAADLAVQLTKAKVAAAVKSSYLELDRLRNLYLLSRRMVSAAQVMDTSYKSGDLDAQATKAQMEADMYRAELEYRQAYAKVKSLAGN
Ga0335082_1056959723300032782SoilDLGVQLTKAKVAAAVKSSYFELERSRQFTQLARRMMSATQVVEASYNPDDPEVESARAKVEADMFRAELEYRQAYAKLKALMGAQ
Ga0335082_1170817513300032782SoilKAKVAAAVKNSYFEMDRSRELFQLARRMVSATQVVEASYKPGDPEIESAQAKMEADMFRAELEYRQAYTKLKALMGAQ
Ga0335079_1000965613300032783SoilANAEAADLGVQLTKAKVAAGVKSSSLELDRSRQLYQLARRMVSASQFVETSYKPDDPEVASARAKMEADMFRAELEYRQAYAKLKALVGTQ
Ga0335079_1006265253300032783SoilAADLAVQLTKAKVAAAVKSSYLELDRLRNLYLLARRMVSAAQFMDTSYKSDDLDAQAAKAQMEADMYRAELEYRQAYAKVKSLAGN
Ga0335078_1187697913300032805SoilGVQLTKAKVAAAVKTSYFELERSRQFSQLARRMVSATRMVEASYQPNNPEVESARAKMEAEMFRAELEYRQAYAKLKALMGTQ
Ga0335080_1005259043300032828SoilEHGVKERNAQAEAADLAVQLTKAKAAAAVKSSYLELDRLRNLYLLARRMVSAAHVMDTSYKSGDLDAQAVKAQMEADMYRAELEYRQAYAKVKSLTGN
Ga0335080_1009051063300032828SoilQLTKAKVAAAVKNAYFELERSRQLSQLARRMVSSTQVIEASYNPDDPEVESARATMEADMFRTELEYRQAYAKLKALTGSH
Ga0335080_1010402013300032828SoilQLYQLASRMVTATQVVQVNYSPDDPEVESARATMEAEMFRTELEYRQAYAKLKALTGAH
Ga0335080_1055650223300032828SoilDRLRNLYLLARRMVSAAQFMDTSYKSDDLDAQAAKAQMEADMYRAELEYRQAYAKVKSLAGN
Ga0335070_1006690043300032829SoilAEAADLAVQLTKAKAAAAVKSSYLELDRLRNLYLLARRMVSAAQFMDTSYKSGDLDAQAAKAQMEADMYRAELEYRQAYAKLQSLTGN
Ga0335081_1220602613300032892SoilELDRSRQLYQLACRMVSATHVVEASYKFDDPEVESAQAKMEADMFRAELEYRQAYAKLRALMGGQ
Ga0335081_1265724723300032892SoilLSKAKAAAGVKSSYLELERSRQLAQLARRMVTASEIVNASYKPDDPEIESARAKMEADMFRAELEYRQAYGRLKNLLGH
Ga0335069_1027276133300032893SoilLTKAKVAAAVKSAYFELERSRQLYQLASRMVTATQVVPVSYNLEDPEVESARASMEAEMFRTELEYRQAYAKVKALTGSH
Ga0335069_1035528033300032893SoilSRQLYQLARRMVSASQLVEASYRPDDPEAVSARAKMEADMFRAELEYRQAYAKLKISVSG
Ga0335069_1070215523300032893SoilSTYLELDRSRQLYQLASRMVSASQVVEASYTPDNPEVESARAKMEADMFRAELEYRQAYAKVKALMGAQ
Ga0335069_1112440623300032893SoilRKFTQLARRMVSATRVVEVSYHPEDPEVEAARARLEADMFRTELEYRQAYAKLKALLGAR
Ga0335069_1202899213300032893SoilELERSRQFTQLARRMASSVQVVEASYKADDPEVQSARAKLEADMFRAELEYRQAYARLMTLVGNK
Ga0335074_1103697633300032895SoilKAKVAGAVKTTYLELDRSRQLYQLARRMVSATQVVEASYKPDDPEVESARAKMEADMFRAELEYRQAYTQLKTMMGAQ
Ga0335071_1015244633300032897SoilRQFTQLARRMASSVQVVEASYKADDPEVQSARAKLEADMFRAELEYRQAYARLMTLVGNK
Ga0335071_1050034323300032897SoilKVAAAVKSSYFELERSRQFTQMARRMMSSARLVEASYKSEDPEVDSARARMEADMFRAELEYRQAYAKLKALTGAH
Ga0335071_1078735713300032897SoilEHGVQEVKAQSQMADMGVQLTKAKVAAAVKNAYFELERSRQLYQLASRMVSSTQVVEASYNPNDPEVESARATMEADMFRTELEYRQAYAKLKGLMGTR
Ga0335071_1110989913300032897SoilEAADLGVELTKAKVAGAVKTSYLELDRSRQLYQLARRMVSATQVVEASYKPDDPEVESARAKMEADMFRAELEYRQAYAQLKALMGK
Ga0335071_1170319423300032897SoilKAKAAASVKTSSLELDRSRQLYQLARRMVSASQLVEASYRPDDPEAVSARAKMEADMFRAELEYRQAYAKLKISVSGQ
Ga0335071_1173256413300032897SoilVQLTKAKVAAAVRSSYLDLDRSRQLFQLASRMVSATKVVAASYNPDDPDVESARAKMEADMLRAELEYRQAYAKVKVLMGGAQ
Ga0335083_1046587023300032954SoilDLGVELVKAKVAAAVKNSYFEMDRSRELYQLARRMVSATQVVEASYKPGDPEIESAQAKMEADMFRAELEYRQAYTKLKALMGAQ
Ga0335076_1074123913300032955SoilDLAVQLTKAKVAAGVKNSYFEMERSRKLSELAHRMVSGAQVVDASYRSDDPDAKAAWAQMEADEFRAELEYRLAYAKLRALMGAD
Ga0335076_1081663313300032955SoilKSSYLELERSRQLAQLARRMVSASAIVNASYKTDDPEIESAQAKMEADMFRSELEYRQAYDRLKNLLGH
Ga0335076_1094448113300032955SoilDGIKESKANAEAADLGVQLTKAKVAAGVKSSYLELDRSRQFYQLARRMVSASQFVDASYKPDDPEVASARAKMEADMFRAELEYRQAYAKVKAQMGGQ
Ga0335076_1172337613300032955SoilLTKAKVAAAVKSAYFELERSRQFTQLARRMMSATQVVEASYKPDDPEVESARATMEADMFRAELEYRQAYAKVKTLMGAQ
Ga0335084_1030133913300033004SoilSSSLELERSRQLYQLARRMVSATQFVEASYRSDDPEAVSARARMEADMFRAELEYRQAYARLKAEMGAQ
Ga0335084_1033716313300033004SoilMADLGVQLTKAKVAAAVKSSYFELERSRQFTQLARRMASSARLVEASAKADDPEVESAQAQLEADMFRAEFEYRQAYAKLKALIDVQ
Ga0335084_1205810413300033004SoilADLGVQLTKAKVAAKVKSSYLELDRSRQLYQLARRMVSATQAVEASYKPDDPDVASARAKMEADMFRAELEYRQAYGKLKALMGAQ
Ga0335073_1101174133300033134SoilKSSSLELERSRQLYQLARRMVSATQFVEASYRPDDPEAVSARARMEADMFRAELEYRQAYAKLKALMGGQ
Ga0335073_1198008413300033134SoilKAKVAAAVKSSYLDLDRSRQLYQLASRMVSATKVVEASYNPDDPDVESARATMEADMFRAELEYRQAYAKVKALMGAQ
Ga0335077_1007110213300033158SoilYFELERSRQLSQLARRMVSSTQLVEASYNPNDPEVESARATIEADMFRTELEYRQAYAKLKALMGTR
Ga0335077_1021111813300033158SoilHDVQESRANAEAADLGVQLTKAKVAAGVKSSSLELDRSRQLYQLARRMVSASQFVETSYKPDDPEVASARAKMEADMFRAELEYRQAYAKLKALVGTQ
Ga0335077_1070743413300033158SoilDRSRQLYQLARRMVSATQFVEASYKPDDPEVTSARAKMEADLFRAELEYRLAYDKVKAQMRGQ
Ga0370515_0235295_3_2903300034163Untreated Peat SoilGEARANAEAADLGVQLTKAKVAGAVKSTYFGLESSRKLTELARRMVSANQLVDASYKPGNTEVESAQAKMEAEMFRAELEYRQAYAKVKALMGPN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.