NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F096699

Metagenome / Metatranscriptome Family F096699

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F096699
Family Type Metagenome / Metatranscriptome
Number of Sequences 104
Average Sequence Length 84 residues
Representative Sequence MSNQYRIEPVGSAFIVIDDLGEPVGRYPTEDAARQDIERCKKEDAMWETAKLLVDTAIKAHMQMHGVDRATAAYWVNSAMGGV
Number of Associated Samples 66
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 66.67 %
% of genes near scaffold ends (potentially truncated) 0.96 %
% of genes from short scaffolds (< 2000 bps) 0.96 %
Associated GOLD sequencing projects 56
AlphaFold2 3D model prediction Yes
3D model pTM-score0.70

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.038 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(57.692 % of family members)
Environment Ontology (ENVO) Unclassified
(58.654 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(57.692 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 42.34%    β-sheet: 13.51%    Coil/Unstructured: 44.14%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.70
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
c.23.1.3: Positive regulator of the amidase operon AmiRd1qo0d_1qo00.62564
a.104.1.1: Cytochrome P450d3czha13czh0.60638
a.127.1.2: HAL/PAL-liked1w27a_1w270.60191
c.124.1.5: IF2B-liked1t5oa11t5o0.60117
a.216.1.1: I/LWEQ domaind1r0da_1r0d0.59578


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF00589Phage_integrase 3.85
PF14373Imm_superinfect 2.88
PF13643DUF4145 2.88
PF13664DUF4149 2.88
PF04607RelA_SpoT 1.92
PF01541GIY-YIG 0.96
PF132794HBT_2 0.96
PF13466STAS_2 0.96
PF00005ABC_tran 0.96
PF00656Peptidase_C14 0.96
PF00239Resolvase 0.96
PF13455MUG113 0.96
PF10282Lactonase 0.96
PF02574S-methyl_trans 0.96
PF13676TIR_2 0.96
PF12697Abhydrolase_6 0.96
PF07730HisKA_3 0.96
PF07508Recombinase 0.96
PF00376MerR 0.96
PF05065Phage_capsid 0.96
PF13205Big_5 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG1961Site-specific DNA recombinase SpoIVCA/DNA invertase PinEReplication, recombination and repair [L] 1.92
COG0646Methionine synthase I (cobalamin-dependent), methyltransferase domainAmino acid transport and metabolism [E] 0.96
COG2040Homocysteine/selenocysteine methylase (S-methylmethionine-dependent)Amino acid transport and metabolism [E] 0.96
COG2452Predicted site-specific integrase-resolvaseMobilome: prophages, transposons [X] 0.96
COG3850Signal transduction histidine kinase NarQ, nitrate/nitrite-specificSignal transduction mechanisms [T] 0.96
COG3851Signal transduction histidine kinase UhpB, glucose-6-phosphate specificSignal transduction mechanisms [T] 0.96
COG4249Uncharacterized conserved protein, contains caspase domainGeneral function prediction only [R] 0.96
COG4564Signal transduction histidine kinaseSignal transduction mechanisms [T] 0.96
COG4585Signal transduction histidine kinase ComPSignal transduction mechanisms [T] 0.96
COG4653Predicted phage phi-C31 gp36 major capsid-like proteinMobilome: prophages, transposons [X] 0.96


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.04 %
All OrganismsrootAll Organisms0.96 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002917|JGI25616J43925_10126836Not Available1034Open in IMG/M
3300011269|Ga0137392_10111092Not Available2174Open in IMG/M
3300020140|Ga0179590_1000671All Organisms → cellular organisms → Bacteria4574Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil57.69%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil24.04%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.73%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands2.88%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.88%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.92%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.96%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.96%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300002910Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cmEnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300004082Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300020140Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300030991Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300032954Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.2EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25613J43889_1022896313300002907Grasslands SoilFAVSASTPPSFLHSLNGLWVFQFGNLQYSGVGVTPMTNEYRIEPVSNAFIVIDDLGEPVGRYPTEEAAQQDIERCKKEDAMWETAKLLVDTAIKSHMKIHGVDRETAAYWINSALGGA*
JGI25615J43890_105842513300002910Grasslands SoilMTNEYRIEPVSNAFIVIDDLGEPVGRYPTEEAAQQDIERCKKEDAMWETAKLLVDTAIKSHMKIHGVDRETAAYWINSALGGA*
JGI25616J43925_1012683613300002917Grasslands SoilMTNEYRIEPVSNAFIVIDDLGEPVGRYPTEEAAQQDIERCKKEDAMWETAKLLVDTAIKSHMKIHGVDRETAAYWINSA
Ga0062384_10038926123300004082Bog Forest SoilMSTSKRIELAGSAFIVIDDLGEQVGAYPTENAAQKDIERCEREDRMYEVAKLLVDTAIKAPMQLHGVDRETARYWVCGAMG
Ga0062389_10056096433300004092Bog Forest SoilMSTSKRIELAGSAFIVIDDLGEQVGAYPTENAAQKDIERCEREDRMYEVAKLLVDTAIKAPMQLHGVDRETARYWVCGAMGVV*
Ga0066690_1009353923300005177SoilMKPAGTKFVVIDPWGEQVNTYPTEDAAKKDIERCKREDRMCETAKQLVDTAIKAHMQMFGVDRETARYWVCRAMDVV*
Ga0099793_1013415533300007258Vadose Zone SoilMTNDYRIEPDGPAFIVIDPWGEQLVNTYPTEGAARQDIERCKKEDAMYETAKQLVDFAIKAHMQVFGVDRETARYWVCSAAEASD*
Ga0099829_1117981213300009038Vadose Zone SoilMINEYRIEPVGSRFAVIDPWGERVGTYPTGEVAKQDIERCKKEDRMYETAKQLMDTAIKAHMQMFEIDRETAQYWIRSASEVV*
Ga0137392_1011109253300011269Vadose Zone SoilMSTEYRVEAAGSQFTVIDPWGEQVNTYPTERAAKQDIERCKKQDSMYETAKHLVDAAIKAHMQMFEVDRETARNWICSASEVVG*
Ga0137392_1064276813300011269Vadose Zone SoilMNNDYRIKSDGPEFTVIDPWGERVDVYSTEDAARQDIERCKKEDAMWETAKLLVDAAIKAHMQIHDVDRETSSYW
Ga0137391_1068867623300011270Vadose Zone SoilLNRGLPMKTDYRIEPDGGEFILVDASGETVGVYPSQDAAKQDIARCEQEDAMYETAKQLVATAVKAHMQMHGVDRETASYWIRSAAETVD*
Ga0137393_1025484223300011271Vadose Zone SoilMSNEYRIEPAGPAFIVIDYAGEQVNTYLTEEAAKQDIERCKKEDAMYETAKQLVDTAIKAHMQMFGVDPRVGAVLD*
Ga0137389_1003231863300012096Vadose Zone SoilMMLGYSSLLVKLMTNDYRIEPVGSLFIVVDPWGEIVNRFSTEDAARQDIERCEREDAMWETAKMMVDTTVKAHMRMHGVDRATSRYWIQSAAETSD*
Ga0137389_1176515513300012096Vadose Zone SoilMSSEYRVETDGSQFAVIDPWGEQVNAYLTEAAAQQDIERCKKEDVMYETAKQLVDTAVKSHMQMFGIDRETASYWIRSASEVVG*
Ga0137388_1058975923300012189Vadose Zone SoilMNTEYRLETAGEQFIVVDPWGEQVDTYLTEEVAKQDIERCKKEDLMYKTAKQVVDTAIKAHMQMFGVDRETARYWIQSASETVD*
Ga0137388_1187096423300012189Vadose Zone SoilMSSEYRVETDGSQFAVIDPWGEQVNAYLTEAAAQQDIEHCKKEDVMYETAKQLVDTAVKSHMQMFGIDRETASYWIRSASEVVG*
Ga0137364_1005810743300012198Vadose Zone SoilMSKDYRIEPAGTHFTVIDPTGEQVDTYPSKEAAEQDIKHCKKEDRMYETAKHFVDTAIKAHMERFGIDREMARYWIHSAAETF*
Ga0137363_1005578143300012202Vadose Zone SoilMSNEYRIEPAGPAFIVIDYAGEQVGTYPTEDAAKRDIERCQREDAMYETAKQLVDIAIKAHMERFGVDPRVGAVLD*
Ga0137363_1024471423300012202Vadose Zone SoilMSNEYRIEPAGPSFIVIDGLGEQVGTYPTEDAAQQDVERCKREDAMYETAKLLVDSAIKAHMQMHSVDRETARYWISSAMDVVD*
Ga0137363_1035561013300012202Vadose Zone SoilMSNGYRVEPVGNAFIVIDDLGEPVGRYPTEEEAQKDIERCKKEDAMWETAKLLVDTAIKAHMQMYSVDRETARYWINSALGS*
Ga0137399_1000990053300012203Vadose Zone SoilVERLMTKEYRIEPVGSLFIVVDPWGEIVNRCSTVDEARQDIERCKKEDAMYEGAKTFLDIAVKAHMRMFEVDRETARYWVCSAMDVV*
Ga0137399_1001989353300012203Vadose Zone SoilMSNEYRIESVGSQFIVIDPAGEIVNRYTTEDAALQDIGRCKKEDAMYAAAKQLVDTAIKAHMQVFGVDRETARYWVSSAMEVVV*
Ga0137399_1055950613300012203Vadose Zone SoilVGSQFIVIDPAGEIVNRYTTEDAALQDIGRCKKEDAMYETAEQLVDAAVKAHMERFGVARETARYWIFSASEVVD*
Ga0137399_1145155323300012203Vadose Zone SoilMSTEYRIEPVGSLFIVVDPWGEIVDRCSTEEAARQDIERCEREDALYETAKLLVDTAIKAHMRMHGVDREAARYWVSNAMETAD*
Ga0137362_1011376713300012205Vadose Zone SoilAFIVIDPAGEIVNRYTTEDAALQDIGRCKKEDAMYATAKQLVDTAINAHMQVFGVDRETARYWVCSASEVV*
Ga0137362_1013419333300012205Vadose Zone SoilMDLEDSNTVVGDTYRVECVGSQFIVVDPAGEIVNRYNTEDAALQDIERCKKEDAMFETGKTLIDLAIKAHMQRFGIDRETAVYWIRTASEVVG*
Ga0137362_1113590913300012205Vadose Zone SoilTKYRVETAGSQFIVNDPWGEQVGTYLTEEAAKQDIERCKKEDAMYETAKQLVDTTVKAHMQMFRVGRETASYWIRSASETVEAALNT*
Ga0137362_1173527813300012205Vadose Zone SoilFGDLQYSGVGVTLMTNEYRIEPVSNAFIVIDDLGEPVGRYPTEEAAQQNIERCKKEDAMWETAKLLVDTAIKSHMKIHGVDRETAAYWINSALGGA*
Ga0137377_1169914823300012211Vadose Zone SoilVKPLTNDYRIEPVGTAFIVIDGLGEQVNTYPTEDAAQQDIERCHKEDAMWETAKQLVDTAIKAHMRIHDVDRETAKRLIRDSAEVAD*
Ga0137360_1076227523300012361Vadose Zone SoilMSNEYKIEPVGNAFIVIDDLGERVDTYPSKEAAQQDVERCKREDAMWSTAKQLVDTAVKAHMQMHGVDRETARYWVSSATDVV*
Ga0137361_1035365933300012362Vadose Zone SoilMSTKYRVETAGSQFIVNDPWGEQVGTYLTEEAAKQDIERCKKEDAMYETAKQLVDTTVKAHMQMFRVGRETASYWIRSASETVEAALNT*
Ga0137361_1188861223300012362Vadose Zone SoilMTNEYRIEAAGPAFIVIDPWGERVDTYHTEAAAKQDIERCKREERMYETAKQLVDTAIKAHMQMFGVDRETARYWVCSAAEVMD*
Ga0137358_1006445513300012582Vadose Zone SoilMKTDYRIEPVGSLFIVVDPWGEIVNRFSTEDAARQDIERCEREDAMWETAKMMVDTAVKAHMKMHCVDRNTSRYWISSAAETVD*
Ga0137397_1006872433300012685Vadose Zone SoilMTNEYRIEPVGSLFIVVDPWGEIVNRCSTEEAAKADIERCEQQDAMWEPAKLMVDTAIKAHMRMHGVDRETARYWLSSAMDVV*
Ga0137396_1004969153300012918Vadose Zone SoilMSNEYRIESVGSQFIVIDPAGEIVNRYTTEDAALQDIGRCKKEDAMYETAKQLVDFAIKAHMQVFGVDRETARYWV
Ga0137396_1046537523300012918Vadose Zone SoilMSYEYRVASAGNQFIVIDDAGEQVGTYPTEEAAKRDIERCKKEDAMYETAKQLVDAAVKAHMQRFGIDRETGELLDSQRVRRDIDSGHRG*
Ga0137394_1002533843300012922Vadose Zone SoilMRNEYRIEPAGSQFTVIDPWDEQVNTYPTEDAAKQDIERCKKEDAMYKTARQLVDSTIRTHMQMFGVDRETARYWVFSAAEVMD*
Ga0137394_1003908253300012922Vadose Zone SoilMNTNEYRIEPAGPAFIVIDPWGEAVNTYATLDAAKQDIERCKREDAMHETAKQLVDTAIKTHTHMFGVDRETASYWVCSAMEVVD*
Ga0137394_1018287723300012922Vadose Zone SoilMTNEYRIEPVGSLFIVVDPWGEIVNRCSTEEAAKADIERCEQQDAMWETAKLMVDTAIKAHMRMHGVDRETARYWLSSAMDVV*
Ga0137394_1033831223300012922Vadose Zone SoilMKTDYCIEPSGPAFTVIDPWGETVNTYATEGAAQHDIERCKREDAMYETAKQLVDTTIKAHMKMHGVDRETARYWVCSAMDVAD*
Ga0137394_1122853023300012922Vadose Zone SoilEYRIEPVGSLFIVVDPWGEIVNRFSTEDAARQDIERCEREDAMWETAKMMVDTAVKAHMKMHCVDRAVSRYWISSAAEVAE*
Ga0137359_1012965553300012923Vadose Zone SoilLFIVVDPWGEIVNRFSTEESARQDIERCEREDAMWQTAKMMVDTGIRAHMRMHGVDRETSKRLIRDAAEVAD*
Ga0137359_1130532313300012923Vadose Zone SoilMSTEYHIEIVGSRFIVIDPDGEIVNTFHTEDAAKQDLERCKKNDEMWETAKLLVDIAIKTHMEVFGVDRETSRYWISSAA*
Ga0137413_1054023423300012924Vadose Zone SoilMSAEYRIEPIDNAFIVIDSWGEQLVHTYPPEDEARQDIGRCKKEEAMWETTKLLVDAAIKAHMQIHDIGRETAAYWINSALGGA*
Ga0137419_1002387633300012925Vadose Zone SoilMTNDYRIEPAGPAFIVIDPWGEQLVNTYPTEGAARQDIERCKKEDAMYETAKQLVDFAIKAHMQVFGVDRETARYWVCSAAEASD*
Ga0137419_1088046113300012925Vadose Zone SoilMNTDYRIEPAGPAFIVIDPWGEQLVNTYPTEDAARQDIERCKREDAMYETAKQLVDTAVKAHMQMFGVDRETASYWIHSAAEVVA*
Ga0137419_1088573823300012925Vadose Zone SoilMKYEYRIEPVGSAFIVIDDLGEPVGRYPTEAAAQQDIERCKKEDAMWESAKLLVDAAIKAHMQMHDVDRETAAYWINSAMGGA*
Ga0137419_1114173113300012925Vadose Zone SoilVIDDLGEPVGRYPTEEAARQDIERCKKEDAMWETAKLLVDTAIKSHMKIHGVDRETAAYWINSALGGA*
Ga0153915_1007444023300012931Freshwater WetlandsMAMGTEYRIESAGTQFIVIDPAGEQVDTYPTQEAAKQDLERCKKDDSIWETAKLLVEIAIKAHMQMFGVDRETARYWVRSASDVMD*
Ga0153915_1036679823300012931Freshwater WetlandsVSNEYRVETAGGESAVIDPCGEQVDTYPAADSAKQDIERCKKDDAMWSTAKQLVDTVVKAHMQMFEVDCETASYWIRSASEVME*
Ga0153915_1205080313300012931Freshwater WetlandsKYRIESTGTQFVVIDPAGAQVDSYPTEEAAKQDVERCKCEDRMYEIAKLLVDIAIKSHMQMFGVDRETAQDCVSSAAEVVD*
Ga0137410_1112655423300012944Vadose Zone SoilMSNEYRIESVGSQFIVVDPAGEIVNRYTTEDAAEQDIERCKREDAMYQTAKQLVDTAIKAHMQMFGVDRETASYWIRSAAEASD*
Ga0137405_116515853300015053Vadose Zone SoilMNTNYRIEPVGSLFIVVDLWGEIVNRCSTEEAARQDIERCQKEDAMWETAKLMVDIAVKAHMRMHGVDRAVARYWIQSAAEVAD*
Ga0137405_116515913300015053Vadose Zone SoilLEDSRRRKLGYSSHLVTTMNTNYRIEPVGSLFIVVDLWGEIVNRCSTEEAARQDIERCQKEDAMWETAKLMVDIAVKAHMRMHGVDRAVARYWIQSAAEVAD*
Ga0137420_149330113300015054Vadose Zone SoilVQIIVLQYSGVGVIAMSTEYRIESSGNTFIVIDPWGEPLVHMYPTEDAARQDIERCKKEDAMWESAKLLVDAAIKAHMQMHDVDRETATYWVCSAMDVVD
Ga0137418_1064009323300015241Vadose Zone SoilMTTNYRIECVGRQFIVVDPAGEIANRHTTEEAAMADIERCKLEDTTYETAKQLVDTAIKAHMERFGVDRETARYWVCSAAEASD*
Ga0137409_1127870913300015245Vadose Zone SoilMTNEYRIEPVSNAFIVIDDLGEPVGRYPTEEVAQQNIERCKKEDAMWETAKLLVDTAIKSHMKIHGVDRETAAYWINS
Ga0179590_100067133300020140Vadose Zone SoilMTNEYRIESVSNAFIVIDDLGEPVGRYPTEEAAQQDIERCKKEDAMWETAKLLVDTAIKSHMKIHGVDRETAAYWINSALGGA
Ga0179592_1038131923300020199Vadose Zone SoilMTNDYRIEPAGPAFIVIDDLGERVGTYPTEEIAKQDIQRCKKEDAMYQTAKQLVDTAIKAHMEMFGVDRETARYWVCSASETAE
Ga0210407_1013233223300020579SoilMATKYRIEPAGSAFVVIDDAGEQVGTYPTENLAKQDIERCLKEDAMYETAKQLVDFAIKTHMQMFRVDRAVSRYWIGSAMEVV
Ga0210403_1045927923300020580SoilMSIEYRIEPAGNQFIVIDPWGEQLVHTYPTEDAARQDIERCKKEEAMWETAKLLVDAAIKAHMQIHDVDRETAAYWINSALGGV
Ga0210403_1133189413300020580SoilLGKLEYSGVGVTTVSTKYCIEPAGNQFIVIDPWSEQLVHTYPTEDAARQDSELCMKEDAMWETAKLLVDAAIKVRMEIHGV
Ga0210403_1144103323300020580SoilMSTEYCIEPAGTGFIVIDPWGEQLVHTYPTEDAARHDIERCKKEEAMWETAKLLVDTAIKAHMQIHDVDREEAAYWINSALGGT
Ga0210399_1015839333300020581SoilMSTEYHIEPSGSQFIVIDPGGEQVNAHPTEDAAKQEIERCKKEDAMWDAAKTLIENAIQAHMEMFGVDRQTARYWINSAAGVNE
Ga0210399_1032504323300020581SoilMSDEYRIEPAGPSFIVIDDAGERVGTYPTEDAARQAIERCEKEDRMYETAKQLVDTAIKAHMQMFGIDRETARYWICSASEATD
Ga0210401_10014244103300020583SoilVGILQYSGVGVTPMIYDYRIEPVGSAFIVIDDLGEPVGRYPTEDAARQEIERCKKEEAMWGTAKLLVDVAIKAHMQIHDVDRETAAYWINSALRGDLNR
Ga0210401_1003180473300020583SoilMSTEYRIQPCENGFLVIDPWGEQLAHTYPTEDAARQDIERCKKEEAMWETTKQLVDAAIKAHMQIHDVDRETAAYWVNSALGGT
Ga0210401_1011988023300020583SoilMSTEYRIEPDGPEFTVIDPWGEQLVHTYPTEDAARQDIERCKKEEVMWETAKLLVDTAIKAHMQLHGVDRETAAYWVNSALGGV
Ga0210404_1059390613300021088SoilTEKRSCASWVDMVEGNYSGYRIEPAGSQFAVIDPAGEQVDTYPTEEAAKQDTERCKKEDAMYETAKRLVDTAVKAHMQMFGVDRETARYWIKSALGGV
Ga0210406_1089952823300021168SoilMTTEYRIEPAGPAYIVIDPWGEQLVHRYPTEDAARQDIERCKKEEAMWETAKLLVDTAIKAHMQIHDVDRETAAYWVNCALGGT
Ga0210406_1094177423300021168SoilMSTDYRIEPVGNQLIVIDPLGEIVGRYPTEDAARRDIERCKKEDAMWETAKLLVDAAIKAHMQLHDVD
Ga0210406_1101661013300021168SoilMSNEYRIEPAGPSFTVIDPWGEQVNKYPTEETAKQDIERCKREDAMWDAAKQLVDTAIKAHMQMFGVDRETARCWVC
Ga0210400_1123685813300021170SoilMSNDYHIEPVGSAFIVIDDLGEPVGRYPTEEAAQQDIERCKKEEAMWETAKLLVDAAIKAHMQIHGVDRETAAYWVNSALGGV
Ga0210396_1142894513300021180SoilMRYEYRIESVANAFIVIDDLGEPVGRYPTEEAAQQDIERCKKEEAMWETAKLLVDAAIKAHMQIHDVDRETAAYWINSALGGV
Ga0210393_1138835513300021401SoilIVIDPWGKNLVDSFLTEEEAQHAIEECKKEDAMWETTELLVDPAIKAHMQMHDVDREIGAYWINSALGGS
Ga0210397_1000428823300021403SoilMNTNYRIEPVGSLFIVVDPWGEIVNRFSTEDAAQQDIERCKREDRMYDTAKQLVDIAIKAHMQMHCVDRETARYWVSSAAEATD
Ga0210389_10000155443300021404SoilMSTEYRIEPTGSQFIVVDPWGEQLVDTYPTEDAARQDIERCKKEDAMWETAKLLVDTAIKAHMQIHDVDRETAAYWVNSALGGV
Ga0210387_10000103653300021405SoilMSIEYRIEPVGNQFIVIDPWGENLVDSFLTEEEAQHAIEECKKEDAMWETTELLVDPAIKAHMQMHDVDREIGAYWINSALGGS
Ga0210398_1003511923300021477SoilMTNDYRIESAGSAFILIDDADEQVNTYRTEDAAKKDLERCKREDAMWESAKFLVDIAIKTHMEMYGVDRDTARYWINSAMGGA
Ga0210402_1113689823300021478SoilMSNQYRIEPVGSAFIVIDDLGEPVGRYPTEDAARQDIERCKKEDAMWETAKLLVDTAIKAHMQMHGVDRATAAYWVNSAMGGV
Ga0210410_10004157203300021479SoilMSTEYRIEPAGPAYIVIDPWGEQLVHTYPTEDAARQDIERCKKEEAMWETAKLLVDTAIKAHMQIHDVDRETAAYWINSALGGT
Ga0210410_1092621923300021479SoilMNTEYRIEPAGSQFIVIDPWGEYLVHTFQTEEAAQHAIAECVKEEAMWETAKLLVDAAIKAHMQIHDVDRVTAAYWVNSALGGT
Ga0210409_1061490113300021559SoilMRYEYRIESVANAFIVIDDLGEPVGRYPTEEAARQDIQRCKREDEMWESAKLLVDAAIKAHMEKHGVGRESAAYWINSALGGV
Ga0179589_1060021523300024288Vadose Zone SoilASALEDSRRKLEYSGVGVMPMSTDYRIEPVGSAFIVIDDLGEPVGRYPTEDAAQQDIQRCKKEDAMWETAKLLVDNAIKAHMQMHDVDRETAAYWINSALGGT
Ga0137417_149256233300024330Vadose Zone SoilMNTDYRIEPAGPAFIVIDPWGEQLVNTYPTEDAARQDIERCKREDAMYETAKQLVDTAVKAHMQMFGVDRETASYWIHSAAEVVA
Ga0209240_111915723300026304Grasslands SoilMTNEYRIEPVSNAFIVIDDLGEPVGRYPTEEAAQQDIERCKKEDAMWETAKLLVDTAIKSHMKIHGVDRETAAYWINSALGGA
Ga0209131_139428413300026320Grasslands SoilGVTPMTNEYRIEPVSNAFIVIDDLGEPVGRYPTEEAAQQDIERCKKEDAMWETAKLLVDTAIKSHMKIHGVDRETAAYWINSALGGA
Ga0209648_1003422763300026551Grasslands SoilMSNEYRIEPAGTQFTVIDPWGEQLVNTYPTEEAAKQDIERCKREDRMYETAKQLVDTAVKAHMQMNGVDRETARYWVCSAMDVV
Ga0209648_1083458323300026551Grasslands SoilMTNDYRIEPVGSAFIVIDDLGEPVGRYPTEEAAQQDIQRCKREDAMWETAKLLVDVAIKAHMQMHDVDRETSAYWINSALGGV
Ga0179593_109717143300026555Vadose Zone SoilMTNDYRIEPAGTAFIVIDDLGEQVDTYPTEEAAKQDIERCKREDRMYETAKLLVDTAIKAHMERFGVDRETARYWIFSASEVVD
Ga0179593_113020113300026555Vadose Zone SoilMTNEYRIEPVSNAFIVIDDLGEPVGRYPTEEVAQQNIERCKKEDAMWETAKLLVDTAIKSHMKIHGVDRETAAYWINSALGGA
Ga0179587_1003652543300026557Vadose Zone SoilMPMTNDYRIEPAGPAFIVIDPWGEQLVNTYPTEGAARQDIERCKKEDAMYETAKQLVDFAIKAHMQVFGVDRETARYWVCSAAEASD
Ga0179587_1016273433300026557Vadose Zone SoilMTNEYRIEPVSNAFIVIDDLGEPVGRYPTEEAAQQDIERCKKEDAMWETAKLLVDTAIKSHMKIHGVDRETAAYWINSALGGV
Ga0179587_1036945723300026557Vadose Zone SoilMSHEYRIQPAGPSFIVIDPWGEQVSTYPTEESAQQDIERCKGEEKMYETAKQLVDAAVTAHVQMHGVDRETARYWVCSAAEVVA
Ga0179587_1054336123300026557Vadose Zone SoilGSQFIVIDPAGEIVNRHTTEDAALQDIGRCKKEDAMYETAEQLVDAAVKAHMERFGVARETARYWIFSASEVVD
Ga0179587_1099814123300026557Vadose Zone SoilMSTEYRIEPDGTQFILIDDLGEQVGTYPTADAAKQDIERCKREEAMYETAKQLVDTAIKTHMEMFGVDRETAMHWIRSASEAAD
Ga0209526_1004531933300028047Forest SoilMPSEYHIQPAGPSFIVIDDLGEQVGTYPTEEAAKQDIERCKREDAMYESAKLLVGIAIKAHMARFGVDREESRYWVCSAMEAV
Ga0137415_1101930623300028536Vadose Zone SoilMATEYRIEIVGSRFIVIDPDGEIVNTFHTEDAAKQDLERCKKNDEMWETAKLLVDIAIKTHM
Ga0073994_1220063523300030991SoilMPMSNEYHIEPAGPAFIVIDPWGEQLVNTYPTIEAARQDIERCKREDAMYETAKQLVDTAIKAHMQMFEVDRETARYWVSSAGRGF
Ga0307475_1013534723300031754Hardwood Forest SoilMATKYRIEPAGSAFVVIDDAGEQMGTYPTENLAKQDIERCLKEDAMYETAKQLVDFAIKTHMQMFRVDRAVSRYWIGSAMEVV
Ga0307479_1065467023300031962Hardwood Forest SoilMEELADMSNDYRIEPAGTAFIVIDPAGEQLVDTYPTEAAAKHDIERCVKEDAMYETAKQLVDTAIKAHMQMFGIDRETSRYWVCSAAEASD
Ga0307471_10079325933300032180Hardwood Forest SoilMSNEYRIEPAGRAFIVIDDVGEQVDTYPTKEAAQQDIERCKKEDAMYETAKQLVDIAIKAHMHMHGVDRETARYWVCSAMDVVG
Ga0335085_1006916443300032770SoilMSTDYRIEPVGHQFIVLDPWGEQLVNTYPTEEAARQDIERCKKEDAMWETAKLLVDNAVRAFMQLHGVDRETAERAICDVMGG
Ga0335083_1097384113300032954SoilSDYQFTVVDPWGEQLVDICPTEDAARQDIERCKKEDAMWETGKLLVDTAIKAHMQLHNVGRDTALRLLRDAAEVTDWR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.