NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F082403

Metagenome Family F082403

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F082403
Family Type Metagenome
Number of Sequences 113
Average Sequence Length 70 residues
Representative Sequence LREGVTAPRRSTPTGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGET
Number of Associated Samples 80
Number of Associated Scaffolds 113

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 7.96 %
% of genes from short scaffolds (< 2000 bps) 1.77 %
Associated GOLD sequencing projects 76
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (92.035 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(57.522 % of family members)
Environment Ontology (ENVO) Unclassified
(60.177 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(63.717 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 65.52%    β-sheet: 0.00%    Coil/Unstructured: 34.48%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 113 Family Scaffolds
PF07687M20_dimer 11.50
PF07995GSDH 7.08
PF01139RtcB 2.65
PF10091Glycoamylase 1.77
PF02517Rce1-like 1.77
PF13520AA_permease_2 1.77
PF00196GerE 0.88
PF05368NmrA 0.88
PF03551PadR 0.88
PF12802MarR_2 0.88
PF12158DUF3592 0.88
PF01569PAP2 0.88
PF08557Lipid_DES 0.88
PF02566OsmC 0.88
PF01323DSBA 0.88
PF01784NIF3 0.88
PF07883Cupin_2 0.88
PF12697Abhydrolase_6 0.88
PF13618Gluconate_2-dh3 0.88
PF13540RCC1_2 0.88
PF05685Uma2 0.88

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 113 Family Scaffolds
COG2133Glucose/arabinose dehydrogenase, beta-propeller foldCarbohydrate transport and metabolism [G] 7.08
COG1690RNA-splicing ligase RtcB, repairs tRNA damageTranslation, ribosomal structure and biogenesis [J] 2.65
COG1266Membrane protease YdiL, CAAX protease familyPosttranslational modification, protein turnover, chaperones [O] 1.77
COG4449Predicted protease, Abi (CAAX) familyGeneral function prediction only [R] 1.77
COG0327Putative GTP cyclohydrolase 1 type 2, NIF3 familyCoenzyme transport and metabolism [H] 0.88
COG1695DNA-binding transcriptional regulator, PadR familyTranscription [K] 0.88
COG1733DNA-binding transcriptional regulator, HxlR familyTranscription [K] 0.88
COG1764Organic hydroperoxide reductase OsmC/OhrADefense mechanisms [V] 0.88
COG1765Uncharacterized OsmC-related proteinGeneral function prediction only [R] 0.88
COG1846DNA-binding transcriptional regulator, MarR familyTranscription [K] 0.88
COG3323PII-like insert in the uncharacterized protein YqfO, YbgI/NIF3 familyFunction unknown [S] 0.88
COG4636Endonuclease, Uma2 family (restriction endonuclease fold)General function prediction only [R] 0.88


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A92.04 %
All OrganismsrootAll Organisms7.96 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300009089|Ga0099828_10025182All Organisms → cellular organisms → Bacteria4715Open in IMG/M
3300011403|Ga0137313_1003899All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes2825Open in IMG/M
3300011445|Ga0137427_10040253All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1842Open in IMG/M
3300012349|Ga0137387_10028753All Organisms → cellular organisms → Bacteria3564Open in IMG/M
3300012354|Ga0137366_10023178All Organisms → cellular organisms → Bacteria4834Open in IMG/M
3300012923|Ga0137359_10621821All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium946Open in IMG/M
3300015241|Ga0137418_10010781All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes8405Open in IMG/M
3300026296|Ga0209235_1038364All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → unclassified Betaproteobacteria → Betaproteobacteria bacterium2408Open in IMG/M
3300028536|Ga0137415_10006328All Organisms → cellular organisms → Bacteria11736Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil57.52%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil11.50%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil5.31%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.42%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil3.54%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.65%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.77%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.77%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.77%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.89%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.89%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.89%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.89%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.89%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.89%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.89%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.89%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005206Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300005441Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009678Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT100EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300011403Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT166_2EnvironmentalOpen in IMG/M
3300011445Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT700_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026306Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028711Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_150EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300034090Peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF00NEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25382J43887_1028650333300002908Grasslands SoilVTYGPLALREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRLS
Ga0062590_10243237513300004157SoilVSHGLPALREDVAAPRQRGVSPGSAAPTGSDRTLLLECRLEQLRGALDEARAEADQARVRLAEAAAREAGETRRLT
Ga0063356_10001080313300004463Arabidopsis Thaliana RhizosphereMPTGSDRTLLLECRLEQLRGALDEARAEADQARVRLAAAAAREAGETRRLSLLQDEVARAREEVAALHRRLEHSEALRAK
Ga0062592_10046540633300004480SoilMTHGPALREGVTAPRRGSLSAGSATPAGSDRTVLLECRLEQLRGALDEARAEADLARVRLAEAAAREVGETQRLSLLQDEVAQAR
Ga0066676_1023878523300005186SoilLREGVTAPLRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETLRLSV
Ga0068995_1002388513300005206Natural And Restored WetlandsVTHGPPALREGITAPPRPTGSDRTLLLECRLEQLRGALDEARAEADQARVRLAAAAAREAGETRRLSLLQDEVARAREEVAALHR
Ga0070700_10110075513300005441Corn, Switchgrass And Miscanthus RhizosphereVGSDRTVLLECRLEQLRAALDEARADADQARIRLAEAAARETGETQRLSALQGELARAREEVAALHRRLEHSEALRAKLQGHLIESEPRED
Ga0070696_10094928113300005546Corn, Switchgrass And Miscanthus RhizosphereVRYGPFALREGVTAPRRSGVSPVSGGAAPTGSDRTLLLECRLEQLRGALEEARAEADQARIRL
Ga0066704_1038653413300005557SoilVTYGPFALREGVTAPRRSGVSPASAPGGSDRTLLLECRLEQLRGALDEARA
Ga0066699_1115502013300005561SoilLREGVTAPRRSTPTGGGSGGSDRTLLLECRLEQLRGALDEARAEADQARIRL
Ga0066659_1180677613300006797SoilLREGITAPRRSTPAGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREASETQR
Ga0079221_1022337133300006804Agricultural SoilLREGVTAPRRSTATEGGSGGSDRTLLLECRLEQLRGALDEARAEADQARVRLAEAAVREAGETRRLSLLQDELARAREEVAALHRRLEHSEALRAKL
Ga0079220_1198018513300006806Agricultural SoilVTHGPLAPREGIIGSDRTFLLECRLDQLRGALDEARAEADQARVRLAEALAREAGETRRLSLLQDELARAREEVAALHRRLEHSEALRAKLQGHLFE
Ga0075426_1079925423300006903Populus RhizosphereVTHGPFALREGVTAPRRSGVAPGSAPTGGGSGGSDRTLLLECRLEQLRGALDEARAEADQ
Ga0099793_1026691113300007258Vadose Zone SoilLREGVTASRRSTPTGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAA
Ga0099793_1041447613300007258Vadose Zone SoilVKDGPLALREGVTAPRRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAE
Ga0099793_1041766913300007258Vadose Zone SoilLREGVTAPRRSPPAGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRLSVLQDEVARARAEVAALHR
Ga0066710_10279946523300009012Grasslands SoilVTYGPFALREGVTAPRRSTPAGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRLSVLQDEVARARAEVAALHRRLEHSEA
Ga0099830_1053862423300009088Vadose Zone SoilLREGVTAPRRSTPAGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRLSVLQDEVARARAEVA
Ga0099828_1002518263300009089Vadose Zone SoilVTYGPFALREGVTAPRRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREA
Ga0099828_1087086523300009089Vadose Zone SoilVTYGPFALREGVTAPRQSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREA
Ga0099827_1006441933300009090Vadose Zone SoilLREGVTAPRRSTPTGSDRTLLLECRLEQLRGALDEARAEADQARIRLA
Ga0111538_1093542623300009156Populus RhizosphereLREGVTAPRRSGVSPVSGGAAPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLA
Ga0111538_1206710513300009156Populus RhizosphereVGSDRTVLLECRLEQLRGALDEARADADQARVRLAEAASRETGETQRLSALQGELARAREEVAALHRRLEHSEALRAKLQGHLIESEPREDA
Ga0105252_1021053813300009678SoilLPGSAAPTGSDRALLLECRLEELRGALDEARAEADQARVRLAESASREAGETRRLSLLQDEVARARDEVAALHRRLEHSE
Ga0134070_1001493233300010301Grasslands SoilVTYGPFALREGVTAPRRSTPVGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETLRLSVLQDEVARARAEVAALHRRL
Ga0134084_1016936223300010322Grasslands SoilLREGVTAPRRSGVSPGPAPPTSSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREA
Ga0134084_1042359723300010322Grasslands SoilVTYGPFALREGVTAPHRSTATGGGSDRTLLLECRLEQLRGALDEARAEADQARTRLAEAAAREAGETQRLSVLQDEVARARAEVAAL
Ga0134084_1047825223300010322Grasslands SoilLREGVTAPRPSTAAGSDRTLLLECRLEQLRGALDEARAEADQAR
Ga0134065_1012495113300010326Grasslands SoilVTYGPFALREGVTAPLRSTPTGGGSDRTLLLECRLEQLRGALDEARADADQARIRLAEAAARE
Ga0134111_1025721113300010329Grasslands SoilLREGVTAPRRSTPTAGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRLSVVQDEVARARA
Ga0134111_1033695613300010329Grasslands SoilLREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAR
Ga0134111_1040535523300010329Grasslands SoilVTYGPFALREGVTAPLRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGET
Ga0134071_1037464323300010336Grasslands SoilLRAGVTAPRRSTPTGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRLSVLQDEVARARAEVAALHRRLEHS
Ga0134127_1175288823300010399Terrestrial SoilVRYGPFALREGVTAPRRSGVSPVSGEAAPTGVGSDRTVLLECRLEQLRAALDEARADADQ
Ga0137313_100389943300011403SoilMAVTAGSVVPTGSDRTLLLECRLEQLRGALDEARAEADQARVRLAQAAAREAGETRRMSLLQDEVAR
Ga0137427_1004025313300011445SoilMAVTAGSVVPTGSDRTLLLECRLEQLRGALDEARAEADRARVRLAQAAAREAGETRRMSLLQDEVARAREEVAALHRRLEHSEALR
Ga0137427_1036550923300011445SoilLREGVTAPRSGVGSDRTLLLECRLEQLRGALDEARAEADQARVRLAEAAAREAGETRRMSLLQDEVARAREEVAALHRRLEHSEALR
Ga0137389_1016521913300012096Vadose Zone SoilVTYGPFALREGVTAPRRSAPGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRLSVLQDEVARARAE
Ga0137388_1105938923300012189Vadose Zone SoilLREGVTAPRRSGISPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARI
Ga0137364_1018438023300012198Vadose Zone SoilVTHGSFALREGVTAPRRSPPAGSDRTLLLECRLEQLRGALDEARAEADQARLRLAEAAAREAAETLRLSVLQDEVARARA
Ga0137364_1050649323300012198Vadose Zone SoilLREGVTAPRRSGVSPGPAPPTGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREA
Ga0137383_1012290533300012199Vadose Zone SoilVTYGPFALREGVTAPHRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLA
Ga0137399_1017114913300012203Vadose Zone SoilVTYGPFALREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARTRLAEAAAREAGETQRLSVLQDEVARARAEVAALHRRLEHSEAL
Ga0137399_1083476423300012203Vadose Zone SoilLLECRLEQLRGALDEARAEADQARVRLAEAAAREAGETRRLSVLQDEVARAREEVAALHRRLEHSEALRAKLQG
Ga0137399_1152576513300012203Vadose Zone SoilMHFTIVTYGPFALREGVTTPRRSTPTGGGSGGSDRTLLLECRLEQL
Ga0137374_1017813713300012204Vadose Zone SoilLREGVTAPRRSTPTGSDRTLLLECRLEQLRGALDEARAEADQARI
Ga0137374_1070344523300012204Vadose Zone SoilLREGVTALRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGET
Ga0137380_1104339723300012206Vadose Zone SoilLREGVTAPRRSTPAGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGET
Ga0137381_1006568313300012207Vadose Zone SoilVTHGPFALREGVTAPRRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADLARIRLAEAAAREAGETLRLSVLQDEVARARAEVAALHRRLEHSEA
Ga0137381_1053301323300012207Vadose Zone SoilLREGVTAPRRSGVSPGPAPPTGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGET
Ga0137381_1074107923300012207Vadose Zone SoilLREGVTAPRRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRLSVLQDEVARARAEVAALHRRLEHSEA
Ga0137376_1010798113300012208Vadose Zone SoilVTHGPFALREGVTAPRRSTPTGSDRTLLLECRLEQLRGALDEARAE
Ga0137376_1011647833300012208Vadose Zone SoilLREGVTAPRRSGVSPGPVAPTGSDRTLLLECRLEQLRGALDEARAEADQARIRL
Ga0137379_1125883023300012209Vadose Zone SoilLREGVTAPRRSTPAGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRLSVLQDEVARARAEVAALHRRLE
Ga0137378_1071848813300012210Vadose Zone SoilVTYGPFALREGVTAPRRSTPTGGGSGGSDRTLLLECRLEQLRGALDEARAEADQAR
Ga0137370_1050270713300012285Vadose Zone SoilLREGITAPRSGVGSDRTLLLECRLEQLRGALDEARAEADQARVRLAEAAAREAGETRRMSLLQDEVARAREEVAALHRRLEQSEALRAKLQGHLFES
Ga0137370_1053403913300012285Vadose Zone SoilVTHGPFALREGVTAPRRSPPAGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEA
Ga0137387_1002875313300012349Vadose Zone SoilLREGVTAPRRSTPAGSERTLLLECRLEQLRGALDEARAEADQARVRLAEAAAREAGETQRLSVLQDEVARAR
Ga0137387_1105889513300012349Vadose Zone SoilVTYGPFALREGVTAPRRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEA
Ga0137372_1009654533300012350Vadose Zone SoilVTYGPFALREGVTAPHRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAG
Ga0137386_1082631323300012351Vadose Zone SoilLREGVTAPRRSTPAGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRLSVLQDEVARARAEVAALHRRLEHSEALRA
Ga0137366_1001517163300012354Vadose Zone SoilLREGVTAPRRSTPTGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGET
Ga0137366_1002317813300012354Vadose Zone SoilVTYGPFALREGVTAPHRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGET
Ga0137369_1038590623300012355Vadose Zone SoilLREGVTAPRRSTPTGGGSGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGET
Ga0137369_1073132823300012355Vadose Zone SoilVTHGPFALREGVTAPRRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRL
Ga0137384_1116192213300012357Vadose Zone SoilLREGVTAPRRSGVSPGPAPPTGSDRTLLLECRLEQLRGALDEARAEADQARVRLAEAAAREAGETLRLSVLQDEVARARAEVAALHRRLEHSEALRAKLQ
Ga0137368_1034290823300012358Vadose Zone SoilLREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGET
Ga0137375_1086431923300012360Vadose Zone SoilLREGVTAPRRSTPTGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRLSVLQDEVARARAEVAAL
Ga0137361_1059741013300012362Vadose Zone SoilVTHGPFALREGVTAPRRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRLSV
Ga0137398_1005219813300012683Vadose Zone SoilVTYGPFALREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETRRLGALQD
Ga0137397_1051163923300012685Vadose Zone SoilVTYGPFALREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARIRLA
Ga0137397_1057735823300012685Vadose Zone SoilLREGVTAPRRSTPASSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREVGET
Ga0137397_1113407723300012685Vadose Zone SoilVTYGPFALREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARTRLAEAAAREAGETQRLSVLQDEVARARAEVAALHRRLEHSEALRAK
Ga0137396_1008802513300012918Vadose Zone SoilVTHGPFALREGVTAPRRSGVSPTSAAPGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQR
Ga0137396_1051013233300012918Vadose Zone SoilLRQGVTAPRRSPPAGSDRTLLLECRLEQLRGALDEARAEADQA
Ga0137396_1089832013300012918Vadose Zone SoilVSDGPPGFREFTHFIIVKYGPFALREGVTAPRRSPPAGSDRTLLLECRLEQLRGALDEAR
Ga0137394_1005030033300012922Vadose Zone SoilVTYGPFALREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREVGETQRLSVLQDEVARARAEV
Ga0137394_1075228313300012922Vadose Zone SoilLREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARTRLAEAAAREAGETQRLSVLQDEVARARAEVAALHRR
Ga0137394_1150276913300012922Vadose Zone SoilLREGVTAPRRSTPAGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRLGVL
Ga0137359_1062182133300012923Vadose Zone SoilLREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARTRLAEAAAREAGETQRLSVLQDEVARARAEVAALHRRLEHSEAL
Ga0137419_1002696453300012925Vadose Zone SoilLREGVTAPLRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRLSVLQDEVARARAEVAALHRRLEHSEA
Ga0137419_1009863613300012925Vadose Zone SoilLREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARTRLAEAAAREAGETQRLSVLQDEVARARAEVA
Ga0137419_1115532613300012925Vadose Zone SoilVTHGPFALREGVTAPRRSTPSGSDRTLLLECRLEQLRGAVDEARAEADQARIRLAEAAAREAGETLRLSVLQDE
Ga0137419_1125021723300012925Vadose Zone SoilLREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARIRLA
Ga0137416_1150091123300012927Vadose Zone SoilLREGITAPRRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREVGETQRLSVLQDEVARARAEV
Ga0137404_1203734023300012929Vadose Zone SoilLREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETRRLSLL
Ga0137410_1026912313300012944Vadose Zone SoilVTHGPFALREGVTAPRRSTPTGSDRTLLLECRLEQLRGALDEARADADQARIRLAEAAAREAGETQRLSVLQDEVARAR
Ga0137410_1041580813300012944Vadose Zone SoilLREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARTRLAEAAAREAGETQRLSVLQDEVARARAEVAALHRRLEHSEALRAK
Ga0137410_1092303323300012944Vadose Zone SoilVTYGPFALREGVTAPLRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRL
Ga0134077_1035488423300012972Grasslands SoilLREGVTAPLRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGET
Ga0134078_1053624423300014157Grasslands SoilVTYGPFALREGVTAPHRSTPTGGGSGGSDRTLLLECRLEQLRGALDEARAEADQARLRLAEAAGREAAETQRLSVLQDEI
Ga0137420_113560333300015054Vadose Zone SoilLREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARTRLAEAAAREAGETQRLSVLQDEVARARAEVASLHRR
Ga0137418_10005447133300015241Vadose Zone SoilLREGVTAPLRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRLSVLQDEVARARAEVAALH
Ga0137418_1001078173300015241Vadose Zone SoilVTYGPFALREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARTRLAEAAAREAGETQRLSVLQDEVARARAEVAALH
Ga0137409_1009968013300015245Vadose Zone SoilLREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRLSVL
Ga0134089_1036115013300015358Grasslands SoilLREGVTAPRRSTPTGSDRTLLLECRLEQLRGALDEARAEADQA
Ga0134089_1049265823300015358Grasslands SoilLRAGVTAPRRSTPTGSDRTLLLECRLEQLRGALDEARAEAD
Ga0132257_10161465423300015373Arabidopsis RhizosphereLREGVTAPHRSGVSPVSAGAAPTAVGSDRTVLLECRLEQLRGALDEARADADQA
Ga0184618_1004134023300018071Groundwater SedimentLREGVTAPRRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLA
Ga0137408_138262323300019789Vadose Zone SoilVTYGPFALREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARIRVAEAAAREAG
Ga0193723_105378143300019879SoilVTYGPFALREGVTAPRRNGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRL
Ga0193735_110296823300020006SoilVTYGPFALREGVTAPRSGVGSDRTFLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETRRVSLLQDEVARAREEVAALHRRLEQSE
Ga0210378_1004989223300021073Groundwater SedimentLREGVTAPRRSGVSPASAAPGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAG
Ga0209235_103836413300026296Grasslands SoilVTYGPFALREGVTAPRRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIQLA
Ga0209236_100741313300026298Grasslands SoilVTYGPFALREGVTAPRRSGVSPASAPGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAA
Ga0209468_101499133300026306SoilVTYGPFALREGVTAPRRSTPAGSDRTLLLECRLEQLRGALDEARADADQARIRL
Ga0209761_102051913300026313Grasslands SoilLREGVTAPRRSTPTGGGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGET
Ga0209378_105224913300026528SoilLREGVTAPRRSGVSPASAPGGSDRTLLLECRLEQLRGALDEARAEADQA
Ga0137415_10006328123300028536Vadose Zone SoilVKHGPFALREGVTAPRRNGVSPGSAAPVGSDRTLLLECRLEQLRGALDEAR
Ga0307293_1025417213300028711SoilLREGVTAPRRSPPAGSDRTLLLECRLEQLRGALDEARAEADQARIRLAE
Ga0307473_1050544923300031820Hardwood Forest SoilVTYGLLALREGVTAPRRSGVSPGSPPAGSDRTLLLECRLEQLRGALDEARAEADQARIRLAEAAAREAGETQRLSVIQDEVA
Ga0326723_0476388_304_5703300034090Peat SoilMTHGPALREGVTAPRRGSLSAGSATPAGSDRTVLLECRLEQLRGALDEARAEADLARVRLAEAAAREVGETQRLSLLQDEVAQAREEVA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.