NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F042849

Metagenome Family F042849

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F042849
Family Type Metagenome
Number of Sequences 157
Average Sequence Length 116 residues
Representative Sequence VGSATSSPSRPELTAALHRLRSTLARVRAEIELAHSDGDPPPVDRLLVDLHEALDLLGQVEAAAFSIVSVLVVDDDERLAELTARGLRRLGYEAESSSRLRALRPREVVVFDLGL
Number of Associated Samples 118
Number of Associated Scaffolds 157

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 106
AlphaFold2 3D model prediction Yes
3D model pTM-score0.58

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(37.580 % of family members)
Environment Ontology (ENVO) Unclassified
(50.955 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(68.153 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 45.45%    β-sheet: 9.79%    Coil/Unstructured: 44.76%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.58
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 157 Family Scaffolds
PF00486Trans_reg_C 86.62
PF07730HisKA_3 2.55
PF13185GAF_2 2.55
PF03989DNA_gyraseA_C 1.91
PF02801Ketoacyl-synt_C 1.27
PF02518HATPase_c 1.27
PF01966HD 0.64
PF02347GDC-P 0.64

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 157 Family Scaffolds
COG3850Signal transduction histidine kinase NarQ, nitrate/nitrite-specificSignal transduction mechanisms [T] 2.55
COG3851Signal transduction histidine kinase UhpB, glucose-6-phosphate specificSignal transduction mechanisms [T] 2.55
COG4564Signal transduction histidine kinaseSignal transduction mechanisms [T] 2.55
COG4585Signal transduction histidine kinase ComPSignal transduction mechanisms [T] 2.55
COG0188DNA gyrase/topoisomerase IV, subunit AReplication, recombination and repair [L] 1.91
COG0403Glycine cleavage system protein P (pyridoxal-binding), N-terminal domainAmino acid transport and metabolism [E] 0.64
COG1003Glycine cleavage system protein P (pyridoxal-binding), C-terminal domainAmino acid transport and metabolism [E] 0.64


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil37.58%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil17.83%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil8.92%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere7.01%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil4.46%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.82%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.18%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.55%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.55%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.91%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.27%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.27%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.27%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost1.27%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil1.27%
SoilEnvironmental → Terrestrial → Soil → Clay → Unclassified → Soil1.27%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.64%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Soil0.64%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.64%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.64%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090004Permafrost microbial communities from permafrost in Bonanza Creek, Alaska - Permafrost Layer P1EnvironmentalOpen in IMG/M
3300000597Forest soil microbial communities from Amazon forest - 2010 replicate II A1EnvironmentalOpen in IMG/M
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006864Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Permafrost soil replicate 3 DNA2013-193EnvironmentalOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011991Permafrost microbial communities from Nunavut, Canada - A34_65cm_12MEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014056Permafrost microbial communities from Nunavut, Canada - A20_5cm_0MEnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026291Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 3 DNA2013-049 (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026308Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103 (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300027546Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027565Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027591Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027629Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027633Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300028803Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_120EnvironmentalOpen in IMG/M
3300031671Soil microbial communities from Risofladan, Vaasa, Finland - OX-1EnvironmentalOpen in IMG/M
3300031672Soil microbial communities from Risofladan, Vaasa, Finland - OX-2EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031770Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f17EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
P1_DRAFT_001918402088090004SoilVESATGSSSPSPELAAALHRLRSTLARLKAELELAQVDGTLPPAERLLGDLNEALVLLRAVEQAAFGLVAILVVDDDARLAELTARGLRRLGXXXRQVEGGA
AF_2010_repII_A1DRAFT_1017902813300000597Forest SoilVGSATGSSSRAELAAALHRLRSTLARARAELELVEAEGGVPPVAALVADLREALELLGDVESLGLGIVRVLVVDDDERLGELTARGLRRLGYEAER
JGI12635J15846_1012155713300001593Forest SoilVGLATGSSPSPELAAALHRLRSTLARLKAELELSQSDEIAPPVKRALEDLNEAFTLLRAVEVAAFGLIEVLVVDDDERLAELTARGLRRSGYEAESAGALRELRPN
JGI25390J43892_1009000513300002911Grasslands SoilVGSATSSPSRPELTAALHRLRSTLARVRAEIELAHSDGDPPPVDRLLVDLHEALDLLGQVEAAAFSIVSVLVVDDDERLAELTARGLRRLGYEAESSGRL
Ga0066674_1000892813300005166SoilVGSATSSPSRPELTAALHRLRSTLARVRAEIELAHSDGDPPPVDRLLVDLHEALDLLGQVEAAAFSIVSVLVVDDDERLAELTARGLRRLGYEAESSGRLRALRPREVVVFDLGLFASLDASERGTLKAAHPIVVTGA
Ga0066672_1061045413300005167SoilVGLVTGSSSRPETAAALHRLRSTLARMRAELEIAQSDRAAPPVDRLIEDLGEALELLGEVESVALAIVHVLVVDDDERLGELTARGLRRLGFD
Ga0066688_1006505613300005178SoilMHRLRSTLARARAELELARTDGDPLPVDKLLGDLREALDLLGRVEAAAFQIVAVLVLDDDERLAELTARGLRRLGYEAESASRMRALRPREVVVLDLGLSASLDADQLVALRAARPIVVTGAADPASRALADDLGASD
Ga0066688_1061083913300005178SoilVGLVTGSSSRPETAAALHRLRSTLARMRAELEIAQSDREAPPVDRLIEDLGEALELLGEVESVALAIVHVLVVDDDERLGELTARGLRRLG
Ga0066678_1007640813300005181SoilVGSATSSPSRPELTAALHRLRSTLARVRAEIELAHSDGDPPPVDRLLVDLHEALDLLGQVEAAAFSIVSVLVVDDDERLAELTARGLRRLGYEAESSSRLRALRPREVVVFDLGL
Ga0066675_1005483653300005187SoilLSSNPELAGALHRLRSTLARMRAELELAHEDGSGPPVDRLLEDVSDALEQLGRVEAVALEVVSVLVVDDDERLAELTARGLRRM
Ga0066675_1035059113300005187SoilLHRLRSTLARLRAELEVAESDGSVPPVARLLSDLREALDLLENVESAALGVVRVLVVDDDERLGELTARSLRRLGFDAEAADRLRALRAGEVVVFDLGVSASLTSADRAVLRAARPVVVTGAVDS
Ga0066675_1048798813300005187SoilLATGSSPGPEARAELAEGLHRLRSTLARVRAELEVAIGDGEGPPVERLLSDLREALELVGDVEAAALGVVRVLVVDDDERLGELTARGLRRMGYDAEASGRLR
Ga0070705_10124933913300005440Corn, Switchgrass And Miscanthus RhizosphereVGSGTGSSSRTELAAALHRLRSTLARARAELELAEADGAPPPVDRLLGDLTEALGLLGAVEAAAFAIVPVLVVDDDQRLAELTARGLRRLGYEAESAGRMRALRPREVVVFDLGLYLGLDSAERAALKAARPIVVTGAADPTSRALAADLGA
Ga0070694_10039318023300005444Corn, Switchgrass And Miscanthus RhizosphereVGSGTGSSSRTELAAALHRLRSTLARARAELELAEADGAPPPVDRLLGDLTEALGLLGAVEAAAFAIVPVLVVDDDQRLAELTARGLRRLGYEAESAG
Ga0070694_10124584023300005444Corn, Switchgrass And Miscanthus RhizosphereVGLATGSSSRPELAAALHRLRSTLARAKAELELAEADGEAPNAKRLLGDLSEALDLLGQVEAAALSIVPVLVLDDDERLAELTARGLRRLGYDAEASSRLRPLKPGEVVVLDLGMTASL
Ga0066686_1002553013300005446SoilVGSATSSPSRPELTAALHRLRSTLARVRAEIELAHSDGDPPPVDRLLVDLHEALDLLGKVEAAAFSIVSVLVVDDDERLAELTARGLRRLGYEAESSGRLRALRPREVVVFDLGVLSSLNPAERDVLKAARPVVLTGATDPA
Ga0066686_1072109113300005446SoilVGLATGSSSRPELAAALHRLRSTLARAKAELELAEADSETLLKERLLGDLSEALQLLGQVEAAALSIVPVLVLDDDERLGELMARGLRRLGYDAESASRLRPL
Ga0066682_1060687823300005450SoilVGSATSSSSRPELAAALHRLRSTLARARAELELARPDGDPVLVERLLGDLREALDVLGLVETAAFSIVSVLVVDDDERLAELTARGLRRMGYVAESAARLRALRPREVVVFDLGVLSSLNPAERDVLKAARPVVLTGATDPASRAL
Ga0066687_1043059423300005454SoilLSSNPELAGALHRLRSTLARMRAELELAHEDGLGPPVDRLLEDVSDALEQLGRVEAVALEVVSVLVVDDDERLAELTARGLRRMGYDAESSARLRPLRAGEVVVFDLGVTGSLDAVGKERLRSARPIIVTGAADPASRAMAEDLDAS
Ga0070707_10032428613300005468Corn, Switchgrass And Miscanthus RhizosphereLSSNPELASALHRLRSTLARMRAELELAHDDASTPPVDRLLTDLSESLQLLGRVEVAALGLVPVLVVDDDERLAELTARGLRRLGYEAESAGRLRPLRHREVVVFDLGVSTSLDVSERAA
Ga0070707_10057845613300005468Corn, Switchgrass And Miscanthus RhizosphereLHRLRSTLARVRAELEVAETGGEAPPVVRLLADLREALETLGNVESAALGVVRVLVVDDDERLGELTARGLRRLGFDAESMTGLRSLRPREVVVFDLGVAGTLSRTERAVLRGSRPVVLTGAVDSA
Ga0070707_10099683323300005468Corn, Switchgrass And Miscanthus RhizosphereVGSDIASSSRTELAGALHRLRSTLARARAELELAQTDGEPVPADRLVGDLGEALELLGAVEAAAFAIVPVLVLDDDDRLAELTARGLRRLGYEAESAGRLRALRPREVVVFDLG
Ga0070707_10167200513300005468Corn, Switchgrass And Miscanthus RhizosphereLHRLRSTLARAKAELEFAAEDGEPLPAERVLGDLREALDLVAQVESAALAIVPVLVLDDDARLAELTARGLRRLGYEAEPAGRLRALRPRELVVFDLGMSGSLDT
Ga0070698_10024880833300005471Corn, Switchgrass And Miscanthus RhizosphereLHRLRSTLARVRAELEVAETGGEAPPVVRLLADLREALETLGNVESAALGVVRVLVVDDDERLGELTARGLRRLGFDAESMTGLRSLRPREVVVFDLGVAGTLSRTERAVLRGSRPVVLTGAVDSASRALAED
Ga0070698_10104606113300005471Corn, Switchgrass And Miscanthus RhizosphereVESATGSSPSPELPAALHRLRSTLARLKAELELVQADGKGAPAERLLGDLNEALVLLQAVEQAAFGVVPILVLDDDARLGDLTARGLRRLG
Ga0066697_1080485413300005540SoilVGLATGSSSRPELAAALHRLRSTLARAKAELELAQADGETPLKERLLGDLSEALQLLGQVEAAALSIVPVLVLDDDERLGELIARGLRRLGYDAESASRLRPLKPGEIVILDLSLTASLTAADHVALRSARPIVVTGSADPHSRAMAED
Ga0070732_1022935623300005542Surface SoilMKAEVELAGLDGTQVTVDRLQGDLREALSLLGAVESAAYAVGPVLVVDDDERLGELTARGLRRLGFEADSRASLRPLRPREVVVFDLGLAASLSAGDGTALRAARPIVVTGAADPASRALAAS
Ga0070732_1076645323300005542Surface SoilMKAELELADPDAETPTSRRLISDLREALELLSAVEAAALGLVRVLVIDDDERLAELTARGLRRLGYDAESGSRFRAPRPREVVVFDLGLAAGLAAGERAALKAARPIVVTGATDAASRAV
Ga0066701_1004293443300005552SoilVPVDRLLGDLQEALDLLSRVETAAFAIVSVLVVDDDERLAELTARGLRRMGYEADSAPRLRKLRSREVVVFDLGACASLDASQRSALKTARPIIVTGSSDPASRAIAEDL
Ga0066695_1049097713300005553SoilVGSATSSSSRPELAAALHRLRSTLARARAELELARSDGDSVPVDRLLGDLQEALDLLSRVETAAFAIVSVLVVDDDERLAELTARGLRRMGYEADSAPRLRKLRSREVVVFDLGASASLDASQRSALKTARPIIVTGSSDPASRA
Ga0066695_1051627823300005553SoilVGLATGSSSRPELAAALHRLRSTLARAKAELELAEADSETLLKERLLGDLSEALQLLGQVEAAALSIVPVLVLDDDERLGELMARGLRRLGYDAESAS
Ga0066661_1004417353300005554SoilMHRLRSTLARARAELELARTDGDPLPVDKLLGDLREALDLLGRVEAAAFQIVAVLVLDDDERLAELTARGLRRLGYEAESASRMRALRPREVVVLDLGL
Ga0066707_1010193333300005556SoilVGSATSSPSRPELTAALHRLRSTLARVRAEIELAHSDGDPPPVDRLLVDLHEALDLLGQVEAAAFSIVSVLVVDDDERLAELTARGLRRLGYEAESSGRLRALRPREVVVFDLGLFASLDASERGTLKAAHPIVVT
Ga0066707_1031326723300005556SoilVESVTGSSSRAELSGALHRLRSTLARLRAELEVAESNGEPPPVDRLLSDLREALELLGNVESAALEVVRILVVDDDERLAELTARGLRRLGYDAEAAGRLRALRPREVVVFDLGVSAALSPAERAALRAAHPIVVTGAADPASRALARDL
Ga0066670_1015127233300005560SoilVGSGTGSSSRTELAAALHRLRSTLARVRAELELAEADGDSPPVGRLRADLSEALELLGAVEAAAFGIVPVLVLDDDERLAELTARGLRRLGYEAESAGRIRALRPHEVVVFDLGLSASLDSAQRATLK
Ga0066670_1037702913300005560SoilLSSNPELAGALHRLRSTLARMRAELELAHEDGSRPPVDRLLEDVSDALEQLGRVEAVALEVVSVLVVDDDERLAELTARGLRRMGYDAESSARLRPLRPGEVVVFDLGVTGSLDV
Ga0066699_1018161013300005561SoilMSSGTEPGPELAAALHRLRSTLARMRAELELAQSDLQGPPVERVLGDLGEALGYLGEVESAALAIVPVLVLDDDERLGELTARGLRRLGYEAESAGRLRALRPREVVVFDLSLSDSLDSAGLAAIRAARPIVVTGATDPGSRA
Ga0066693_1029964513300005566SoilMRAELELAHEDGSGPPVDRLLEDVSDALEQLGRVEAVALEVVSVLVVDDDERLAELTARGLRRMGYDAESSARLRPLRAGEVVVFDLGVTRSLDVVGMELLRSARPIIVTGAADPASRAMAEDLDASAYLVK
Ga0066702_1008168913300005575SoilLSSNPELAGALHRLRSTLARMRAELELAHEDGLGPPVDRLLEDVSDALEQLGRVEAVALEVVSVLVVDDDERLAELTARGLRRMGYDAESSARLRPLRPGEVVVFDLGVTRSLDVVGMELLRSARPIIVTGAADPAS
Ga0066706_1059358713300005598SoilLGTGSPSRTEAELAAALHRLRSTLARMRAELELAQSDGEDPPIDRVLGDLGEALHYLGDVESAALAIVPVLVLDDDERLGELTARGLRRLGYEAESAASLRALRPREVVVLDLSLCDSLDKAGLAT
Ga0066903_10412888713300005764Tropical Forest SoilMRAELELARSDEASPLVDRLLGDLQEALTLLGRVESAALGIVPVLVLDDDSRLGELTARSLRRAGFDADSADCFRELRPGEVVVFDLGLIASLDVNERSALTASRPIVVTGAADSGSRALAEEIGASDYLI
Ga0066651_1045559923300006031SoilLATGSSSRPELAAALHRLRSTLARAKAELELAQADGETPLKERLLGDLSEALHLLGQVEAAALSIVPVLVLDDDERLGELIARGLRRLGYDAESASRLRPLKPGE
Ga0066696_1024819713300006032SoilVRAEIELAHSDGDPPPVDRLLVDLHEALDLLGKVEAAAFSIVSVLVVDDDERLAELTARGLRRLGYEAESSGRLRALRPREVVVFDLGLFASLDASERGTLKAAHPIVVTGATDPGSRAVADDL
Ga0066656_1021435713300006034SoilLATGSSSSPEPRAELAQALHRLRSTLARMRAELEVAEGDGGALPVDRLLSDMREALEALGNVESTALGVVRVLVVDDDERLGELTARGLRRLGYDAESSLAMRTLRPREVVVFDLSVAPSLDEPSRAALR
Ga0066656_1036292813300006034SoilVGSATSSSSRPELAAALHRLRSTLARARAELELARPDGDPVLVERLLGDLREALDVLGLVETAAFSIVSVLVVDDDERLAELTARGLRRMGYVAESAAR
Ga0066656_1046647713300006034SoilVAIGDGEGPPVERLLSDLREALELVGDVEAAALGVVRVLVVDDDERLGELTARGLRRMGYDAEASGRLRTLRAREVVVFDLGVLNSLEAEEHAALTVARPIVVTGAADPGARALADNLGASDYL
Ga0079222_1043815023300006755Agricultural SoilMRAELELAQDDASTPPVDRLLADLSESLQLLGRVEAVALGLVPVLVVDDDERLAELTARGLRRLGYEAESAGRLRPLRQKEVVVFDLGVSTSLDVSERAAL
Ga0066659_1114711413300006797SoilMSSGTEPGPELAAALHRLRSTLARMRAELELAQSDLQGPPVERVLGDLGEALGYLGEVESAALAIVPVLVLDDDERLGELTARGLRRLGYEAESAGRLRALRPREVVVFDLSLSDSLDS
Ga0066659_1129718513300006797SoilVGSDTGSSSRTELAGALHRLRSTLARARAELELAEADGDAPPVDKLLGDLGEALDLLGAVETAAFAIVHVLVLDDDDRLAELMARGLRRRGYEAESAGRMRQ
Ga0066660_1070759413300006800SoilVGSATSSSSRPELAAALHRLRSTLARARAELELARPDGDPVLVERLLGDLREALDVLGLVETAAFSIVSVLVVDDDERLAELTARGLRRMGYVAESAARLRALRPREVVVFDLGVLSSLNPAERDVLKAARPVVLTGATDPASRALADEI
Ga0066660_1108250913300006800SoilLVTGSSSRPETAAALHRLRSTLARMRAELEIAQSDREAPPVDRLIEDLGEALELLGEVESVALAIVHVLVVDDDERLGELTARGLRRLGFDAESAGRLRTLRPREVV
Ga0075425_10129156523300006854Populus RhizosphereVGLATGSSSRPELAAALHRLRSTLARARAELELAQADGKAPSADRLLADLTEALHFLGQVEAAALSIVPVLVLDDDERLGELTARGLRRLGYDAESSGRLRALKPGEV
Ga0075425_10170227013300006854Populus RhizosphereMRAELEVAEGDGGAPPVDRLLSDMREALELLGNVESTALGVVRVLVVDDDERLGELTARGLRRLGYDAESSVGMRTLRPREVVVFDLSVAPSLDDPSRAALRLSRPIVL
Ga0066797_109304323300006864SoilVESATGSSPSPELAAALHRLRSTLARLKAELELAQADGTVPPAERLLGDLNEALVLLRAVEQAAFGLVSVLVVDDDVRLAELTARGLRRLGYEADSADAFRELRPGEV
Ga0075434_10090662923300006871Populus RhizosphereVGSATSSSSRSELAAALHRLRSTLARARAELELARSDSDSVPVDRLLGDLQEALDLLGRVEAAAFAIVSVLVVDDDERLAELTARGLRRMGFEADSAPRLRALRPGEVVVFDLGASASLDASERRALKTARPIIVTGSSDPASRAIAEDLDASAYLVKP
Ga0079219_1164589523300006954Agricultural SoilVGSGTGSSSGHSRGSELAAALHRLRSTLARVRAELEVAETGGEAPPVVRLLADLREALETLGNVESAALGVVRVLVVDDDERLGELTARGLRRLGFDAESMTGLRSLRPREVVVFDLGVAGTLSGTDRAILRGSRPVVLTGAVDSASRALAED
Ga0099793_1001888153300007258Vadose Zone SoilLSSRAEPGPELAAALHRLRSTLARGRAELELAQADAEGPPVDRLLGDLREALDLLGQVESAAFAIVPVLVIDDDERLAELTARGLRRLGYEAEAAGRLRALRPREVVVLDLGVSAHMDGAERDALRAARPIVVTGAA
Ga0066710_10090788833300009012Grasslands SoilVESVTGSSSRAELSGALHRLRSTLARLRAELEVAESNGEPPPVDRLLSDLREALELLGNVESAALEVVRILVVDDDERLAELTARGLRRLGYDAEAAGRLRALR
Ga0099828_1046253513300009089Vadose Zone SoilMRAEVELADSDGAAPPIERLLSDLREALEMLGQVESAAFDLVRVVVLDDDERLGELTARGLRRLGYEAESSISMRPLRPRDVVVLDLGLVESFDPAQRAAVKKARPIVVTGAADPGSRA
Ga0099828_1092447823300009089Vadose Zone SoilVRAELELAQADAEGPPVERLLGDLREALDLLGQVESAAFAIVPVLVIDDDERLAELTARGLRRLGYEAEAAGRLRALRPREVVVLDLGVSAHMDGAERDAL
Ga0099827_1123811713300009090Vadose Zone SoilMSWVIEPAPELAAALHRLRSTLARMRAELELAQSDGQGPPVERVLGDLGEALQHLGEVESAALAIVPVLVLDDDERLGELTARGLRRLGYEAESAGRLRALRPHEVVVFDLSLS
Ga0066709_10000610213300009137Grasslands SoilVGSGTGSSSRTELAAALHRLRSTLARVRAELELAEADGDSPPVGRLRADLSEALELLGAVEAAAFGIVPVLVLDDDERLAELTARGLRRMGYEAESAGRIRALRPHEVVVFDLSLSASLDSAQRATLKTARPIVVTGAADPRSRAIA
Ga0066709_10191214723300009137Grasslands SoilLDTGSSSRTELAGALHRLRSTLARARAELELADADGAAPPVDKLLGDLGEALDLLGAVETAAFAIVHVLVLDDDERLAELMARGLRRLGYEAESAGRMRQLRPREVVVFDLGLLASLDSAQRAAL
Ga0066709_10343639423300009137Grasslands SoilMHRLRSTLARARAELELARTDGDPVPVDRLLGDLREALDLLGRVEAAAFQIVAVLVLDDDERLAELTARGLRRLGYEAESASRMRALR
Ga0075423_1079147413300009162Populus RhizosphereVGSATSSSSRSELAAALHRLRSTLARARAELELARSDSDSVPVDRLLGDLQEALDLLGRVEAAAFAIVSVLVVDDDERLAELTARGLRRMGFEADSAPRLRALRPGEVVVFDLGASASLDASERRTLKTARPIIVTGSSDPASRAIAEDLDASAYLVKPVELDEL
Ga0134109_1031924313300010320Grasslands SoilVRAELEVAIGDGEAPPVERLLSDLRQALEVLGDVEAAALGVVRVLVVDDDERLGELTARGLRRMGYDAEASGRLRSLRPREVVVFDLGVLNSLEAEEHAALTVARPIVVTGAADPGARALADD
Ga0134065_1022254423300010326Grasslands SoilLGTGSSSRPELAAALHRLRSTLARAKAELELAEAGSETPFKERLLGDLSEALQLLGQVEAAALSIVPVLVLDDDERLG
Ga0134111_1006970113300010329Grasslands SoilVRAEIELAHSDGDPPPVDRLLVDLHEALDLLGQVEAAAFSIVSVLVVDDDERLAELTARGLRRLGYEAESSGRLRALRPREVVVFDLGLFASLDASERGTLKAAHPIVVTGA
Ga0134111_1021075623300010329Grasslands SoilLDTGSSFRTELAGALHRLRSTLARARAELELADADGDAPPVDKLLGDLGEALDLLGAVETAAFAIVHVLVLDDDERLAELMARALRRLGYEAESAARMRTLRPREVVVFDLGVYPSLDAT
Ga0134063_1006027133300010335Grasslands SoilVGSATSSSSRPELAAALHRLRSTLARARAELELARPDGDPVLVERLLGDLREALDVLGLVETAAFSIVSVLVVDDDERLAELTARGLRRMGYVAESAARLRALRPREVVVCDTTPMSNTTTARGPNTRSRAADSAP*
Ga0134128_1212397313300010373Terrestrial SoilLRSTLARARAELELAEADGAPPPVDRLLGDLTEALGLLGAVEAAAFAMVPVLVVDDDQRLAELTARGLRRLGYEAESAGRMRALRPREVVVFDLGLYLDPDSTERAALKAARPIVVTG
Ga0134121_1170244723300010401Terrestrial SoilLATGSSSRPKLAAALHRLRSTLARAKAELELAEADDGESPLKQRLLGDLSEALQLLGDVEAAALSIVPVLVLDDDERLGELTARGLRRLGYDAESASR
Ga0137392_1120878113300011269Vadose Zone SoilVRAELELAQADAEGPPVERLLGDLREALDLLGQVESAAFAIVPVLVIDDDERLAELTARGLRRLGYEAEAAGRLRALRPREVVVLDLGVSAHMDGAERDA
Ga0120153_102142913300011991PermafrostVESATGSSPSPELAAALHRLRSTLARLKAELELSQSDETIFPVERALADLADAFSLLRAVEVAAFGLVAVLVVDDDERLAELTARGLRRRGYDAES
Ga0137389_1073433923300012096Vadose Zone SoilVRAELELAQADAEGPPVERLLGDLREALDLLGQVESAAFAIVPVLVIDDDERLAELTARGLRRLGYEAEAAGRLRALRPREVVVLDLGVSAHMDGAERDALRAARP
Ga0137364_1031205923300012198Vadose Zone SoilLATGSSPGPEARAELAEGLHRLRSTLARVRAELEMAIGDGEGPPVERLLSDLRQALEVLGDVEAAALGVVRVLVVDDDERLGELTARGLRRMGYDAEASGRLRSLRPREVVVFDLGVLNSLEAEEH
Ga0137364_1037078113300012198Vadose Zone SoilVGSGTGSSSRAELAAALHRLRSTLARVRAELELAEADGDSPPVGRLRADLSEALELLGAVEAAAFAIVPVLVLDDDERLAELTARGLRRMGYESESAG
Ga0137374_1100777923300012204Vadose Zone SoilLATGSSSRTELAAALHRLRSTLARARAELEVAQADGESPPADRLLGDLGEALALLEKVEAAAFSIVSVLVLDDDERLGELTVRSLRRLGYEAEFASRMRKLRPREIVVFDLGLSSSLDAGELATLRA
Ga0137380_1003007463300012206Vadose Zone SoilMSWVIEPGPELAAALHRLRSTLARMRAELELAQSDGQGPPVERVLGDLGEALQHLGEVESAALAIVPVLVLDDDERLGELTARGLRRLGYEAESAGRLRALRPHEVVVFDLSLS
Ga0137380_1047332813300012206Vadose Zone SoilLALSDGDPPPVDRLLGDLQEALELLGQVEAAAFAIVSVLVVDDDERLAELTARGLRRLGYEAESSGRLRALRPREVVVFDLGLFASLDGSERSTLKAARPIVVTGATDPGSRAVADDLGA
Ga0137380_1070474623300012206Vadose Zone SoilLGTGSSSKTELAGALHRLRSTLARARAELELADADGAAPPVDRLLGDLGEALDLLEAVETAAFAIVHVLVLDDDERLAELTARGLRRLGYEAESA
Ga0137376_1027532213300012208Vadose Zone SoilLSEALDVLGLVEAAAFSIVSVLVVDDDERLAELTARGLRRMGYVAESAARLRALRPREVVVFDLGLLSSLNPAERDVLKAARPVVLTGATDPASRALADEI
Ga0137379_1010353463300012209Vadose Zone SoilLDTGSSFRTELAGALHRLRSTLARARAELELADADGAAPPVDRLLGDLGEALDLLEAVETAAFAIVHVLVLDDDERLGELTARGLRRLGYEAE
Ga0137377_1033363913300012211Vadose Zone SoilMRAELEIAQSDGQEPPVDRVLGDLGEALQHLGDVESAALAIVPVLVLDDDERLGELTARGLRRLGYEAESAGRLRALRPREVVVFDLSLSDSLDSAGLAAIRAARPIVVTGATDPGSRAL
Ga0137377_1183889013300012211Vadose Zone SoilDTGSSFRTELAGALHRLRSTLARARAELELADADGAAPPVDRLLGDLGEALDLLEAVETAAFAIVHVLVLDDDERLGELTARGLRRLGYEAESAGRMRQLRPHDVVVFDLGLLASLDSAQRAALKAARPIVVTGAADPRSRAIADGIGAFDYLLKPIEMEELATAISRRIAEES
Ga0137387_1045568013300012349Vadose Zone SoilLATGSSSRPELAAALHRLRSTLARAKAELELAQADGETPLKDRLLGDLSEALQLLGQVEAAALSIVSVLVLDDDERLGELTARGLRRMGYDAEAASRLRALKPGEVVVLDLGLT
Ga0137369_1026180113300012355Vadose Zone SoilMGDLGEALALLEKVEAAAFSIVSVLVLDDDERLGELTARSLRRLGYEAEFASRMRTLRPREIVVLDLGLSSSLDAGELATLRATRPIVVTGAADPASRSLAEDLGAAAYLIKPVETADLA
Ga0137385_1037111413300012359Vadose Zone SoilLHRLRSTLARVRAELEIAEADGEAPPVDRLLADLREALETLGNVEAAALGVVRVLVVDDDERLGELTARGLRRLGFDAEWTTGLRSLRPREVVVFDLGVAGSLSGTDRAILRGSRPVVLTGAVDSASRALAE
Ga0137375_1013495443300012360Vadose Zone SoilLATGSSSRTELAAALHRLRSTLARARAELEVAQADGESPPADRLLGDLGEALALLEQVETAAFSIVSVLVLDDDERLGELTARSLRRLGYEAEFASRMRTLRPREIVVLDLGLSSSLDAGELATLRATRPIVVTGAADPASRSLAEDLGAAAYL
Ga0137390_1044305823300012363Vadose Zone SoilVGSDTGSPSSHSRGVELAAALHRLRSTLARVRAELEVAEADGETPPVVRLLADLREALETLGNVESAALGVVRVLVVDDDERLGELTARGLRRLGFDAEWTTGLRSLRPREVVVFDLGVAGSLSGTDRAILRGAR
Ga0137373_1078093713300012532Vadose Zone SoilLATGSSSRPELAAALHRLRSTLARAKAELELAQADGETPLKDRLLGDLSEALQLLGQVEAAALSIVSVLVLDDDERLGELTARGLRRMGYDAEAASRLRALKPGEVVVLDLGLTGSLTAADHVTLRSVRPIVVTG
Ga0137419_1011128213300012925Vadose Zone SoilVESATGSSPSPELAAALHRLRSTLARLKAELELAEADGRAAPTERLLGDLNEALVLLREVEQAAFGTVSILVVDDDARLAELTARGLRRLGYDAESTEALRDIRPREVVVFDL
Ga0137410_1146336213300012944Vadose Zone SoilLSPSPELAASLHQLRSTLARLKAEVELAQIDGAPASTERLLGDLNEALVLLREVEQAAFGTVSILVVDDDA
Ga0134077_1048799523300012972Grasslands SoilVPADRLVGDLGEALELLGAVEAAAFAIVPILVLDDDDRLAELTARGLRRLGYEAESAGRLRALRPREVVVFDLGLSASLDAAQRGALRAARPIVVTGAADPGSRAIAEDLGAFDYMVK
Ga0120125_107264423300014056PermafrostVASATGSSPSPELAAALHRLRSTLARLKAELELALVDGVEPPVERLLGDLNEAFGQLRRVEEAAFGIVSVLVVDDDARLAELTGG
Ga0137409_1113569013300015245Vadose Zone SoilLSPSPELAAALHRLRSTLARLKAEVELAQIDGAPASTERLLGDLNEALVLLREVEQAAFGTVSILVVDDDARLAEL
Ga0137403_1157998713300015264Vadose Zone SoilVESATGSSPSPELAEALHRLRSTLARVKAELELAQADGTPVPAERHLGDLNEALGLLRAVEQSAFGTFSVLVVDDDARLAELTARGLRRLGY
Ga0134085_1049580813300015359Grasslands SoilVRAELEVAIGDGEAPPVERLLSDLRQALEVLGDVEAAALGVVRVLVVDDDERLGELTARGLRRMGYDAEASGRLRSLRPREVVVFDLGVLNSLEAEEHAALTVARPIVVTGAADPGA
Ga0066655_1007186833300018431Grasslands SoilLATGSSSSPEPRAELAQALHRLRSTLARMRAELEVAEGDGGAPPVDRLLSDMREALEALGNVESTALGVVRVLVVDDDERLGELTARGLRRLGYDAESSLVMRTLRPREVVVFDLSVAPSLDRSARASLRASRPIVWTGAS
Ga0066655_1023287913300018431Grasslands SoilLDTGSSSRTELAGALHRLRSTLARARAELELADADGAAPPVDRLLGDLGEALDLLEAVETAAFAIVHVLVLDDDERLGELTARGLRRLGYEAESASRMRPLRPGEVVVFDLGLLASLDSAQRGALKAARPIVVTGAADLRSRAIADGIGAFDYL
Ga0066669_1007038943300018482Grasslands SoilVGSATSSSSRPELAAALHRLRSTLARARAELELARPDGDPVLVERLLGDLREALDVLGLVEAAAFSIVSVLVVDDDERLAELTARGLRRMGYVAESAARLRALRPR
Ga0066669_1162163223300018482Grasslands SoilVGSDTGSSSRTELAGALHRLRSTLARARAELELADADGDAPPVDKLLGDLGEALDLLGAVETAAFAIVHVLVLDDDERLAELMARGLRRLGYEAESAGRMRQLRPREVVVFDLGLLASIDSAQRA
Ga0066669_1190807013300018482Grasslands SoilLDTGSSSRTELAGALHRLRSTLARARAELELADADGAAPPVDKLMADLREALDLLGAVESAAFAIVQVLVLDDDPRLAELMARGLRRVGYEAESSGQMRQLRPREVVVFDL
Ga0215015_1086566923300021046SoilVARDTGSPSSPDLAAALHRLRSTLARLKAEAELAEEDGEAHPPPHLLGGLREALDLVAAVEEASLGVVRVLVVDDDKRLGELTARGLRRRGYEAESVGALR
Ga0179596_1037421913300021086Vadose Zone SoilVESATGSSPSPELAEALHRLRSTLARLKAELELAQADGTPVPAERHLGDLNEALGLLRAVEQSAFGTFSVLVVDDDARLAELTARGLRRLGYDAESSD
Ga0210408_1080671423300021178SoilLSSNPELAAALHRLRSTLARMRAELELAREDGEAPPMARMLEDVTDALGLLGRVESVALGIVPVLVVDDDERLAELTARSLRRLGYEAESSGRLRPLRSGEVVVFDLGATSSLDVAERAALRTARPIIVTGAADPGSRAMAEDLDASAYLVK
Ga0210410_1131625923300021479SoilVDSATGSSPSPELAAALHRLRSTLARLKAELELAQDEGTTPPVRRLLGDLDEALHLLGDVERTALGLVSVLVIDDDARLGELTARGLRRLGFDSDS
Ga0210410_1165087113300021479SoilVGSGTGSSPSPELAAALHRLRSTLARAKAELEAAESVDSESKPGRLLGDLNEALMLLQEVEQAAFGVVPILVVDDDTRLAELTARGL
Ga0210409_1135484013300021559SoilMRAELELAREDGKVPPVARMLEDVTDALGLLGRVESVALGIVPVLVVDDDERLAELTARSLRRLGYEAESSGRLRPLRSGEVVVFDLGATSSLDVAERAALRTARPIIVTGA
Ga0207646_1005231963300025922Corn, Switchgrass And Miscanthus RhizosphereVARDTSSSSSPELAATLHRLRSTLARLKAELELAGTDGEAPPVPRLLTDLREALDLVSAVEAATLNVVRVLVLDDDERLGELTARGLRRLGYEAESTPNLRPLRAGEVLVLDLGLVASLGADGRAALRAA
Ga0207646_1028972813300025922Corn, Switchgrass And Miscanthus RhizosphereLSSNPELASALHRLRSTLARMRAELELAHDDASTPPVDRLLADLSESLQLLGRVEAAALGLVPVLVVDDDERLAELTARGLRRLGYEAESAGRLRPLRQAEVVVFDLGVSTSLD
Ga0209438_119827713300026285Grasslands SoilVESATGSSPSPELAEALHRLRSTLARLKAELELAQADGTPVPAERHLGDLNEALGLLRAVEQSAFGTFSVLVVDDDARLAELTARGLRRLGYDAESSDALRDIRPREVV
Ga0209890_1022720933300026291SoilVDSATGSSPSPELAAALHRLRSTLARLKAELELAQVDGTAPPVQGLLGDLNEALTLLRAVEQAAFGLVSVLVVDDDARLAELTARGLRRLGYE
Ga0209237_116263413300026297Grasslands SoilVGSATSSSSRPELAAALHRLRSTLARARAELELARSDGDSVPVDRLLGDLQEALDLLSRVETAAFAIVSVLVVDDDERLAELTARGLRRMGYEADSAPRLR
Ga0209236_113161113300026298Grasslands SoilLARARAELELAQADGDPPPVDQLLTDLREALDLLGRVEAATFQIVRVLVLDDDERLAELTARGLRRLGYEAESASRMRELRPREVVVLDLGLSALLGAHELTVL
Ga0209265_114883813300026308SoilMHRLRSTLARARAELELARTDGDPLPVDRLLGDLREALDLLGRVEAAAFQIVAVLVLDDDERLAELTARGLRRLGYEAESASRMRALRPREVVVLDLGLS
Ga0209055_101280313300026309SoilMHRLRSTLARARAELELARTDGDPLPVDKLLGDLREALDLLGRVEAAAFQIVAVLVLDDDERLAELTARGLRRLGYEAESASRMRALRPREVVVLDLGLSASLDADQLVAL
Ga0209055_106446713300026309SoilMSSGTEPGPELAAALHRLRSTLARMRAELELAQSDLQGPPVERVLGDLGEALGYLGEVESAALAIVPVLVLDDDERLGELTARGLRRLGYEAESAGRLRALRPR
Ga0209055_120351513300026309SoilVGSATSSPSRPELTAALHRLRSTLARVRAEIELAHSDGDPPPVDRLLVDLHEALDLLGQVEAAAFSIVSVLVVDDDERLAELTSRGLRRLGYEAESSGRLRALRPREVVVFDLGLFASLDASERGTLKAAHPI
Ga0209761_1001101173300026313Grasslands SoilVGSDTGSSSRTELAGALHRLRSTLARARAELELAQTDGEPVPADRLVGDLGEALELLGAVEAAAFAIVPVLVLDDDDRLAELTARGLR
Ga0209471_114950313300026318SoilVGSVTGSSSKPELAGALHRLRSTLARLKAELELADSAGDPAPVERMLGDLTEALELLGEVESAALNVVRVLVLDDDERLGELTARGLRRLGYDAESSTVMRPLRSAEVMVFDLSL
Ga0209472_131126723300026323SoilVGSDTGSSSRTELAGALHRLRSTLARARAELELAQTDGEPVPADRLVGDLGEALELLGAVEAAAFAIVPVLVLDDDDRLAELTARGLRRLGYEAESAGRLRALRRLPRRLPQER
Ga0209470_108097813300026324SoilLATGSSSSPEPRAELAQALHRLRSTLARMRAELEVAEGDGGALPVDRLLSDMREALEALGNVESTALGVVRVLVVDDDERLGELTARGLRRLGYDAESSLAMRTLRPREVVVFDLSVAPSLDEPSRAALRLSRPIVLTGASDPSSRAVAEDLD
Ga0209470_137597223300026324SoilLDTGSSSRTELAGALHRLRSTLARARAELELADADGAAPPVDRLLGDLGEALDLLEAVETAAFAIVNVLVLDDDERLGELTARGLRRLGYEAESASRMRPLRPGEVVVFDLGLLASLDSA
Ga0209152_1027248813300026325SoilMPWLASSCKRPATSSSSRPELAAALHRLRSTLARARAELELARPDGDPVLVERLLGDLREALDVLGLVETAAFSIVSVLVVDDDERLAELTARGLRRMGYVAESAARLRALRPREVVVFDIGVLSSLNPAERDVLKAARPVVLTGAT
Ga0209801_131454613300026326SoilVGSATSSSSRPELAAALHRLRSTLARARAELELARPDGDSVLIERLLGDLSEALDVLGLVEAAAFSIVSVLVVDDDERLAELTARGLRRMGYV
Ga0209377_113743913300026334SoilVGSATSSSSRPELAAALHRLRSTLARARAELELARSDGDSVPVDRLLGDLQEALDLLSRVETAAFAIVSVLVVDDDERLAELTARGLRRMGYEA
Ga0209808_100240513300026523SoilVGSATSSPSRPELTAALHRLRSTLARVRAEIELAHSDGDPPPVDRLLVDLHEALDLLGQVEAAAFSIVSVLVVDDDERLAELTARGLRRLGYEAESSGRLRALRRREPRVA
Ga0209808_110158223300026523SoilMHRLRSTLARARAELELARTDGDPLPVDRLLGDLREALDLLGRVEAAAFQIVAVLVLDDDERLAELTARGLRRLGYEAESASRMRALRPREVVVLDLGLSASLDADQLV
Ga0209808_121311123300026523SoilLSSNPELAGALHRLRSTLARMRAELELAHEDGSGPPVDRLLVDVSDALEQLGRVEAVALEVVSVLVVDDDERLAELTARGLRRMGYDAESSAR
Ga0209690_103086953300026524SoilVESVTGSSSRAELSGALHRLRSTLARLRAELEVAESNGEPPPVDRLLSDLREALELLGNVESAALEVVRILVVDDDERLAELTARGLRRLGYDAEAAGRLRALRPREVVVFDLGVSAALSPAERAALRAAHP
Ga0209059_105376113300026527SoilLHRLRSTLARQRAELEVAESDGSVPPVARLLSDLREALDLLENVESAALGVVRVLVVDDDERLGELTARSLRRLGFDAEAADHLRALRTGEVVVFDLGVLASLTSADRAVLRAARPVVVTGA
Ga0209059_132943013300026527SoilLARARAELELAQADGDPPPVDRLLTDLREALDLLGRVEAAAFQIVRVLVLDDDERLAELTARGLRRLGYDAEAASRVRELRPREVVVLDLGLSGSLDAHQLSVLRAARPIVVTGATDPASRAV
Ga0209806_102523413300026529SoilMHRLRSTLARARAELELARTDGDPLPVDKLLGDLREALDLLGRVEAAAFQIVAVLVLDDDERLAELTARGLRRLGYEAESASRMRALRPREVVVLDLGLSASLDADQLVALR
Ga0209806_110807633300026529SoilVGSATSSPSRPELTAALHRLRSTLARVRAEIELAHSDGDPPPVDRLLVDLHEALDLLGQVEAAAFSIVSVLVVDDDERLAELTARGLRRLGYEAESSGRLRALRPREVVV
Ga0209160_120338623300026532SoilMHRLRSTLARARAELELARTDGDPLPVDKLLGDLREALDLLGRVEAAAFQIVAVLVLDDDERLAELTARGLRRLGYEAESASRMRALRP
Ga0209056_1063255123300026538SoilMHRLRSTLARARAELELARTDGDPLPVDRLLGDLREALDLLGRVEAAAFQIVAVLVLDDDERLAELTARGLRRLGYEAESASRMRALRPREVVVLDLGLSAS
Ga0209805_107517833300026542SoilMHRLRSTLARARAELELARTDGDPLPVDRLLGDLREALDLLGRVEAAAFQIVAVLVLDDDERLAELTARGLRRLGYEAESASRMRALRPREVVVLDLGLSASLDADQLVALRAARPIVVTGASDPASRALADDLGAS
Ga0209156_1003543213300026547SoilMHRLRSTLARARAELELARTDGDPLPVDKLLGDLREALDLLGRVEAAAFQIVAVLVLDDDERLAELTARGLRRLGYEAESASRMRALRPREVVVLDLGLSASLDADQLVALRAARPIVVTGAADPASRALAD
Ga0209161_1019162613300026548SoilMRAELEIAQSDGDAPPVDRLIEDLGEALELLGEVESVALAIVHVLVVDDDERLGELTARGLRRLGFDAESAGRLRTLRPREVVVFDLGVSGSLTAAELAALKVAKPIVVTGAADP
Ga0209474_1047926813300026550SoilVGSATSSPSRPELTAALHRLRSTLARVRAEIELAHSDGDPPPVDRLLVDLHEALDLLGKVEAAAFSIVSVLVVDDDERLAELTARGLRRLGYEAESSGRLRALRPREVVVFDLGLFASLDVSERGMLKAAHPIVVTGATDPGS
Ga0208984_107395223300027546Forest SoilVESVTGSSHSPELAAALHRLRSTLARLKAELELAEADGVAPQPQRLLGDLNEALRLLHAVEQVAFGMVRVLVVDDDARLAELTARGLRRLGYEADSTSALPDL
Ga0209219_117365823300027565Forest SoilVESATGSSPSPEVAAALHRLRSTLARLKAELERAQEDGAAPPLGRLQGDVEEALALLREVERAALGLVSVLVVDDDARLAELTARGLRRMGFDSDSTDAFREPRAGEVV
Ga0209733_102053433300027591Forest SoilVESATGSSRNPELAAALHRLRSTLARLKAELELAEADGAAATPGRLLGDLNEALMLLQAVEQAAFGVVPVLVVDDDARLGELTARGL
Ga0209422_112494913300027629Forest SoilVESATGSSPSPELAAALHRLRSTLARLKAELELAQESEGPESLPGRMLGDLNEALMLLQEVEQAAFGVVPILVVDDDARLGELTARGLRRFLCLRR
Ga0208988_106395923300027633Forest SoilMSLAGSRRFVGSATGSSPSPELAAALHRLRSTLARLKAELELSQTDGTTPPVERVLGDLNEAFSLLRDVERAAFGLVPVLVVDDDVRLAELTARGLRRLGYEANAASALRE
Ga0209076_121262613300027643Vadose Zone SoilMHRLRSTLARARAELELARTDGDPLPVDQLLGDLREALDLLGRVEAAAFQIVAVLVLDDDERLAELTARGLRRLGYEAESASRMRALRPREVVVLDLGLSASLDAD
Ga0209580_1007430633300027842Surface SoilMKAEVELAGLDGTQVTVDRLQGDLREALSLLGAVESAAYAVGPVLVVDDDERLGELTARGLRRLGFEADSRASLRPLRPREVVVFDLGLAASLSAGDGTALRAARPIVVTGAADPASRALAASFD
Ga0209580_1064375723300027842Surface SoilVGSATDLPARAEVASALHRLRSTLARLKAELELAELDGKPPPVDRLLDDLQEALALLGALEAATYSAGPVLVVDDDERLGELTARGLRRLGYEAGSRNTLRPLRPGEVVV
Ga0209068_1060395613300027894WatershedsVASDTGSSRKAELAAALHRLRSTLARLKAELELAASDGALPPADVLLGDLNEALGLLGAAEHAAYGAVCVLVVDDDERLGELTARGLRRLGFEAEAARRAQR
Ga0307281_1009116713300028803SoilVESVTGSSPSPELPAALHRLRSTLARLKAELELAQLDGTATPTERVVADLNEALVLLRAVEQAAFSLVSVLVVDDDARLAELTARGLRRLG
Ga0307372_1019958313300031671SoilMKAELELARADGAPLPAGALLGDLQEALETLAGVEAVALGLARVLVFDDDGRLAELTARGLRRLGYDAEPGSRFRPPRPREVVVFDLGLAAHLTVPERAALKAARPIVV
Ga0307373_1019653113300031672SoilMKAELELARADGAPLPAGALLGDLQEALETLAGVEAVALGLARVLVFDDDGRLAELTARGLRRLGYDAEPGSRFRPPRPREVVVFDLGLAAHLTVPERAALKAARPIV
Ga0307468_10066818323300031740Hardwood Forest SoilLGTDSSSRPELAAALHRLRSTLARARAELELAQADDTPVPGERLLGDLSEALDLLGRVEAAALSISSVLVIDDDERLGELTARGLRRLGYDAQSSSRMRNLKPREIVVLDLGITASLDAEARASLKESRPIVVTGAADPASRV
Ga0307477_1066786913300031753Hardwood Forest SoilVGSATGSSRSPELAAALHRLRSTLARLKAELELAQADGSAPPVERLLGDLDDALVLLRAVERVALGLVTVLVVDDDARLADLTARGLRRLGYESDSAAAFRESRPGEVVVFDLS
Ga0318521_1084309413300031770SoilVNQAGSRRSAGLATGSSSRAELAAALHRLRSTLARAKAELDLAGSDGELVERLRGDLSEALELLGQVESSALRIVPVLVLDDDERLGELTARGLRRLGFDAESAVAMRKLRTGEVVVFDLSLAGGLDASDRALLAAARPIVV
Ga0307471_10286041013300032180Hardwood Forest SoilVGLGTGSSSRPELPASLHRLRSTLARAKAELELAQADGETPPLERLLGDLSEALRLLGEVEAAALSIVPVLVLDDDERLAELTARGLRRLGYDAESSSRLRALKPGEVVVLDLGL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.