NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F050449

Metagenome Family F050449

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F050449
Family Type Metagenome
Number of Sequences 145
Average Sequence Length 108 residues
Representative Sequence ARVSDAGFSDAEGLLTSVRGALEPILPFFDRHVVHQSADANPAQPHPILRPLDNGDPVGLRPLSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT
Number of Associated Samples 119
Number of Associated Scaffolds 145

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 103
AlphaFold2 3D model prediction Yes
3D model pTM-score0.28

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(35.172 % of family members)
Environment Ontology (ENVO) Unclassified
(52.414 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(68.966 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 31.39%    β-sheet: 0.00%    Coil/Unstructured: 68.61%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.28
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 145 Family Scaffolds
PF00830Ribosomal_L28 90.34
PF00072Response_reg 1.38
PF16916ZT_dimer 0.69
PF05598DUF772 0.69

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 145 Family Scaffolds
COG0227Ribosomal protein L28Translation, ribosomal structure and biogenesis [J] 90.34


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil35.17%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil20.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil11.72%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil4.83%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.45%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.76%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil2.76%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.07%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.38%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.38%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.38%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.38%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil1.38%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.38%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.69%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.69%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.69%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.69%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.69%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.69%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.69%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.69%
FenEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Fen0.69%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.69%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.69%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.69%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.69%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001160Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2EnvironmentalOpen in IMG/M
3300001305Grasslands soil microbial communities from Hopland, California, USAEnvironmentalOpen in IMG/M
3300002906Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005547Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005949Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil leachate replicate DNA2013-051EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006237Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2 (version 2)Host-AssociatedOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012964Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 4 metaGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017959Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_10_MGEnvironmentalOpen in IMG/M
3300018060Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_10_MGEnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026023Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026075Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026216Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil leachate replicate DNA2013-051 (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026319Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cm (SPAdes)EnvironmentalOpen in IMG/M
3300026322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300027181Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM2_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027669Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027678Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027727Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300029989III_Fen_N1 coassemblyEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300032074Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.P.R1EnvironmentalOpen in IMG/M
3300032782Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.1EnvironmentalOpen in IMG/M
3300032892Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.5EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12654J13325_101215013300001160Forest SoilSDAEGLLSSIRATLEPIFPFFDRHVVHQAADVNPAQPHPILRAPEGNDPIGLRPISESNEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
C688J14111_1000398653300001305SoilPVLPFFDRHVVHQSGDVNPAQPHLILRAQEDGDTIGLRPVSDASERILFASAATYPGFGLEGQLLAARAASGQALALSGRKTVSAT*
JGI25614J43888_1008968333300002906Grasslands SoilIVHQSADLNPAPGHPILRPHDDAEAIGLRPLSDAHERALFASAATYPGFGLEGQIVAARAAAGQALLLSGRKSVSAV*
JGI25390J43892_1009191413300002911Grasslands SoilGFSDEQGLLTSIRGALEPIFPFFERHVVHQSADANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVRAT*
JGI25389J43894_103297413300002916Grasslands SoilGHALGPLLLTTLPARRARGESTGEKLLTVARVSDAGFSDEQGLLTSIRGALEPIFPFFERHVVHQSADANPAQPHPILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT*
Ga0066672_1024185313300005167SoilSGGERLLTVARISDAGFSDAEGLLQSIRSALEPVLPFFDRHIVHQTADVSPAQLHTVLRPHDDASPIGLRPSSAAHDRVLFASAATYPGFGLEGQILAARAAAEQALALSGRKAVAAT*
Ga0066677_1014299933300005171SoilGEKLLTVARVSDAGFSDEQGLLTSIRGALEPIFPFFERHVVHQSADANPAQPHPILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT*
Ga0066677_1042878433300005171SoilESLLTSVKNALEPVLPFFDRHIVHQSADLNPLYSHVILRPHDDGEAIGLRPVSEAHDRILFASAAAYPGFGLEGQLLAARGAADQALVISGRKPVSAT*
Ga0066680_1019818513300005174SoilLEPIFPFFERHIVHQGADVNPPQPHPILRPLENGDPVGLRPASEADEHIVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT*
Ga0066673_1042521323300005175SoilLGHTLGPLLLTTLPARRARGESTGERLLTVARVSDAGFSDEQGLLTSIRGALEPIFPFFERHVVHQSADANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT*
Ga0066679_1007612933300005176SoilGPLLLTTLPARRARGESTGEKLLTVARVSDAGFSDEQGLLTSIRGALEPIFPFFERHVLHQSADANPAQPHPILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT*
Ga0066679_1024776713300005176SoilRVRGEAEDERLLTVARVSDAGFSDEQGLLNSVRTALEPVLPFFDRHILHQSADLNPPLSHPILRPHDDAEPIGLRPISEAHDRVLFASAATYPGFGLEGQILAARAAADQALAISGRKTVSAV*
Ga0066690_1003844913300005177SoilSADANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVRAT*
Ga0066690_1009162233300005177SoilFSDAEGLLQSIRSALEPVLPFFDRHIVHQTADVSPAQLHTVLRPHDDASPIGLRPSSAAHDRVLFASAATYPGFGLEGQILAARAAAEQALALSGRKAVAAT*
Ga0066690_1050573513300005177SoilHQSADLNPPLSHPILRPHDDAEPIGLRPISEAHDRVLFASAATYPGFGLEGQILAARAAADQALAISGRKTVSAV*
Ga0066688_1030394913300005178SoilLPQALEDAALVLGHALGPLLLTTLPARRARGESTGEKLLTVARVSDAGFSDEQGLLTSIRGALEPIFPFFERHVVHQSADANPAQPHPILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT*
Ga0066688_1066466413300005178SoilEAALLLGHAMGPLVISALPARRTRGEAENERLLTVARVSDAGFSDEQGLLNSVRAALAPVLPFFDRHIVHQSADLNPPLSHPILRPHDDAEPIGLRPISEAHDRVLFASAATYPGFGLEGQILAARAAADQALAISGRKTVSAV*
Ga0066684_1045685533300005179SoilFFDRHVLHQAADLNPLQPHTILHPHEDADPIGLRATSEGHERVLFASEATYPGFGLEGQILAARVAAEQALAMSGRKTVSAT*
Ga0066684_1081836713300005179SoilRRAKGEPAGERLLTVARVSDAGFSDAESLLTSVKNALEPVLPFFDRHIVHQSVDLNPLYSHVILRPHDDGEAIGLRPVSEAHDRILFASAAAYPGFGLEGQLLAARGAADQALVISGRKPVSAT*
Ga0066678_1038209333300005181SoilVLGHALGPLLLTTLPARRARGESTGEKLLTVARVSDAGFSDEQGLLTSIRGALEPIFPFFERHVVHQSADANPAQPHPILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT*
Ga0066689_1071388033300005447SoilHQSADANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT*
Ga0066682_1035823613300005450SoilQSADLNPPLSHPILRPHDDAEPIGLRPISEAHDRVLFASAATYPGFGLEGQILAARAAADQALAISGRKTVSAV*
Ga0066687_1020179513300005454SoilPFFERHIVHQGADVNPPQPHPILRPLENGDPVGLRPASEADEHVVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT*
Ga0070730_1039587313300005537Surface SoilVRGDASGERVLTVARVTDAAYTDGPGLLKAVRGALEPVLPFFERHVLHQFADVTPVPGHPILRPHEDAEAIGLRPHTEAHDRVLFASSSTYPGFGLEGQFLAARAAADQALALSGRKTISAT*
Ga0070693_10086762413300005547Corn, Switchgrass And Miscanthus RhizosphereEGLLTSIRNLLEPILPFFDRHIVHQAADVNPWQPHPILRPAENGDPVGLRPMSGADDHVVFASAATYPGFGLEGQILAARAAASQALALSGRKSVSAT*
Ga0070704_10033054433300005549Corn, Switchgrass And Miscanthus RhizosphereTTLPARRARGDSTGEKLLTVARVSDAGFSDEEGLLTSIRNLLEPILPFFDRHIVHQAADVNPWQPHLILRPAENGDPVGLRPMSGADDHVVFASAATYPGFGLEGQILAARAAASQALALSGRKSVSAT*
Ga0066701_1002813813300005552SoilSIRGALEPIFPFFERHVVHQSADANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT*
Ga0066701_1079318323300005552SoilRHVVHQSADANPAQPHPILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT*
Ga0066661_1062140533300005554SoilRHVVHQSADANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT*
Ga0066700_1082900013300005559SoilGHAMGPLVISALPARRVRGEAEDERLLTVARVSDAGFSDEQGLLNSVRTALEPVLPFFDRHILHQSADLNPPLSHPILRPHDDAEPIGLRPISEAHDRVLFASAATYPGFGLEGQILAARAAADQALAISGRKTVSAV*
Ga0066708_1021661013300005576SoilPQALEDAALLLGHTLGPLLLTTLPARRARGESTGERLLTVARVSDAGFSDEQGLLTSIRGALEPIFPFFERHVVHQSADANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT*
Ga0066654_1011743413300005587SoilERHIVHQGADVNPPQPHPILRPLENGDPVGLRPASEADEHIVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT*
Ga0066706_1153541213300005598SoilPFFDRHIVHEAADLSPSPPHVLIRPHDDVEPIGLRPVSAAHERVLFASAATYPGFGIEGQLLAARGAAEQAHALSGRKTIAV*
Ga0066791_1001884113300005949SoilISALQARKVRGEAPGERVLTVARVTEVTYADGPAFLQSVRAALEPVLPFFERHVLHQFVDVTPIPGHPILRPHEDAEAIGLRPHTEAHDRVFFASSSTYPGFGLEGQFLAARAAADQALALSGRKSISAT*
Ga0066651_1049447723300006031SoilLLTVARVSDAGFSDAEGLLSSMRAILEPIFPFFDRHVVHQAADVNPLQPHPILRAPEGNDPIGLRPISESNEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
Ga0066696_1011714433300006032SoilFPFFERHIVHQGADVNPPQPHPILRPLENGDPVGLRPASEADEHVVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT*
Ga0066696_1097944113300006032SoilLPFFDRHVVHQSADLNPSQIHTILRPHEDAEPIGLRPVSEVHERVLFASAATYPGFGLEGQILAARAAAEQAVALSGRKSVSAT*
Ga0066656_1061095113300006034SoilADAGFSDDQALVSSIRAVLEPIFPFFERHIVHQGADVNPPQPHPILRPLENGDPVGLRPASEADEHIVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT*
Ga0066652_10209326613300006046SoilVARVSDAGFSDADGLLHSVRAALEPVLPFFDKHIVHQSADLNPVGGHPLLRPHADGEPIGLRPVSNTHERVLFASAATYPGFGLEGQILAARAAADQAQALSGRKSVSAT*
Ga0097621_10043814613300006237Miscanthus RhizosphereDAGFSDEEGLLTSIRNLLEPILPFFDRHIVHQAADVNPWQPHLILRPAENGDPVGLRPMSGADDHVVFASAATYPGFGLEGQILAARAAASQALALSGRKSVSAT*
Ga0066659_1004284113300006797SoilEGLPQALEDAALLLGNAMGPLLLTTLPARRARGEGTGEKLLTVARISDAGFSDAEGLLATIRATLEPIFPFFDRHLVHQAADVNPAQLHPILRTPEGNDPIGLRPISDASDHVLFASASTYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
Ga0066659_1137090323300006797SoilVLEPIFPFFERHIVHQGADVNPPQPHPILRPLENGDPVGLRPASEADEHIVFASAATYPGFGLEGQILAARAAADQALAISGRKTVSAV*
Ga0066660_1096745013300006800SoilVSDAGFSDEQGLLTSIRGALEPIFPFFERHVLHQSADANPAQPHPILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT*
Ga0075429_10185135913300006880Populus RhizospherePQALEDAALLLGSAMGPLLLTTLPARRARGESTGEKLVTVARVADAGYSDAEALLASVRSTLEPLFPFFDRHIVHQAADVNPAQPHSILRAPDGHDPVGLRPISEADEHVLFASSSTYPGFGLEGQILAGRAAAGQALALSGRKSVSAT*
Ga0079219_1057160933300006954Agricultural SoilLEPILPFFDRHIVHQAADVNPWQPHPILRPAENGDPVGLRPMSGADDHVVFASAATYPGFGLEGQILAARAAASQALALSGRKSVSAT*
Ga0099791_1060973313300007255Vadose Zone SoilEDAALLLGNAMGPLLLTTLPARRARGEGPGEKLLTVARVSDAGFSDADGLLATVRATLEPIFPFFDRHLVHQTADVNPAQPHPILRAPEGNDPIGLRPISAASEHVLFASASTYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
Ga0099791_1066924313300007255Vadose Zone SoilLEPICPFFDRHLVHQTADVNPAQPHPILRAPEGNDPIGLRPISATDDHVLFASASTYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
Ga0066710_10106265313300009012Grasslands SoilFSDEQGLLNSVRAALAPVLPFFDRHIVHQSADLNPPLSHPILRPHDDAEPIGLRPISEAHDRVLFASAATYPGFGLEGQILAARAAADQALAISGRKTVSAV
Ga0066710_10218579333300009012Grasslands SoilADVNPPQPHPILRPLENGDPVGLRPASEADEHIVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT
Ga0066710_10296394923300009012Grasslands SoilPLALAAAARGHGNAMGPLLLESRPARRARGESAGEKLLTVARVSDAGFSDAEGLLSSMRAILEPIFPFFDRHVVHQAADVNPLQPHPILRAPEGNDPIGLRPISESNEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT
Ga0066710_10316070113300009012Grasslands SoilQSVRSALEPVLPFFDRHIVHQSADLNPIPAHPLLRHPEEYAELIGLRPISDVNDRVFFASAATYPGFGLEGQFLAARAAADQALVISGRKAVSAT
Ga0066710_10431801713300009012Grasslands SoilHVVHQSADLNPSQIHTILRAHEDGEPIGLRPISEAHERVLFASAATYPGFGLEGQLLAARAAAEQAVALSGRKSVSAT
Ga0099830_1046118233300009088Vadose Zone SoilVLPFFDRHIVIQSADLNPSHGHPILRPHEDAEPIGLRPLSDAHERVLFASAATYPGFGLEGQILAARAVAEQALALSGRKSVSAV*
Ga0099830_1099754813300009088Vadose Zone SoilALLLGPPPAPLIISALPARRARGETSGERLLTVGRVSDAGFSDAEGLLQSVRAALEPVLPFFDKHIVHQSADVNPVQGHLLLRPHDDGEPIGLRPLSQAHERVLFASAATYPGFGLEGQILAARGAADQALALSGRKSVSAT*
Ga0099828_1137309313300009089Vadose Zone SoilFFDRHIVHQSADLNPPLSHPILRPHDDAEPIGLRPISEAHDRVLFASAATYPGFGLEGQILAARAAADQALAISGRKSVSAV*
Ga0066709_10016118213300009137Grasslands SoilADVNPPQPHPILRPLENGDPVGLRPASEADEHIVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT*
Ga0066709_10077271013300009137Grasslands SoilSADANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT*
Ga0066709_10109086233300009137Grasslands SoilFFDRHIVHQSVDLNPLYSHVILRPHDDGEAIGLRPVSEAHDRILFASAAAYPGFGLEGQLLAARGAADQALVISGRKPVSAT*
Ga0066709_10463103523300009137Grasslands SoilSDDQALVSSIRAVLEPIFPFFERHIVHQGADVNPPQPHPILRPLENGDPVGLRPASEADEHVVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT*
Ga0099792_1074292513300009143Vadose Zone SoilIRATLEPICPFFDRHLVHQTADVNPAQPHPILRAPEGNDPIGLRPISATDDHVLFASASTYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
Ga0111538_1374094213300009156Populus RhizosphereLLLTTLPARRARGESTGEKLLTVARVADAGYSDAEALLASVRSTLEPLFPFFDRHIVHQAADVNPAQPHSILRAPDGHDPVGLRPISEADEHVLFASSSTYPGFGLEGQILAGRAAAGQALALSGRKSVSAT*
Ga0126374_1184743913300009792Tropical Forest SoilGEADGERLLTVARVSDAGFSDEQGLLNSIRSALEPVLPFFDRHILHQSADLNPPLSHPILRPNDDAEPIGLRPISDAHERVLFASAATYPGFGLEGQILAARAAAGQALALSGRKVVSAV
Ga0126384_1072391213300010046Tropical Forest SoilSALEPVLPFFDRHILHQSADLNPPLSHPILRPNDDAEPIGLRPISDAHERVLFASAATYPGFGLEGQILAARAAAGQALALSGRKVVSAV*
Ga0126384_1103461133300010046Tropical Forest SoilLTVARISDAGFSDPEGLLATVRATLEPIFPFFDRHLIHQTSDVNPAQPHPILRTPEGSDPIGLRPISEASEHVLFASASTYPGFGLEGQILAGRAAAGQALALSGRKMVSAT*
Ga0126373_1183519013300010048Tropical Forest SoilALEEAALLLGHAMGPLLVSALPARRAKGEAAGERLLTVARVCDAGFSDEGGLLASVRTALEAVLPFFDRHIVHQSADLNPPQPHAILTAHEDAEPIGLRPISDAHERVLFASSATYPGFGIEGQILAARAAAGQAMVLSGRKSVSAT*
Ga0134088_1020761233300010304Grasslands SoilLEPIFPFFDRHLVHQAADVNPAQLHPILRTPEGNDPIGLRPISDASDHVLFASASTYPGFGLEVQILAGRAAAGQALALSGRKTVSAT*
Ga0134067_1027261113300010321Grasslands SoilPMGPLLLTTLPARRARGESSGEKLLTVARVADAGFSDDQALVSSIRAVLEPIFPFFERHIVHQGADVNPPQPHPILRPLENGDPVGLRPASEADEHIVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT*
Ga0134084_1008885633300010322Grasslands SoilFERHIVHQGADVNPPQPHPILRPLENGDPVGLRPASEADEHIVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT*
Ga0134065_1003688533300010326Grasslands SoilPARRARGESSGEKLLTVARVADAGFSDDQALVSSIRAVLEPIFPFFERHIVHQGADVNPPQPHPILRPLENGDPVGLRPASEADEHIVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT*
Ga0134062_1043482923300010337Grasslands SoilEPGRARGESTGEKLLTVARISDAGFSDAEGLLATVRATLEPIFPFFDRHLVHQTADVNPAQPHPILRAPDGNDPIGLRPISAASDHVLFASSATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
Ga0134123_1169369913300010403Terrestrial SoilTSIRNLLEPILPFFDRHIVHQAADLNPWQPHPILRPAENGDPVGLRPMSGADDHVVFASAATYPGFGLEGQILAARAAASQALALSGRKSVSAT*
Ga0137392_1029916013300011269Vadose Zone SoilAPLIISALPARRARGETSGERLLTVGRVSDAGFSDAEGLLQSVRAALEPVLPFFDKHIVHQSADVNPVQGHLLLRPHDDGEPIGLRPLSEAHERVLFASASTYPGFGLEGQILAARGAADKALALSGRKAVSAT*
Ga0137391_1031721233300011270Vadose Zone SoilLTVARVSDAGFSDEQGLLASVRAALEPVLPFFDRHIVIQSADLNPSHGHPILRPHEDAEPIGLRPLSDAHERVLFASAATYPGFGLEGQILAARAVAEQALALSGRKSVSAV*
Ga0137393_1086838633300011271Vadose Zone SoilEPVLPFFDKHIVHQSADVNPVQGHLLLRPHDDGEPIGLRPLSEAHERVLFASASTYPGFGLEGQILAARGAADKALALSGRKAVSAT*
Ga0137388_1167298213300012189Vadose Zone SoilEEAALLLGPPPAPLLIAAIPARRARGESGGERLLTVARISDAGFSDAEGLLQSIRSALEPVLPFFDRHIVHQTADVSPAQPHTVLRPHDDGSPIGLRPSSAAHDRVLFASAATYPGFGLEGQILAARAAAEQALALSGRKAVAAT*
Ga0137399_1009346513300012203Vadose Zone SoilFFERHVVHQSADANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT*
Ga0137399_1107251733300012203Vadose Zone SoilDAGFSDAEGLLSSMRATLEPIFPFFDRHVVHQAADVNPAQPHPILRAPEGNEPIGLRPISESNEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
Ga0150985_12073468933300012212Avena Fatua RhizosphereMFSALPARKARGEAAGERLLTVARVADAGFSDEQSLLASLRAALEPVLPFFDRHVVHQSGDVNPAQPHLILRAQEDGDTIGLRPVSDASERILFASAATYPGFGLEGQLLAARAASGQALALSGRKTVSAT*
Ga0137360_1004052553300012361Vadose Zone SoilESSGEKLLTVARVSDAGFSDAEGLLSSTRAALEPIFPFFDRHLVHQAADVNPAQPHPILRAPEGNEPIGLRPISESSEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
Ga0137360_1117970323300012361Vadose Zone SoilLEDAALLLGNAMGPLLLTNLPARRARGESSGEKLLTVARVSDAGFSDAEGLLSSMRATLEPIFPFFDRHLVHQAADVNPAQPHPILRAPEGNDPIGLRPISESNEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
Ga0137361_1048654233300012362Vadose Zone SoilRRARGESTGERLLTVARVSDAGFSDEQGLLTSIRGALEPIFPFFERHVVHQSADANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT*
Ga0137361_1098663313300012362Vadose Zone SoilALGPLLLTTLPARRARGESTGEKLLTVARVSDAGFSDEQGLLTSIRGALEPIFPFFERHVVHQSADANPAQPHPILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT*
Ga0150984_11620533733300012469Avena Fatua RhizosphereSGDVNPAQPHLILRAQEDGDTIGLRPVSDASERILFASAATYPGFGLEGQLLAARAASGQALALSGRKTVSAT*
Ga0137358_1054047233300012582Vadose Zone SoilAGFSDAEGLLSSMRATLEPIFPFFDRHLVHQAADVNPAQPHPILRAPEGNEPIGLRPISESSEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
Ga0137397_1100271123300012685Vadose Zone SoilEKLLTVARVSDAGFSDAEGLLSSMRAILEPIFPFFDRHVVHQAADVNPLQPHPILRAPEGNDPIGLRPISESNEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
Ga0137394_1133810213300012922Vadose Zone SoilRGESSGEKLLTVARVSDAGFSDAEGLLSSMRATLEPIFPFFDRHLVHQAADVNPAQPHPILRAPEGNEPIGLRPISESSEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
Ga0137359_1089350413300012923Vadose Zone SoilGLPQALEDAALLLGSPMGPLLLTTLPARRARGESTGEKLLTVARVADAGFSDADGLLSSIRATLEPICPFFDRHVVHQTADVNPAQPHPILRAPEGNDPVGLRPISETDDHVLFASSSTYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
Ga0137419_1121600133300012925Vadose Zone SoilRATLEPIFPFFDRHVVHQAADVNPAQPHPILRAPEGNEPIGLRPISESNEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
Ga0137410_1007945843300012944Vadose Zone SoilEDAALLLGNAMGPLLLTNLPARRARGESSGEKLLTVARVSDAGFSDAEGLLSSMRATLEPIFPFFDRHLVHQAADVNPAQPHPILRAPEGNEPIGLRPISESSEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
Ga0153916_1049135313300012964Freshwater WetlandsPLVISALPARRIRGEAKNERLLTVARISDAGFSDEEALLQSIRGALEPVLPFFDKHIVHQAADLNPAQSHPILRPHEDAEPIGLRPHSEAHERVMFASAATYPGFGLEGQILAARAAAAQALDLSGRKSVKAV*
Ga0134110_1054833913300012975Grasslands SoilTLPARRARGESTGERLLTVARVSDAGFSDEQGLLTSIRGALEPIFPFFERHVVHQSADANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVRAT*
Ga0137411_116976513300015052Vadose Zone SoilVARVARVSDAGFSDAEGLLSSMRATLEPIFPFFDRHVVHQAADVNPAQPHPILRAPEGNEPIGLRPISESNEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
Ga0137412_1036387713300015242Vadose Zone SoilTVARVSDAGFSDAEGLLSSMRATLEPIFPFFDRHLVHQAADVNPAQPHPILRAPEGNDPIGLRPISESSEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT*
Ga0134073_1002647913300015356Grasslands SoilLEDAALVLGHPMGPLLLTTLPARRARGESSGEKLLTVARVADAGFSDDQALVSSIRAVLEPIFPFFERHIVHQGADVNPPQPHPILRPLENGDPVGLRPASEADEHIVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT*
Ga0187779_1122644513300017959Tropical PeatlandEKLLTVARVADAGFADDQGLLTGIRSALEPVLPFFERHVVHQAADVNPAQPHPILRIPDEGDAVGLRPLSSADQHVTFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT
Ga0187765_1036208533300018060Tropical PeatlandRLLTVARVSDAGFSDAPGVLSSVRNALEPVLPFFDRHIVHQGADLDPAQPWPILTPHEDAEPIGLQPLSDAHERVLFASCATYPGFGLEGQILAARAAAAQALHLSGRKVVSAV
Ga0184619_1025249833300018061Groundwater SedimentGPLLLTTLPARRARGESTGEKLLTVARVADAGFSDEQGLMGSIRAVLEPIFPFFERHIVHQGADVNPPQPHPILRPLENGDPVGLRPASEADEHIVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT
Ga0066667_1003436413300018433Grasslands SoilTLGPLLLTTLPARRARGESTGERLLTVARVSDAGFSDEQGLLTSIRGALEPIFPFFERHVVHQSADANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVRAT
Ga0066662_1257820813300018468Grasslands SoilSIRAALEPVLPFFDRHLVYQAGDVNPAQPHLILRAQDDLMGLRPVSEASERILFASAATYPGFGLEGQILAARAASAQALALSGRKSVSAT
Ga0137408_100212933300019789Vadose Zone SoilSMRATLEPIFPFFDRHLVHQAADVNPAQPHPILRAPEGNDPIGLRPISESSEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT
Ga0179594_1026522713300020170Vadose Zone SoilLVLGNAMGPLLLTNLPARRARGESSGEKLLTVARVSDAGFSDAEGLLSSMRATLEPIFPFFDRHLVHQAADVNPAQPHPILRAPEGNEPIGLRPISESSEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT
Ga0210402_1112838213300021478SoilVRTEGLPQALEDAALLLGNAMGPLLLTTLPARRARGEASGEKLLTVARISDAGFSDAEGLLATVRAALEPIFPFFDRHLVHQAADVNPAQPHPILRTPEGNDPIGLRPVSETSEHVLFASASTYPGFGLEGQILAGRAAAGQALALSGRKIVSAT
Ga0207664_1163931213300025929Agricultural SoilRIADAGFSDEASLASSIRAALEPVLPFFDRHVVHQAADANPAQPHLILRVAENAEAVGLRPISEASERLLFASAATYPGFGLEGQLLAARAASVQALALSGRKTVSAT
Ga0207677_1059454233300026023Miscanthus RhizospherePQALEDAALLLGHAMGPLLLTTLPARRARGDSTGEKLLTVARVSDAGFSDEEGLLTSIRNLLEPILPFFDRHIVHQAADVNPWQPHPILRPAENGDPVGLRPMSGADDHVVFASAATYPGFGLEGQILAARAAASQALALSGRKSVSAT
Ga0207708_1158976523300026075Corn, Switchgrass And Miscanthus RhizospherePLLLTTLPARRARGDSTGEKLLTVARVSDAGFSDEEGLLTSIRNLLEPILPFFDRHIVHQAADVNPWQPHPILRPAENGDPVGLRPMSGADDHVVFASAATYPGFGLEGQILAARAAASQALALSGRKSVSAT
Ga0209903_105585023300026216SoilISALQARKVRGEAPGERVLTVARVTEVTYADGPAFLQSVRAALEPVLPFFERHVLHQFVDVTPIPGHPILRPHEDAEAIGLRPHTEAHDRVFFASSSTYPGFGLEGQFLAARAAADQALALSGRKSISAT
Ga0209235_120039933300026296Grasslands SoilLLTSIRGALEPIFPFFERHVVHQSADANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT
Ga0209055_108097933300026309SoilADANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT
Ga0209055_117025113300026309SoilRARGESGGERLLTVARISDAGFSDAEGLLQSIRSALEPVLPFFDRHIVHQTADVSPAQLHTVLRPHDDASPIGLRPSSAAHDRVLFASAATYPGFGLEGQILAARAAAEQALALSGRKAVAAT
Ga0209154_117516833300026317SoilDEQGLLTSIRGALEPIFPFFERHVVHQSADANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT
Ga0209471_110081733300026318SoilTTLPARRARGESTGEKLLTVARVSDAGFSDEQGLLTSIRGALEPIFPFFERHVLHQSADANPAQPHPILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT
Ga0209647_102425313300026319Grasslands SoilFDRHIVHQSADLNPAPGHPILRPHDDAEAIGLRPLSDAHERALFASAATYPGFGLEGQIVAARAAAGQALLLSGRKSVSAV
Ga0209647_118250613300026319Grasslands SoilRSALEPVLPFFDRHVVHQSADLNPSQIHTILRPHDDAEPIGLRPVSEAHERVLFASAATYPGFGLEGQILAARAAAEQAVALSGRKSVSAT
Ga0209687_124298113300026322SoilEGLPQALEDAALVLGHALGPLLLTTLPARRARGESTGEKLLTVARVSDAGFSDEQGLLTSIRGALEPIFPFFERHVLHQSADANPAQPHPILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT
Ga0209470_130557833300026324SoilDANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT
Ga0209152_1013355813300026325SoilARGESTGERLLTVARVSDAGFSDEQGLLTSIRGALEPIFPFFERHVVHQSADANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT
Ga0209473_130116523300026330SoilFFDRHVLHQAADLNPLQPHTILHPHEDADPIGLRATSEGHERVLFASEATYPGFGLEGQILAARVAAEQALAMSGRKTVSAT
Ga0209267_119215813300026331SoilARISDAGFSDAEGLLQSIRSALEPVLPFFDRHIVHQTADVSPAQLHTVLRPHDDASPIGLRPSSAAHDRVLFASAATYPGFGLEGQILAARAAAEQALALSGRKAVAAT
Ga0209377_125236713300026334SoilRHIVHQGADVNPPQPHPILRPLENGDPVGLRPASEADEHVVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT
Ga0257165_102923233300026507SoilLTVGRVSDAGFSDAEGLLSSIRATLEPIFPFFDRHVVHQAADVNPAQPHPILRAPEGNDPIGLRPISESNEHVLFASAATYPGFGLEGQILAGRAAAGQALSLSGRKTVSAT
Ga0209808_124893613300026523SoilVHQAADVNPLQPHPILRAPEGNDPIGLRPISESNEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT
Ga0209808_130993723300026523SoilNSVRAALAPVLPFFDRHIVHQSADLNPPLSHPILRPHDDAEPIGLRPISEAHDRVLFASAATYPGFGLEGQILAARAAADQALAISGRKTVSAV
Ga0209690_123308633300026524SoilIVHQGADVNPPQPHPILRPLENGDPVGLRPASEADEHVVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT
Ga0209378_119250123300026528SoilMGPLLLTTLPARRARGESAGEKLLTVARVADAGFSDDQALVSSIRAVLEPIFPFFERHIVHQGADVNPPQPHPILRPLENGDPVGLRPASEADEHIVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT
Ga0209806_132520123300026529SoilQSADANPAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVRAT
Ga0209058_102172323300026536SoilMGPLLLTTLPARRARGESSGEKLLTVARVADAGFSDDQALVSSIRAVLEPIFPFFERHIVHQGADVNPPQPHPILRPLENGDPVGLRPASEADEHIVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT
Ga0209805_128339113300026542SoilRSALEPVLPFFDRHIVHQTADVSPAQLHTVLRPHDDASPIGLRPSSAAHDRVLFASAATYPGFGLEGQILAARAAAEQALALSGRKAVAAT
Ga0209161_1008838433300026548SoilAQPHSILRPLDNGDPVGLRPFSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT
Ga0209161_1055935623300026548SoilPVLPFFDRHIVHEAADLSPSPPHVLIRPHDDVEPIGLRPVSAAHERVLFASAATYPGFGIEGQLLAARGAAEQAHALSGRKTIAV
Ga0209474_1065033713300026550SoilLLTTLPARRARGESSGEKLLTVARVADAGFSDDQGLMGSIRAVLEPIFPFFERHIVHQGADVNPPQPHPILRPLENGDPVGLRPASEADEHIVFASAATYPGFGLEGQILAARAAAGQALALSGRKTVSAT
Ga0179593_120723733300026555Vadose Zone SoilMRRCSWAARWGPLLLTTLPARRARGESTGEKILTVARVADAGFSDADGLLSSIRATLEPICPFFDRHLVHQTADVNPAQPHPILRAPEGNDPIGLRPISETDDHVLFASSSTYPGFGLEGQILAGRAAAGQALALSGRKTVSAT
Ga0208997_103271713300027181Forest SoilSMRATLEPIFPFFDRHLVHQAADVNPAQPHPILRAPEGNEPIGLRPISESNEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT
Ga0208981_103114513300027669Forest SoilEPIFPFFDRHLVHQAADVNPAQPHPILRAPEGNDPIGLRPISASNEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT
Ga0209011_105580113300027678Forest SoilRATLEPIFPFFDRHVVHQAADVNPAQPHPILRAPEGNDPIGLRPISESNEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT
Ga0209328_1006347613300027727Forest SoilGFSDAEGLLSSIRATLEPIFPFFDRHVVHQAADVNPAQPHPILRAPEGNDPIGLRPISESNEHVLFASAATYPGFGLEGQILAGRAAAGQALALSGRKTVSAT
Ga0209689_127245213300027748SoilADLNPPLSHPILRPHDDAEPIGLRPISEAHDRVLFASAATYPGFGLEGQILAARAAADQALAISGRKTVSAV
Ga0209701_1023975733300027862Vadose Zone SoilALLLGPPPAPLIISALPARRARGETSGERLLTVGRVSDAGFSDAEGLLQSVRAALEPVLPFFDRHIVIQSADLNPSHGHPILRPHEDAEPIGLRPLSDAHERVLFASAATYPGFGLEGQILAARAVAEQALALSGRKSVSAV
Ga0209488_1040866313300027903Vadose Zone SoilFPFFDRHLVHQTADVNPAQPHPILRAPEGNDPIGLRPISDASEHVLFASASTYPGFGLEGQILAGRAAAGQALALSGRKTVSAT
Ga0307310_1057965413300028824SoilHIVHQGADANPAQPHSILRPAENGDPVGLRPMSETDDHVVFASAATYPGFGLEGQILAARAAAGQALALSGRKSVSAT
Ga0307308_1047594613300028884SoilTLPARRARGDSTGEKLLTVARVSDAGFSDEQGLLSSIRGVLEPIFPFFERHIVHQGADANPAQPHSILRPAENGDPVGLRPMSETDDHVVFASAATYPGFGLEGQILAARAAAGQALALSGRKSVSAT
Ga0311365_1162995913300029989FenIVHQRADLDPPLPHPLFQPREDGEPLGLRPQSEAHDRALFASAAVYPGFGLEGQLIAARACADAALVLSGRKQVAAV
Ga0307469_1011029513300031720Hardwood Forest SoilARVSDAGFSDAEGLLTSVRGALEPILPFFDRHVVHQSADANPAQPHPILRPLDNGDPVGLRPLSEADEHVVFASAATYPGFGLEGQILAARAAAGRALALSGRKTVSAT
Ga0308173_1037545433300032074SoilIADAGFSDEASLASSIRAALEPVLPFFDRHVVHQAADANPAQPHLILRVAENAEAVGLRPISEASERLLFASAATYPGFGLEGQLLAARAASVQALALSGRKTVSAT
Ga0335082_1078106933300032782SoilARVADAGFADDQGLLNGIRSALEPVFPFFERHVVHQAADVNPAQPHPILRIPEDGEAVGLRPLSAADQHVTFASAATYPGFGLEGQILAARAAAGQALALSRRKTVSAT
Ga0335081_1007183853300032892SoilPVLPFFDRHIVHQGADVNPAQPWPILTPHEDADPIGLQPHSEAHERVLFASCATYPGFGLEGQILAARAAAAQALHLSGRKSVSAT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.