NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F066753

Metagenome / Metatranscriptome Family F066753

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F066753
Family Type Metagenome / Metatranscriptome
Number of Sequences 126
Average Sequence Length 75 residues
Representative Sequence MATSHEARDDGARAAIQAILHELPEDHPAREAYRAGADPIRLINLVDREDLAEKLNQAWQDWYTRRLLRQRQGSAA
Number of Associated Samples 106
Number of Associated Scaffolds 126

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 79.37 %
% of genes near scaffold ends (potentially truncated) 29.37 %
% of genes from short scaffolds (< 2000 bps) 84.92 %
Associated GOLD sequencing projects 99
AlphaFold2 3D model prediction Yes
3D model pTM-score0.63

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (65.873 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere
(8.730 % of family members)
Environment Ontology (ENVO) Unclassified
(23.016 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(42.063 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 54.81%    β-sheet: 0.00%    Coil/Unstructured: 45.19%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.63
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 126 Family Scaffolds
PF02774Semialdhyde_dhC 22.22
PF12849PBP_like_2 10.32
PF00528BPD_transp_1 2.38
PF02522Antibiotic_NAT 1.59
PF01554MatE 1.59
PF00072Response_reg 0.79
PF13426PAS_9 0.79
PF13749HATPase_c_4 0.79
PF04055Radical_SAM 0.79
PF16655PhoD_N 0.79
PF13751DDE_Tnp_1_6 0.79
PF01553Acyltransferase 0.79

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 126 Family Scaffolds
COG0002N-acetyl-gamma-glutamylphosphate reductaseAmino acid transport and metabolism [E] 22.22
COG0136Aspartate-semialdehyde dehydrogenaseAmino acid transport and metabolism [E] 22.22
COG2746Aminoglycoside N3'-acetyltransferaseDefense mechanisms [V] 1.59


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms65.87 %
UnclassifiedrootN/A34.13 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090014|GPIPI_17025641All Organisms → cellular organisms → Bacteria1017Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101779991All Organisms → cellular organisms → Bacteria → Proteobacteria1454Open in IMG/M
3300000559|F14TC_102585086All Organisms → cellular organisms → Bacteria898Open in IMG/M
3300000881|JGI10215J12807_1123302All Organisms → cellular organisms → Bacteria643Open in IMG/M
3300001139|JGI10220J13317_10734730All Organisms → cellular organisms → Bacteria616Open in IMG/M
3300004022|Ga0055432_10188004Not Available589Open in IMG/M
3300004025|Ga0055433_10048681All Organisms → cellular organisms → Bacteria854Open in IMG/M
3300004156|Ga0062589_100593556All Organisms → cellular organisms → Bacteria959Open in IMG/M
3300004156|Ga0062589_101157932Not Available736Open in IMG/M
3300004157|Ga0062590_100028029All Organisms → cellular organisms → Bacteria2798Open in IMG/M
3300004463|Ga0063356_101996160All Organisms → cellular organisms → Bacteria → Proteobacteria877Open in IMG/M
3300004463|Ga0063356_104843378Not Available578Open in IMG/M
3300004479|Ga0062595_101741579Not Available589Open in IMG/M
3300004480|Ga0062592_101657446Not Available620Open in IMG/M
3300004643|Ga0062591_100680820All Organisms → cellular organisms → Bacteria926Open in IMG/M
3300004798|Ga0058859_11796243Not Available590Open in IMG/M
3300004800|Ga0058861_10473188All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Chloroflexia → unclassified Chloroflexia → Chloroflexia bacterium628Open in IMG/M
3300005093|Ga0062594_100015487All Organisms → cellular organisms → Bacteria → Proteobacteria2980Open in IMG/M
3300005104|Ga0066818_1009447All Organisms → cellular organisms → Bacteria703Open in IMG/M
3300005332|Ga0066388_100053914All Organisms → cellular organisms → Bacteria → Proteobacteria4172Open in IMG/M
3300005332|Ga0066388_101508837All Organisms → cellular organisms → Bacteria1176Open in IMG/M
3300005334|Ga0068869_100424519All Organisms → cellular organisms → Bacteria1097Open in IMG/M
3300005336|Ga0070680_100737766All Organisms → cellular organisms → Bacteria848Open in IMG/M
3300005340|Ga0070689_100208277Not Available1599Open in IMG/M
3300005354|Ga0070675_102066020Not Available525Open in IMG/M
3300005444|Ga0070694_100857244All Organisms → cellular organisms → Bacteria748Open in IMG/M
3300005445|Ga0070708_100107841All Organisms → cellular organisms → Bacteria2557Open in IMG/M
3300005447|Ga0066689_10686863All Organisms → cellular organisms → Bacteria641Open in IMG/M
3300005526|Ga0073909_10122800Not Available1054Open in IMG/M
3300005543|Ga0070672_100207465Not Available1640Open in IMG/M
3300005545|Ga0070695_100494631All Organisms → cellular organisms → Bacteria945Open in IMG/M
3300005561|Ga0066699_11162684Not Available530Open in IMG/M
3300005764|Ga0066903_103290449All Organisms → cellular organisms → Bacteria873Open in IMG/M
3300005841|Ga0068863_100095150All Organisms → cellular organisms → Bacteria → Proteobacteria2827Open in IMG/M
3300005841|Ga0068863_100442901All Organisms → cellular organisms → Bacteria → Proteobacteria1274Open in IMG/M
3300006049|Ga0075417_10016102All Organisms → cellular organisms → Bacteria → Proteobacteria2903Open in IMG/M
3300006579|Ga0074054_11977583Not Available658Open in IMG/M
3300006604|Ga0074060_11109595Not Available512Open in IMG/M
3300006844|Ga0075428_101331792All Organisms → cellular organisms → Bacteria755Open in IMG/M
3300006846|Ga0075430_101091658Not Available657Open in IMG/M
3300006854|Ga0075425_100755300Not Available1115Open in IMG/M
3300006871|Ga0075434_101907131All Organisms → cellular organisms → Bacteria600Open in IMG/M
3300006881|Ga0068865_101640489All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria579Open in IMG/M
3300006904|Ga0075424_100994620Not Available894Open in IMG/M
3300006954|Ga0079219_10822437All Organisms → cellular organisms → Bacteria734Open in IMG/M
3300007004|Ga0079218_11043400All Organisms → cellular organisms → Bacteria → Proteobacteria825Open in IMG/M
3300007076|Ga0075435_100899222All Organisms → cellular organisms → Bacteria772Open in IMG/M
3300009094|Ga0111539_10279946All Organisms → cellular organisms → Bacteria1941Open in IMG/M
3300009094|Ga0111539_11527242All Organisms → cellular organisms → Bacteria774Open in IMG/M
3300009137|Ga0066709_101086980All Organisms → cellular organisms → Bacteria1175Open in IMG/M
3300009137|Ga0066709_101167475Not Available1133Open in IMG/M
3300009137|Ga0066709_102757886All Organisms → cellular organisms → Bacteria653Open in IMG/M
3300009147|Ga0114129_11989576Not Available703Open in IMG/M
3300009610|Ga0105340_1009449All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria3863Open in IMG/M
3300009802|Ga0105073_1032952All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales → Cystobacterineae615Open in IMG/M
3300010047|Ga0126382_11430236All Organisms → cellular organisms → Bacteria632Open in IMG/M
3300010047|Ga0126382_11823324All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → Candidatus Competibacteraceae → Plasticicumulans → Plasticicumulans lactativorans573Open in IMG/M
3300010147|Ga0126319_1176210Not Available600Open in IMG/M
3300010147|Ga0126319_1191493All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales512Open in IMG/M
3300010359|Ga0126376_12023958All Organisms → cellular organisms → Bacteria → Proteobacteria618Open in IMG/M
3300010373|Ga0134128_10184163All Organisms → cellular organisms → Bacteria2356Open in IMG/M
3300010375|Ga0105239_10853292Not Available1044Open in IMG/M
3300010396|Ga0134126_10929507All Organisms → cellular organisms → Bacteria977Open in IMG/M
3300010397|Ga0134124_11305404Not Available749Open in IMG/M
3300010397|Ga0134124_13158488Not Available504Open in IMG/M
3300010398|Ga0126383_13384236Not Available521Open in IMG/M
3300010400|Ga0134122_10006720Not Available8545Open in IMG/M
3300010400|Ga0134122_10596851All Organisms → cellular organisms → Bacteria1019Open in IMG/M
3300010401|Ga0134121_10388517All Organisms → cellular organisms → Bacteria1258Open in IMG/M
3300010403|Ga0134123_10508260All Organisms → cellular organisms → Bacteria1136Open in IMG/M
3300010403|Ga0134123_11748095All Organisms → cellular organisms → Bacteria674Open in IMG/M
3300010403|Ga0134123_11997747Not Available638Open in IMG/M
3300011398|Ga0137348_1097546Not Available509Open in IMG/M
3300011420|Ga0137314_1033013All Organisms → cellular organisms → Bacteria1262Open in IMG/M
3300011440|Ga0137433_1242207Not Available589Open in IMG/M
3300012212|Ga0150985_103670040All Organisms → cellular organisms → Bacteria702Open in IMG/M
3300012212|Ga0150985_107089002All Organisms → cellular organisms → Bacteria889Open in IMG/M
3300012212|Ga0150985_109186102All Organisms → cellular organisms → Bacteria1893Open in IMG/M
3300012469|Ga0150984_102017123Not Available645Open in IMG/M
3300012486|Ga0157331_1018484All Organisms → cellular organisms → Bacteria598Open in IMG/M
3300012505|Ga0157339_1020142All Organisms → cellular organisms → Bacteria702Open in IMG/M
3300012685|Ga0137397_10207304All Organisms → cellular organisms → Bacteria1459Open in IMG/M
3300012958|Ga0164299_10266931All Organisms → cellular organisms → Bacteria1032Open in IMG/M
3300012960|Ga0164301_11173690All Organisms → cellular organisms → Bacteria615Open in IMG/M
3300012987|Ga0164307_11391363All Organisms → cellular organisms → Bacteria590Open in IMG/M
3300015371|Ga0132258_12557509All Organisms → cellular organisms → Bacteria → Proteobacteria1276Open in IMG/M
3300015371|Ga0132258_12710576All Organisms → cellular organisms → Bacteria1236Open in IMG/M
3300015374|Ga0132255_102288255All Organisms → cellular organisms → Bacteria824Open in IMG/M
3300018083|Ga0184628_10017490All Organisms → cellular organisms → Bacteria3505Open in IMG/M
3300018469|Ga0190270_13345668All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300021082|Ga0210380_10148499Not Available1051Open in IMG/M
3300024177|Ga0247686_1002521All Organisms → cellular organisms → Bacteria1925Open in IMG/M
3300024224|Ga0247673_1022158All Organisms → cellular organisms → Bacteria850Open in IMG/M
3300025318|Ga0209519_10028482All Organisms → cellular organisms → Bacteria3045Open in IMG/M
3300025521|Ga0210083_1023782All Organisms → cellular organisms → Bacteria869Open in IMG/M
3300025913|Ga0207695_10322880Not Available1433Open in IMG/M
3300025917|Ga0207660_10878073Not Available732Open in IMG/M
3300025949|Ga0207667_12036559Not Available534Open in IMG/M
3300026035|Ga0207703_10016482All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria5762Open in IMG/M
3300026542|Ga0209805_1333877Not Available576Open in IMG/M
3300027573|Ga0208454_1171939Not Available504Open in IMG/M
3300027778|Ga0209464_10037124All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1576Open in IMG/M
3300027821|Ga0209811_10012780All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2702Open in IMG/M
3300027873|Ga0209814_10015552All Organisms → cellular organisms → Bacteria → Proteobacteria3045Open in IMG/M
3300028792|Ga0307504_10013913All Organisms → cellular organisms → Bacteria1881Open in IMG/M
3300028812|Ga0247825_10131096All Organisms → cellular organisms → Bacteria1711Open in IMG/M
3300028812|Ga0247825_10173522Not Available1485Open in IMG/M
3300028814|Ga0307302_10339817Not Available740Open in IMG/M
3300031184|Ga0307499_10001109All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria8418Open in IMG/M
3300031538|Ga0310888_10196589Not Available1106Open in IMG/M
3300031720|Ga0307469_10528891All Organisms → cellular organisms → Bacteria1040Open in IMG/M
3300031740|Ga0307468_100026676All Organisms → cellular organisms → Bacteria → Proteobacteria2672Open in IMG/M
3300031740|Ga0307468_100548212All Organisms → cellular organisms → Bacteria930Open in IMG/M
3300031820|Ga0307473_10014202All Organisms → cellular organisms → Bacteria3027Open in IMG/M
3300031908|Ga0310900_10260413All Organisms → cellular organisms → Bacteria1254Open in IMG/M
3300032075|Ga0310890_11653731All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300032174|Ga0307470_10035799All Organisms → cellular organisms → Bacteria → Proteobacteria2411Open in IMG/M
3300032180|Ga0307471_101059186Not Available977Open in IMG/M
3300032180|Ga0307471_104192168All Organisms → cellular organisms → Bacteria509Open in IMG/M
3300032211|Ga0310896_10566119Not Available630Open in IMG/M
3300033550|Ga0247829_10102515Not Available2155Open in IMG/M
3300033551|Ga0247830_10730537Not Available787Open in IMG/M
3300033551|Ga0247830_10827823All Organisms → cellular organisms → Bacteria737Open in IMG/M
3300034115|Ga0364945_0200310Not Available609Open in IMG/M
3300034149|Ga0364929_0071664All Organisms → cellular organisms → Bacteria1070Open in IMG/M
3300034664|Ga0314786_190104Not Available504Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere8.73%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil7.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil7.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.35%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil5.56%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil5.56%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.56%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil3.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.17%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil3.17%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands2.38%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.38%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.38%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.38%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere2.38%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere2.38%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere2.38%
SoilEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil1.59%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.59%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.59%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.59%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.59%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.59%
Host-AssociatedHost-Associated → Human → Digestive System → Large Intestine → Fecal → Host-Associated1.59%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.59%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.59%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.59%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.59%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.79%
Wetland SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Wetland Sediment0.79%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.79%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil0.79%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.79%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.79%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.79%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.79%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Rhizosphere0.79%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.79%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300000881Soil microbial communities from Great Prairies - Wisconsin Restored Prairie soilEnvironmentalOpen in IMG/M
3300001139Soil microbial communities from Great Prairies - Wisconsin, Switchgrass soilEnvironmentalOpen in IMG/M
3300004022Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D1EnvironmentalOpen in IMG/M
3300004025Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D1EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300004798Switchgrass rhizosphere and bulk soil microbial communities from Kellogg Biological Station, Michigan, USA for expression studies - roots SR-2 (Metagenome Metatranscriptome)Host-AssociatedOpen in IMG/M
3300004800Switchgrass rhizosphere and bulk soil microbial communities from Kellogg Biological Station, Michigan, USA for expression studies - soil CB-1 (Metagenome Metatranscriptome)Host-AssociatedOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005104Soil and rhizosphere microbial communities from Laval, Canada - mgHACEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005340Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaGEnvironmentalOpen in IMG/M
3300005354Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-3 metaGHost-AssociatedOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005543Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M1-3 metaGHost-AssociatedOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005841Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2Host-AssociatedOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006579Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtLAB (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006604Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtLMB (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006846Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006881Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009610Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT700EnvironmentalOpen in IMG/M
3300009802Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_50_60EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010147Soil microbial communities from California, USA to study soil gas exchange rates - BB-CA-RED metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010375Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-4 metaGHost-AssociatedOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011398Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT600_2EnvironmentalOpen in IMG/M
3300011420Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT199_2EnvironmentalOpen in IMG/M
3300011440Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT840_2EnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012486Unplanted soil (control) microbial communities from North Carolina - M.Soil.8.old.120510EnvironmentalOpen in IMG/M
3300012505Arabidopsis rhizosphere microbial communities from North Carolina - M.Col.10.yng.090610Host-AssociatedOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300018083Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_b1EnvironmentalOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300021082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_coex redoEnvironmentalOpen in IMG/M
3300024177Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK27EnvironmentalOpen in IMG/M
3300024224Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK14EnvironmentalOpen in IMG/M
3300025318Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 1EnvironmentalOpen in IMG/M
3300025521Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025913Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025949Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026035Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300027573Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT100 (SPAdes)EnvironmentalOpen in IMG/M
3300027778Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare1Fresh (SPAdes)EnvironmentalOpen in IMG/M
3300027821Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300031184Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 13_SEnvironmentalOpen in IMG/M
3300031538Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D1EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031908Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D1EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032211Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D1EnvironmentalOpen in IMG/M
3300033550Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day4EnvironmentalOpen in IMG/M
3300033551Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day5EnvironmentalOpen in IMG/M
3300034115Sediment microbial communities from East River floodplain, Colorado, United States - 29_s17EnvironmentalOpen in IMG/M
3300034149Sediment microbial communities from East River floodplain, Colorado, United States - 20_j17EnvironmentalOpen in IMG/M
3300034664Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20R3 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_004143302088090014SoilMARSSEARDDGARAAIQAILHELPEDHPAREAYRSGADPIRLINLVDREDLAEKLNQAWQDWYTRRLLRQRPGSAS
INPhiseqgaiiFebDRAFT_10177999123300000364SoilMARSSEARDDGARAAIQAILHELPEDHPAREAYRSGADPIRLINLVDREDLAEKLNQAWQDWYTRRLLRQRPGSAS*
F14TC_10258508623300000559SoilMATKLRAARDDVQRATFKAILSELPEDHPARLAYRAGADPIELMNLVEREDLAEKLNQAWLDWYSTRLRLQRTGRVA*
JGI10215J12807_112330223300000881SoilMARSHEARDEQKAAYMAILGELPEDHPARAAYTAGADPIQFMHLVEREDLVEKLNQAWLDWYATRLRRQRAGSAS*
JGI10220J13317_1073473023300001139SoilMARSHEARDEQKAAYMAILGELPEDHPARAAYTAGADPIQFMHLVEREDLVEKLNQAWLDWYATRLRR
Ga0055432_1018800423300004022Natural And Restored WetlandsTMAKSSAARDDSQRAAFKAILSELPEDHPARAAYVAGADAIQLMNLVEREDLVEKLNQAWLDWYFTRLRLQRTGGAA*
Ga0055433_1004868123300004025Natural And Restored WetlandsMAKSSAARDDSQRAAFKAILSELPEDHPARAAYVAGADAIQLMNLVEREDLVEKLNQAWLDWYFTRLRLQRTGGAA*
Ga0062589_10059355623300004156SoilMARPSEARDDEQRAAFQAILGELPEDHPAREAYRAGADPMRLINLVDREDLAEKLNQAWQDWYTRRLLRQRQGSAL*
Ga0062589_10115793223300004156SoilMAISHKAHDDRTRAAIQAILHELPEDHPAHEAYRSGADPIRLINLVEREDVAEKLNQAWQDWYTRRLLRQRR*
Ga0062590_10002802913300004157SoilMARSHEVRDDGARAAIQAILYELPEDHPAREAYKAGADPIRLINLVDREDLSEKLNQAWQDWYTRRLLRQRWGSAG*
Ga0063356_10199616023300004463Arabidopsis Thaliana RhizosphereMPKSLEAHDDEERAAIQAILDELPADHPAREAHRAGADAIRLTHLVEREDLVEKLTQAWLDGYGRLLRRQGGFRPNVGAPRHE*
Ga0063356_10484337823300004463Arabidopsis Thaliana RhizosphereDDEERAAIQAILGELPENHPAREAYNAGANPIQLVNLVDREDLAEMLNHAWVDWYTRRLRRQRRGSVT*
Ga0062595_10174157913300004479SoilCMARSQWARDDDARAAIQAILRELPEDHPAREAYRTGADPTRLLTLVDREDLVEKLNQAWLDWYSRRLRWQRAGNVL*
Ga0062592_10165744623300004480SoilMARPHEARDDEERAAIQAILGELPENHPAREAYNAGANPIQLVNLVDREDLAEMLNHAWVDWYTRRLRRQRRGSVT*
Ga0062591_10068082023300004643SoilMARSNEARDDEARAAIQAILHELPEDHPAREAYRSGADPIRLINLVEREDLAEKLNQAWQDWYTRRLLRQRQGSAL*
Ga0058859_1179624313300004798Host-AssociatedKKGLGAGMARSHEVRDDGARAAIQAILYELPEDHPAREAYKAGADPIRLINLVDREDLSEKLNQAWQDWYTRRLLRQRWGSAG*
Ga0058861_1047318823300004800Host-AssociatedLGAGMARSHEVRDDGARAAIQAILYELPEDHPAREAYKAGADPIRLINLVDREDLSEKLNQAWQDWYTRRLLRQRWGSAG*
Ga0062594_10001548743300005093SoilMARSQWARDDDARAAIQAILRELPEDHPAREAYRTGADPTRLLTLVDREDLVEKLNQAWLDWYSRRLRWQRAGNVL*
Ga0066818_100944713300005104SoilMATSNEARDDGARAAIQAILHELPEDHPAREAYRSGADPIRLINLVDREDLAEKLNQAWQDWYTRRLLRQRQGSAP*
Ga0066388_10005391423300005332Tropical Forest SoilMSRSYVEARDEEQRAAIQAILGELPEDHPAREAYRAGADPIRLIALVEREDLVEKLNQAWMDWYTRRLQRQRSGNAA*
Ga0066388_10150883723300005332Tropical Forest SoilMARSHGAHDDNEARAAIQAILGELPEDHPAREAYRAGADPIRLINLVDREDLAEKLNQAWLDWYSTRLRQQRAGSAS*
Ga0068869_10042451913300005334Miscanthus RhizosphereMARSQWARDDDARAAIQAILRELPEDHPAREAYRAGADPTRLLTLVDREDLVEKLNQAWLDWYSRRLRWQRAGNVL*
Ga0070680_10073776623300005336Corn RhizosphereMARSHEARDEQKAAYMAILGELPEDHPARAAYTAGADPIQFMHLVEREDLVEKLNQAWLDWYATRLRRQRAGSTS*
Ga0070689_10020827713300005340Switchgrass RhizosphereGHKKIGGCMARSQWARDDDARAAIQAILRELPEDHPAREAYRTGADPTRLLTLVDREDLVEKLNQAWLDWYSRRLRWQRAGNVL*
Ga0070675_10206602013300005354Miscanthus RhizosphereRDDDARAAIQAILRELPEDHPAREAYRAGADPTRLLTLVDREDLVEKLNQAWLDWYSRRLRWQRAGNVL*
Ga0070694_10085724423300005444Corn, Switchgrass And Miscanthus RhizosphereMARPHEAHDEEERVAIQAILGELPENHPAREAYNAGANSIQLMNLVDREDLAEKPNQAWVDWYTRRLRRQRRGSAT*
Ga0070708_10010784143300005445Corn, Switchgrass And Miscanthus RhizosphereMARSHEVRDDGARAAIQAILYELPEDHPAREAYKAGADPIRLINLVDREDLSEKLNQAWQDWYTR
Ga0066689_1068686323300005447SoilMARPNQAHDDEQRAAIQAILGELPRDHPAREAYEAGADAIELTHLVDREDLAEKLNQAWLDWYARRLQRQRP*
Ga0073909_1012280023300005526Surface SoilMATSHEARDDGARAAIQAILHELPEDHPAREAYRAGADPIRLINLVDREDLAEKLNQAWQDWYTRRLLRQRQGSAA*
Ga0070672_10020746513300005543Miscanthus RhizosphereGMARSHEVRDDGARAAIQAILYELPEDHPAREAYKAGADPIRLINLVDREDLSEKLNQAWQDWYTRRLLRQRWGSAG*
Ga0070695_10049463123300005545Corn, Switchgrass And Miscanthus RhizosphereMARPHEAHDEEERVAIQAILGELPENHPAREAYNAGANPIQLVNFVDREDLAEKLNHAWVDWYTRRLRRQRKGSVT*
Ga0066699_1116268423300005561SoilMARPTQVHDDEQRAAIQAILGELPNDHPAHEAYKAGADPIELTSLVDREDLVEKLNQAWLDWYARRLRRQRP*
Ga0066903_10329044913300005764Tropical Forest SoilMGQDHEARVAIRAILRELPEDHPAWEADRAGADVTRLINLVEREDLVEKLNQAWQDWYTGRLQRQRLGSAG*
Ga0068863_10009515023300005841Switchgrass RhizosphereMARSNEAHDDGARAAIQAILYELPEDHPAREAYRSGADPIRLMNLVEREDLAEKLNQAWQDWYTRRLLRQRRGTAS*
Ga0068863_10044290113300005841Switchgrass RhizosphereKRWGRCMARPSEARDDEQRAAFQAILGELPEDHPAREAYRAGADPMRLINLVDREDLAEKLNQAWQDWYTRRLLRQRQGSAL*
Ga0075417_1001610223300006049Populus RhizosphereMARSDEARDDGARAAIQSILHELPEDHPAREAYRSGADPIRLINLVEREDLAEKLNQAWQDWYTRRLLRQRRGTAS*
Ga0074054_1197758323300006579SoilMARSNEARDDGARAAIQAILYELPEDHPAREAYRAGADPIRLINLVDREDLAEQLNQAWQDWYTRRLLRQRQGSAP*
Ga0074060_1110959513300006604SoilMARSNEARDDGARAAIQAILYELPEDHPAREAYRAGADPIRLINLVDREDLAEQLNQAWQDWYTRRLLRQRQGRAP*
Ga0075428_10133179223300006844Populus RhizosphereMAKLAARDDAQRATFKSILSELPEDHPARAAYRAGADAIRLMNLVDREDLVEKLNQAWLDWYSTRLRLQRAGRPA*
Ga0075430_10109165823300006846Populus RhizosphereMTRPHEAHDDEQQAAIRAILGELPEGHPAREAYDAGANPIQLMSLVEREDLVEKLNQAWLDWYTRRLRQAQPKRGLTLG*
Ga0075425_10075530033300006854Populus RhizosphereMARSQWARDDDARAAIQAILRELPEDHPAREAYRAGADPTRLVTLVDREDLVEKLNQAWLDWYSRRLRWQRAGNVL*
Ga0075434_10190713123300006871Populus RhizosphereMARSHEARDDGARAAIQAILYELPEDHPAREAYKAGADPIRLINLVEREDLGEKLNQAWQDWYTRRLLRQRWGSAGQ*
Ga0068865_10164048913300006881Miscanthus RhizosphereMARSHWARDDDARAAIQAILRELPEDHPAREAYRDGTDPARLITLVEREDLVEKLNQAWLDWYSRRLRWQRAGTAL*
Ga0075424_10099462023300006904Populus RhizosphereMARSDEARDDGARAAIQSILHELPEDHPAREAYRSGADPIRLMNLVEREDLAEKLNQAWQDWYTRRLLRQRWGSAGQ*
Ga0079219_1082243723300006954Agricultural SoilMARSQWARDDDARAAIQAILRELPEDHPAHEAYRAGADPTRLVTLVDREDLVEKLNQAWLDWYSRRLRWQRAGNVL*
Ga0079218_1104340023300007004Agricultural SoilMPRSQRARDDGQQAAIRVILSELPEDHPARAAHRGGADAIELTHLVGREDLAEKLSQAWLDWYTARVRRA*
Ga0075435_10089922223300007076Populus RhizosphereMARSHEARDDGARAAIQAILYELPEDHPAREAYKAGADPIRLINLVEREDLAEKLNQAWQDWYTRRLLRQRWGSAGQ*
Ga0111539_1027994633300009094Populus RhizosphereMARPNEARDDGARAAIQAILHELPEDHPAREAYRSGADPIRLINLVEREDVAEKLNQAWQDWYTSRLLRQRLGGAT*
Ga0111539_1152724223300009094Populus RhizosphereMARSNEAHDDGARAAIQAILYELPEDHPAREAYRSGADPIRLMNLVEREDLAEKLNQAWQDWYTRRLLRQRWGSAGQ*
Ga0066709_10108698023300009137Grasslands SoilMARPTQVHDDEQRAAIQAILGELPKDHPAHEAYKAGADPIELTYLVDREDLVEKLNQAWLDWYARRLRRQRP*
Ga0066709_10116747513300009137Grasslands SoilHEARDDEEMAAIQAILCELPQDHPAREAYAAGADPIQLTYLVDREDLVEKLNQAWLDWYSRRLRRQRKGSAS*
Ga0066709_10275788623300009137Grasslands SoilMARPNQAHDDEQRSAIQAILGELPRDHPAREAYEAGADAIELTHLVDREDLAEKLNQAWLDWYARRLQRQRP*
Ga0114129_1198957623300009147Populus RhizosphereMARPTQAHDDEQRAAIHAILRELPEDHPAREAYNAGADAIELTHLVDREDLAEKLNQAWLDWYARRLHLQRS*
Ga0105340_100944953300009610SoilMAKSRAARDDVQRATFKAILSELPEDHPARAAYRAGADPIQLMNLVEREDLAEKLNQAWLDWYSTQLRLQRTGRAV*
Ga0105073_103295213300009802Groundwater SandMRSHEARDDEQKAAIQAILRELPEDHPAREAYAAGADPIRLTHLVDREDLVEKLNQAWLDWYTRRLRRQRRGSAS*
Ga0126382_1143023623300010047Tropical Forest SoilMARSHWARDDDARAAIQAILRELPEDHPAREAYRAGADPTRLITLVEREDLVEKLNQAWLDWYSTRLRQQRAGSVG*
Ga0126382_1182332423300010047Tropical Forest SoilMAQSHEAGNDEERAAIQAILGELPEDHPARQAYRAGADPIELTHLIDREDLVEKLNQAWVDWYTKRLRSQLRSSAA*
Ga0126319_117621013300010147SoilMARPREGRDDEQKAAIQAILCELPEDHPARQAYRAGADPIELTHLVDREDLAEKLNDAWLDWYTRRLRRQRLGSTA*
Ga0126319_119149323300010147SoilWGRGMARSNEARDDGARAAIQAILYELPEDHPAREAYRAGADPIRLINLVEREDLAEKLNQAWQDWYTRRLLRQRQGSAV*
Ga0126376_1202395813300010359Tropical Forest SoilRDNDARLAIQAILRELPEDHPAREAYRAGADPTRLITLVDREDLVEKLNQAWIDWYSSRLRQQRADSVC*
Ga0134128_1018416313300010373Terrestrial SoilMARSQWARDDDARAAIQAILRELPGDPPAREAYRTGADPTRLLTLVDREDLVEKLNQAWLDWYSRRLRWQRAGN
Ga0105239_1085329213300010375Corn RhizosphereSQWARDDDARAAIQAILRELPEDHPAREAYRTGADPTRLLTLVDREDLVEKLNQAWLDWYSRRLRWQRAGNVL*
Ga0134126_1092950723300010396Terrestrial SoilMARSQWARDDDARAAIQAILRELPEDHPAREAYRTGADPTRLLTLVDREDLVEKLNQAWLDCYSRRLRWQRAGNVL*
Ga0134124_1130540413300010397Terrestrial SoilHDDRTRAAIQAILHELPEDHPAHEAYRSGADPIRLINLVEREDVAEKLNQAWQDWYTRRLLRQRR*
Ga0134124_1315848813300010397Terrestrial SoilAAVKAILGELPDDHPAREAYEAGVNPTRLMSLVEREDLVEKLNQAWLDWYTRRLLQQRSAHEG*
Ga0126383_1338423623300010398Tropical Forest SoilMARSHAARDDGARAAIRAILRELPEDHPAREADRAGADVTRLINLVEREDLVEKLNQAWQDWYTGRLQRQRLGSAG*
Ga0134122_1000672093300010400Terrestrial SoilMATSNEARDDGARAAIQAILHELPEDHPAREAYRSGADPIRLINLVDREDLAEKLNQAWQDWYTRRLLRQRQGSAL*
Ga0134122_1059685123300010400Terrestrial SoilMARSQWARDDDARAAIQAILRELPEDHPAREAYRAGADPTRLLTLVDREDLVEKLNQAWLDWYSRRLRWQRAGTVL*
Ga0134121_1038851723300010401Terrestrial SoilMARPHEAHDEEERVAIQAILGELPENHPAREAYNAGANPIQLVNFVDREDLAEKLNHAWVDWYTRRLRRQRKGS
Ga0134123_1050826023300010403Terrestrial SoilMARSQWARDDDARAAIQAILRELPEDHPAREAYRAGADPTRLLTLVDREDLVEKLNQAWLDWYSRRLRWQR
Ga0134123_1174809523300010403Terrestrial SoilMARPHEAQDEEERFAIQAILGELPENHPAREAYNAGANPIQLVNFVDREDLAEKLNHAWVDWYTRRLRRQRRGSVT*
Ga0134123_1199774723300010403Terrestrial SoilMARSHWARDDDARAAIQAILHELPEDHPAREAYRSGADPIRLINLVDREDLAEKLNQAWQDWYTRRLLRQRQGSAL*
Ga0137348_109754623300011398SoilMAKSRAAHDDAQRATFKAILSELPEDHPARAAYRAGADPIQLMNLVEREDLAEKLNQAWLDWYSTQLRLQRTGRAV*
Ga0137314_103301313300011420SoilMARHEARDNEERAAIQTILHELPESHPAREAYEAGADAIELMSLVDREDLAEKLNQAWLDWYSRRLQRQRKAGMA*
Ga0137433_124220723300011440SoilMAKLRAARDDVQRATFKAILSELPEDHPARAAYRAGADPIQLMNLVEREDLAEKLNQAWLDWYSTQLRLQRTGRAV*
Ga0150985_10367004023300012212Avena Fatua RhizosphereMAISHEARDGARAAIQAILYELPEDHPAREAYRSGVDPIRLINLVDREDLVEKLNQAWQDWYTRRLLRQRR*
Ga0150985_10708900223300012212Avena Fatua RhizosphereMARPREARDDEQRAAIQAILGELPEDHPAREAYRSGADPIRLINLVDREDLAEKLNQAWQDWYTRRLLRQRLGRAS*
Ga0150985_10918610223300012212Avena Fatua RhizosphereMAISHKAHDDGTRAAIQAILHELPEDHPAREAYRSGADPIRLINLVDREDVAEKLNQAWQDWYTRRLLRQRR*
Ga0150984_10201712313300012469Avena Fatua RhizosphereMERPRDARDDEQRLAIHAILAELPEDHPAREAYRAGADPIRLVNLVDREDLAEKLNQAWQDWYTRRLLRQRQGSAA*
Ga0157331_101848423300012486SoilMARSNEAHDDGARAAIQAILYELPEDHPAREAYRSGADPIRLINLVEREDLAEKLNQAWQDWYTRRLLRQRRGTAS*
Ga0157339_102014213300012505Arabidopsis RhizosphereMARSNEAHDEGARAAIQAILYELPEDHPAREAYRSGADPIRLMNLVEREDLAEKLNQAWQDWYTRRLLRQRRGTAS*
Ga0137397_1020730423300012685Vadose Zone SoilMARSNEARDDGARAAIQAILYELPEDHPAREAYRAGADPIRLINLVEREDLAEKLNHAWQDWYTRRLLRQRQGSAV*
Ga0164299_1026693123300012958SoilMATSHEARDDGARAAIQAILHELPEDHPAREAYRAGADPIRLINLVDREDLADKLNQAWQDWYTRRLLRQRQGSAA*
Ga0164301_1117369023300012960SoilMARSHEVRDDGARAAIQAILYELPEDHPAREAYKAGADPIRLINLVDREDLSEKLNQAWQDWYTRRLLRQRRGSAG*
Ga0164307_1139136323300012987SoilMARSHEVRDDGARAAIQAILYELPEDHPAREAYKAGADPIRLINLVEREDLAEKLNQAWQDWYTRRLLR
Ga0132258_1255750933300015371Arabidopsis RhizosphereDDARAAIQAVLRELPEDHPAREAYRAGADPTRLITLVDREDLVEKLNQAWLDWYSTRLRQQRASRAV*
Ga0132258_1271057623300015371Arabidopsis RhizosphereMARSQWARDDDARAAIQAILRELPEDHPAREAYRAGADPTRLITLVEREDLVEKLNQAWLDWYSRRLRWQRAGNVL*
Ga0132255_10228825523300015374Arabidopsis RhizosphereMARSDEARDDGARAAIQSILHELPEDHPAREAYRSGADPIRLINLVEREDLAEKLNQAWQDWYTRRLLRQLRGAS*
Ga0184628_1001749043300018083Groundwater SedimentMATSHEARDDGARAAIQAILYELPEDHPAREAYRAGADPIRLINLVDREDLAEQLNQAWQDWYTRRLLRQRQGSAM
Ga0190270_1334566813300018469SoilMTKVRAARDGIHRATFKSILSELPEDHPARAAYRAGADAIRLMNLVDREDLLEKLNQAWLDWYSTRLRLQRTGRAA
Ga0210380_1014849933300021082Groundwater SedimentIKKAWGRGMATSHEARDDGARAAIQAILYELPEDHPAREAYRAGADPIRLINLVDREDLAEQLNQAWQDWYTRRLLRQRQGSAM
Ga0247686_100252123300024177SoilMARSHEVRDDGARAAIQAILYELPEDHPAREAYKAGADPIRLINLVDREDLSEKLNQAWQDWYTRRLLRQRWGSAG
Ga0247673_102215813300024224SoilMARSQWARDDDARAAIQAVLRELPEGHPAREAYRAGADPTRLITLVDREDLVEKLNQAWLDWYSTRLRQQRAGRAL
Ga0209519_1002848233300025318SoilMRRRSEARDDEQRAAIQAILGELPQDHPAVQAFRAGADVIRLVHLVEREDLADKLHHVWLDWYARRLRRQPTQPIDKPVT
Ga0210083_102378223300025521Natural And Restored WetlandsMAKSSAARDDSQRAAFKAILSELPEDHPARAAYVAGADAIQLMNLVEREDLVEKLNQAWLDWYFTRLRLQRTGGAA
Ga0207695_1032288013300025913Corn RhizosphereKGLGAGMARSHEVRDDGARAAIQAILYELPEDHPAREAYKAGADPIRLINLVDREDLSEKLNQAWQDWYTRRLLRQRWGSAG
Ga0207660_1087807323300025917Corn RhizosphereMARSHEARDEQKAAYMAILGELPEDHPARAAYTAGADPIQFMHLVEREDLVEKLNQAWLDWYATRLRRQRAGSTS
Ga0207667_1203655923300025949Corn RhizosphereMARSQWARDDDARAAIQAILRELPEDHPAREAYRAGADPTRLLTLVDREDLVEKLNQAWLDWYSRRLRWQRAGNVL
Ga0207703_1001648213300026035Switchgrass RhizosphereRAAIQAILYELPEDHPAREAYKAGADPIRLINLVDREDLSEKLNQAWQDWYTRRLLRQRWGSAG
Ga0209805_133387713300026542SoilMARPTQVHDDERRAAIQAILGELPNDHPAHEAYKAGADPIELTSLVDREDLVEKLNQAWLDWYARRLRRQRP
Ga0208454_117193913300027573SoilMARHEARDNEERAAIQTILHELPESHPAREAYEAGADAIELMSLVDREDLAEKLNQAWLDWYSRRLQRQRKAGMA
Ga0209464_1003712423300027778Wetland SedimentMAKSRAARDDEQRAAVKAILCELPEEHPAREAYRAGADPVRVISLVEREDLVEKLYQAWLDSYTS
Ga0209811_1001278033300027821Surface SoilMATSHEARDDGARAAIQAILHELPEDHPAREAYRAGADPIRLINLVDREDLADKLNQAWQDWYTRRLLRQRQGSAA
Ga0209814_1001555223300027873Populus RhizosphereMARSDEARDDGARAAIQSILHELPEDHPAREAYRSGADPIRLINLVEREDLAEKLNQAWQDWYTRRLLRQRRGTAS
Ga0307504_1001391323300028792SoilMARSHGARDDDGARAAIQAILRELPEDHPAREAYRAGAAPTRLINLVDREDLAEKLNQAWQDWYTRRLLRQRWGSAS
Ga0247825_1013109623300028812SoilMAKSSAARDDSQRAAFKAILSELPEDHPARAAYVAGADAIQLMNLVEREDLVEKLNQAWLDWYSTRLRLQRTGGAA
Ga0247825_1017352213300028812SoilMGTRNVRDREGRAAIKAIVSELPKNHPARAAYEAGADAMRLMSLVDREDLAEKLNQAWLDWYSRRLLMQRRANAA
Ga0307302_1033981723300028814SoilTRDDEARAAIQAIVDELPDGHPAREAYKAGADAIELTYLVGREDLAEKLNQAWLEWYARRLRQRLSAP
Ga0307499_1000110963300031184SoilMTRPREARDDEQRAAIQAILGELPEDHPAREAYRSGADPIRLINLVDREDLAEKLNQAWQDWYTRRLLRQRSGRAS
Ga0310888_1019658923300031538SoilMARSDEARDDGARAAIQSILHELPEDHPAREAYRSGADPIRLINLVEREDLAEKLNQAWQDWYTRRLLRQRRGAS
Ga0307469_1052889123300031720Hardwood Forest SoilMAKLRAARDDAQRATFKAILSELPEDHPARAAYSAGADPIELMNLVEREDLAEKLNQAWLDWYSTRLRLQRTGRVA
Ga0307468_10002667653300031740Hardwood Forest SoilMAKLRAARDDVQRATFKAILSELPEDHPARVAYRAGADPIELMNLVEREDLAEKLNQAWLDWYSTRLLLQRTGRVA
Ga0307468_10054821223300031740Hardwood Forest SoilMARPGEARDDEQRAAIQAILGELPEDHPAREAYRSGADPIRLINLVEREDLAEKLNQAWQDWYTRRLLRQRLGRAS
Ga0307473_1001420253300031820Hardwood Forest SoilMARSHEARDHGARAAIQAILYELPEDHPAREAYKAGADPIRLINLVDREDLAEKLNQAWQDWYMSRLLRQRRGSAA
Ga0310900_1026041323300031908SoilMARSHEVRDDGARAAIQAILYELPEDHPAREAYKAGADPIRLINLVDREDLSEKLNQAWQDWYTRRLLRQRWGSA
Ga0310890_1165373123300032075SoilMATSHEARDDGARAAIQAILHELPEDHPAREAYRAGADPIRLINLVDREDLADKLNQAWQDWYTRR
Ga0307470_1003579933300032174Hardwood Forest SoilMATKLRAARDDVQRATFKAILSELPEDHPARVAYRAGADPIELMNLVEREDLAEKLNQAWLDWYSTRLRLQRTGRVA
Ga0307471_10105918613300032180Hardwood Forest SoilMATKLRAARDDVQRATFKAILSELPENHPARVAYRAGADPIELMNLVEREDLAEKLNQAWLDWYSTRLRLQRT
Ga0307471_10419216813300032180Hardwood Forest SoilMARSNEARDDGARAAIQSILHELPEDHPAREAYRSGADPIRLINLVDREDVAEKLNQAWQDWYTRRLLRQRLGSAS
Ga0310896_1056611923300032211SoilQKAWGRGMARSDEARDDGARAAIQSILHELPEDHPAREAYRSGADPIRLINLVEREDLAEKLNQAWQDWYTRRLLRQRRGAS
Ga0247829_1010251513300033550SoilKSSAARDDSQRAAFKAILSELPEDHPARAAYVAGADAIQLMNLVEREDLVEKLNQAWLDWYSTRLRLQRTGGAA
Ga0247830_1073053723300033551SoilMAKSSAARDDSQRVAFKAILSELPEDHPARAAYVAGADAIQLMNLVEREDLVEKLNQAWLDWYSTRLRLQRTGGAA
Ga0247830_1082782323300033551SoilMAKLRAARDDEQRATFKAILSELPEDHPARAAYGAGADPIQLMYLVEREDLVEKLNQAWLDWYSTQLQLQRAGRAA
Ga0364945_0200310_131_3613300034115SedimentMAKSRAARDDVQRATFKAILSELPEDHPARAAYRAGADPIQLMNLVEREDLAEKLNQAWLDWYSTQLRLQRTGRAV
Ga0364929_0071664_32_2623300034149SedimentMAKSRAAHDDAQRATFKAILSELPEDHPARAAYRAGADPIQLMNLVEREDLAEKLNQAWLDWYSTQLRLQRTGRAV
Ga0314786_190104_271_5013300034664SoilMARPGEARDDEQTAAIQAILHELPEDHPARAAYRAGADPIRLINLVDREDLADKLNQAWQDWYTRRLLRQRQGSAL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.