NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F098843

Metagenome Family F098843

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F098843
Family Type Metagenome
Number of Sequences 103
Average Sequence Length 103 residues
Representative Sequence MDVLTEAQVAKVYELTDGLLLNRDWVVVPLVGSPRGMEMLMPDGKVLIRPAGGDRFDAWFADLKTRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYV
Number of Associated Samples 84
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 39.81 %
% of genes near scaffold ends (potentially truncated) 33.01 %
% of genes from short scaffolds (< 2000 bps) 79.61 %
Associated GOLD sequencing projects 78
AlphaFold2 3D model prediction Yes
3D model pTM-score0.67

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (66.990 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere
(12.621 % of family members)
Environment Ontology (ENVO) Unclassified
(25.243 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(30.097 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 33.08%    β-sheet: 10.77%    Coil/Unstructured: 56.15%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.67
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 103 Family Scaffolds
PF07859Abhydrolase_3 15.53
PF02082Rrf2 15.53
PF08669GCV_T_C 8.74
PF01408GFO_IDH_MocA 3.88
PF13453zf-TFIIB 3.88
PF00135COesterase 2.91
PF00266Aminotran_5 1.94
PF09339HTH_IclR 1.94
PF00355Rieske 0.97
PF03588Leu_Phe_trans 0.97
PF06491Disulph_isomer 0.97
PF09471Peptidase_M64 0.97
PF01592NifU_N 0.97
PF10518TAT_signal 0.97
PF01458SUFBD 0.97
PF14446Prok-RING_1 0.97
PF00072Response_reg 0.97
PF00583Acetyltransf_1 0.97
PF00440TetR_N 0.97
PF02634FdhD-NarQ 0.97
PF00005ABC_tran 0.97

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 103 Family Scaffolds
COG0640DNA-binding transcriptional regulator, ArsR familyTranscription [K] 15.53
COG0657Acetyl esterase/lipaseLipid transport and metabolism [I] 15.53
COG1414DNA-binding transcriptional regulator, IclR familyTranscription [K] 15.53
COG1725DNA-binding transcriptional regulator YhcF, GntR familyTranscription [K] 15.53
COG1959DNA-binding transcriptional regulator, IscR familyTranscription [K] 15.53
COG2186DNA-binding transcriptional regulator, FadR familyTranscription [K] 15.53
COG2188DNA-binding transcriptional regulator, GntR familyTranscription [K] 15.53
COG2378Predicted DNA-binding transcriptional regulator YobV, contains HTH and WYL domainsTranscription [K] 15.53
COG2524Predicted transcriptional regulator, contains C-terminal CBS domainsTranscription [K] 15.53
COG2272Carboxylesterase type BLipid transport and metabolism [I] 2.91
COG0719Fe-S cluster assembly scaffold protein SufBPosttranslational modification, protein turnover, chaperones [O] 0.97
COG0822Fe-S cluster assembly scaffold protein IscU, NifU familyPosttranslational modification, protein turnover, chaperones [O] 0.97
COG1526Formate dehydrogenase assembly factor FdhD, a sulfurtransferaseEnergy production and conversion [C] 0.97
COG2360Leu/Phe-tRNA-protein transferasePosttranslational modification, protein turnover, chaperones [O] 0.97


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms66.99 %
UnclassifiedrootN/A33.01 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000364|INPhiseqgaiiFebDRAFT_100700893All Organisms → cellular organisms → Bacteria2614Open in IMG/M
3300000956|JGI10216J12902_109255951All Organisms → cellular organisms → Bacteria1413Open in IMG/M
3300004157|Ga0062590_100992355All Organisms → cellular organisms → Bacteria797Open in IMG/M
3300004463|Ga0063356_100518372All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium1583Open in IMG/M
3300004463|Ga0063356_101298946Not Available1064Open in IMG/M
3300005337|Ga0070682_101431064Not Available592Open in IMG/M
3300005340|Ga0070689_101640693All Organisms → cellular organisms → Bacteria584Open in IMG/M
3300005444|Ga0070694_100935965All Organisms → cellular organisms → Bacteria717Open in IMG/M
3300005536|Ga0070697_100518601All Organisms → cellular organisms → Bacteria1043Open in IMG/M
3300005536|Ga0070697_101149869All Organisms → cellular organisms → Bacteria691Open in IMG/M
3300005537|Ga0070730_10042062All Organisms → cellular organisms → Bacteria3360Open in IMG/M
3300005538|Ga0070731_10001932All Organisms → cellular organisms → Bacteria20544Open in IMG/M
3300005538|Ga0070731_10191712All Organisms → cellular organisms → Bacteria → Proteobacteria1357Open in IMG/M
3300005538|Ga0070731_10878925All Organisms → cellular organisms → Bacteria594Open in IMG/M
3300005544|Ga0070686_100575047All Organisms → cellular organisms → Bacteria884Open in IMG/M
3300005545|Ga0070695_101329458Not Available594Open in IMG/M
3300005615|Ga0070702_101027612Not Available654Open in IMG/M
3300005618|Ga0068864_100308576All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1483Open in IMG/M
3300005829|Ga0074479_10164705All Organisms → cellular organisms → Bacteria1166Open in IMG/M
3300005829|Ga0074479_10819044All Organisms → cellular organisms → Bacteria1231Open in IMG/M
3300005829|Ga0074479_11154875All Organisms → cellular organisms → Bacteria1641Open in IMG/M
3300005836|Ga0074470_11692104All Organisms → cellular organisms → Bacteria90518Open in IMG/M
3300006046|Ga0066652_100071132All Organisms → cellular organisms → Bacteria2710Open in IMG/M
3300006845|Ga0075421_101921906Not Available633Open in IMG/M
3300006854|Ga0075425_100768356All Organisms → cellular organisms → Bacteria1105Open in IMG/M
3300006871|Ga0075434_100428390All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes1344Open in IMG/M
3300006871|Ga0075434_101243772Not Available756Open in IMG/M
3300006904|Ga0075424_100849197All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes974Open in IMG/M
3300006904|Ga0075424_102829974Not Available505Open in IMG/M
3300006954|Ga0079219_10825973Not Available734Open in IMG/M
3300007072|Ga0073932_1182397All Organisms → cellular organisms → Bacteria840Open in IMG/M
3300007076|Ga0075435_102011774Not Available507Open in IMG/M
3300009012|Ga0066710_101013334All Organisms → cellular organisms → Bacteria1282Open in IMG/M
3300009094|Ga0111539_10251619All Organisms → cellular organisms → Bacteria2057Open in IMG/M
3300009100|Ga0075418_10560313Not Available1229Open in IMG/M
3300009146|Ga0105091_10082284All Organisms → cellular organisms → Bacteria1460Open in IMG/M
3300009147|Ga0114129_11176393All Organisms → cellular organisms → Bacteria956Open in IMG/M
3300009162|Ga0075423_11986114All Organisms → cellular organisms → Bacteria629Open in IMG/M
3300010391|Ga0136847_11211440All Organisms → cellular organisms → Bacteria2237Open in IMG/M
3300010399|Ga0134127_10579778All Organisms → cellular organisms → Bacteria1147Open in IMG/M
3300010399|Ga0134127_12441940Not Available602Open in IMG/M
3300010400|Ga0134122_10150016All Organisms → cellular organisms → Bacteria → Proteobacteria1895Open in IMG/M
3300010400|Ga0134122_10198251All Organisms → cellular organisms → Bacteria1664Open in IMG/M
3300010400|Ga0134122_11232874Not Available751Open in IMG/M
3300010401|Ga0134121_12765919Not Available536Open in IMG/M
3300010403|Ga0134123_10576687All Organisms → cellular organisms → Bacteria1076Open in IMG/M
3300010403|Ga0134123_13176856All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium528Open in IMG/M
3300011403|Ga0137313_1063749Not Available655Open in IMG/M
3300011435|Ga0137426_1049418All Organisms → cellular organisms → Bacteria1100Open in IMG/M
3300011437|Ga0137429_1120761All Organisms → cellular organisms → Bacteria804Open in IMG/M
3300012231|Ga0137465_1094487Not Available895Open in IMG/M
3300012899|Ga0157299_10164261All Organisms → cellular organisms → Bacteria639Open in IMG/M
3300012913|Ga0157298_10184756All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium655Open in IMG/M
3300012944|Ga0137410_10000970All Organisms → cellular organisms → Bacteria19749Open in IMG/M
3300012971|Ga0126369_13558168Not Available510Open in IMG/M
3300015199|Ga0167647_1068344All Organisms → cellular organisms → Bacteria947Open in IMG/M
3300017961|Ga0187778_10047792All Organisms → cellular organisms → Bacteria2612Open in IMG/M
3300017965|Ga0190266_10465835All Organisms → cellular organisms → Bacteria725Open in IMG/M
3300017965|Ga0190266_11029990All Organisms → cellular organisms → Bacteria554Open in IMG/M
3300018468|Ga0066662_11929363Not Available618Open in IMG/M
3300019360|Ga0187894_10263095Not Available811Open in IMG/M
3300019458|Ga0187892_10012406All Organisms → cellular organisms → Bacteria10135Open in IMG/M
3300019487|Ga0187893_10004724All Organisms → cellular organisms → Bacteria25521Open in IMG/M
3300020140|Ga0179590_1073405All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium904Open in IMG/M
3300020202|Ga0196964_10044189All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes1871Open in IMG/M
3300021384|Ga0213876_10131196All Organisms → cellular organisms → Bacteria1331Open in IMG/M
3300021432|Ga0210384_10155969All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium2049Open in IMG/M
3300024232|Ga0247664_1173087Not Available510Open in IMG/M
3300024245|Ga0247677_1075914Not Available500Open in IMG/M
3300024284|Ga0247671_1034304All Organisms → cellular organisms → Bacteria778Open in IMG/M
3300024288|Ga0179589_10089495All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1240Open in IMG/M
3300025324|Ga0209640_10094600All Organisms → cellular organisms → Bacteria2577Open in IMG/M
3300025936|Ga0207670_11259413Not Available627Open in IMG/M
3300027857|Ga0209166_10200104All Organisms → cellular organisms → Bacteria1074Open in IMG/M
3300027869|Ga0209579_10001339All Organisms → cellular organisms → Bacteria20550Open in IMG/M
3300027907|Ga0207428_10171741All Organisms → cellular organisms → Bacteria1641Open in IMG/M
3300027909|Ga0209382_11727678Not Available613Open in IMG/M
3300031538|Ga0310888_11024314Not Available520Open in IMG/M
3300031576|Ga0247727_10488347All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium961Open in IMG/M
3300031716|Ga0310813_10003140All Organisms → cellular organisms → Bacteria9998Open in IMG/M
3300031716|Ga0310813_10007006All Organisms → cellular organisms → Bacteria7084Open in IMG/M
3300031716|Ga0310813_10036934All Organisms → cellular organisms → Bacteria3503Open in IMG/M
3300031716|Ga0310813_10115051All Organisms → cellular organisms → Bacteria2115Open in IMG/M
3300031716|Ga0310813_10256379All Organisms → cellular organisms → Bacteria1458Open in IMG/M
3300031716|Ga0310813_10419408All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes1155Open in IMG/M
3300031718|Ga0307474_10108873Not Available2075Open in IMG/M
3300031740|Ga0307468_101426771Not Available638Open in IMG/M
3300031754|Ga0307475_10719340Not Available795Open in IMG/M
3300031765|Ga0318554_10673523Not Available581Open in IMG/M
3300031768|Ga0318509_10382948Not Available787Open in IMG/M
3300031782|Ga0318552_10494777Not Available624Open in IMG/M
3300031912|Ga0306921_12552882Not Available529Open in IMG/M
3300031944|Ga0310884_10994469All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium521Open in IMG/M
3300031962|Ga0307479_10121721All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium2546Open in IMG/M
3300032039|Ga0318559_10530976Not Available549Open in IMG/M
3300032068|Ga0318553_10535003Not Available614Open in IMG/M
3300032075|Ga0310890_10780734Not Available755Open in IMG/M
3300032211|Ga0310896_10563368Not Available632Open in IMG/M
3300032421|Ga0310812_10227513All Organisms → cellular organisms → Bacteria815Open in IMG/M
3300032421|Ga0310812_10348974All Organisms → cellular organisms → Bacteria662Open in IMG/M
3300033412|Ga0310810_10189731All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes2337Open in IMG/M
3300033805|Ga0314864_0042080All Organisms → cellular organisms → Bacteria1023Open in IMG/M
3300034268|Ga0372943_0798502All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium625Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil12.62%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere12.62%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil9.71%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil7.77%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil5.83%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.85%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil3.88%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)3.88%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil3.88%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil2.91%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil2.91%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere2.91%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.94%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks1.94%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.94%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.97%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment0.97%
Hot Spring SedimentEnvironmental → Aquatic → Thermal Springs → Sediment → Unclassified → Hot Spring Sediment0.97%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.97%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil0.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.97%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.97%
PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Peatland0.97%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.97%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.97%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.97%
SoilEnvironmental → Terrestrial → Soil → Sand → Desert → Soil0.97%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze0.97%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.97%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.97%
Plant RootsHost-Associated → Plants → Roots → Unclassified → Unclassified → Plant Roots0.97%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005337Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3L metaGEnvironmentalOpen in IMG/M
3300005340Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005544Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005615Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaGEnvironmentalOpen in IMG/M
3300005618Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2Host-AssociatedOpen in IMG/M
3300005829Microbial communities from Cathlamet Bay sediment, Columbia River estuary, Oregon - S.190_CBCEnvironmentalOpen in IMG/M
3300005836Microbial communities from Youngs Bay mouth sediment, Columbia River estuary, Oregon - S.42_YBBEnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007072Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Dewar Creek DC9 2012 metaGEnvironmentalOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009146Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 10-12cm March2015EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011403Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT166_2EnvironmentalOpen in IMG/M
3300011435Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT660_2EnvironmentalOpen in IMG/M
3300011437Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT736_2EnvironmentalOpen in IMG/M
3300012231Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT828_2EnvironmentalOpen in IMG/M
3300012899Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S058-202B-2EnvironmentalOpen in IMG/M
3300012913Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S043-104R-2EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300015199Arctic soil microbial communities from a glacier forefield, Storglaci?ren, Tarfala, Sweden (Sample st-2c, rock/snow interface)EnvironmentalOpen in IMG/M
3300017961Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_20_MGEnvironmentalOpen in IMG/M
3300017965Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 220 TEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300020140Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020202Soil microbial communities from Anza Borrego desert, Southern California, United States - S1_10EnvironmentalOpen in IMG/M
3300021384Root-associated microbial communities from Barbacenia macrantha in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R9Host-AssociatedOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300024232Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK05EnvironmentalOpen in IMG/M
3300024245Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK18EnvironmentalOpen in IMG/M
3300024284Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK12EnvironmentalOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025936Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027869Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300031538Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D1EnvironmentalOpen in IMG/M
3300031576Biofilm microbial communities from Wishing Well Cave, Virginia, United States - WW16-25EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031765Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f22EnvironmentalOpen in IMG/M
3300031768Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f22EnvironmentalOpen in IMG/M
3300031782Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f20EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031944Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D1EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032039Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f21EnvironmentalOpen in IMG/M
3300032068Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f21EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032211Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D1EnvironmentalOpen in IMG/M
3300032421Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NN3EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M
3300033805Tropical peat soil microbial communities from peatlands in Loreto, Peru - MAQ_50_10EnvironmentalOpen in IMG/M
3300034268Forest soil microbial communities from Eldorado National Forest, California, USA - SNFC_MG_FRD_1.2EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10070089333300000364SoilMMMPMDQVLTEAQVSKVYELTDGLLLNRDWVVVPLIGMVDGMEMLMPDGKILVRPAGGSKFDGWFAGLKTRLESLDISRALRASQLERHYVRTPATAAPGSGARKYVK*
JGI10216J12902_10925595123300000956SoilMDDILSAERVERIYRLTDGLLLNRDWVVVPLKGSENGLEMVLPDGKLLIRPPSGPGFDTWLTGLKERLEILDLDRALRASQLERHYVRTPAAAPPGSGARKYTSWKA*
Ga0062590_10099235523300004157SoilMNILSESQVRKVYELTSSLLLNPDWVVVPLVGSPQGMEMLMPDGKILIRPAGGDAFDAWFSGLKTRLESLDLSRALRASQLERHYVRTPAAAAPGSGARKYLK*
Ga0063356_10051837223300004463Arabidopsis Thaliana RhizosphereMNVLTEAQVQKVYELTDGLLLNRDWVVVPLVGSSQGMEMLMPDGKILIRPAGGGQFDSWFADLKTRLESLDLSRALRASQLERH
Ga0063356_10129894613300004463Arabidopsis Thaliana RhizosphereMDILTEAQVRRVYELTDSLLLNRDWVVVPLVGSANGIEMLMPDGKVLIRPAGGPAFDPWFADLKMRLESLDLSRALRASQLERHYVRT
Ga0070682_10143106413300005337Corn RhizosphereMHVLSEAQVRKVYELTDALMLNPDWVVVPLVGAPQGMEMLMPDGKVLIRPAGGEAFDAWFSGLKTRLDALDLSRALRASQLERHYVRTPATAAPGSGARKYVK*
Ga0070689_10164069323300005340Switchgrass RhizosphereDALMLNPDWVVVPLVGAPQGMEMLMPDGKVLIRPAGGDAFDAWFSGLKTRLDSLDLSRALRASQLERHYVRTPATAAPGSGARKYVK*
Ga0070694_10093596523300005444Corn, Switchgrass And Miscanthus RhizosphereMDVLSESQVRRVYELTDALLLNPDWVVVPLVGAPLGMEMLMPDGKILIRPAGGEGFEAWFSGLKTRLESLDLSRALRASQLERHYVRTPASAAPGSGARKYVQP*
Ga0070697_10051860123300005536Corn, Switchgrass And Miscanthus RhizosphereMDVLSEAQVRKVYELSDALLLNRDWVVVPLVGSPQGMEMLMPDGKILIRPPGGDRFDPWFADLRTRLEALDLSRALRASQLERHYVRTPASAAPGSGARKYLK*
Ga0070697_10114986923300005536Corn, Switchgrass And Miscanthus RhizosphereLGAPARYHGAAMDVLSEAQVRKVYELTDSLLLNQDWVVVPLVGSPRGMEMLMPDGKILIRPVGGAAFEPWFAGLKTRLEALDLSRALRASQLERHYVRTPAAAAPGSGAR
Ga0070730_1004206253300005537Surface SoilMDSVLSEAQVAKIYHLTDSLLLNRDWIVVPLVGSAEGMELLMPDGKVLIRPAGGLKFDAWFSGLKNRLESLDLSRALRASQLERHYVRTPATAAPGSGARRYTK*
Ga0070731_1000193223300005538Surface SoilMDTLLTESQVSKVYELTDALLLNRDWVVVPLVGSPDGMEMLMPDGKILIRPAGGPKFDAWFSGLKTRLEALDISRALRASQLERHYVRTPAAAAPGSGARRYVK*
Ga0070731_1019171223300005538Surface SoilMDTVLTQAQVGKVYELTDSLLLNRDWVVVPLVGSTDGMEMLMPDGKILIRPAGGVKFDGWFSGLKNRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVK*
Ga0070731_1087892523300005538Surface SoilMDPVLTEAQVARIYELTDSLLLNRDWVVVPLVGTAEGMEMLMPDGKILIRPAGGAKFDAWFSGLPGRLASLDLSRALRASQLERHYVRTPATAAPGSGARRYVKG*
Ga0070686_10057504713300005544Switchgrass RhizosphereMDVLTEAQVAKVYELTDGLLLNRDWVVVPLVGSAHGMEMLMPDGKILIRPAGGDQFDAWFTDLKIRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVK*
Ga0070695_10132945823300005545Corn, Switchgrass And Miscanthus RhizosphereMDVLSESQVRRVYELTDALLLNPDWVVVPLVGAPLGMEMLMPDGKILIRPAGGEGFEAWFSGLKTRLESLDLSRALRASQLERHYVR
Ga0070702_10102761223300005615Corn, Switchgrass And Miscanthus RhizosphereMDTVLTEAQVAKVYDLTDTLLLNRDWVVVPLIGSPEGLEMLMPDGKILIRPAGSGRYDGWFAGLKTRLESLDLSRALRASQLERHYVRTPAAAAPGSGARKYVK*
Ga0068864_10030857643300005618Switchgrass RhizosphereMDTVLTEAQVAKVYELTDGLELNRDWVVVPLIGAPDGLEMLMPDGKILIRPAGGGRYDAWFAGLKTRLESLDLSRALRASQLERHYVRTPAAAAPGSGARKYVK*
Ga0074479_1016470523300005829Sediment (Intertidal)MVPFPGFVNLLVLREAEGYDARMQVLTAEQVEKVYAVTDALLLNRDWVVVPLVGSTDGVEMVLPDGKLLIRPVGGEGFRAWFTGLKERLELLDLDRALRASQLERHYVRTPSTAAPGSGARKYVK*
Ga0074479_1081904433300005829Sediment (Intertidal)MLDIGVGEGIMRGMSVLTAEQVEKVYQLTDALLLNRDWVVVPLVGSPTGLETVLPDGKLLIRPVGGAGFPGWFQGLKERLELLDLDRALRASQLERHYVR
Ga0074479_1115487533300005829Sediment (Intertidal)MGPEGYPFQEPEERSANCWRRGRDLGYHPRMSVLKEEQVEKVYQLTDSLLLNRDWVVVPLVGSPTGLETVLPDGKLLIRPVGGEGFSGWFGGLKERLEMLDLDRALRASQLERHYVRTPASAAPGSGARKYVK*
Ga0074470_1169210493300005836Sediment (Intertidal)MDTVLTEAQVAKVYELTDGLQLNRDWVVVPLVGAAQGLEMLMPDGKILIRPVGGAFYDAWFAGLKTRLEALDLSRALRASQLERHYVRTPATAAPGSGARKYVK*
Ga0066652_10007113223300006046SoilMHVLSEAQVRKVYELTDALMLNPDWVVVPLVGAPQGMEMLMPDGKVLIRPAGGDAFDAWFSGLKTRLDLLDLSRALRASQLERHYVRTPATAAPGSGARKYVK*
Ga0075421_10192190623300006845Populus RhizosphereMDVLSEAQVQMVYELTDGLLLNRDWVVVPLVGSPTGMEMLMPDGKILIRPPGGDRFDAWFADLKTRLESLDLSRALRASQLERHY
Ga0075425_10076835613300006854Populus RhizosphereDLTDSLLLNRDWVVVPLVGSPRGIEMLMPDGKILIRPAGGDGFDAWFSGLKTRLESLDLSRALRASQLERHYVRTPAAAAPGSGARKYVK*
Ga0075434_10042839023300006871Populus RhizosphereMDVLTEAQVAKVYELTDSLLLNRDWVVVPLVGSPQGMEMLMPDGKILIRPAGGDRFDSWFTDLKLRLESLDISRALRASQLERHYVRTPASAAPGSGARKYVK*
Ga0075434_10124377213300006871Populus RhizosphereMDVLSEAQVRKVYELTDALLLNRDWVVVPLVGSPHGMEMLMPDGKILIRPAGGDRFDAWFADLKIRLESLDLSRALRASQLERHYVRTP
Ga0075424_10084919723300006904Populus RhizosphereSLRRVGYHGTSMDVLTEAQVAKVYELTDSLLLNRDWVVVPLVGSPQGMEMLMPDGKILIRPAGGDRFDSWFTDLKLRLESLDISRALRASQLERHYVRTPASAAPGSGARKYVK*
Ga0075424_10282997413300006904Populus RhizosphereMDVLTEAQVAKVYELTDGLLLNRDWVVVPLIGSPHGMEMLMPDGKVLIRPAGGDRFDAWFTDLKIRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVK*
Ga0079219_1082597313300006954Agricultural SoilELTDGLLLNRDWVVVPLVGSVDGIEMLMPDGKILIRPAGASKFDGWFAGLKIRLESLDLSRALRASQLERHYVRTPAAAAPGSGARKYVK*
Ga0073932_118239723300007072Hot Spring SedimentMNVLTAEQVERVYRLTDSLDLHRDWVVVPLAAHPTGLEMTLPDGKILIRPPAGPDFEPWFAGLRERLGTLDLARALRAGQAERPCLRTPADALPAFGTRRYLRP*
Ga0075435_10201177423300007076Populus RhizosphereMDVLSEAQVSKVYDLTDSLLLNRDWVVVPLVGSPRGIEMLMPDGKILIRPAGGDGFDAWFSGLKTRLESLDLSRALRASQLERHYVRTPAAAAPGSGARKYVK*
Ga0066710_10101333423300009012Grasslands SoilMSVLTAEQVEKVYQVTDGLHLNRDWVVVPLVGSPNGLETVLPDGKVLIRPVPGEGFSAWFAALQSRLEQLDLDRALRASQIERYYVRTPATAAPGSGARRYVR
Ga0111539_1025161923300009094Populus RhizosphereMEILTEAQVRKVYELTDSLLLHPDWVVVPLVGSAQGLEMLMPDGKILIRPAGGPGFEPWFADLKTRLELLDLSRALRAGQMERHYVRTPATAAPGSGARKYVK*
Ga0075418_1056031313300009100Populus RhizosphereDSLLLHPDWVVVPLVGSPRGVEMLMPDGKILIRPAGGAGFDAWFSGLKIRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVKP*
Ga0105091_1008228423300009146Freshwater SedimentMEILTEAQVRKVYELTDSLLLNPDWVVVPLVGSTQGIEMLMPDGKVLIRPAGGPGFDPWFTDLKTRLESLDLSRALRASQMERHYVRTPATAAPGTGARKYVK*
Ga0114129_1117639313300009147Populus RhizosphereMEVLTEAQVRKVYELTDSLLLNPDWVVVPLVGSANGIEMLMPDGKVLIRPAGGPGFEPWFADLKIRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVK*
Ga0075423_1198611413300009162Populus RhizosphereSLHLNPDWVVVPLVGSSRGLEMLMPDGKVLIRPAGGDGFDAWFVGLKTRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVK*
Ga0136847_1121144023300010391Freshwater SedimentMQVLTAEQVEKVYQLTDALLLNRDWVVVPLVGSDTGLEMVLPDGKLLIRPVGGEGFSGWFGGLKERLEQLDLDRALRASQLERHYVRTPASAAPGSGARKYVK*
Ga0134127_1057977833300010399Terrestrial SoilMPLQHPTGADSSRERLLDSSAAGRYHGTSMDVLSEAQVAKVYELTDHLLLNRDWVVVPLVGSSHGMEMLMPDGKVLIRPAGGGRFDAWFADLKTRLESLDLSRALRASQVERHYVRTPSTAAPGSGA
Ga0134127_1244194013300010399Terrestrial SoilMEILTEAQVRKVYELTDSLLLHPDWVVVPLVGSAQGLEMLMPDGKILIRPAGGPGFDPWFADLKTRLELLDLSRALRAGQMERHYVRTPATAAPGSGARKYVK*
Ga0134122_1015001633300010400Terrestrial SoilMMSAMDAVLTEAQVQKVYELTDGLLLNRDWVVVPLIGCVDGLEMLMPDGKILIRPAGGARFDAWFGGLKTRLESLDISRALRASQLERHYVRTPATAAPGSGARKYVK*
Ga0134122_1019825123300010400Terrestrial SoilMDVLSEAQVRKVYELTDSLLLNQDWVVVPLVGSPRGMEMLMPDGKILIRPVGGAAFEPWFAGLKTRLEALDLSRALRASQLERHYVRTPAAAAPGSGARKYVK*
Ga0134122_1123287413300010400Terrestrial SoilMDTVLTEAQVAKVYELTDGLELNRDWVVVPLIGAPDGLEMLMPDGKILIRPAGGGRYDAWFVGLKTRLESLDLSRALRASQLERHYVRTPATAAPGSSARRYVK*
Ga0134121_1276591913300010401Terrestrial SoilMDTVLTEAQVAKVYDLTDSLELNRDWVVVPLVGAPQGLEMLMPDGKILIRPVGGSRYDAWFAGLKIRLESLDLSRALRASQLERHYVRTPASAAPGSGARKYVK*
Ga0134123_1057668723300010403Terrestrial SoilMDVLSEAQVAKVYELTDHLLLNRDWVVVPLVGSSQGMEMLMPDGKVLIRPAGGGRFDAWFADLKTRLESLDLSRALRASQVERHYVRTPATAAPGSGARKYVK*
Ga0134123_1317685623300010403Terrestrial SoilRKMEMDGRSRYDREPMDVLTEAQVARVYELTDGLLLNRDWVVVPLVGSPHGMEMLMPDGKILIRPAGGGRYDAWFVGLKTRLESLDLSRALRASQLERHYVRTPASAAPGSGARKYVK*
Ga0137313_106374923300011403SoilMEILTEAQVRQVYELTDSLLLNPDWVVVPLVGSAQGIEMLMPDGKVLIRPAGGPGFDPWFTDLKMRLESLDLSRALRASQMERHYVRTPATAAPGSGARKYVK*
Ga0137426_104941823300011435SoilMDDILSADQVERVYRVTDGLLLNRDWVVVPLRGSELGLELIQPDGKLLVRPPAGPGFDLWCTGLKERLQALDLDRALRASQLERHYVRTPAAAPPGSGARKYTPWTK*
Ga0137429_112076123300011437SoilPSMEVLTEAQVRKVYELTDSLLLNPDWVVVPLVGSANGIEMLMPDGKVLIRPAGGPGFEPWFADLKIRLESLDLSRALRASQLERHYVRTPATSAPGSGARKYVK*
Ga0137465_109448723300012231SoilMEVLTEAQVRRVYELTDGLLLNRDWVVVPLVGAPQGMEMLMPDGKILIRPVGGNRFDSWFADLKTRLESLDLSRALRAGQLERHYVRTPAAAAPGSGARKYVK*
Ga0157299_1016426113300012899SoilMDTVLTEAQVAKVYELTDGLELNRDWVVVPLIGAPDGLEMLIPDGKILIRPAGGGRYDAWFAGLKTRLESLDLSRALRASQLERHY
Ga0157298_1018475623300012913SoilMEILTEAQVRRVYEITDSLLLNRDWVVVPLVGSANGIEMLMPDGKVLIRPAGGPAFDPWFSDLKTRLESLDLSRALRASQLERHY
Ga0137410_10000970193300012944Vadose Zone SoilMDSVLTEAQVHKVYELTDGLLLNRDWVVVPLIGSLDGLEMLMPDGKILIRPPGLAKFEPWFAGLKTRLESMDISRALRASQLERHYVRTPATAAPGSGARKYVK*
Ga0126369_1355816813300012971Tropical Forest SoilMDVLTEAQVAKVYELTDGLLLNRDWVVVPLVGSPRGMEMLMPDGKVLIRPAGGDRFDAWFADLKTRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYV
Ga0167647_106834423300015199Glacier Forefield SoilMDTLLTEAQVAKVYDLTDGLELNRDWVVVPLVAAAQGLEMLMPDGKILIRPVGGARYDAWFAGLKTRLESLDLSRALRASQLERHYVRTPASAAPGSGARKYVK*
Ga0187778_1004779233300017961Tropical PeatlandMKVLTAEQVARVYDLTDALLLNRDWVVVPLVGSDSGLEMVLPDGKILIRPAGGEAFAGWFGGLKERLEMLNLDRALRASQLERHYVRTPASAAPGSGARKYVKP
Ga0190266_1046583523300017965SoilMDPVLTEAQVSKVYELTDGLLLNRDWVVVPLVGSIDGLEMLMPDGKILIRPAGGSTFAGWFVGLKTRLESMDISRALRASQLERHYVRTPATAAPGSGARKYYDK
Ga0190266_1102999013300017965SoilMDSVLTEAQVSKVYELTDGLLLNRDWVVVPLVASLDGMEMLMPDGKILIRPAGGSTFDGWFAGLKVRLESLDLSRALRASQLERH
Ga0066662_1192936323300018468Grasslands SoilMSVLTAEQVEKVYQVTDGLHLNRDWVVVPLVGSPNGLETVLPDGKVLIRPVPGEGFSAWFAGLQNRLEQLDLDRALRASQLERHYVRTPATAAPGSGA
Ga0187894_1026309523300019360Microbial Mat On RocksMDVLTEAQVRKVYELTDGLLLNPDWVVVPLVGSPQGMEMLMPDGKILIRPAGGDRFDAWFADLKTRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVKEL
Ga0187892_1001240643300019458Bio-OozeMDVLTAGQVDRVYALTDSLELNRDWVVVPLGAHETGIEMTLPDGKILIRPPAGDAFDAWFAGLRRRLEGLDLERALRASHLERHVLRTPAEAAPGSGARRYVNPPK
Ga0187893_10004724123300019487Microbial Mat On RocksMDDVLSAEQVERVYRLTDSLLLNRDWVVVPLRSSGAGLEMAMPDGKILVRPPAGPAFEPWFAGLRSRLEALDLDRALRASQLERHYPHVSARAAPGSGARRYVPPPPHPNHPG
Ga0179590_107340513300020140Vadose Zone SoilGLLLNRDWVVVPLVGSVDGLEMLMPDGKILIRPAGASKFDGWFVGLKVRLESLDLSRALRASQLERHYVRTPAAAAPGSGARKYVK
Ga0196964_1004418923300020202SoilMHDVLTADQIEKVYRITDGLLLNRDWVVVPLRGHVQGLELTMPDGKILIRPAYGAAFDAWYQNLRERLEALDLDRALRASQLERHYARTPAGSAPGTGARKYVPWAQPPFPTPPPKAASTASPT
Ga0213876_1013119633300021384Plant RootsMPAMDLVLTEAQVARVYELTDGLLINRDWVVVPLVGSPEGMEMLMPDGKILIRPAGGSNFDAWFVGLKNRLEALDISRALRASQLERHYVRTPAAAAPGSGARKYYGK
Ga0210384_1015596923300021432SoilMRGMQVLTAEQVEKVYQVTDALLLNRDWVVVPLVGSDTGLEVVLPDGKLLIRPAGGEGFGGWFAGLKERLGMLDLDRALRASQLERHYVRTPASAAPGSGARKYVK
Ga0247664_117308723300024232SoilMNVLSEAQVRKVYELTSSLLLNPDWVVVPLVGSPQGMEMLMPDGKVLIRPAGGDGFEAWFSGLKTRLESLDLSRALRASQLERHYVRTPATAAPGTGARKYVK
Ga0247677_107591413300024245SoilAMDAVLTEAQVQKVYELTDSLLLNRDWVVVPLIGSIDGMEMLMPDGKILIRPAGGSRFDPWFGGLKTRLESLDISRALRASQLERHYVRTPASAAPGSGARKYVK
Ga0247671_103430423300024284SoilMVAMDVLSEAQVKKVYELSDALLLNRDWVVVPLVGSPQGMEMLMPDGKILIRPPGGDRFDPWFADLRTRLEALDLSRALRASQLERHYVRTPASAAPGSGARKYVK
Ga0179589_1008949523300024288Vadose Zone SoilMDAVLSEAQVSKVYELTDGLLLNRDWVVVPLVGSVDGLEMLMPDGKILIRPAGASKFDGWFVGLKVRLESLDLSRALRASQLERHYVRTPAAAAPGSGARKYVK
Ga0209640_1009460033300025324SoilVSPPAPPGGRWDEDFGLRTEDWGLWTVDPAGSIMPLMSVLTAEQVEKVYVLTDSLLLNRDWVVVPLVGSPTGLETVLPDGKLLIRPVGGEGFAGWFQGLKERLEMLDLDRALRASQLERHYVRTPSTAPPGSGARKYVK
Ga0207670_1125941323300025936Switchgrass RhizosphereVLTEAQVAKVYELTDGLLLNRDWVVVPLVGSAHGMEMLMPDGKILIRPAGGDQFDAWFTDLKIRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVK
Ga0209166_1020010423300027857Surface SoilMDSVLSEAQVAKIYHLTDSLLLNRDWIVVPLVGSAEGMELLMPDGKVLIRPAGGLKFDAWFSGLKNRLESLDLSRALRASQLERHYVRTPATAAPGSGARRYTK
Ga0209579_1000133923300027869Surface SoilMDTLLTESQVSKVYELTDALLLNRDWVVVPLVGSPDGMEMLMPDGKILIRPAGGPKFDAWFSGLKTRLEALDISRALRASQLERHYVRTPAAAAPGSGARRYVK
Ga0207428_1017174123300027907Populus RhizosphereMEILTEAQVRKVYELTDSLLLHPDWVVVPLVGSAQGLEMLMPDGKILIRPAGGPGFEPWFADLKTRLELLDLSRALRAGQMERHYVRTPATAAPGSGARKYVK
Ga0209382_1172767823300027909Populus RhizosphereMDVLSEAQVQMVYELTDGLLLNRDWVVVPLVGSPTGMEMLMPDGKILIRPPGGDRFDAWFADLKTRLESLDLSRALRASQLERHYVRTPASAAPGSGARKYVK
Ga0310888_1102431413300031538SoilMEILTEAQVRKVYELTDSLLLNPDWVVVPLVGSTQGLEMLMPDGKVLIRPAGGPGFDPWFADLKTRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVK
Ga0247727_1048834723300031576BiofilmRGGTLTEDRGLWTEDPAGDIMPPMSVLTAEQVEKVYQLTDSLLLNRDWVVVPLVGSPTGLEMVLPDGKLLIRPVGGEGFAGWFQGLKERLESLELDRALRASELERHYVRTPAAAPPGSGARKYVK
Ga0310813_1000314083300031716SoilMNILSESQVRKVYELTGSLLLNPDWVVVPLVGTPQGMEMLMPDGKVLIRPASGDAFDAWFSGLKTRLEALDLSRALRASQIERHYVRTPATAAPGSGARKYVK
Ga0310813_1000700683300031716SoilMDVLTEAQVAKVYELTDGLLLNRDWVVVPLIGSPHGMEMLMPDGKVLIRPAGGDRFDAWFTDLKIRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVK
Ga0310813_1003693463300031716SoilMDVLTEAQVAKIYELTDGLLLNRDWVVVPLVGSAHGMEMLMPDGKILIRPAGGDRFDAWFTDLKIRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVK
Ga0310813_1011505133300031716SoilMDVLSESQVRRVYELTDSLLINPDWVVVPLVGSPRGIEMLMPDGKILIRPAGGDGFDAWFSGLKTRLESLDLSRALRASQLERHYVRTPAAAAPGSGARKYVQ
Ga0310813_1025637913300031716SoilLTDGLLLNRDWVVVPLVGSPHGMEMLMPDGKILIRPAGGDRFDAWFADLKIRLESLDLSRALRASQLERHYVRTPASAAPGSGARKYVK
Ga0310813_1041940813300031716SoilVRKIYELTDSLLLNRDWVVVPLVGSANGIEMLMPDGKVLIRPAGGPAFDPWFSDLKMRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVK
Ga0307474_1010887333300031718Hardwood Forest SoilMVDSQESAGYDAGRMDIVLTEAQVSKVYELTDGLLLNRDWVVVPLVGSSDGMEMLMPDGKILIRPAGNAKFDGWFSGLKSRLESMDISRALRASQLERHYVRTPATAAPGSGARKYVK
Ga0307468_10142677123300031740Hardwood Forest SoilMIVLTEAQVQKVYELTDGLLLNRDWVVVPLVGSTHGMEMLMPDGKVLIRPAGGDRFDSWFADLKVRLESLDLSRALRASQLERHYVRTPAAAAPGSGARRYVK
Ga0307475_1071934023300031754Hardwood Forest SoilPGSSRYDVRAMDAVLTEAQVAKVYDLTDSLLLNRDWVVVPLVGAPGGMEMLMPDGKILIRPPGGTKFDGWFSGLKARLDALDLSRALRASQLERHYVRTPATAAPGSGARKYVK
Ga0318554_1067352313300031765SoilFQAARAPLGPRPILEIGAPPRYDGRPMTILSEAQVRKVYELTGSLLLNPDWVVVPLVGSPQGTEMLMPDGKILIRPAGGDAFDAWFAGLRTRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVN
Ga0318509_1038294813300031768SoilSLLLNRDWVVVPLVGSSEGLEMLMPDGKILIRPAGGLRFDAWFSGLKGRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVK
Ga0318552_1049477713300031782SoilMDLVLTESQVRKVYELTDGLLLNRDWVVVPLVGSPQGVEMLMPDGKVLIRPAGGDRFDSWFADLGTRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVQ
Ga0306921_1255288213300031912SoilQVRKVYELTDGLLLNRDWVVVPLVGSPQGVEMLMPDGKVLIRPAGGDRFDSWFADLGTRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVQ
Ga0310884_1099446923300031944SoilMDDILSAERVERIYQVTDGLKLNRDWVVVPLRGSEHGIAMVLPDGKLLIRPPGGPAFDSWFSGLKERLEILDLDRALRASQLERHYVRTPAAAPPGSGARKYTPWSR
Ga0307479_1012172123300031962Hardwood Forest SoilMDAVLTEAQVAKVYDLTDSLLLNRDWVVVPLVGAPGGMEMLMPDGKILIRPPGGTKFDGWFSGLKARLDALDLSRALRASQLERHYVRTPATAAPGSGARKYVK
Ga0318559_1053097613300032039SoilMDSVLSEAQVAKVYELTDSLLLNRDWVVVPLVGSAYGVEMLMPDGKVLVRPAGGEAFDAWFSGLKTRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVK
Ga0318553_1053500323300032068SoilSVLSEAQVAKVYELTDSLLLNRDWVVVPLVGSAYGVEMLMPDGKVLVRPAGGEAFDAWFSGLKTRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVK
Ga0310890_1078073423300032075SoilMEILTEAQVRKVYELTDSLLLHPDWVVVPLVGSAQGLEMLMPDGKVLIRPAGGPGFDPWFADLKTRLELLDLSRALRAGQMERHYVRTPATAAPGSGARKYVK
Ga0310896_1056336823300032211SoilMEILTEAQVRRVYEITDSLLLNRDWVVVPLVGSANGIEMLMPDGKVLIRPAGGPAFDPWFSDLKMRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVK
Ga0310812_1022751323300032421SoilAKIYELTDGLLLNRDWVVVPLVGSAHGMEMLMPDGKILIRPAGGDRFDAWFTDLKIRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVK
Ga0310812_1034897413300032421SoilAQVARVYELTDGLLLNRDWVVVPLVGSPHGMEMLMPDGKILIRPAGGDRFDAWFADLKIRLESLDLSRALRASQLERHYVRTPASAAPGSGARKYVK
Ga0310810_1018973123300033412SoilMDGRSRYDREPMDVLTEAQVARVYELTDGLLLNRDWVVVPLVGSPHGMEMLMPDGKILIRPAGGDRFDAWFADLKIRLESLDLSRALRASQLERHYVRTPATAAPGSGARKYVK
Ga0314864_0042080_423_7373300033805PeatlandMDTVLTEAQVAKVYELTDSLLLNRDWVVVPLVGSPDGMEMLMPDGKILIRPAGGGKFDPWFSGLKTRLESLDISRALRASQLERHYVRTPATAAPGSGARKYVK
Ga0372943_0798502_257_5773300034268SoilMDIVLTESQVAKIYDLTDSLLLNRDWVVVPLIGSPDGLEMLMPDGKILIRPAGAGRYDAWFAGLKARLESLDLSRALRASQLERHYVRTPASAAPGSGARKYVKER


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.