NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F095970

Metagenome / Metatranscriptome Family F095970

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F095970
Family Type Metagenome / Metatranscriptome
Number of Sequences 105
Average Sequence Length 63 residues
Representative Sequence MLERLGPSGLLRLAAVITLGASLLLSGCAASTHEVTQTAAEGKEGWSRCDHGRPGTITVVCWKR
Number of Associated Samples 92
Number of Associated Scaffolds 105

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 75.24 %
% of genes near scaffold ends (potentially truncated) 27.62 %
% of genes from short scaffolds (< 2000 bps) 69.52 %
Associated GOLD sequencing projects 86
AlphaFold2 3D model prediction Yes
3D model pTM-score0.36

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (57.143 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere
(20.952 % of family members)
Environment Ontology (ENVO) Unclassified
(28.571 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(47.619 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 41.30%    β-sheet: 10.87%    Coil/Unstructured: 47.83%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.36
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 105 Family Scaffolds
PF00692dUTPase 18.10
PF00072Response_reg 11.43
PF02653BPD_transp_2 11.43
PF09423PhoD 5.71
PF13541ChlI 1.90
PF16655PhoD_N 0.95
PF04452Methyltrans_RNA 0.95

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 105 Family Scaffolds
COG0717dCTP deaminaseNucleotide transport and metabolism [F] 18.10
COG0756dUTP pyrophosphatase (dUTPase)Defense mechanisms [V] 18.10
COG138516S rRNA U1498 N3-methylase RsmETranslation, ribosomal structure and biogenesis [J] 0.95


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A57.14 %
All OrganismsrootAll Organisms42.86 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300003324|soilH2_10039370All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4918Open in IMG/M
3300004114|Ga0062593_100534008Not Available1098Open in IMG/M
3300004479|Ga0062595_101481474All Organisms → cellular organisms → Bacteria624Open in IMG/M
3300005341|Ga0070691_10089412All Organisms → cellular organisms → Bacteria1517Open in IMG/M
3300005341|Ga0070691_11037386Not Available513Open in IMG/M
3300005406|Ga0070703_10163918Not Available846Open in IMG/M
3300005434|Ga0070709_10247902Not Available1282Open in IMG/M
3300005439|Ga0070711_100071072All Organisms → cellular organisms → Bacteria2452Open in IMG/M
3300005445|Ga0070708_100014911All Organisms → cellular organisms → Bacteria → Proteobacteria6405Open in IMG/M
3300005445|Ga0070708_100100218All Organisms → cellular organisms → Bacteria2651Open in IMG/M
3300005456|Ga0070678_101054268Not Available749Open in IMG/M
3300005467|Ga0070706_100032641All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4803Open in IMG/M
3300005468|Ga0070707_100141952All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2337Open in IMG/M
3300005471|Ga0070698_100469607Not Available1194Open in IMG/M
3300005518|Ga0070699_100462956Not Available1150Open in IMG/M
3300005529|Ga0070741_10000387All Organisms → cellular organisms → Bacteria163029Open in IMG/M
3300005536|Ga0070697_101026067Not Available733Open in IMG/M
3300005542|Ga0070732_10686175Not Available623Open in IMG/M
3300005543|Ga0070672_100326491All Organisms → cellular organisms → Bacteria1305Open in IMG/M
3300005545|Ga0070695_101564816Not Available550Open in IMG/M
3300005842|Ga0068858_100101810All Organisms → cellular organisms → Bacteria2679Open in IMG/M
3300005876|Ga0075300_1015782Not Available910Open in IMG/M
3300005879|Ga0075295_1009583Not Available992Open in IMG/M
3300005880|Ga0075298_1018397Not Available645Open in IMG/M
3300005985|Ga0081539_10353398Not Available617Open in IMG/M
3300006047|Ga0075024_100371174Not Available720Open in IMG/M
3300006173|Ga0070716_100058472All Organisms → cellular organisms → Bacteria2219Open in IMG/M
3300006755|Ga0079222_10112162All Organisms → cellular organisms → Bacteria1463Open in IMG/M
3300006852|Ga0075433_10016229All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium elkanii6127Open in IMG/M
3300006854|Ga0075425_100588857Not Available1279Open in IMG/M
3300006903|Ga0075426_10334817Not Available1110Open in IMG/M
3300006904|Ga0075424_100520972Not Available1269Open in IMG/M
3300007255|Ga0099791_10000716All Organisms → cellular organisms → Bacteria → Proteobacteria12438Open in IMG/M
3300007265|Ga0099794_10762921Not Available517Open in IMG/M
3300007788|Ga0099795_10566506Not Available537Open in IMG/M
3300009038|Ga0099829_10147673All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1874Open in IMG/M
3300009143|Ga0099792_10017515All Organisms → cellular organisms → Bacteria3138Open in IMG/M
3300009148|Ga0105243_10153529All Organisms → cellular organisms → Bacteria1978Open in IMG/M
3300009174|Ga0105241_10446757Not Available1143Open in IMG/M
3300010400|Ga0134122_10367669Not Available1259Open in IMG/M
3300010400|Ga0134122_11773195Not Available648Open in IMG/M
3300011270|Ga0137391_10689057Not Available850Open in IMG/M
3300012202|Ga0137363_10055577All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2856Open in IMG/M
3300012205|Ga0137362_10006520All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium elkanii8167Open in IMG/M
3300012205|Ga0137362_10069050All Organisms → cellular organisms → Bacteria → Proteobacteria2922Open in IMG/M
3300012906|Ga0157295_10285651Not Available569Open in IMG/M
3300012917|Ga0137395_10006136All Organisms → cellular organisms → Bacteria → Proteobacteria6246Open in IMG/M
3300012917|Ga0137395_10184945Not Available1444Open in IMG/M
3300012918|Ga0137396_10960373Not Available622Open in IMG/M
3300012927|Ga0137416_10972581Not Available757Open in IMG/M
3300012986|Ga0164304_10003259All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria6164Open in IMG/M
3300013297|Ga0157378_12694068Not Available550Open in IMG/M
3300017930|Ga0187825_10033673Not Available1728Open in IMG/M
3300017936|Ga0187821_10113112Not Available1007Open in IMG/M
3300017936|Ga0187821_10378497Not Available576Open in IMG/M
3300017993|Ga0187823_10140105Not Available756Open in IMG/M
3300019789|Ga0137408_1331996All Organisms → cellular organisms → Bacteria2583Open in IMG/M
3300019865|Ga0193748_1014403Not Available735Open in IMG/M
3300019877|Ga0193722_1059821Not Available956Open in IMG/M
3300019881|Ga0193707_1108147Not Available823Open in IMG/M
3300020002|Ga0193730_1179870Not Available534Open in IMG/M
3300020021|Ga0193726_1131364Not Available1107Open in IMG/M
3300020070|Ga0206356_11894236Not Available952Open in IMG/M
3300020579|Ga0210407_10075471All Organisms → cellular organisms → Bacteria2534Open in IMG/M
3300021168|Ga0210406_10276983Not Available1370Open in IMG/M
3300021479|Ga0210410_10509361Not Available1074Open in IMG/M
3300024310|Ga0247681_1085985Not Available505Open in IMG/M
3300025885|Ga0207653_10027658All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1821Open in IMG/M
3300025906|Ga0207699_10036393All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Oceanospirillales → Halomonadaceae → Halomonas → Halomonas halophila2806Open in IMG/M
3300025910|Ga0207684_10008401All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria9180Open in IMG/M
3300025910|Ga0207684_10013663All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium elkanii7015Open in IMG/M
3300025910|Ga0207684_10023128All Organisms → cellular organisms → Bacteria → Proteobacteria5307Open in IMG/M
3300025910|Ga0207684_10484923Not Available1060Open in IMG/M
3300025916|Ga0207663_10895034Not Available709Open in IMG/M
3300025928|Ga0207700_10150806All Organisms → cellular organisms → Bacteria1921Open in IMG/M
3300026015|Ga0208286_1002853Not Available1059Open in IMG/M
3300026023|Ga0207677_10135986All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1874Open in IMG/M
3300026304|Ga0209240_1084541All Organisms → cellular organisms → Bacteria1163Open in IMG/M
3300026340|Ga0257162_1002151All Organisms → cellular organisms → Bacteria2165Open in IMG/M
3300026351|Ga0257170_1000177All Organisms → cellular organisms → Bacteria → Proteobacteria4376Open in IMG/M
3300026507|Ga0257165_1111071Not Available511Open in IMG/M
3300026515|Ga0257158_1018525Not Available1151Open in IMG/M
3300026551|Ga0209648_10171364All Organisms → cellular organisms → Bacteria1691Open in IMG/M
3300026551|Ga0209648_10217641All Organisms → cellular organisms → Bacteria1442Open in IMG/M
3300026551|Ga0209648_10563011Not Available634Open in IMG/M
3300026557|Ga0179587_10799877Not Available622Open in IMG/M
3300027645|Ga0209117_1012067All Organisms → cellular organisms → Bacteria2899Open in IMG/M
3300027765|Ga0209073_10399168Not Available564Open in IMG/M
3300027894|Ga0209068_10014995All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium elkanii3725Open in IMG/M
3300027915|Ga0209069_10614175Not Available628Open in IMG/M
3300028047|Ga0209526_10089460All Organisms → cellular organisms → Bacteria2167Open in IMG/M
3300028784|Ga0307282_10451516Not Available624Open in IMG/M
3300028792|Ga0307504_10012659All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1944Open in IMG/M
(restricted) 3300031248|Ga0255312_1075000Not Available817Open in IMG/M
3300031538|Ga0310888_10389307Not Available816Open in IMG/M
3300031716|Ga0310813_10027792All Organisms → cellular organisms → Bacteria → Proteobacteria3962Open in IMG/M
3300031720|Ga0307469_10543457Not Available1027Open in IMG/M
3300031820|Ga0307473_10004487All Organisms → cellular organisms → Bacteria → Proteobacteria4378Open in IMG/M
3300031820|Ga0307473_11096366Not Available586Open in IMG/M
3300032174|Ga0307470_10843661Not Available714Open in IMG/M
3300033432|Ga0326729_1006148All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2247Open in IMG/M
3300033433|Ga0326726_10283862Not Available1549Open in IMG/M
3300033513|Ga0316628_100205486All Organisms → cellular organisms → Bacteria2370Open in IMG/M
3300033513|Ga0316628_103909547Not Available533Open in IMG/M
3300034090|Ga0326723_0070241Not Available1494Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere20.95%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil14.29%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil9.52%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil7.62%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment3.81%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.81%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.81%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil3.81%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.81%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.86%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil2.86%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.90%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.90%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.90%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.90%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.90%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.95%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.95%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.95%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.95%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.95%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.95%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.95%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Soil → Unclassified → Miscanthus Rhizosphere0.95%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.95%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.95%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.95%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.95%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005456Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M7-3 metaGHost-AssociatedOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005543Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M1-3 metaGHost-AssociatedOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005842Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2Host-AssociatedOpen in IMG/M
3300005876Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_401EnvironmentalOpen in IMG/M
3300005879Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_301EnvironmentalOpen in IMG/M
3300005880Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_201EnvironmentalOpen in IMG/M
3300005985Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S4T2R2Host-AssociatedOpen in IMG/M
3300006047Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012906Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S212-509R-1EnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017993Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_3EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300019865Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1s1EnvironmentalOpen in IMG/M
3300019877Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m1EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020021Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c1EnvironmentalOpen in IMG/M
3300020070Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-1 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300024310Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK22EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026015Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_401 (SPAdes)EnvironmentalOpen in IMG/M
3300026023Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026340Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-AEnvironmentalOpen in IMG/M
3300026351Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-05-BEnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031538Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D1EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300033432Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF6AY SIP fractionEnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300034090Peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF00NEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
soilH2_1003937043300003324Sugarcane Root And Bulk SoilMLSRFGPSGLLRLVAVLGIGASLLLAGCATSAPETATQSAGTEKEGRMRCDHPRAGGITVVCWKR*
Ga0062593_10053400823300004114SoilMLSHFGPSGLLRLAAVTGIGASLLLAGCATSTHETTQAAVDEKPGWTRCDHPRDGGITVVCWKR*
Ga0062595_10148147413300004479SoilMLERLGPSGLLRLAAVITLGASLLLSGCAASTHESPQTAADQKEGWSRCDHGRPGTITVVCWKR*
Ga0070691_1008941223300005341Corn, Switchgrass And Miscanthus RhizosphereMLQRLGPSGLLRLAAVITLGASFLLSGCTASTHEATQTAADEGWSRCAHGRPGALTVVCWQR*
Ga0070691_1103738613300005341Corn, Switchgrass And Miscanthus RhizosphereMLSHFGPSGLLRLAAVTGIGASLLLAGCATSTHETTQAAVDEKPGWTRCDHPRDDGITVVCWKR*
Ga0070703_1016391813300005406Corn, Switchgrass And Miscanthus RhizosphereMLERLGPSGLLRLATVITLGASLLLSGCAASTHEATQTAADEKEGWRRCDQGRPGTITLVCWKR*
Ga0070709_1024790223300005434Corn, Switchgrass And Miscanthus RhizosphereMLERMGPSGLLRLAAMISVAASLLLAGCAASTPAPRQTAAEEKEGWSRCEHGRPDTITVVCWKR*
Ga0070711_10007107213300005439Corn, Switchgrass And Miscanthus RhizosphereMLERMGPSGLLRLAAVISVAASLLLAGCAASTPAPRQTAAEEKEGWSRCEHGRPDTITVVCWKR*
Ga0070708_10001491183300005445Corn, Switchgrass And Miscanthus RhizosphereMLERLGPSGLLRLAAVITLGASLLLSGCAASTHEVTQTTAEGKEGWSRCDHGRPGMITVVCWKR*
Ga0070708_10010021823300005445Corn, Switchgrass And Miscanthus RhizosphereMLERMGPSGLLRLAAVISVAASLLLTGCAASTPAPPQTAAEEKAGWSRCEHGRPDTITVVCWKR*
Ga0070678_10105426813300005456Miscanthus RhizosphereMLSHFGPSGLLRLAAVTGIGASLLLAGFATSTHETTQAAVDEKPGWTRCDHPRDGGITVVCWKR*
Ga0070706_10003264143300005467Corn, Switchgrass And Miscanthus RhizosphereMLERLGPSGLLRLATVITLGASLLLSGCAASTHEATQTAADEKEGWSRCDHGRPGTITIVCWKR*
Ga0070707_10014195243300005468Corn, Switchgrass And Miscanthus RhizosphereSEEAADMLERLGPSGLLRLATVITLGASLLLSGCAASTHEATQTAADEKEGWRRCDQGRPGTITLVCWKR*
Ga0070698_10046960723300005471Corn, Switchgrass And Miscanthus RhizosphereSEEAADMLERLGPSGLLRLATVITLGASLLLSGCAASTHEATQTAADEKEGWSRCDHGRPGTITIVCWKR*
Ga0070699_10046295613300005518Corn, Switchgrass And Miscanthus RhizosphereHMLSHFGPSGLLRLAAVTGIGASLLLAGCATSTHETTQAAVDEKPGWTRCDHPRDGGITVVCWKR*
Ga0070741_100003871473300005529Surface SoilMLTRFGPSGLLRLTAVIGIGASLLLAGCAASTPETAQAAVEDRDGWTRCDHPRAGGITVVCWKR*
Ga0070697_10102606723300005536Corn, Switchgrass And Miscanthus RhizosphereLSEEAADMLERLGPSGLLRLATVITLGASLLLSGCAASTHEATQTAADEKEGWRRCDQGRPGTITLVCWKR*
Ga0070732_1068617513300005542Surface SoilMLERLGPSGLLRLAAVITLGASLLLSGCTATTHEATQTAAEDRDGWSRCAHGRPGAITVVCWQR*
Ga0070672_10032649133300005543Miscanthus RhizosphereMLSHFGPSGLLRLAAVTGIGASLLLAGCATSTHETTQAAVEEKPGWTRCDHPRDGGITVVCWKR*
Ga0070695_10156481623300005545Corn, Switchgrass And Miscanthus RhizosphereLAAVISVAASLLLAGCAASTPAPRQTAAEEKEGWSRCEHGRPDTITVVCWKR*
Ga0068858_10010181013300005842Switchgrass RhizosphereSGLLRLAAVTGIGASLLLAGCATSTHETTQAAVDEKPGWTRCDHPRDGGITVVCWKR*
Ga0075300_101578223300005876Rice Paddy SoilMLQRLGPSGLLRLAAVITLGASFLLSGCTASTHEATQTAADGGWSRCAHGRPGALTVACWQR*
Ga0075295_100958323300005879Rice Paddy SoilMLQRLGPSGLLRLAAVITLGASFLLSGCTASTHEATQTAADGGWSCCAHGRPGALTVACWQR*
Ga0075298_101839723300005880Rice Paddy SoilMLQRLGPSGLLRLAAVITLGASLLLSGCTASTHEATQTAADEGWSRCAHGRPGALTVVCWQR*
Ga0081539_1035339823300005985Tabebuia Heterophylla RhizosphereMLTRFGPSGLLRLAAMIGLSASLLLAGCATSAHETTQATVDEKDGWTRCDHPRAGGITVVCWKR*
Ga0075024_10037117423300006047WatershedsMLERLGPSGLLRLAAVITLGASFLLSGCTASTHEATQTAAEDREGWSRCAHGRPGALTVVCWQR*
Ga0070716_10005847223300006173Corn, Switchgrass And Miscanthus RhizosphereMLERMGPSGLLRLAAVISVAASLLLAGCATSTPAPRQTAAEEKEGWSRCEHGRPDTITVVCWKR*
Ga0079222_1011216223300006755Agricultural SoilMLSRFGPSGLLRLVAMLGIGASLLLAGCATYAPETATQTAGAEKEGWTRCDHPRAGGITVVCWKR*
Ga0075433_1001622943300006852Populus RhizosphereMLERMGPSGLLRLAAVISVAASLLLAGCAASTPAPPQTAAEEKGGWSRCEHGRPDTITVVCWKR*
Ga0075425_10058885713300006854Populus RhizosphereMLERLGPSGLLRLAAVISVAASLLLAGCATSTPKPPQTAAEEKEGWSRCEHGRPDTITVVCWKR*
Ga0075426_1033481723300006903Populus RhizospherePEEAADMLERLGPSGLLRLAAVISVAASLLLAGCATSTPKPPQTAAEEKEGWSRCEHGRPDTITVVCWKR*
Ga0075424_10052097213300006904Populus RhizosphereMLERLGPSGLLRLAAVISVAASLLLAGCATATPKPPQTAAEEKEGWSRCEHGRPDTITVVCWKR*
Ga0099791_1000071663300007255Vadose Zone SoilMLERLGPSGLLRLATVITLGASLLLSGCAASTHEATQTAADEKEGWSRCDHGHPGTITIVCWKR*
Ga0099794_1076292113300007265Vadose Zone SoilMLERLGPSGLLRLAAVITLGASLLLSGCAASTHEVTQTAAEGKEGWSRCDHGRPGTITVVCWKR*
Ga0099795_1056650613300007788Vadose Zone SoilMLERLGPSGLLRLAAVITLGASLLLSGCAASTHEVTQTAAEGKEGWSRCDHGRPGTITIVCWKR*
Ga0099829_1014767313300009038Vadose Zone SoilGLLRLATVITLGASLLLSGCAVSTHEATQTAADEKEGWSRCDHGHPGTITIVCWKR*
Ga0099792_1001751533300009143Vadose Zone SoilMLERLGPSGLLRLATVITLGASLLLSGCAVSTHEATQTAADEKEGWSRCDHGHPGTITIVCWKR*
Ga0105243_1015352933300009148Miscanthus RhizosphereSVRDTGGWIFEEAHMLSHFGPSGLLRLAAVTGIGASLLLAGCATSTHETTQAAVDEKPGWTRCDHPRDGGITVVCWKR*
Ga0105241_1044675723300009174Corn RhizosphereMGGWIFEEAHMLSHFGPSGLLRLAAVTGIGASLLLAGCATSTHETTQAAVDEKPGWTRCDHPRDGGITVVCWKR*
Ga0134122_1036766923300010400Terrestrial SoilMLERLGPSGLLRLATVITLGASLLLTGCTTSTHEANQAAAEEKEGGWSRCDHARPGAITVVCWKR*
Ga0134122_1177319523300010400Terrestrial SoilMLQRLGPSGLLRLAAVITLGASLLLSGCAASIHESPQTAAEEKAGWSRCDHGRSGTIILVCWKR*
Ga0137391_1068905723300011270Vadose Zone SoilMLERLGPSGLLRLATVITLGASLLLSGCAVSTHEATQTAADEKEGWSRCDHGRPGMITVVCWKR*
Ga0137363_1005557723300012202Vadose Zone SoilMLERLGPSGLLRLATVITLGASLLLSGCAASTHGATQTAADEKEGWSRCDHGHPGTITIVCWKR*
Ga0137362_1000652053300012205Vadose Zone SoilMLERLGPSGLLRLAAVITLGASLLLSGCAASTHEVTQTTAEGKEGWSRCDHGRPGMITIVCWKR*
Ga0137362_1006905013300012205Vadose Zone SoilMLERLGPSGLLRLATVITLGASLLLSGCAASTHGATQTAADEKEGWSRCDHGRPGTITLVCWKR*
Ga0157295_1028565113300012906SoilMLSHFGPSGLLRLAAVTGIGASLLLAGCATSPHETTQAAVDEKPGWTRCDHPRDGGITVVCWKR*
Ga0137395_1000613633300012917Vadose Zone SoilMLERLGPSGLLRLAAVITLGASLLLSGCAASTHEVTQTAAEGKEGWSRCDHGRPGMITIVCWKR*
Ga0137395_1018494523300012917Vadose Zone SoilMLERLGPSGLLRLATVITLGASLLLSGCAASTHEATQTAADEKEGWSRCDHGRPGTITLVCWKR*
Ga0137396_1096037313300012918Vadose Zone SoilRLAAVITLGASLLLSGCAASTHEVTQTAAEGKEGWSRCDHGRPGTITVVCWKR*
Ga0137416_1097258133300012927Vadose Zone SoilMLERLGPSGLLRLAAVITLGASLLLSGCAASTHEVTQTAVEGKEGWSRCDHGRPGTITVVCWKR*
Ga0164304_1000325993300012986SoilLLRLAAVTGIGASLLLAGCATSTHETTQAAVEEKPGWTRCDHPRDGGITVVCWKR*
Ga0157378_1269406823300013297Miscanthus RhizosphereLAAVTGIGASLLLAGCATSTHETTQAAVDEKPGWTRCDHPRDGGITVVCWKR*
Ga0187825_1003367313300017930Freshwater SedimentMLERMGPSGLLRLAAVITLGASLLLSGCGASTHEATQTAAQDREGWSRCAHGRPGAITVVCWQR
Ga0187821_1011311223300017936Freshwater SedimentGPSGLLRLAAVITLGASLLLSGCGASTHEATQTAAEEREGWSRCAHGRPGAITVVCWQR
Ga0187821_1037849713300017936Freshwater SedimentPSGLLRLAAVITLGASLLLSGCGASTHEATQTAAQDREGWSRCAHGRPGAITVVCWQR
Ga0187823_1014010523300017993Freshwater SedimentMLEWLGPSGLLRLAAVITLGASLLLSGCGASTHEATQTAAEEREGWSRCAHGRPGAITVVCWQR
Ga0137408_133199643300019789Vadose Zone SoilMLERLGPSGLLRLATVITLGASLLLSGCAASTHGATQTAADEKEGWSRCDHGRPGTISIVCWKR
Ga0193748_101440323300019865SoilMLERLGPSGLLRLAAVITLGASLLLSGCAASTHESPQTAADQKEGWSRCDHGRPGTITVVCWKR
Ga0193722_105982113300019877SoilMLERLGPSGLLRLAAVITLGASLLLAGCAASTRESPQTAADEKEGWSRCDRGRPGMITLVCWKR
Ga0193707_110814723300019881SoilMLERLGPSGLLRLAAVITLGASLLLSGCSASTRESPQTAADEKEGWSRCDHGRPGTITLVCWKR
Ga0193730_117987023300020002SoilAAEMLERLGPSGLLRLAAVITLGASLLLSGCAASTRESPQTAADEKEGWSRCDHGRPGTITLVCWKR
Ga0193726_113136423300020021SoilMLERLGPSGLLRLAAVITLGASLLLSGCAASTHESPQTAADEKEGWSRCDHGRPGTITLVCWKR
Ga0206356_1189423613300020070Corn, Switchgrass And Miscanthus RhizosphereMLSHFGPSGLLRLAAVTGIGASLLLAGCATSTHETTQAAVDEKPGWTRCDHPRDGGITVVCWKR
Ga0210407_1007547133300020579SoilMLERMGPAGLLRLAAMISVAASLLLAGCATSTPKPPQTAAEEKEGWSRCERGRPDTITVVCWKR
Ga0210406_1027698313300021168SoilMLERMGPAGLLRLAAMISVAASLLLAGCATSTPKPPQTAAEEKEGWSRCEHGRPDTITVVCWKR
Ga0210410_1050936113300021479SoilLLRLAAMISVAASLLLAGCATSTPKPPQTAAEEKEGWSRCEHGRPDTITVVCWKR
Ga0247681_108598523300024310SoilPSGLLRLATVITLGASLLLTGCTTSTHEANQAAAEEKEGGWSRCDHARPGAITVVCWKR
Ga0207653_1002765843300025885Corn, Switchgrass And Miscanthus RhizosphereDMLERLGPSGLLRLATVITLGASLLLSGCAASTHEATQTAADEKEGWSRCDHGRPGTITIVCWKR
Ga0207699_1003639323300025906Corn, Switchgrass And Miscanthus RhizosphereMLERMGPSGLLRLAAMISVAASLLLAGCAASTPAPRQTAAEEKEGWSRCEHGRPDTITVVCWKR
Ga0207684_1000840173300025910Corn, Switchgrass And Miscanthus RhizosphereMLERLGPSGLLRLATVITLGASLLLSGCAASTHEATQTAADEKEGWRRCDQGRPGTITLVCWKR
Ga0207684_1001366363300025910Corn, Switchgrass And Miscanthus RhizosphereMLERLGPSGLLRLATVITLGASLLLSGCAASTHEATQTAADEKEGWSRCDHGRPGTITIVCWKR
Ga0207684_1002312853300025910Corn, Switchgrass And Miscanthus RhizosphereMLERMGPSGLLRLAAVISVAASLLLTGCAASTPAPPQTAAEEKAGWSRCEHGRPDTITVVCWKR
Ga0207684_1048492323300025910Corn, Switchgrass And Miscanthus RhizosphereMLERLGPSGLLRLAAVITLGASLLLSGCAASTHEVTQTTAEGKEGWSRCDHGRPGMITVVCWKR
Ga0207663_1089503413300025916Corn, Switchgrass And Miscanthus RhizosphereMLERMGPSGLLRLAAVISVAASLLLAGCATSTPKPPQTAAGEKEGWSRCEHGRPDTITVVCWKR
Ga0207700_1015080613300025928Corn, Switchgrass And Miscanthus RhizosphereMLERMGPSGLLRLAAVISVAASLLLAGCATSTPAPRQTAAEEKEGWSRCEHGRPDTITVVCWKR
Ga0208286_100285323300026015Rice Paddy SoilMLQRLGPSGLLRLAAVITLGASFLLSGCTASTHEATQTAADGGWSRCAHGRPGALTVACWQR
Ga0207677_1013598613300026023Miscanthus RhizosphereMLSHFGPSGLLRLAAVTGIGASLLLAGCATSTHETTQAAVDEKPGWTRCDHPRDGDATVRRGRRGRIVG
Ga0209240_108454133300026304Grasslands SoilMLERLGPSGLLRLAAVITLGASLLLSGCAASTHEVTQTAAEGKEGWSRCDHGRPGTITVVCWKR
Ga0257162_100215113300026340SoilMLERLGPSGLLRLATVITLGASLLLSGCAVSTHEATQTAADEKEGWSRCDHGHPGTITIVCWKR
Ga0257170_100017753300026351SoilMLERLGPSGLLRLATVITLGASLLLSGCAASTHEATQTAADEKEGWSRCDHGHPGTITIVCWKR
Ga0257165_111107113300026507SoilMLERLGPSGLLRLAAVITLGASLLLSGCAASTHEVTQTAVEGKEGWSRCDHGRPGTITVVCWKR
Ga0257158_101852523300026515SoilMLERLGPSGLLRLAAVITLGASLLLSGCAASSHESPQTAADEKEGWSRCDHGRPGTITLVCWKR
Ga0209648_1017136413300026551Grasslands SoilEEAADMLEGLGPSGLLRLATVITLGASLLLSGCAASTHEATQTAADEKEGWSRCDHGRPGTITIVCWKR
Ga0209648_1021764113300026551Grasslands SoilMLERLGPSGLLRLATAITLGASLLLSGCAASTHEATQTAAGEKEGWSRCDHGRPGTITIVCWKR
Ga0209648_1056301123300026551Grasslands SoilMLERLGPSGLLRLATVITLGASLLLSGCAASTHEATQTAADEKEGWRRCDHGRPGTITLVCWKR
Ga0179587_1079987713300026557Vadose Zone SoilRLGAVITLGASLLLSGCAASSHESPQTAADEKEGWSRCDHGRPGTITLVCWKR
Ga0209117_101206743300027645Forest SoilMLERLGPSGLLRLAAVITLGASLLLSGCAASTHESPQTAADEKEGWSRCDHGRPGMITLVCWKR
Ga0209073_1039916823300027765Agricultural SoilMLSHFGPSGLLRLAAVTGIGASLLLAGCATSTHETTQAAVEEKPGWTRCDHPRDGGITVVCWKR
Ga0209068_1001499523300027894WatershedsMQERLGPAGLLRLATVITLGASLLLSGCAASTHETAQAAPEEKEGWSRCDHGRPGTITVVCWKR
Ga0209069_1061417513300027915WatershedsMLERLGPSGLLRLAAVITLGASFLLSGCTASTHEATQTAAEDREGWSRCAHGRPGALTVVCWQR
Ga0209526_1008946033300028047Forest SoilMLERLGPSGLLRLAAVITLSASLLLSGCAASTHESPQTAAEEKEGWSRCDHGRPGMITIVCWKR
Ga0307282_1045151613300028784SoilMLQRLGPSGLLRLAAVITLGASLLLSGCAASTHESPQTAADQKEGWSRCDHGRPGTITVVCWKR
Ga0307504_1001265933300028792SoilMLERLGPSGLLRLAAVITLGASLLLSGCAASTHESPQTAATEKEGWSRCDHGRPGMITLVCWKR
(restricted) Ga0255312_107500023300031248Sandy SoilMLERLGPSGLLRLAAVITLGASLLLSGCGAATHEPTQTAAEDREGWSRCAHGRPGALTVVGWQR
Ga0310888_1038930723300031538SoilMLQRLGPSGLLRLAAVITLGASFLLSGCTASTHEATQTAADEGWSRCAHGRPGALTVVCWQR
Ga0310813_1002779243300031716SoilMLERLGPSGLLRLTAVITLGASLLLSSCTATTHETTRTAAEDREGWSRCAHGRPGVLTVVCWQR
Ga0307469_1054345723300031720Hardwood Forest SoilPSGLLRLATVITLGASLLLSGCAASTHEATQTAADEKEGWRRCDQGRPGTITLVCWKR
Ga0307473_1000448723300031820Hardwood Forest SoilMLERMGPSGLLRLAAVISVAASLLLTGCAASTPAPPQTAAEEKEGWSRCEHGRPDTITVVCWKR
Ga0307473_1109636623300031820Hardwood Forest SoilMLERLGPSGLLRLATVITLGASLLLSGCAASTHEATQTAADEKEGWRRCDQGRPGTITL
Ga0307470_1084366123300032174Hardwood Forest SoilERLGPSSLLRLAAVITLGASLLLSGCAASTHESPQTAADEKEGWSRCEHGRPDTITVVCWKR
Ga0326729_100614843300033432Peat SoilMLERLGPSGLLRLAAVITLGASFLLSGCAASTHEATQTAAEDREGWSRCAHGRPGALTVVCWQR
Ga0326726_1028386233300033433Peat SoilRLAAVITLGASFLLSGCAASTHETTQTAAEDREGWSRCAHGRPGALTVVCWQR
Ga0316628_10020548623300033513SoilMLQRLGPSGLLRLAAVITLGASLLLSGCTASTHEATQTAADEGWSRCAHGRPGALTVVCWQR
Ga0316628_10390954713300033513SoilMLARLGPSGLLRLAAVFTLGASFLLSGCSASTHEATRTAAEDKEGWSRCAHGRPGALTV
Ga0326723_0070241_34_2283300034090Peat SoilMLERLGPSGLLRLAAVITLGASFLFSGCAASTHEATQTAAEDRDGWSRCAHGRPGALTVVCWQR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.