NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F072662

Metagenome / Metatranscriptome Family F072662

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F072662
Family Type Metagenome / Metatranscriptome
Number of Sequences 121
Average Sequence Length 114 residues
Representative Sequence MSITPKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSEVGSVAIPPPAQKTFDAEGEALEHALRKSFRLRNDAIKSLCSSVRRFRDATIKEECKGDYPHALQALWKALDRAEDLIQN
Number of Associated Samples 88
Number of Associated Scaffolds 121

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 7.44 %
% of genes near scaffold ends (potentially truncated) 35.54 %
% of genes from short scaffolds (< 2000 bps) 88.43 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.62

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (61.983 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(18.182 % of family members)
Environment Ontology (ENVO) Unclassified
(39.669 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(66.942 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 66.21%    β-sheet: 0.00%    Coil/Unstructured: 33.79%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.62
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 121 Family Scaffolds
PF04679DNA_ligase_A_C 9.92
PF02586SRAP 4.96
PF13302Acetyltransf_3 2.48
PF12680SnoaL_2 1.65
PF01909NTP_transf_2 1.65
PF01022HTH_5 1.65
PF01068DNA_ligase_A_M 1.65
PF00072Response_reg 0.83
PF13847Methyltransf_31 0.83
PF00140Sigma70_r1_2 0.83
PF04542Sigma70_r2 0.83
PF00375SDF 0.83
PF13515FUSC_2 0.83
PF00589Phage_integrase 0.83
PF00144Beta-lactamase 0.83
PF13414TPR_11 0.83
PF01272GreA_GreB 0.83
PF00239Resolvase 0.83
PF00701DHDPS 0.83
PF12762DDE_Tnp_IS1595 0.83
PF04198Sugar-bind 0.83
PF00069Pkinase 0.83
PF13676TIR_2 0.83
PF02018CBM_4_9 0.83
PF13193AMP-binding_C 0.83
PF09948DUF2182 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 121 Family Scaffolds
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 11.57
COG2135ssDNA abasic site-binding protein YedK/HMCES, SRAP familyReplication, recombination and repair [L] 4.96
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 3.31
COG03294-hydroxy-tetrahydrodipicolinate synthase/N-acetylneuraminate lyaseCell wall/membrane/envelope biogenesis [M] 1.65
COG0568DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)Transcription [K] 1.65
COG1423ATP-dependent RNA circularization protein, DNA/RNA ligase (PAB1020) familyReplication, recombination and repair [L] 1.65
COG0782Transcription elongation factor, GreA/GreB familyTranscription [K] 0.83
COG1191DNA-directed RNA polymerase specialized sigma subunitTranscription [K] 0.83
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 0.83
COG1680CubicO group peptidase, beta-lactamase class C familyDefense mechanisms [V] 0.83
COG1686D-alanyl-D-alanine carboxypeptidaseCell wall/membrane/envelope biogenesis [M] 0.83
COG1961Site-specific DNA recombinase SpoIVCA/DNA invertase PinEReplication, recombination and repair [L] 0.83
COG2367Beta-lactamase class ADefense mechanisms [V] 0.83
COG2390DNA-binding transcriptional regulator LsrR, DeoR familyTranscription [K] 0.83
COG2452Predicted site-specific integrase-resolvaseMobilome: prophages, transposons [X] 0.83
COG4941Predicted RNA polymerase sigma factor, contains C-terminal TPR domainTranscription [K] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A61.98 %
All OrganismsrootAll Organisms38.02 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000955|JGI1027J12803_102444924Not Available564Open in IMG/M
3300001593|JGI12635J15846_10294255Not Available1019Open in IMG/M
3300001593|JGI12635J15846_10639463Not Available616Open in IMG/M
3300002245|JGIcombinedJ26739_100252599Not Available1651Open in IMG/M
3300002245|JGIcombinedJ26739_100530894All Organisms → cellular organisms → Bacteria1054Open in IMG/M
3300003219|JGI26341J46601_10008526All Organisms → cellular organisms → Bacteria3366Open in IMG/M
3300003219|JGI26341J46601_10047810Not Available1344Open in IMG/M
3300003368|JGI26340J50214_10006218All Organisms → cellular organisms → Bacteria3858Open in IMG/M
3300004080|Ga0062385_10723878Not Available644Open in IMG/M
3300004092|Ga0062389_101044379All Organisms → cellular organisms → Bacteria1002Open in IMG/M
3300004092|Ga0062389_102624833All Organisms → cellular organisms → Bacteria670Open in IMG/M
3300004092|Ga0062389_103157926All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium IMCC26134617Open in IMG/M
3300004152|Ga0062386_100319402Not Available1241Open in IMG/M
3300004152|Ga0062386_101768441Not Available516Open in IMG/M
3300004635|Ga0062388_102317528All Organisms → cellular organisms → Bacteria561Open in IMG/M
3300005338|Ga0068868_102402293Not Available503Open in IMG/M
3300005439|Ga0070711_101873727All Organisms → cellular organisms → Bacteria → Proteobacteria527Open in IMG/M
3300005548|Ga0070665_100472706Not Available1264Open in IMG/M
3300005712|Ga0070764_10961995Not Available537Open in IMG/M
3300006052|Ga0075029_100533738Not Available778Open in IMG/M
3300006059|Ga0075017_101178159Not Available600Open in IMG/M
3300006162|Ga0075030_101616001Not Available507Open in IMG/M
3300006173|Ga0070716_100905460All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium IMCC26134691Open in IMG/M
3300006175|Ga0070712_100077094Not Available2402Open in IMG/M
3300006175|Ga0070712_100160442Not Available1736Open in IMG/M
3300006175|Ga0070712_101106732Not Available688Open in IMG/M
3300006176|Ga0070765_101848427Not Available566Open in IMG/M
3300006354|Ga0075021_10370006All Organisms → cellular organisms → Bacteria895Open in IMG/M
3300006893|Ga0073928_10089014All Organisms → cellular organisms → Bacteria2621Open in IMG/M
3300006893|Ga0073928_10357621All Organisms → cellular organisms → Bacteria1079Open in IMG/M
3300006893|Ga0073928_10557998Not Available814Open in IMG/M
3300009176|Ga0105242_13283555Not Available503Open in IMG/M
3300009545|Ga0105237_10989374Not Available848Open in IMG/M
3300009553|Ga0105249_13078211Not Available536Open in IMG/M
3300010339|Ga0074046_10034017All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Opitutae → Opitutales → Opitutaceae3463Open in IMG/M
3300010339|Ga0074046_10119654All Organisms → cellular organisms → Bacteria1691Open in IMG/M
3300010339|Ga0074046_10289233All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Azospirillaceae1009Open in IMG/M
3300010341|Ga0074045_10256133Not Available1156Open in IMG/M
3300010343|Ga0074044_10090916All Organisms → cellular organisms → Bacteria2057Open in IMG/M
3300010343|Ga0074044_10330687Not Available1001Open in IMG/M
3300010343|Ga0074044_10989564Not Available551Open in IMG/M
3300010379|Ga0136449_100988778All Organisms → cellular organisms → Bacteria1356Open in IMG/M
3300010379|Ga0136449_101651719All Organisms → cellular organisms → Bacteria969Open in IMG/M
3300010379|Ga0136449_103776742Not Available570Open in IMG/M
3300012361|Ga0137360_11583681Not Available560Open in IMG/M
3300012924|Ga0137413_11847164Not Available500Open in IMG/M
3300012957|Ga0164303_10184037Not Available1141Open in IMG/M
3300012986|Ga0164304_11418949Not Available571Open in IMG/M
3300012987|Ga0164307_10566828Not Available870Open in IMG/M
3300012989|Ga0164305_11012607Not Available707Open in IMG/M
3300012989|Ga0164305_12219683Not Available505Open in IMG/M
3300013296|Ga0157374_11625657All Organisms → cellular organisms → Bacteria670Open in IMG/M
3300013297|Ga0157378_10095595All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Chelatococcaceae → Chelatococcus2706Open in IMG/M
3300013308|Ga0157375_11904073Not Available706Open in IMG/M
3300014969|Ga0157376_12278423Not Available581Open in IMG/M
3300020579|Ga0210407_10316118All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Rhodospirillaceae1220Open in IMG/M
3300020580|Ga0210403_10414770Not Available1099Open in IMG/M
3300020581|Ga0210399_11168143Not Available613Open in IMG/M
3300020583|Ga0210401_10081609All Organisms → cellular organisms → Bacteria3043Open in IMG/M
3300021046|Ga0215015_10812951Not Available576Open in IMG/M
3300021046|Ga0215015_10838117All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptomyces → unclassified Streptomyces → Streptomyces sp. DH1547Open in IMG/M
3300021046|Ga0215015_10861021All Organisms → cellular organisms → Bacteria3556Open in IMG/M
3300021171|Ga0210405_10111117All Organisms → cellular organisms → Bacteria2164Open in IMG/M
3300021171|Ga0210405_10161576All Organisms → cellular organisms → Bacteria1774Open in IMG/M
3300021171|Ga0210405_10362415All Organisms → cellular organisms → Bacteria1143Open in IMG/M
3300021171|Ga0210405_10503629Not Available949Open in IMG/M
3300021171|Ga0210405_10650327Not Available818Open in IMG/M
3300021180|Ga0210396_11344217Not Available593Open in IMG/M
3300021181|Ga0210388_10716010Not Available871Open in IMG/M
3300021405|Ga0210387_10612214Not Available967Open in IMG/M
3300021405|Ga0210387_10905984Not Available776Open in IMG/M
3300021405|Ga0210387_10935746Not Available761Open in IMG/M
3300021407|Ga0210383_10812087All Organisms → cellular organisms → Bacteria800Open in IMG/M
3300021407|Ga0210383_11479262Not Available562Open in IMG/M
3300021420|Ga0210394_10805827All Organisms → cellular organisms → Bacteria820Open in IMG/M
3300021474|Ga0210390_10034395All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Rhodospirillaceae4138Open in IMG/M
3300021478|Ga0210402_11624557Not Available573Open in IMG/M
3300021479|Ga0210410_10171280All Organisms → cellular organisms → Bacteria1940Open in IMG/M
3300022557|Ga0212123_10241630Not Available1303Open in IMG/M
3300025898|Ga0207692_10353313All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Rhodospirillaceae907Open in IMG/M
3300025915|Ga0207693_10324646Not Available1205Open in IMG/M
3300025915|Ga0207693_10444900Not Available1013Open in IMG/M
3300025916|Ga0207663_11716998All Organisms → cellular organisms → Bacteria505Open in IMG/M
3300025939|Ga0207665_11220897Not Available600Open in IMG/M
3300026551|Ga0209648_10142237All Organisms → cellular organisms → Bacteria1906Open in IMG/M
3300027109|Ga0208603_1029628All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia861Open in IMG/M
3300027521|Ga0209524_1056735All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Rhodospirillaceae824Open in IMG/M
3300027562|Ga0209735_1012714Not Available1651Open in IMG/M
3300027583|Ga0209527_1154689Not Available507Open in IMG/M
3300027603|Ga0209331_1124832Not Available620Open in IMG/M
3300027629|Ga0209422_1106403Not Available646Open in IMG/M
3300027651|Ga0209217_1125755Not Available720Open in IMG/M
3300027660|Ga0209736_1106889Not Available758Open in IMG/M
3300027667|Ga0209009_1121785Not Available662Open in IMG/M
3300027667|Ga0209009_1125401Not Available652Open in IMG/M
3300027684|Ga0209626_1129983Not Available661Open in IMG/M
3300027698|Ga0209446_1006134All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Opitutae → Opitutales → Opitutaceae2945Open in IMG/M
3300027727|Ga0209328_10196475All Organisms → cellular organisms → Bacteria609Open in IMG/M
3300027812|Ga0209656_10027710Not Available3394Open in IMG/M
3300027812|Ga0209656_10064505All Organisms → cellular organisms → Bacteria2015Open in IMG/M
3300027812|Ga0209656_10216992Not Available919Open in IMG/M
3300027812|Ga0209656_10233187Not Available878Open in IMG/M
3300027824|Ga0209040_10356432Not Available692Open in IMG/M
3300027908|Ga0209006_10543248Not Available965Open in IMG/M
3300027908|Ga0209006_10911276Not Available705Open in IMG/M
3300028047|Ga0209526_10183387Not Available1455Open in IMG/M
3300028047|Ga0209526_10262905Not Available1177Open in IMG/M
3300028906|Ga0308309_10413698All Organisms → cellular organisms → Bacteria1158Open in IMG/M
3300029636|Ga0222749_10739842Not Available538Open in IMG/M
3300029943|Ga0311340_10615984All Organisms → cellular organisms → Bacteria946Open in IMG/M
3300030503|Ga0311370_11802758Not Available622Open in IMG/M
3300031023|Ga0073998_10059586Not Available728Open in IMG/M
3300031231|Ga0170824_114176443Not Available635Open in IMG/M
3300031234|Ga0302325_10544580All Organisms → cellular organisms → Bacteria1734Open in IMG/M
3300031236|Ga0302324_101005234All Organisms → cellular organisms → Bacteria1133Open in IMG/M
3300031708|Ga0310686_113973931All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1138Open in IMG/M
3300031715|Ga0307476_11199476Not Available556Open in IMG/M
3300031718|Ga0307474_11441791Not Available542Open in IMG/M
3300031753|Ga0307477_10425293All Organisms → cellular organisms → Bacteria908Open in IMG/M
3300031754|Ga0307475_11543388Not Available509Open in IMG/M
3300031962|Ga0307479_10854214All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Isosphaerales → Isosphaeraceae → Aquisphaera → Aquisphaera giovannonii884Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil18.18%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil16.53%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil13.22%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.26%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil7.44%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil5.79%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.13%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring3.31%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds3.31%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa3.31%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere3.31%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil2.48%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil2.48%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil1.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.83%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.83%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.83%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.83%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.83%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300003219Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM3EnvironmentalOpen in IMG/M
3300003368Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM2EnvironmentalOpen in IMG/M
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300004635Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005548Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaGHost-AssociatedOpen in IMG/M
3300005712Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4EnvironmentalOpen in IMG/M
3300006052Freshwater sediment microbial communities from North America - Little Laurel Run_MetaG_LLR_2013EnvironmentalOpen in IMG/M
3300006059Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2012EnvironmentalOpen in IMG/M
3300006162Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006354Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009545Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaGHost-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300010339Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM3EnvironmentalOpen in IMG/M
3300010341Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM2EnvironmentalOpen in IMG/M
3300010343Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM1EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300014969Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaGHost-AssociatedOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300025898Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027109Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF008 (SPAdes)EnvironmentalOpen in IMG/M
3300027521Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027562Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027583Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027603Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027629Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027660Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027667Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027684Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027698Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027727Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027812Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027824Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM3 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300029943I_Palsa_N3 coassemblyEnvironmentalOpen in IMG/M
3300030503III_Palsa_E3 coassemblyEnvironmentalOpen in IMG/M
3300031023Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil TCEFA (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031234Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_2EnvironmentalOpen in IMG/M
3300031236Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_1EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI1027J12803_10244492413300000955SoilELEASKQSRLRAWNVLQKLRSVLSEAGNVAIPPPNQKTFDAEGEALEHALTKSFRIRNEAIKSLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDLIQI*
JGI12635J15846_1029425533300001593Forest SoilMSITLKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSELGSVAIPPPAQKTFDAEGEALEHVLRKSFRLRNDAIKSLCSSVRRFRDATIKDECKGDYPHALQALWKALDRAEDLIQN*
JGI12635J15846_1063946313300001593Forest SoilMSITPKAIQDLLDELEASKQSRLRAWNVLQRLRSVLSEAGNVAIPPLAHKTFDGEGEVLEDALTKSLRIRNEAIKSLCSSVRRFRDATIKEECKRDYPHALQALWKALDRAENLIQN*
JGIcombinedJ26739_10025259923300002245Forest SoilMEITPNAIQDLLDELEASKRSRLRAWNALQRLRLVLSKLGNISIPPPAQKTFDSEGEILEHALTKTFRIRNEAIKSLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDLIQN*
JGIcombinedJ26739_10053089423300002245Forest SoilMSITPKAIQDLLDELEASKQSRLRAWSVLQRLRMLLSEVGSVAIPPPAQKTFDAEGEALEHALRKSFRLRNDAIKSLCSSVRRFRDATIKEECKGDYPHALQALWKALDRAEDLIQN*
JGI26341J46601_1000852653300003219Bog Forest SoilMSITPRAIQDLLDELETSKQSRLRAWNVLQXLRLVLSELGNVAIPSPAQKTFDAEGEILEHALTKCFRARSDAIKNLCSAVRRFRDATLKESCKGDYPQALQAMLKALDRAEDLI
JGI26341J46601_1004781023300003219Bog Forest SoilMSITPKAIQDLLDELEASKQSRLRAWRVLQRLRMVLSEVGSVAIPPPAQKTFDAEGEALEHALRKSFRLRNDAIKSLCSSVRRFRDATIKEECKGDYPHALQALWKALDRAEDLIQN*
JGI26340J50214_1000621833300003368Bog Forest SoilMSITPRAIQDLLDELETSKQSRLRAWNVLQRLRLVLSELGNVAIPSPAQKTFDAEGEILEHALTKCFRARSDAIKNLCSAVRRFRDATLKESCKGDYPQALQAMLKALDRAEDLI*
Ga0062385_1072387823300004080Bog Forest SoilMSITPKAIQDLLEELEASKQSRLRAWSVLQRLRVVLSELGNVAIPAAKRKTFDAEGEILEHALTKCFRARNDAIKSLCSSVRRFRDATIKEKCKGDYPQALQAMFR*AKTGR*
Ga0062389_10104437913300004092Bog Forest SoilMSITPKAIQDLLDELEASKQSRLRAWKVLQRLRLVLSETGNIAIPPPAQKTFDAEGEILEHALKKSLRIRNEAIKSLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDLTQS*
Ga0062389_10262483313300004092Bog Forest SoilMLGMAITPKAIQDLLDELESSKQSRVRAWNVLQRLRLVLSDTGNVAIPPPVQKTFDAEGEALEHALTKTFRIRNEAIKSLCSSERRFRDATIKEECKSDYPHALQALWKALDRAEDLIQN
Ga0062389_10315792613300004092Bog Forest SoilMNTQRMSITPKAIQDLLDELEASKQSRLRAWNVLQKLRLVLSELGNVPIPPPAQKTFDAEGEILEHALTKSFRIRNEAIKSLCSSVRRFRDATIKEECKRVIRTLFRRFGKH*
Ga0062386_10031940223300004152Bog Forest SoilLEASKQSRLRAWNALQRIRALLSDLGKTEIAPAAQKTFDAEGEILEHALTKSLHLQNESIKSLCSSVRRFPDATIKEECKADYPHAPQALWKALDRAEDLIQS*
Ga0062386_10176844113300004152Bog Forest SoilLRAWNALQRLRAILSETGNAAIPSPAQKTFDAEGEILEHALTKSFRIRNEAIKKLCSAARRFRDATVKEECKRDYPHALQALWKALDRAEDSIQN*
Ga0062388_10231752813300004635Bog Forest SoilDELESSKQSRLRAWNVLQRLRLILSDTGNVAIPPPAQKTFDAEGEALEHALTKTFRIRNEAIKSLCSSERRFRDATIKEECKSDYPHALQALWKALDRAEDLIQN*
Ga0068868_10240229313300005338Miscanthus RhizosphereDELEASKQSRLRAWNVLQRLRLVLTEAGNVAVPPSAQKTFDAEGEILEHALAKSYRIRDEAIKSLCASVRRFRDATIKEERKSDFPHALQALWKALDQAENLIQS*
Ga0070711_10187372723300005439Corn, Switchgrass And Miscanthus RhizosphereIQDLLDELEASKQSRLRAWSVLQRLRMVLSEMGSVAIPPPAQKTFDPEGEALEHALRKSFRRRNDAIKSLCSSVRRFRDATIKEECKGDYPHALQALWKALDRAEDLIQN*
Ga0070665_10047270623300005548Switchgrass RhizosphereMGITPQAIQDLLDELEASKQSRLRAWNVLQRLRLVLTEAGNVAVPPSAQKTFDAEGEILEHALAKSYRIRDEAIKSLCASVRRFRDATIKEERKSDFPHALQALWKALDQAENLIQS*
Ga0070764_1096199523300005712SoilMAITPKAIQDLLDELESSRQSRLRAWNVLQRLRLVLSDNGNVAVPPPAQKAFDAEGEALEHALKKSLRIRNEAIKSLCSSVRRFRDATIKEECKSDYPHALQALWKALD
Ga0075029_10053373813300006052WatershedsMSITPKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSELGSVAIPPPAQKTFYAEGETLEQSLKKCLRLRNDALKSLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDLIQNE*
Ga0075017_10117815923300006059WatershedsMSFTSKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSELGSVAIPPPAQKTFDAEGEALEHALRKSFRLRNDAIKSLCSSVRRFRDATIKEECKGDYPQALQALWKALDRAEDLIQN*
Ga0075030_10161600123300006162WatershedsMSITPKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSELGSVAIPPPAQKTFDAEGEALEHALRKSFHLRNDAIKSLCSSVRRFRDATIKEECKGDYPQALQALW
Ga0070716_10090546013300006173Corn, Switchgrass And Miscanthus RhizosphereEASKQSRLRAWNVLQKLRSVLSEAANVAIPPPNQKTFDAEGEALEHALTKSFRIRNEAIKTLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDLIQN*
Ga0070712_10007709423300006175Corn, Switchgrass And Miscanthus RhizosphereMNTSRMSITPKAIQDLLDELEASKQSRLRAWNALQRLRSVLSEMGNVAIPAPTQKTFEAEGEILENVLTKSFRIRNEAIKSLCSSVRRFRDAIIKEECRSDYPHALQALWKALDRAEDLIQS*
Ga0070712_10016044213300006175Corn, Switchgrass And Miscanthus RhizosphereMLITPKAIQDLLDELEASKQSRMRAWNALQRFRSVLSEAGHVAIPASVQKTFDEEGAILENALTKSFRIRNDAIKSLCSSVRRFRDATIKEECKCDYPHALQ
Ga0070712_10110673223300006175Corn, Switchgrass And Miscanthus RhizosphereMSITPKAIQDLLDELEASKQSRLRAWKALQRLRSVLSEVGNVAIPAPAQKTFDAEGELLEHALTKSFRIRNEAMKSLCSSVRRFRDATIKEECKSDYANALQALWKALDRAEDLIQN*
Ga0070765_10184842723300006176SoilTPKAIQDLLDELEASKQNRLRAWNVLQRLRSVLCEVANVAIPPPAQQTFDAEGEVLEHALTKSLHLRNEAIKSLCSSVRRFRDATIQKECKSDYPRALQALWKALDRAEDLIQS*
Ga0075021_1037000613300006354WatershedsGHSREVAGCWIPRVGACNALQRLRLVLSENGNVVVPLPAQKTFEADGEILEHALTKSFRIRNEAIKSLCSSVRRFLDATVKEECKRDYPHALQALWKAPDRGEDLIQN*
Ga0073928_1008901423300006893Iron-Sulfur Acid SpringMSITPKAIQDLLDELEASKQSRLRAWNVLQKLRLVLSELGNVPIPPPAQKTFDTEGEILEHALTKSFRIRNEAIKSLFSSVRRFRDATIEEECKRDYPHALQAWLKALDWAEDLIQN*
Ga0073928_1035762123300006893Iron-Sulfur Acid SpringMNTTSMSITPKAIQDLLDDLEASKQSRLRAWSVLQRLRMVLSEAGSVAIPPPAQKTFDAEGEALEHALRKSLRLRNDAIKSLCSSVRRFRDATIKEECKGDYPHALQALWKALDRAEDLIQN*
Ga0073928_1055799823300006893Iron-Sulfur Acid SpringLEASKQSRLRAWNVLQRIRAILSEFGNATIPPATQKTFDAEGEVLEHAITRSFRIRNDAIKSMCSAVRRFRDATIKEDCKGDYPQAREALWKALDRAEDLIQN*
Ga0105242_1328355513300009176Miscanthus RhizosphereMSITPKAIQDLLEELEASKQSRLRAWNVLQKLRSVLSEAGNVAIPPPNQKTFDAEGEALEHALTKSFRIRNEAIKTLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDLIQN*
Ga0105237_1098937413300009545Corn RhizosphereMGITPQAIQDLLDELEASKQSRLRAWNVLQRLRLVLTEAGNVAVPPSAQKTFDAEGEILEHALAKSYRIRDEGIKSLCASVRRFRDATIKEERKSDFPHALQALWKALDQAENLIQS*
Ga0105249_1307821113300009553Switchgrass RhizosphereKARRKVVGAIERIQDLLDELEASKQSRLRAWNVLQRLRLVLTEAGNVAVPPSTQKTFEAEGEILEHALAKSYRIRDEAIKSLCASVRRFRDAAIKEDRKSDFPHALQALWKALDQAENLIQS*
Ga0074046_1003401733300010339Bog Forest SoilMSITPRAIQDLLDELETSKQSRLRAWNVLQRLRLVLSELGNVAIPSPAQKTFDAEGEILEHALTKCFRARSDAIKNLCSAVRRFRDATLKESCKGDYPQALQAMLKALDRAEDLIQN*
Ga0074046_1011965423300010339Bog Forest SoilMCSYEHCINVDHTEGNSRLLDELEASKQSRLLAWSVLQRLRMVLSEMGSVAIPPPAQKTFDAEGEALEHALRKSFRLRNDAIKNLCSSVRRFRDATIKDECKGDYPHALQALWKALDRAEDLIQNRTM*
Ga0074046_1028923313300010339Bog Forest SoilLLEELEASKQSRLRAWSVLQRLRVVLSALGNVAIPAPKQKTFNAEGEILEHALTKCFRARNDAIKSLCSSVRRFRDATIKEKCKGDYPQALQAMLKALDRAEDLIQN*
Ga0074045_1025613323300010341Bog Forest SoilMSITPKAIQDLLDELEASKQSRLRAWNVLQKLRLVLSELGNVPIPPPAQKTFDAEGEILEHALTKSFRIRNEAIKSLCSSVRRFRDATIKEECKSDYPHALQAL*
Ga0074044_1009091623300010343Bog Forest SoilMSITPKAIQDLLDELEASKQSRLRAWNVLQKLRLVLSELGNVPIPPPAQKTFDAEGEILEHALTKSFRIRNEAIKSLCSSVRRFRDATIKAECRSDYPHALQALWKALDRADDLIQN*
Ga0074044_1033068713300010343Bog Forest SoilMSITPEAIQDLLDELEASKQSRLRAWSVLQRLRMVLSEVGSVAIPPPAQKTFDAEGEALEHALRKSFRLRNDAIKSLCSSVRRFRDATIKEECKGDHPHALQALWKALDRAEDLIQN*
Ga0074044_1098956413300010343Bog Forest SoilLLEELEASEQSRLPAWSVLQRLRVVLSELGNVAIPAPKQKTFNAEGEILEHALTKCFRARSDAIKNLCSAVRRFRDATLKESCKGDYPQALQAMLKALDRAEDL
Ga0136449_10098877813300010379Peatlands SoilLEVSKENRLRAWYVLQRLCSVLSENGNITIPPPAQKTFDAEGEILEHALTKTLHLRNEAIKSLCSADRHFRDATVKPECRSDYPHALQAFWKALDRAEDLIQS
Ga0136449_10165171913300010379Peatlands SoilLRAWNVLQRLRAILSETGNVAIPPPAQEKFDAEGKILEHALTKSFMIRDEAIKSLCSSVRRFRDITIKAECRSDYPHALQALWKALDRAEDLIQN*
Ga0136449_10377674213300010379Peatlands SoilVSAGFVCVGISIILDLEYVHMNTLAMSITPKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSEVGSVAIPLPAQKTFDAEGEALEHALRKSFRLRNVAIKSLCSSVRRFRDATIKEECKGDYPHALQALWKALDRAEDLIQN*
Ga0137360_1158368123300012361Vadose Zone SoilMSITPKSIQDVLDELEAPKQSRLRAWNALQRLRSILSEMGNAPIPQPSQKTFEAQGEILEHALTKSLRIRNDAIKSLCFAVRRFRGATLKEDCKGDYPKALQAMLKALSSSSSD
Ga0137413_1184716413300012924Vadose Zone SoilRMSITPKAIQDLLEELEASKQSRLRAWNVLQKLRSVLSEAGNVAIPPPRQKTFDAEGEALEHALTKSFRIRNEAIKNLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDLIQN*
Ga0164303_1018403723300012957SoilMRAWNALQRFRSVLSEAGHVAIPASVQKTFDEEGAILENALTKSFRIRNDAIKSLCSSVRRFRDATIKEECKCDYPHALQALWKALDRAEDLIQN*
Ga0164304_1141894913300012986SoilMGITPQAIQDLLDELEASKQSRLRAWNVLQRLRLVLTEAGNVAVPPSAQKTFDVEGEILEHALAKSYRIRDEAIKSLCASVRRFRDATIKEERKSDFPHALQALWKALDQAENLIQS*
Ga0164307_1056682823300012987SoilMGITPQAIQDLLDELEASKQSRLRAWNVLQRLRLILTEAGNVPVPPSPQKTFDAEGEILEHALAKSYRIRDEAIKSLCASVRRFRDATIKEERKSDFPHALQALWKALDQAENLIQS*
Ga0164305_1101260713300012989SoilQSRMRAWNALQRFRSVLSEAGHVAIPASVQKTFDEEGAILENALTKSFRIRNDAIKSLCSSVRRFRDATIKEECKCDYPHALQALWKALDRAEDLIQN*
Ga0164305_1221968323300012989SoilESSKQSRLRAWNVLQRCRLVLSDSGNVAIPPPAQKTFDAEGEALEHALAKTFRTRSEAIKSLCSSVRRFRDATIKEECKSDYPHALQGFVESVR*
Ga0157374_1162565723300013296Miscanthus RhizosphereMGITPQAIQDLLDELEASKQSRLRAWNVLQRLRLVLTEAGNVAVPPSTQKTFEAEGEILEHALAKSYRIRDEAIKSLCASVRRFRDATIKEDRKSDFPQALQALWKALDQAENLIQS*
Ga0157378_1009559533300013297Miscanthus RhizosphereVNTFGMSITPKAIQDLLEELEASKQSRLRAWNVLQKLRSVLSEAANVAIPPPNQKTFDAEGEALEHALTKSFRIRNEAIKTLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDLIQN*
Ga0157375_1190407313300013308Miscanthus RhizosphereMGITPQAIQDLLDELEASKQSRLRAWNVLQRLRLVLTEAGNVAVPPSAQKTFDAEGEILEYALAKSYRIRDEAIKSLCASVRRFRDATIKEERKSDFPHALQALWKALDQAENLIQS*
Ga0157376_1227842313300014969Miscanthus RhizosphereMSITPKAIQDLLEELEASKQSRLRAWNVLQKLRSLLSEAGNVAIPPPNQKTFDAEGEALEHALTKSFRIRNEAIKTLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDLIQN*
Ga0210407_1031611823300020579SoilMAITPQAIQDLLDELEASKQSRLRAWHVLQRLRAVLSELGNIAIPPPKQKTFDLEGEILEHALTKCLQTRNDALRNLCSSVRRFRDATLKEECKGDYPQALQSMLKALDRAEDVIQT
Ga0210403_1041477023300020580SoilMSITPKAIQDLLDELEASKQSRLRAWNALQRLRSVLSEMGNVAIPAPTQKTFEAEGEILENVLTKSFRIRNEAIKSLCSSVRRFRDAAIKEECKSDYPHALQALWKALDRAEDLIKD
Ga0210399_1116814313300020581SoilMSITPKAIQDLLDELEASKQSRLRAWNALQRLRSVLSEMGNVAIPAPTQKTFEAEGEILENALTKSLHLRNEAIKSLCSSVRRLRDATIREECKSDYPHALQALWKALDRAEDLIKD
Ga0210401_1008160933300020583SoilMSITPKAIQDLLDELEASKQSRLRAWNALQRLRSVLSEMGNVAIPAPTQKTFEAEGEILENVLTKSFRIRNEAIKSLCSSVRRFRDATIKEECKSDYPHALQALSKALDRAEDLIQN
Ga0215015_1081295113300021046SoilYHMSITPKAIQDLLDELEASKRSRLRAWTVLQRLRLVLSENGNVAIPPPAQKTFNAEGEILEHALTKSFRIRDDAIKSLCSSVRRFRDATIKEECKRDYPHALQALWKALDRAEDLIQNWNDV
Ga0215015_1083811713300021046SoilLDQKCVRENTISMSITPKAIQDLLDELEASKQSRLRAWSLLQRLRMVLSELESVAIPSPAKKTFDAEGEALEHALRKSFRLRNDAIKSLCSSVRRFRDATVKEECKGDYPHALQALWKALDRAEDLIQN
Ga0215015_1086102153300021046SoilMSITPKAIQDLLDELESSKQSRLRAWNVLQRLRAVLSDLGDAAIPSPAQKTFDTEGEILKHALTKCLRARNDAIKSLCSAVRRFRDATSKEDCKSDYPRALQAMLKALS
Ga0210405_1011111723300021171SoilMPITPKAIQDLLDELEASKQNRLRAWNVLQRLRSVLCEVANVAIPPPAQQTFDAEGEDLEHALTKSLHLRNEAIKSLCSSVRRFRDATIQKECKSDYPRALQALWKALDRAEDLIQS
Ga0210405_1016157633300021171SoilMSITPKAIQNLLDELEASKQSRLRAWNVLQRLRLVLSEAGNVAIPPPAQKTFDAEGEILEQALTKTLHLRNEAIKSLCSSVRRFRDATMRQECKSDYPHALQALWKALDRAEDLISELRIFL
Ga0210405_1036241523300021171SoilMSITPKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSEMGSVAIPPPVQKTFDAEGETLEHALRKSFRLRNDAIKSLCSSVRRFRDATIKEECKGD
Ga0210405_1050362923300021171SoilMSITPTAIQDLLDELEASKQSRLRAWNALQRLRSVLSEMANVAIPAPTQKTFEAEGEILENALTKSLHLRNEAIKSLCSSVRRFRDATIREECKSDYPHALQALWKALDRAEDLIKD
Ga0210405_1065032723300021171SoilMSITPKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSEMGSVVIPPAAQKTFDTEGQALEHALRKSFRLRNDAIKNLCSSVRRFRDATIKEECKGDYPHAV
Ga0210396_1134421723300021180SoilLDELEASKQSRLRAWNALQRLRSVLSEMANVAIPAPTQKTFEAEGEILENALTKSLHLRNEAIKSLCSSVRRLRDATIREECKSDYPHALQALWKALDRAEDLIKD
Ga0210388_1071601023300021181SoilMSITPKAIQDLLDELEASKQSRLRAWKALQRLRSVLSEMGNVAIPAPAQKTFEAEGEILEHALTRSFRIRNEAIKSLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDLIQN
Ga0210387_1061221423300021405SoilMSITPTAIQDLLDELEASKQSRLRAWNALQRLRSVLSEMANVAIPAPTQKTFEAEGEILENALTKSLHLRNEAIKSLCSSVRRLRDATIREECKSDYPHALQALWKALDRAEDLIKD
Ga0210387_1090598423300021405SoilSDEHYADMSITPKAIQDLLDELEASKQSRLRAWNALQRIRALLSDLGKIEISPPADKTFDAEGEILEHALSKSLRIRNEAIKSLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDLIQN
Ga0210387_1093574613300021405SoilMSITPKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSEAGNVAIPAPAQMTFNAEGEILEHALTKSFRIRNEAMKSLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDLIQN
Ga0210383_1081208713300021407SoilMAITPKAIQDSLDGLESSKQSRLRAWNVLQRLRLVLSDAGNVAISPPAQKTFDAEGEALEHALTKACRIRNEAIKSLSSSVRRFRDATIKEECKSDYPHAL
Ga0210383_1147926213300021407SoilMSITPTAIQDLLDELEASKQSRLRAWNALQRLRSVLSEMANVAIPAPTQKTFEAEGEILENALTKSLHLRNEAIKSLCSSVRRFRDATSREECKSDYPHALQALWK
Ga0210394_1080582713300021420SoilMSITPKAIQNLLDELEASKQSRLRAWNVLQRLRLVLSEAGNVAIPPPAQKTFDAEGEILEQALTKTLHLRNEAIKSLCSSVRRFRDATMRQECKSDYPHALQALWKALDRAEDLISELRI
Ga0210390_1003439523300021474SoilMSITPKAIQDLLDELEASKQSRLRAWKVLQRLRLVLSETGDIAIPPSAQKTFDAEGEILEHALTKSFRIRNDAIKSLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDLIQN
Ga0210402_1162455713300021478SoilLEASKQSRLRAWNVLQRIRALLSDLGKTAIPPPAHKTFDAEGEILEYTLTKSLRIRNDAIKSLCSAVRRFRDATLKEDRKGDYPQALQALLKALDRAEDLIQD
Ga0210410_1017128023300021479SoilMSITPKAIQDLLDELEASKQSRLRAWNALQRFRSVLSEMGNVAIPAPTQKTFEAEGEILENVLTKSFRIRNEAIKSLCSSVRRFRDATIKEECKSDYPHALQALSKALDRAEDLIQN
Ga0212123_1024163013300022557Iron-Sulfur Acid SpringMSITPKAIQDLLDELEASKQSRLRAWNVLQKLRLVLSELGNVPIPPPAQKTFDTEGEILEHALTKSFRIRNEAIKSLFSSVRRFRDATIEEECKRDYPHALQAWLKALDWAEDLIQN
Ga0207692_1035331333300025898Corn, Switchgrass And Miscanthus RhizosphereASKQSRLRAWKALQRLRSVLSEVGNVAIPAPAQKTFDAEGELLEHALTKSFRIRNEAMKSLCSSVRRFRDATIKEECKSDYPNALQALWKALDRAEDLIQN
Ga0207693_1032464613300025915Corn, Switchgrass And Miscanthus RhizosphereMLITPKAIQDLLDELEASKQSRMRAWNALQRFRSVLSEAGHVAIPASVQKTFDEEGAILENALTKSFRIRNDAIKSLCSSVRRFRDATIKEECKYDYPHSHSSGFVESVGSAGLPRTTT
Ga0207693_1044490023300025915Corn, Switchgrass And Miscanthus RhizosphereMSITPKAIQDLLDELEASKQSRLRAWKALQRLRSVLSEVGNVAIPAPAQKTFDAEGELLEHALTKSFRIRNEAMKSLCSSVRRFRDATIKEECKSDYANALQALWKALDRAEDLIQN
Ga0207663_1171699813300025916Corn, Switchgrass And Miscanthus RhizosphereASKQSRLRAWSVLQRLRMVLSEVGSVAIPPPAQKTFDPEGETLEHALRKSFRLRNDAIKSLCSSVRRFRDASIKEECKGDYPHALQALWKAPDRAEDLIQN
Ga0207665_1122089713300025939Corn, Switchgrass And Miscanthus RhizospherePKAIQDLLDELEASKQSRMRAWNALQRFRSVLSEAGHVAIPASVQKTFDEEGAILENALTKSFRIRNNAIKSLCSSVRRFRDATVKPECKSDYPHALQALWKALDRAEDLIQNSAMCGRYRRTTAEEEPTRRNIKESC
Ga0209648_1014223723300026551Grasslands SoilMTITPKAIQDLLDELESSKQSRLRAWNVLQRLRLVLSDTGYVAIPPPAQKTFDAEGQALEHALTKSFRIRNEAIKSLCSSVHRFRDATIKGECKSDYPHALQALWKALDRAEDLIQS
Ga0208603_102962813300027109Forest SoilMNTLLMSITPKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSEMGSVAIPPPAQKTFDAEGEVLEHALRKSFRLRNDAIKSLCSSVRRFRDATIKEECKGDYPHALQALWKALDRAEDLIQK
Ga0209524_105673523300027521Forest SoilVNTISISITPKAIQDLLDELEASKQSRLRAWSVLQRLRMLLSEVGSVAIPPPAQKTFDAEGEALEHALRKSFRRRNDAIKSLCSSVRRFRDATIKEECKGDYPHALQALWKALDRAEDLIQN
Ga0209735_101271433300027562Forest SoilMSITPKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSELGSVAIPPPAQKTFDAEGEALEHALTKPFRLRYDAIKSLCSSVRRFRDATIKEECKGDYPHALQ
Ga0209527_115468923300027583Forest SoilEASKQNRLRAWNVLQRLRSVLCEVANVAIPPPAQQTFDAEGEVLEHALTKSLHLRNEAIKSLCSSVRRFRDATIQKECKSDYPRALQALWKALDRAEDLIQS
Ga0209331_112483223300027603Forest SoilMSITPKAIQDLLDELEASKQSRLRAWSVLQRLRMLLSEVGSVAIPPPAQKTFDAEGEALEHALRKSFRLRNDAIKSLCSSVRRFRDATIKEECKGDYPHALQALWKALDRAEDLIQN
Ga0209422_110640313300027629Forest SoilMSITLKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSELGSVAIPPPAQKTFDAEGEALEHVLRKSFRLRNDAIKSLCSSVRRFRDATIKEECKGDYPHALQALWKALDRAEDLIQN
Ga0209217_112575513300027651Forest SoilMEITPNAIQDLLDELEASKRSRLRAWNALQRLRLVLSKLGNISIPPPAQKTFDSEGEILEHALTKTFRIRNEAIKSLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDLIQD
Ga0209736_110688923300027660Forest SoilMNTISISITPKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSETGGVAIPPPAQKTFDAEGEVLENALRKSFRLRNDAIKSLCSSVRRFRDAIIKEECKGDYPHAL
Ga0209009_112178513300027667Forest SoilMSITPKAIQDLLDELESSKQSRLRAWNVLQRLRSVLSDTGNVAIPPPAQKTFDAEGEALEHALTKSFRIRNEAIKSLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDL
Ga0209009_112540113300027667Forest SoilMSITPKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSEVGSVAIPPPAQKTFDAEGEALEHALRKSFRLRNDAIKSLRSSVRRFRDATIKEECKGDYPHALQALWKALDRAEELIQN
Ga0209626_112998313300027684Forest SoilMNTISMSIIPKAIQDLLDELQASKQSRLRAWSVLQRLRMVLSEMGSVAIPPPAQKTFDAEGEALEHALRKSFRLRNDAIKSLCSSVRRFRDATIKEECKGDYPHALQALWKALDRAEDLIQN
Ga0209446_100613423300027698Bog Forest SoilMSITPRAIQDLLDELETSKQSRLRAWNVLQRLRLVLSELGNVAIPSPAQKTFDAEGEILEHALTKCFRARSDAIKNLCSAVRRFRDATLKESCKGDYPQALQAMLKALDRAEDLI
Ga0209328_1019647523300027727Forest SoilMSITPKAIQDLLDELEASKQSRLRAWNVLQRLRSVLSEAGNIEIPPPAQKTFDAEGEALEHALTKSFRIRNEAIKSLFSSVRRFRDATIKAECKSDYPHALQALWKALDRAENLIKG
Ga0209656_1002771053300027812Bog Forest SoilMSITPKAIQDLLDELEASKQSRLRAWRVLQRLRMVLSEVGSVAIPPPAQKTFDAEGEALEHALRKSFRLRNDAIKSLCSSVRRFRDATIKEECKGDYPHALQALWKALDRAEDLIQN
Ga0209656_1006450513300027812Bog Forest SoilMCSYEHCINVDHTEGNSRLLDELEASKQSRLLAWSVLQRLRMVLSEMGSVAIPPPAQKTFDAEGEALEHALRKSFRLRNDAIKNLCSSVRRFRDATIKDECKGDYPHALQALWKALDRAEDLIQNRTM
Ga0209656_1021699213300027812Bog Forest SoilMSITPKAIQDLLDELEGSKQSRLRAWRVLQKLRFALSELGNVEIPPPAQKTFDAEGEILEHALTKSFQIRNEAIKSLCTSVRRFRDATIKEECKRDYPHALQALWKALDRAEDLIQN
Ga0209656_1023318713300027812Bog Forest SoilLRAWNALQRLRAILSETGNAAIPSPAQKTFDAEGEILEHALTKSFRIRNEAIKKLCSAARRFRDATVKEECKRDYPHALQALWKALDRAEDSIQN
Ga0209040_1035643213300027824Bog Forest SoilWPSKCSHEPMSITPKAIQDLLEELEASKQSRLRAWSVLQRLRVVLSELGNVAIPAAKRKTFDAEGEILEHALTKCFRARNDAIKSLCSSVRRFRDATIKEKCKGDYPQALQAMFR
Ga0209006_1054324813300027908Forest SoilMAITPKAIQDLLDELESSKQSRLRAWNVLQRFRLVLSETGNVAVPPSAQKTFDAEGEALEHSLTKSLRIRNEAIRSLCSSVRRFRDATIKEDCKSDYPHALQALWKALDRAEDLIQN
Ga0209006_1091127613300027908Forest SoilVLMNTFQPSITPKAIQDLRDELEASKQSRLRAWKVLQSLRWVLSEAGNVTIPPRAQKTFDAEGEILEHALTKSFRLQNDAIKSLCSSVRRFRDATIKEECRSDYPHALQALWKALDRAEDLIRD
Ga0209526_1018338723300028047Forest SoilMEITPNAIQDLLDELEASKRSRLRAWNALQRLRLVLSKLGNISIPPPAQKTFDSEGEILEHALTKTFRIRNEAIKSLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDLIQN
Ga0209526_1026290513300028047Forest SoilMPITPKAIQDLLDELEASKQNRLRAWNVLQRLRSVLCEVANVAIPPPAQQTFDAEGEVLEHALTKSFRIRNDAIESLCSRVRRFRDATIKEECKRDYPHALQALWKAL
Ga0308309_1041369813300028906SoilMTITPKAIRDLLDKLEASRQSRLRAWSVLQRLRMVLSDMGSVAIPPPAQKTFYAEGETLEQSLKKSLRLRNDALKSLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDLIQNE
Ga0222749_1073984223300029636SoilMSVSITPKAIQDLLDELEASKQSRLRAWSVLQRLCMVLSEVGSVAIPPPAQKTFDAEGEALEHALRKSFRLRNDAIKSLCSSVRRFRDATIKEECKGDYPHAVQALWKALDRAEDLIQN
Ga0311340_1061598423300029943PalsaMCSYEHYIMSITPKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSEMGNVAIPPAAQKTFDAEGEALEHALRKSFRLRNDAIKSLCSSVRRFRDATSKEECKGDYPHALQALWKALDRAEDLIQN
Ga0311370_1180275813300030503PalsaMAVTPKAIQDLLDELESSKQSRLRAWKVLQRLRSVLSDSGNVAIPPPAQRTFDAEGEALEHALAKTLRIRNEAIKSLCSSVRRFRDATIKEECKSDYPHALQALWKALDRAEDLIQN
Ga0073998_1005958613300031023SoilEASKQSRLRARNVLQRLRLVLSEAGNVTIPPPAQKTFDAEGEALEHALTRSFRIRNEAIKSLCSSVRRFRDATINEECKYDYPHSPSSGFVEALDRLGLPRTST
Ga0170824_11417644323300031231Forest SoilMSITPKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSEVGSVAIPPPAQKTFDAEGEALEHALRKSFCLRTDAIKSLCSSVRRFRDATTKEECKGDYPHALQALWKALDRAEDLIQN
Ga0302325_1054458013300031234PalsaMCSYEHYIMSITPKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSEMGNVAIPPAAQKTFDAEGEALEHALRKSFRLRNDAIKSQALWKALDRAEDLIQN
Ga0302324_10100523423300031236PalsaMSITPKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSEMGNVAIPPAAQKTFDAEGEALEHALRKSFRLRNDAIKSLCSSVRRFRDATSKEECKGDYPHALQALWKALDRAEDLIQN
Ga0310686_11397393123300031708SoilMSITPKAIQDLLDELETSKQSRLRAWNVLQRLRLVLSELGNVAIPPPAQKTFDAEGEILEHALTKCFRARNDAIKNLCSAVRRFRDATLKENCKGDYPQALQAMLKA
Ga0307476_1119947613300031715Hardwood Forest SoilMSITPKAIQDLLDELEASKQSRLRPWSMLQRLRMVLSEMGSVAIAPPAQKTFDAEGEALEHALRKSFRLRNDAIKSLCSSVRRVRDATIKEECKGDYPHALQALWKALDRAEDLIQN
Ga0307474_1144179113300031718Hardwood Forest SoilMAITPEAIQDLLDELESSKQSRLRAWNVLQRLRLVLSDTGNVAVPPPAQKTFDAEGEVLEHALTKSFRIRNEAIKSLCSSVRRFRDATIKEECKGDYPHALQALWKALDRAEDLIQ
Ga0307477_1042529323300031753Hardwood Forest SoilMSNTPKAIQDLLDELEASKQSRLRAWNVLQRLRSVLSEAGNVAIPPPAQKTFDAEGEALEHALTRSLRIRNEAIKSLCSSVRRFRDATIRQECKSDYPHALQAL
Ga0307475_1154338813300031754Hardwood Forest SoilMSITPKAIQDLLDELEASKQSRLRAWSVLQRLRMVLSEVGSVAIPPPAQKTFDAEGEALEHALRKSFRLRNDAIKSLCSSVRRFRDATIKEECKGDYPHALQALWKALDRAEDLIQN
Ga0307479_1085421423300031962Hardwood Forest SoilMNTIPKPITPKAIQDLVDELEASKQSRLRAWNVLQRLRLVLSVAGNVTIPPPAQKTFDAEGEALEHALTKSFRHRNDAIKSLCSSVRRFRDATIKEECKHDYPHALQALWKALDRAEDLIQD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.