NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F075131

Metagenome / Metatranscriptome Family F075131

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F075131
Family Type Metagenome / Metatranscriptome
Number of Sequences 119
Average Sequence Length 109 residues
Representative Sequence MQTIEEHRAHHGSINLVSLKTEILSALSAVGSATSIEELRQADAAYKKALCTARGMGYRPHIRAADGQQIQVEWVARGEATFCEGHANIARQAPTTSKRERR
Number of Associated Samples 81
Number of Associated Scaffolds 119

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 57.14 %
% of genes near scaffold ends (potentially truncated) 28.57 %
% of genes from short scaffolds (< 2000 bps) 69.75 %
Associated GOLD sequencing projects 75
AlphaFold2 3D model prediction Yes
3D model pTM-score0.57

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (59.664 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(37.815 % of family members)
Environment Ontology (ENVO) Unclassified
(34.454 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(64.706 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 35.38%    β-sheet: 12.31%    Coil/Unstructured: 52.31%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.57
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 119 Family Scaffolds
PF13519VWA_2 4.20
PF13581HATPase_c_2 4.20
PF07110EthD 4.20
PF00072Response_reg 3.36
PF11008DUF2846 1.68
PF00092VWA 1.68
PF00296Bac_luciferase 1.68
PF13444Acetyltransf_5 1.68
PF10282Lactonase 1.68
PF12704MacB_PCD 1.68
PF13088BNR_2 0.84
PF13564DoxX_2 0.84
PF08281Sigma70_r4_2 0.84
PF13592HTH_33 0.84
PF04820Trp_halogenase 0.84
PF04542Sigma70_r2 0.84
PF07992Pyr_redox_2 0.84
PF00665rve 0.84
PF09836DUF2063 0.84
PF04224DUF417 0.84
PF00248Aldo_ket_red 0.84
PF02321OEP 0.84
PF05114DUF692 0.84
PF00082Peptidase_S8 0.84
PF12697Abhydrolase_6 0.84
PF01047MarR 0.84
PF08447PAS_3 0.84
PF13185GAF_2 0.84
PF13490zf-HC2 0.84

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 119 Family Scaffolds
COG1538Outer membrane protein TolCCell wall/membrane/envelope biogenesis [M] 1.68
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 1.68
COG0568DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)Transcription [K] 0.84
COG1191DNA-directed RNA polymerase specialized sigma subunitTranscription [K] 0.84
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 0.84
COG2801Transposase InsO and inactivated derivativesMobilome: prophages, transposons [X] 0.84
COG2826Transposase and inactivated derivatives, IS30 familyMobilome: prophages, transposons [X] 0.84
COG3059Reactive chlorine resistance protein RclC/YkgB, DUF417 familyDefense mechanisms [V] 0.84
COG3220Uncharacterized conserved protein, UPF0276 familyFunction unknown [S] 0.84
COG3316Transposase (or an inactivated derivative), DDE domainMobilome: prophages, transposons [X] 0.84
COG4584TransposaseMobilome: prophages, transposons [X] 0.84
COG4941Predicted RNA polymerase sigma factor, contains C-terminal TPR domainTranscription [K] 0.84


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms59.66 %
UnclassifiedrootN/A40.34 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001087|JGI12677J13195_1002665All Organisms → cellular organisms → Bacteria1677Open in IMG/M
3300001471|JGI12712J15308_10181961Not Available549Open in IMG/M
3300002245|JGIcombinedJ26739_100015727All Organisms → cellular organisms → Bacteria → Acidobacteria6356Open in IMG/M
3300003505|JGIcombinedJ51221_10231636Not Available750Open in IMG/M
3300004082|Ga0062384_100476459Not Available822Open in IMG/M
3300004092|Ga0062389_100734980Not Available1163Open in IMG/M
3300005435|Ga0070714_100081111All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Silvibacterium → Silvibacterium bohemicum2824Open in IMG/M
3300005436|Ga0070713_100145321All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Silvibacterium → Silvibacterium bohemicum2104Open in IMG/M
3300005445|Ga0070708_100000363All Organisms → cellular organisms → Bacteria34322Open in IMG/M
3300005467|Ga0070706_100006366All Organisms → cellular organisms → Bacteria11148Open in IMG/M
3300005471|Ga0070698_100002693All Organisms → cellular organisms → Bacteria19540Open in IMG/M
3300005536|Ga0070697_100003031All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae12910Open in IMG/M
3300005538|Ga0070731_10520632Not Available792Open in IMG/M
3300005541|Ga0070733_10158517All Organisms → cellular organisms → Bacteria1471Open in IMG/M
3300005542|Ga0070732_10066250All Organisms → cellular organisms → Bacteria2100Open in IMG/M
3300005542|Ga0070732_10974184Not Available518Open in IMG/M
3300005602|Ga0070762_10100701Not Available1672Open in IMG/M
3300005610|Ga0070763_10483087Not Available706Open in IMG/M
3300005921|Ga0070766_10135812Not Available1492Open in IMG/M
3300006176|Ga0070765_100293417All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1501Open in IMG/M
3300006176|Ga0070765_101693871Not Available594Open in IMG/M
3300007265|Ga0099794_10122338All Organisms → cellular organisms → Bacteria1310Open in IMG/M
3300007982|Ga0102924_1297522Not Available643Open in IMG/M
3300009088|Ga0099830_10026583All Organisms → cellular organisms → Bacteria3851Open in IMG/M
3300009089|Ga0099828_11033574All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia731Open in IMG/M
3300009143|Ga0099792_10138025All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia1329Open in IMG/M
3300009635|Ga0116117_1201240All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium526Open in IMG/M
3300011269|Ga0137392_11034093Not Available674Open in IMG/M
3300011269|Ga0137392_11266717Not Available596Open in IMG/M
3300011270|Ga0137391_10732869All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium819Open in IMG/M
3300011270|Ga0137391_11125246All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium633Open in IMG/M
3300011271|Ga0137393_10001232All Organisms → cellular organisms → Bacteria16153Open in IMG/M
3300011271|Ga0137393_10015473All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis5428Open in IMG/M
3300011271|Ga0137393_10536796All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1003Open in IMG/M
3300012189|Ga0137388_10867974All Organisms → cellular organisms → Bacteria835Open in IMG/M
3300012202|Ga0137363_11471111All Organisms → cellular organisms → Bacteria → Acidobacteria572Open in IMG/M
3300012205|Ga0137362_10083320All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter2670Open in IMG/M
3300012361|Ga0137360_10152651All Organisms → cellular organisms → Bacteria1832Open in IMG/M
3300012362|Ga0137361_10120829All Organisms → cellular organisms → Bacteria2308Open in IMG/M
3300012362|Ga0137361_10412980All Organisms → cellular organisms → Bacteria → Acidobacteria1240Open in IMG/M
3300012917|Ga0137395_10680912Not Available744Open in IMG/M
3300014200|Ga0181526_10295214All Organisms → cellular organisms → Bacteria1032Open in IMG/M
3300014501|Ga0182024_10079940All Organisms → cellular organisms → Bacteria4880Open in IMG/M
3300018042|Ga0187871_10026606All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium 13_1_20CM_2_69_213661Open in IMG/M
3300020579|Ga0210407_10031162All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3966Open in IMG/M
3300020579|Ga0210407_11182353Not Available576Open in IMG/M
3300020580|Ga0210403_10006938All Organisms → cellular organisms → Bacteria9473Open in IMG/M
3300020580|Ga0210403_10226469All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1534Open in IMG/M
3300020581|Ga0210399_10674461Not Available851Open in IMG/M
3300020581|Ga0210399_10862938Not Available736Open in IMG/M
3300020581|Ga0210399_11340880Not Available562Open in IMG/M
3300020582|Ga0210395_10129003All Organisms → cellular organisms → Bacteria1880Open in IMG/M
3300020582|Ga0210395_10395996Not Available1039Open in IMG/M
3300020583|Ga0210401_10007460All Organisms → cellular organisms → Bacteria10966Open in IMG/M
3300020583|Ga0210401_10015277All Organisms → cellular organisms → Bacteria7431Open in IMG/M
3300020583|Ga0210401_10034786All Organisms → cellular organisms → Bacteria4792Open in IMG/M
3300020583|Ga0210401_10126278All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2398Open in IMG/M
3300020583|Ga0210401_10185419All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1937Open in IMG/M
3300020583|Ga0210401_10291293All Organisms → cellular organisms → Bacteria1493Open in IMG/M
3300020583|Ga0210401_10415703All Organisms → cellular organisms → Bacteria1207Open in IMG/M
3300020583|Ga0210401_10519701All Organisms → cellular organisms → Bacteria1053Open in IMG/M
3300020583|Ga0210401_11339999Not Available573Open in IMG/M
3300021046|Ga0215015_10442242Not Available602Open in IMG/M
3300021171|Ga0210405_10002128All Organisms → cellular organisms → Bacteria20932Open in IMG/M
3300021171|Ga0210405_10006366All Organisms → cellular organisms → Bacteria10553Open in IMG/M
3300021171|Ga0210405_10495967All Organisms → cellular organisms → Bacteria957Open in IMG/M
3300021178|Ga0210408_10747826All Organisms → cellular organisms → Bacteria768Open in IMG/M
3300021180|Ga0210396_10254381Not Available1561Open in IMG/M
3300021180|Ga0210396_10924720Not Available742Open in IMG/M
3300021180|Ga0210396_11350439Not Available591Open in IMG/M
3300021402|Ga0210385_10627865Not Available820Open in IMG/M
3300021403|Ga0210397_10264056Not Available1254Open in IMG/M
3300021404|Ga0210389_10259063Not Available1361Open in IMG/M
3300021406|Ga0210386_10031434All Organisms → cellular organisms → Bacteria → Acidobacteria4171Open in IMG/M
3300021406|Ga0210386_10673740Not Available892Open in IMG/M
3300021407|Ga0210383_10292185All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1401Open in IMG/M
3300021407|Ga0210383_10597459Not Available951Open in IMG/M
3300021407|Ga0210383_10616048All Organisms → cellular organisms → Bacteria935Open in IMG/M
3300021420|Ga0210394_10353518Not Available1291Open in IMG/M
3300021420|Ga0210394_10824744Not Available809Open in IMG/M
3300021432|Ga0210384_10186937All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1862Open in IMG/M
3300021432|Ga0210384_11581146Not Available561Open in IMG/M
3300021432|Ga0210384_11669748Not Available542Open in IMG/M
3300021433|Ga0210391_10888469Not Available695Open in IMG/M
3300021474|Ga0210390_10279590Not Available1413Open in IMG/M
3300021474|Ga0210390_10306928Not Available1344Open in IMG/M
3300021477|Ga0210398_10751716Not Available787Open in IMG/M
3300021479|Ga0210410_10002610All Organisms → cellular organisms → Bacteria → Acidobacteria16123Open in IMG/M
3300021559|Ga0210409_10004613All Organisms → cellular organisms → Bacteria14769Open in IMG/M
3300021559|Ga0210409_10838869Not Available792Open in IMG/M
3300023030|Ga0224561_1004013Not Available1030Open in IMG/M
3300025434|Ga0208690_1063360All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium575Open in IMG/M
3300025910|Ga0207684_10005369All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae11834Open in IMG/M
3300026551|Ga0209648_10043919All Organisms → cellular organisms → Bacteria3854Open in IMG/M
3300026551|Ga0209648_10580198All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium620Open in IMG/M
3300027069|Ga0208859_1007778All Organisms → cellular organisms → Bacteria1158Open in IMG/M
3300027545|Ga0209008_1010828All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2156Open in IMG/M
3300027567|Ga0209115_1026416All Organisms → cellular organisms → Bacteria1303Open in IMG/M
3300027842|Ga0209580_10223397Not Available936Open in IMG/M
3300027853|Ga0209274_10343641Not Available768Open in IMG/M
3300027855|Ga0209693_10116820All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1321Open in IMG/M
3300027889|Ga0209380_10207927Not Available1149Open in IMG/M
3300027889|Ga0209380_10223320All Organisms → cellular organisms → Bacteria1106Open in IMG/M
3300028536|Ga0137415_11442060Not Available512Open in IMG/M
3300028906|Ga0308309_10027617All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3844Open in IMG/M
3300031090|Ga0265760_10186632Not Available695Open in IMG/M
3300031708|Ga0310686_111677412All Organisms → cellular organisms → Bacteria10691Open in IMG/M
3300031708|Ga0310686_115521555All Organisms → cellular organisms → Bacteria2510Open in IMG/M
3300031715|Ga0307476_10001965All Organisms → cellular organisms → Bacteria → Acidobacteria12370Open in IMG/M
3300031715|Ga0307476_10308338Not Available1162Open in IMG/M
3300031718|Ga0307474_10000907All Organisms → cellular organisms → Bacteria → Acidobacteria21364Open in IMG/M
3300031718|Ga0307474_10012240All Organisms → cellular organisms → Bacteria6212Open in IMG/M
3300031753|Ga0307477_10080681All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatomonas → Candidatus Sulfotelmatomonas gaucii2254Open in IMG/M
3300031820|Ga0307473_10056018All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatomonas → Candidatus Sulfotelmatomonas gaucii1898Open in IMG/M
3300031823|Ga0307478_10429913Not Available1096Open in IMG/M
3300031823|Ga0307478_11239025Not Available621Open in IMG/M
3300031962|Ga0307479_10730997Not Available968Open in IMG/M
3300032205|Ga0307472_100360552All Organisms → cellular organisms → Bacteria → Acidobacteria1198Open in IMG/M
3300032515|Ga0348332_14194008All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1842Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil37.82%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil15.97%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil8.40%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil8.40%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil5.88%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.04%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.20%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil4.20%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.68%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Peatland1.68%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.68%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland0.84%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog0.84%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring0.84%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost0.84%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.84%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.84%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001087Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_O3EnvironmentalOpen in IMG/M
3300001471Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300003505Forest soil microbial communities from Harvard Forest LTER, USA - Combined assembly of forest soil metaG samples (ASSEMBLY_DATE=20140924)EnvironmentalOpen in IMG/M
3300004082Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007982Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPM 11 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009635Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_11_10EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300014200Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin06_30_metaGEnvironmentalOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300018042Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_16_10EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021402Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300023030Soil microbial communities from Bohemian Forest, Czech Republic ? CSU2EnvironmentalOpen in IMG/M
3300025434Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_16_10 (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027069Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF002 (SPAdes)EnvironmentalOpen in IMG/M
3300027545Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O3 (SPAdes)EnvironmentalOpen in IMG/M
3300027567Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027853Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1 (SPAdes)EnvironmentalOpen in IMG/M
3300027855Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300031090Metatranscriptome of rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZI1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032515FICUS49499 Metatranscriptome Czech Republic combined assembly (additional data)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12677J13195_100266523300001087Forest SoilVAQVVTVGTNSGLEGNMETIGEHRAHPGSINLVILKTEILSALSAVGSATTVEELAQADAAYKKALCTARIMGYRPHIRATDGQQIQVEWVALGNATFCEGHNIERQAAMTSKRERL*
JGI12712J15308_1018196113300001471Forest SoilVAQVVTVGTNSGLEGNMETIAEHRAHPGSINLVILKIEILSALSAVGSATTVEELEQADAAYKKALCTARIMGYRPHIRATDGQQIQVEWVALGNATFCEGHNIERQAAMTSKRERL*
JGIcombinedJ26739_10001572753300002245Forest SoilMETIGEHRAHPGSINLVILKTEILSALSAVGSATTVEELAQADAAYKKALCTARIMGYRPHIRATDGQQIQVEWVALGNATFCEGHNIERQAAMTSKRERL*
JGIcombinedJ51221_1023163613300003505Forest SoilMEPIEEHRARNGSINLVILKAEILSALSGVGSATTTEELRQADAAYRKPLYAARSIGYCPHIRAADGQQIKVEWVALGEATFCEGHANIARQAPKTSKKERR*
Ga0062384_10047645913300004082Bog Forest SoilMVSIEPGENTCGASRYRAYECIGSKGNMETIGERRALLGSINLVTLKTEILSALSSVNSATTTEELQQADAAYKKALCTARSLGYRPHIRAAGGQQIHVEWVSLRQAPCWD
Ga0062389_10073498023300004092Bog Forest SoilMVSIEPGENTCGASRYRAYECIGSKGNMETIGERRALLGSINLVTLKTEILSALSSVNSATTTEELQQADAAYKKALCTARSLGYRPHIRAAGGQQIHVEWVSLRQAPCWDPANIARQGL
Ga0070714_10008111143300005435Agricultural SoilMEPIEEHRARNGSINLVILKAEIPSALSGVGSATTTEELRQADAAYRKALYAARSIGYCPHIRAADGQQIKVEWVALGEATFCEGHANIARQAPKTSKRERR*
Ga0070713_10014532133300005436Corn, Switchgrass And Miscanthus RhizosphereMEPIEEHRARNGSINLVILKAEILSALSGVGSATTTEELRQADAAYRKALYAARSIGYCPHIRAADGQQIKVEWVALGEATFCEGHANIARQAPKTSKRERR*
Ga0070708_100000363203300005445Corn, Switchgrass And Miscanthus RhizosphereMETIGEHNANSGSINLIILKAEILSALSSVSSATTTEELQQADAAYKKVLCTARRMGYRPHIRAVDGQQIQLEWVALGEATFSDTHANIARKAPMTSNGERG*
Ga0070706_10000636693300005467Corn, Switchgrass And Miscanthus RhizosphereVKKLVAQVVTVHTGAGLEGNMETIGEHNANSGSINLIILKAEILSALSSVSSATTTEELQQADAAYKKVLCTARRMGYRPHIRAVDGQQIQLEWVALGEATFSDTHANIARKAPMTSNGERG*
Ga0070698_100002693193300005471Corn, Switchgrass And Miscanthus RhizosphereMETIGEHNANSGSINLIILKAEILSALSSVSSATTTEELQQADAAYKKALCTARRMGYRPHIRAVDGQQIQLEWVALGEATFSDTHANIARKAPMTSNGERG*
Ga0070697_10000303163300005536Corn, Switchgrass And Miscanthus RhizosphereVKKLVAQVVTVHTGAGLEGNMETIGEHNANSGSINLIILKAEILSALSSVSSATTTEELQQADAAYKKALCTARRMGYRPHIRAVDGQQIQLEWVALGEATFSDTHANIARKAPMTSNGERG*
Ga0070731_1052063223300005538Surface SoilFNDVYVSGSGRHRTWGNHRLAQVVVVRTNAGQEGKMQTIEKHRAHHGSINLVSLKTEIVSALSAVGSATSIEELRQADAAYKKALCTARGMGYRPHIRAGDGQQIQVEWVALGEATFCEGHANIARQVPTTSKRERR*
Ga0070733_1015851733300005541Surface SoilMQTIEEHRSHHGSINLVSLKTEILSALSAVASARSMEELRQADSAYKKALNAARGMGYRPHIRAGDGQQIQVEWVALGEATFCEGHANMARQAPMTSKRERR*
Ga0070732_1006625013300005542Surface SoilMYRGQEVVLLETCGNHRLAQIVAVRSNAGKEEKMQTIIEEHIAHHGSMNLVSLKTEILSALSAVGSATSIEERRQADAAYKKALCTARGMGYRPHIRAGDGQQIQVEWVALGEATFCEGHANIAGQVPTTSKRERR*
Ga0070732_1097418413300005542Surface SoilMEPIEEHRARNGSINLVILKAEILSALSGVGSATTTEELRQADAAYKKALYAARSVGYCPHIRAADGQQIKVEWVALGEATFCEGHANIARQAPKTSK
Ga0070762_1010070113300005602SoilMQTIEEHRAHHGSMNLVSLKTEILAAISAVGSATSIEELRQADAAYKKALCTARGMGYRPHIRAGDGQQIQVEWVALGEATFCEGHANIARQAPATSKTERR*
Ga0070763_1048308713300005610SoilTAPTGWFNDVYVSGSGPCRTWGNHRLAQVVAVRTNTGQEGKMQTIEEHRAHHGSINLVSLKTEILSALSAVGSATSIEELRQADAAYKKALCTARGMGYRPHIRAGDGQQIQVEWVALGKATFCEGHANIARQAPTTSKRERR*
Ga0070766_1013581223300005921SoilMTKVVTVRTNAGLEGNIETIGEHRAHPGSINLVILKTEILFALSAVGSATSMEELRQADAAYKKALCTARSMGYRPHIRAADGHQIQVEWVALGEATFCEGHGNIARQAPMTSKSEGG*
Ga0070765_10029341723300006176SoilMETIEEHRAHHGSINLAMLKTEVVSALSAVGSATSIEELRQADAAYKKALCTARGMGYRPHIRAGDGQQIQVEWVALGKATFCEGHANIARQVPTTSKRERR*
Ga0070765_10169387113300006176SoilRTNAGLEGNMETLGEHRAHPGSINLVILKTEILFALSAVGSATSMEELRQADAAYKKALCTARSMGYRPHIRAADGHQIQVEWVALGEATFCEGHGNIARQAPMTSKSEGG*
Ga0099794_1012233823300007265Vadose Zone SoilLKAEILSALSLVGTATTTEELRQANWAYKKALSTARSMGYRPHIRAADNQQIRVDWVAIDEGTFCKAHARISRQAPMTSKE*
Ga0102924_129752213300007982Iron-Sulfur Acid SpringMKTSMAQIVTVRTDIDSEGNMKTIREPKALPRDINLITLKTEILSALFSVSSATTTEELQQADAAYKKALYTARSVGYRPHIRAADGQQVQVEWVALGEVPFWEDPANVARQSAITSNAERG*
Ga0099830_1002658333300009088Vadose Zone SoilMETIGQRRAHWSGGANLVALKAEILSALSLVGTATTTEELRQANWAYQKALSTARSMGYRPHIRAADNQQIRVDWVAIDEGTFCEAHARISRQAPMTSKE*
Ga0099828_1103357413300009089Vadose Zone SoilILKAEILSALSSVSSATTTEELQQADAAYKKALCTARRMGYRPHVRAVDGQQIQVEWVALGEATFSDTHANIARKAPMTSNGERG*
Ga0099792_1013802513300009143Vadose Zone SoilVKAPVAQVVTVHTGVGLEGDMETIGEHNEHCGSINLVILKAEILSALSSVSSATTTEELQQADAAYKKALCTARRMGYRPHVRAVDGQQIQVEWVALGEATFSDTHANIARKAPMTSNRERG*
Ga0116117_120124023300009635PeatlandESTNLVILKTEIVSALSAVGSATSIEELRQADAAKKKALCTARSMDYRPHIRAGDGQQIKVEWVPLGEATFCEGHANIARQAPTTSKRERR*
Ga0137392_1103409313300011269Vadose Zone SoilVKTPVAQGVTVHTGAGLEGNMETIGERRAHSGSINLVILKAEIQSALSSVGSATTTEELQQADAAYKKALCTARRMGYRPHVRAVDGQQIQVEWVALGEATFSDTHANIARKAPMTSNRERG*
Ga0137392_1126671723300011269Vadose Zone SoilLKAEILSALSLVGTATTTEELRQANWAYQKALSTARSMGYRPHIRAADNQQIRVDWVAIDEGTFCEAHARISRQASMTSKE*
Ga0137391_1073286913300011270Vadose Zone SoilVKAPVAQVVTVHTGVGLEGDMETIGEHNEHCGSINLVILKAEILSALSSVSSATTTEELQQADAAYKKALCTARRMGYRPHVRAVDGQQIQVEWVALGEATFSDTHANIARKAPMTSNGERG*
Ga0137391_1112524623300011270Vadose Zone SoilVKTPVAQGVTVHTGAGLEGNMETIGERRAHSGSINLVILKAEIQSALSSVGSATTTEELQQADAAYKKALCTARSMGYRPHIRAADGRQIQVEWVALGEA
Ga0137393_10001232243300011271Vadose Zone SoilVKTPVAQVVTVHTGAGLEGNMEAIGERRAHSGSINLVILKAEILSALSSVGSATTTEELQQADAAYKRALSTARSMGYRPHIRAGDGQQIQVELVALGEETFFASHANILRERRG*
Ga0137393_1001547323300011271Vadose Zone SoilMETIGQRRAHWSGGANLVALKAEILSALSLVGTATTTEELRQANWAYKKALSTARSMGYRPHIRAADNQQIRVDWVAIDEGTFCEAHARISRQASMTSKE*
Ga0137393_1053679623300011271Vadose Zone SoilVKTPVAQGVTVHTGAGLEGNMETIGERRAHSGSINLVILKAEIQSALSSVGSATTTEELQQADAAYKKALCTARSMGYRPHIRAADGRQIQVEWVALGEATFCEWSVTTLAPIPPSA*
Ga0137388_1086797423300012189Vadose Zone SoilMETIGQRRAHWSGGTNLVALKAEILSALSLVGTATTTEELRQANWAYQKALSTARSMGYRPHIRAADNQQIRVDWVAIDEGTFCKAHARISRQAPMTSKE*
Ga0137363_1147111113300012202Vadose Zone SoilMETIGQRRAHWSGGANLVALKAEILSALSLVGTATTTEELRQANWAYQKALSTARSMGYRPHIRAADNQQIRVDWVAIDEGTFCKAHARISRQAPMTSKE*
Ga0137362_1008332023300012205Vadose Zone SoilVKTPLAQVATVRTGAGLEGNMETIGERRAHSGSINLVILKVEIQSALSSVSSATTTEELQQADGAYEKALCTARRMGYRPHIRAVDGQQIQVEWVALGEATFSDTHANIARKAPMTSNAERG*
Ga0137360_1015265113300012361Vadose Zone SoilILSALSLVGTATTTEELRQANWAYQKALSTARSMGYRPHIRAADNQQIRVDWVAIDEGTFCKAHARISRQAPMTSKE*
Ga0137361_1012082933300012362Vadose Zone SoilMNLIVLKAEILSALSLVGTATTTEELRQANWAYQKALSTARSMGYRPHIRAADNQQIRVDWVAIDEGTFCKAHARISRQAPMTSKE*
Ga0137361_1041298013300012362Vadose Zone SoilVKAPVAQVVTVHTGAGLEGDMETIGEHNAHCGSINLVILKAEILSALSSVSSATTTEELQQADAAYKKALCTARRMGYRPHVRAVDGQQIQVEWVALGEATFSDTHANIARKAPMTSNAERG*
Ga0137395_1068091223300012917Vadose Zone SoilVKAPVAQVVTVHTGSGLEGDMETIGEHNAHCGSINLAIKKAEILSALSSVSSATTTEELQQADAAYKKALCTARRMGYRPHVRAVDGQQIQVEWVALGEATFSDTHANIARKAPMTSNGERG*
Ga0181526_1029521413300014200BogILKTEIVSALSAVGSATSIEELRQADAAKKKALCTARSMDYRPHIRAGDGQQIKVEWVPLGEATFCEGHANIARQAPTTSKRERR*
Ga0182024_1007994083300014501PermafrostMKTSMAQIVTVRTDIDSEGNMKTIREPKAPPGDINLATLKTEILSALSSVSSATTTEELQQADAAYKKALCIARSVGYRPHIRAADGQQVQVEWVALGKVPFWEDPANVARQGL*
Ga0187871_1002660623300018042PeatlandRLGKKNGTIEEHRTHHESTNLVILKTEIVSALSAVGSATSIEELRQADAAKKKALCTARSMDYRPHIRAGDGQQIKVEWVPLGEATFCEGHANIARQAPTTSKRERR
Ga0210407_1003116243300020579SoilMETIEEQRTHHGSINLVSLKTEILSALSTVGSATSIEELRQADAAYKKALCTARSMGYRPHIRAGDSQQIKVEWVALGEATFCEGHANIARQVPTTSKRERR
Ga0210407_1118235313300020579SoilMETIGKHREHAGSINLIVLKTEILSALSAVGSATSIEEFRQADAAYKKALYTARSIGYRPRIRAADGQQIKVEWVALGEATSCEGHANIARQVPRTSKRERR
Ga0210403_1000693863300020580SoilMQTIEEHRAHHGSINLVSLKTEVLSALSAVGSATSIEERRQADAAYSKALCTARGMGYRPNIRAGDGQQIQVEWVALGEATFCEGHANIARQVPTTSKRERR
Ga0210403_1022646933300020580SoilPPSIVSILQGNKVPTGWFNDVYVSGSGRHRTWGNHRLAQVVVVRTNAGQEGKMQTIDRAHHGSSNLVSLKREILPALSAVGSATSMEELRQADAAYKKALCTARSMGYRPHIRATDGQQIQVDWVALGKVTFCEGRANIAHQAPMTSKRLDVAREHG
Ga0210399_1067446113300020581SoilMTKVVTVRTNAGLEGNMETIGEHKAHPGSINLVILKTEILSALSAVGSATSMEELRQADAAYKKALCTARSLGYRPHIRAADGRQIQVEWVALGEATFCKATPILRARRQ
Ga0210399_1086293813300020581SoilMATIGEHRAHPGSINLVILKIEILSALFAVGSATTVEELEQADAAYKKALCTARIMGYRPHIRATDGQQIQVEWVALGNATFCEGHNIERQAAMTSKRERL
Ga0210399_1134088013300020581SoilKASNLVKLPMALVVTVGTNIDSEGKMKTIGEPRALLGSISLVTLKTEILSALSSVSSATTTEELQQADAACKKALCTARSVGFRPHIRAADGQQVQVEWVALGKVPSWEDPANVARQGL
Ga0210395_1012900343300020582SoilMETIEEHRAHHGSINLAMLKTEVVSALSAVGSATSIEELRQADAAYKKALCTARGMGYRPHIRAGDGQQIQVEWVALGEETFCEGHANIARQAPTTSKRERR
Ga0210395_1039599623300020582SoilMQTIEEHRAHHGSINLVSLKTEILSALSAVGSATSIEELRQADAAYKKALCTARGMGYRPHIRAADGQQIQVEWVARGEATFCEGHANIARQAPTTSKRERR
Ga0210401_1000746093300020583SoilMPVWKGTWKQQENTEHTRSINLVILKIEILSALSAVGSATIVEELEQADAAYKKALRTARIMGYRPHIRATDGQQIQVEWVALGNATFCEGHANIERQAAMTSKRERL
Ga0210401_1001527753300020583SoilMYTCQGLVLIEWGKPPMTQVVTVGTNAGLEGNMETIGEYRAHPGSINLVILKTEILSALSAVGSATTMEELQQADAAYKKALCTARSMGYRPHIRATGGQQIQVEWVAVGEATFCEGHVNIARRAPMTSKRDSG
Ga0210401_1003478663300020583SoilMQTIEEHRSHHGSINLVSLKTEILSALSAVGSATSIEERRQADAAYKKALCTARGMGYRPHIRAGDGQQIQVEWVALGEATFCEGHANIARQVPTTSKRERR
Ga0210401_1012627843300020583SoilMQTIEEHRSHHGIINLVSLKTEILSALSAVASARSMEELRQADSAYKKALNAARGMGYRPHIRAGDGQQIQVEWVALGEATFCEGHANMARQAPMTSKRERR
Ga0210401_1018541913300020583SoilMKSIEPGETPMAQVVTVRTNADLEGNMETIGECRAHPGSISLVILKTEILSALSSVSGATTVEELQKADAAYKKALCTARSMGYRPHIRAADGQQIQVEWVALGKA
Ga0210401_1029129323300020583SoilMNLVSLKTEILAAISAVGSATSIEELRQADAAYKKALCTARGMGYRPHIRAADGQQIQVEWVALGEATFCEGHANIARQAPATSKTERR
Ga0210401_1041570323300020583SoilAIEEQRARNGSINLVILKAEILSALSGVGSATTTEELRQADAAYRKALYAARSIGYCPHIRAADGQQIKVEWVALGEATFCEGHANSARQAPKTSKTERR
Ga0210401_1051970113300020583SoilMETIEEQRTHHGSINLVGLKTEILSALSTVGSATSIEELRQADAAYKKALCTARSMGYRPHIRAGDSQQIKVEWVALGEATFCEGHANIARQVPTTSKRERR
Ga0210401_1133999913300020583SoilYVSGSGPHRTWGSHRLTQVVAVHTNAGQEGKMQTIDRAHHGSSNLVSLKREILPALSAVGSATSMEELRQADAAYKKALCTARSMGYRPHIRAGDGHQIQVEWVALGEATFCERHANMARQAPMTSKRERR
Ga0215015_1044224213300021046SoilVKTLVAQVVTVHTGAGLEGNMETIGEHNAHSGSINLIILKAEILSALSSVSRATTTEELQQADAAYKKALCTARRMGYRPHIRAVDGQQIQVEWVALGEATFSDTHANIARKAPMTSNGERG
Ga0210405_1000212843300021171SoilMQTIEEHRAQHGSIKLVGLKTEILSALSAVGSATSIEERRQADAAYQKALCTARGMGYRPHIRAGDGQQIQVEWVALGGAFCEGHANIARQVPATSKRERR
Ga0210405_10006366143300021171SoilMAELVTVRTDIDLEGNMKTIREPKALPGDINLVRLKTEILSALSSVRSATTTEELQQADAAYKKALCAARSVGYRPHIRAADGQQIQVEWVALGKVLFWEDPANIARQRL
Ga0210405_1049596713300021171SoilHPGSINLVILKIEILSALSAVGSATTVEELEQADAAYKKALCTARIMGYRPHIRATDGQQIQVEWVALGNATFCEGHANIERQAAMTSKRERL
Ga0210408_1074782623300021178SoilVVTVRTNIDSEETMKTIEERRALPGSINLVTLKTEILSALSSVSTATTTEELQQADAAYRKALCTARSVGYRPHIRAADGQQIQVEWVALARVPFWENPADIARQGL
Ga0210396_1025438113300021180SoilMQTIEEHRAHHGSINLVSLKTEILSALSAVGSATSIEELRQADAAYKKALCTARGMGYRPHIRAADGQQIQVEWVARGEATFCEGHANIAR
Ga0210396_1092472013300021180SoilIINLVSLKTEILSALSAVASARSMEELRQADSAYKKALNAARGMGYRPHIRAGDGQQIQVEWVALGEATFCEGHANMARQAPMTSKRERR
Ga0210396_1135043913300021180SoilMQTIEEHRAHHGSINLVSLKTEILSALSVVGSATSMEELRQADSAYKKALCTARGMGYRPHIRAGYGQQIQVEWVALGKATFCEGHANIARQAPTTSKRER
Ga0210385_1062786513300021402SoilSLKREILPALSAVGSATSMEELRQADAAYKKALCTARSMGYRPHIRAADGHQIQVEWVALGEATFCEGHGNIARQAPMTSKSEGG
Ga0210397_1026405623300021403SoilREPPSIVSILQGNKVPTGRFNDVCASGSGPPRTWGNHRQAQIVAVRTNAGQEGKMQTIEEHRAHHGSINLVSLKTEILSALSAVGSATSIEELRQADAAYKKALCTARGMGYRPHIRAADGQQIQVEWVARGEATFCEGHANIARQAPTTSKRERR
Ga0210389_1025906323300021404SoilDVDVSGSGPCRTWGNHRLAQVVAVRTNTGQEGKMQTIEEHRAHHGSMNLVSLKTEILSALSAVGSATSIEELRQADAAYKKALCTARGMGYRPHIRAADGQQIQVEWVARGEATFCEGHANIARQAPTTSKRERR
Ga0210386_1003143433300021406SoilMYTCQGLVLIEWGKPPMTQVVTVGTNAGLEGNMETIGEYRAHPGSINLVILKTEILSALSAVGSATTMEELQQADAAYKKALCTARSIGYRPHIRATGGQQIQVEWVAVGEATFCEGHVNIARRAPMTSKRDSG
Ga0210386_1067374013300021406SoilMEPIEEHRARNGSINLVILKAEILSALSGVGSATTTEELRQADAAYRKPLYAARSIGYCPHIRAADGQQIKVEWVALGEATFCEGHANIARQAPKTSKKERR
Ga0210383_1029218533300021407SoilMETIEEHRAHHGSINLAMLKTEVVSALSAVGSATSIEELRQANAAYKKALCTARGMGYRPHIRAGDGQQIQVEWVALGGATFCEGHANIARQAPTASKRERR
Ga0210383_1059745913300021407SoilVSGGPVLIEAGEPPVAQVVTVGTNSGLERNMATIGEHRAHPGSINLVILKIEILSALFAVGSATTVEELEQADAAYKKALCTARIMGYRPHIRATDGQQIQVEWVALGNATFCEGHNIERQAAMTSKRERL
Ga0210383_1061604813300021407SoilMKTSMAQVVTVRTDIDSEGNVKTIREPKALPGDINLVTLKTEILPALSSVSSATTTEELQQADAACKKALCTARSVGFRPHIRAADGQQVQVEWVALGKVPSWEDPANVARQGL
Ga0210394_1035351823300021420SoilMETIEEHRAHHGSINLAMLKTEVVSALSAVGSATSIEELRQADAAYKKALCTARGMGYRPHIRAGDGQQIQVEWVALGGATFCEGHANIARQAPTASKRERR
Ga0210394_1082474413300021420SoilMPVWKGTWKQQENTEHTRSINLVILKIEILSALSAVGSATIVEELEQADAAYKKALRTARIMGYRPHIRATDGQQIQVEWVALGNATFCEGHAN
Ga0210384_1018693723300021432SoilMYRGQEVVLLETCGNHRLAQIVAVRSNAGKEEKMQTIIEEHIAHHGSMNLVSLKTEILSALSAVGSATSIEERRQADAAYTKALCAARGMGYRPHIRAGDGQQIQVEWVALGKATF
Ga0210384_1158114613300021432SoilMETIEEHRAHHGSINLVVLKAEILSALSAVGRATTLEELRQADAAYKTALYAARSIGYRPHIRAADGQQIKVDWVGLGEATFREGHARIARQVGHKSPSEGSSLS
Ga0210384_1166974813300021432SoilVYVSGSGPHRTWGDHRLTQVVAVHTNAGQEGKMQTIEEHRAHHGSINLVSLKTEILSALSVVGSATSMEELRQADSAYKKALCTARGMGYRPHIRAGYGQQIQVEWVALGKATFCEGHANIARQAPTTSKRERH
Ga0210391_1088846913300021433SoilMQTIEEHRSHHGIINLVSLKTEILSALSAVASARSMEELRQADSAYKKALNAARGMGYRPHIRAGDGQQIQVEWVALGEATFCEGHANMARQAPMT
Ga0210390_1027959023300021474SoilMETTEEHRATHGSINLVILKTEIVSALSAVGSATSIEELRQADAAYKRALCTARGMGYRPHIRAGDGQQIQVEWVALREATFCEGHANIARQAPTTSKRERR
Ga0210390_1030692823300021474SoilMTNVVTERTNAGLEGNMETLGEHRAHPGSINLVILKTEILFALSAVGSATSMEELRQADAAYKENLCTARSMGYRPHIPAADGHQIQVEWVALGEATFCEGHANIARQAPLTSKSEGG
Ga0210398_1075171623300021477SoilMETIEEHRAHHGSINLAMLKTEVVSALSAVGSATSIEELRQANAAYKKALCTARGMGYRPHIRAGDGQQIQVEWVALGGATFCEGHANIARQAPTASKR
Ga0210410_1000261083300021479SoilMAQVVTVRTNADLEGNMETIGECRAHPGSISLVILKTEILSALSSVSGATTMEELQKADAAYKKALCTARSMGYRPHIRAADGQQIQVEWVALGKAPFCEAHASVARQTRMTFKRERG
Ga0210409_1000461373300021559SoilMQTIEEHRAHHGSINLVSLKTEILSALSVVGSATSMEELRQADSAYKKALRTARGMGYRPHIRAGYGQQIQVEWVALGKATFCEGHANIARQAPTTSKRERH
Ga0210409_1083886913300021559SoilMAQVVTVRTNADLEGNMETIGECRAHPGSISLVILKTEILSALSSVSGATTMEELQKADAAYKKALCTARSMGYRPHIRAADGQQIQVEWVALGKAPFCEAHASVARQTRMTSKGKEVECHERA
Ga0224561_100401323300023030SoilVAQVVTVGTNSGLEGNMETIGEHRAHPGSINLVILKIEILSALSAVGSATTVEELEQADAAYKKALCTARIMGYRPHIRATDGQQIQVEWVALGNATFCEGHNIERQAMTSKRERL
Ga0208690_106336013300025434PeatlandTIEEHRTHHESTNLVILKTEIVSALSAVGSATSIEELRQADAAKKKALCTARSMDYRPHIRAGDGQQIKVEWVPLGEATFCEGHANIARQAPTTSKRERR
Ga0207684_10005369103300025910Corn, Switchgrass And Miscanthus RhizosphereVKKLVAQVVTVHTGAGLEGNMETIGEHNANSGSINLIILKAEILSALSSVSSATTTEELQQADAAYKKVLCTARRMGYRPHIRAVDGQQIQLEWVALGEATFSDTHANIARKAPMTSNGERG
Ga0209648_1004391923300026551Grasslands SoilVHTGAGLEGNMETIGERRAHSGSINLVILKAEILSALSSVGSATTTEELQQADAAYKKALCTARSMGYRPHIRAADGRQIQVEWVALGEATFCEAHANIARKASMTSNGERG
Ga0209648_1058019813300026551Grasslands SoilVKAPVAQVVTVHTGVGLEGDMETIGEHNAHCGSINLVILKADILSALSSVSSATTTEELQQADAAYKKALCTARRMGYRPHVRAVDGQQIQVEWVALGEATFSDTHANIARKAPMTSDGERG
Ga0208859_100777833300027069Forest SoilMQTIEEHRAHHGSMNLVSLKTEILAAISAVGSATSIEELRQADAAYKKALCTARGMGYRPHIRAGDGQQIQVEWVALGNATFCEGHANIERQAAMTSKRERL
Ga0209008_101082823300027545Forest SoilVAQVVTVGTNSGLEGNMETIAEHRAHPGSINLVILKIEILSALSAVGSATTVEELEQADAAYKKALCTARIMGYRPHIRATDGQQIQVEWVALGNATFCEGHNIERQAAMTSKRERL
Ga0209115_102641623300027567Forest SoilVAQVVTVGTNSGLEGNMETIGEHRAHPGSINLVILKTEILSALSAVGSATTVEELAQADAAYKKALCTARIMGYRPHIRATDGQQIQVEWVALGNATFCEGHNIERQAAMTSKRERL
Ga0209580_1022339713300027842Surface SoilMEPIEEHRARNGSINLVILKAEILSALSGVGSATTTEELRQADAAYKKALYAARSVGYCPHIRAADGQQIKVEWVALGEATFCEGHANIARQAPKTSKKERR
Ga0209274_1034364113300027853SoilMTNVVTERTNAGLEGNMETLGEHRAHPGSINLVILKTEILFALSAVGSATSMEELRQADAAYKKNLCTARSMGYRPHIPAADGHQIQVEWVALGEAT
Ga0209693_1011682013300027855SoilLAQVVAVRTNTGQEGKMQTIEEHRAHHGSMNLVSLKTEILAAISAVGSATSIEELRQADAAYKKALCTARGMGYRPHIRAGDGQQIQVEWVALGKATF
Ga0209380_1020792713300027889SoilMTKVVTVRTNAGLEGNIETIGEHRAHPGSINLVILKTEILFALSAVGSATSMEELRQADAAYKKALCTARSMGYRPHIRAADGHQIQVEWVALGEATFCEGHGNIARQAPMTSKSEGG
Ga0209380_1022332013300027889SoilMAEVVTVRTDIDSEGNMKTIREPKAPPGDINLVTLKTEILSALSSVGSATTTEELQQADAAYKKALCTARSVGYRPHIRAADGQQIQVEWVALGKAPFCEAPADTARQGP
Ga0137415_1144206013300028536Vadose Zone SoilVKAPVSQVVTVHTGAGLEGDMGTIGEHNAHCGSINLVILKADILSALSSVSSATTTEELQQADAAYKKALCTARRMGYRPHARAVDGQQIQVEWVTLGEATFSDTHADIARKAPMTSNGERG
Ga0308309_1002761723300028906SoilMETMEEHRAHHGSINLVILKAEILSALSAVGSVTIMEELRQANAAYKKALYAARSIGYRPHIRAADGQQIQVELVALGEATFCEGHANIARQAPRTSKKGRTLNVTRACLP
Ga0265760_1018663213300031090SoilMETMEEHRATHGSINLVILKTEILSALSAVGSATSIEERRQADAAYKKALCTARGMGYRPHIRAGDGQQIQVEWVALGEATFCEGHANIARQAPKTSKSERR
Ga0310686_111677412153300031708SoilMKTPMAQVVTVRTNVGSEGNMETIGERRALLGSINLVTLKTEILSALSSVSSATTTQELQQADAAYKKALCTARSLGYRPHIRAADGQQIHVEWVALGKAPCWDPANIVRQGL
Ga0310686_11552155533300031708SoilMAEVVTVRTDIDSEGNMKTIREPKAPPGDINLVTLKTEILSALSSVSSATTTEELQQADAAYEKALCTARSVGYRPHIRAADGQQIQVEWVALGKVNAERG
Ga0307476_10001965163300031715Hardwood Forest SoilMETMEEHRATHGSINLVILKTEIVSALSAVGSATSIEELRQADAAYKRALCTARGMGYRPHIRAGDGQQIQVEWVALGEATFCEGHANIARQAPTTSKRERR
Ga0307476_1030833813300031715Hardwood Forest SoilMYTCPGPVLIEAGEPPVAQVVTVGTNSGLGGNMETIGEHRAHPGSINLVILKIEILSALSVVGSATTVEELEQADAAYKKALCTARIMGYRPHIRATDGQQIQVEWVALGNATFCEGHNIERQAAMTSKRERL
Ga0307474_10000907273300031718Hardwood Forest SoilMQTIEKHRAHHGSINLVSLKTEILSALSAVGSATSIEELRQEDAAYKKALCTARGMGYRPHIRAGDGQQIQVEWVAVGKATFCEGHANIARQVPTTSRRERH
Ga0307474_1001224093300031718Hardwood Forest SoilMETIEENRAHYGSINLVILKTEILSALSAVGSATSIEERRQADAAYKKALCTARGMGYRPHIRAGDGQQIQVEWVALGEATLCEGHANIARQAPTTSKRERR
Ga0307477_1008068123300031753Hardwood Forest SoilMEPIEEHRARNGSINLAILKAEILSALSGVGSATTTEELRQADAAYKKALYAARSVGYCPHIRAADGQQIKVEWVALGEATFCEGHANIARQAPKTSKKERR
Ga0307473_1005601843300031820Hardwood Forest SoilMEPIEEHRARNGSINLVILKAEILSALSGVGSATTTEELRQADAAYRKALYAARSIGYCPHIRAADGQQIKVEWVALGEATFCEGHANIARQAPKTSKTERR
Ga0307478_1042991313300031823Hardwood Forest SoilMETMEEHRATYGSINLVILKTEILSALSAVGSATSIEERRQADAAYKKALCTARGMGYRPHIRAGDGQQIQVEWVALGEATFCEGHANIARQAPKTSKSERR
Ga0307478_1123902523300031823Hardwood Forest SoilMETMEEHRATHGSINLVILKTEIVSALSAVGSATSIEELRQADAAYKRALCTARGMGYRPHIRAGDGQQIQVEWVALGEATFCEGHAN
Ga0307479_1073099723300031962Hardwood Forest SoilVAQVVTVGTNSGLEGNMETIGEHRAHPGSINLVILKIEILSALSAVGSATTVEELEQADAAYKKALCTARIMGYRPHIRATDGQQIQVEWVALGNATFCEGHNIERQAAMTSKRERL
Ga0307472_10036055223300032205Hardwood Forest SoilLKAEILSALSLVGTATTTEELRQANWAYKKALSTARSMGYRPHIRAADNQQIRVDWVAIDGGTFCEAHAGISRQAPMTSKE
Ga0348332_1419400813300032515Plant LitterAQVVTVRTNVGSEGNMETIGERRALLGSINLVTLKTEILSALSSVSSATTTQELQQADAAYKKALCTARSLGYRPHIRAADGQQIHVEWVALGKAPCWDPANIVRQGL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.