NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F073947

Metagenome / Metatranscriptome Family F073947

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F073947
Family Type Metagenome / Metatranscriptome
Number of Sequences 120
Average Sequence Length 75 residues
Representative Sequence MESKSPHRGITMHRLPVGGDFPGLVFAVGSALIFLIALPALWYLVVGALAVGLVIAAVLQVIHDKPDESQRTTLKL
Number of Associated Samples 92
Number of Associated Scaffolds 120

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 82.35 %
% of genes near scaffold ends (potentially truncated) 17.50 %
% of genes from short scaffolds (< 2000 bps) 67.50 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.44

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (85.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere
(12.500 % of family members)
Environment Ontology (ENVO) Unclassified
(21.667 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(40.833 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 44.23%    β-sheet: 0.00%    Coil/Unstructured: 55.77%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.44
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 120 Family Scaffolds
PF08818DUF1801 10.00
PF07883Cupin_2 8.33
PF12796Ank_2 7.50
PF04069OpuAC 7.50
PF03473MOSC 5.83
PF034753-alpha 5.00
PF10861DUF2784 3.33
PF14310Fn3-like 3.33
PF00127Copper-bind 2.50
PF027395_3_exonuc_N 2.50
PF13637Ank_4 1.67
PF13432TPR_16 1.67
PF13414TPR_11 1.67
PF00550PP-binding 0.83
PF13564DoxX_2 0.83
PF04055Radical_SAM 0.83
PF00850Hist_deacetyl 0.83
PF069833-dmu-9_3-mt 0.83
PF04191PEMT 0.83
PF11373DUF3175 0.83
PF14559TPR_19 0.83
PF04337DUF480 0.83
PF04203Sortase 0.83
PF00528BPD_transp_1 0.83
PF013675_3_exonuc 0.83
PF01553Acyltransferase 0.83
PF00202Aminotran_3 0.83
PF02861Clp_N 0.83
PF12867DinB_2 0.83
PF06262Zincin_1 0.83
PF02580Tyr_Deacylase 0.83
PF13490zf-HC2 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 120 Family Scaffolds
COG4430Uncharacterized conserved protein YdeI, YjbR/CyaY-like superfamily, DUF1801 familyFunction unknown [S] 10.00
COG5646Iron-binding protein Fra/YdhG, frataxin family (Fe-S cluster biosynthesis)Posttranslational modification, protein turnover, chaperones [O] 10.00
COG5649Uncharacterized conserved protein, DUF1801 domainFunction unknown [S] 10.00
COG2258N-hydroxylaminopurine reductase YiiM, contains MOSC domainDefense mechanisms [V] 5.00
COG02585'-3' exonuclease Xni/ExoIX (flap endonuclease)Replication, recombination and repair [L] 3.33
COG0123Acetoin utilization deacetylase AcuC or a related deacetylaseSecondary metabolites biosynthesis, transport and catabolism [Q] 1.67
COG0542ATP-dependent Clp protease, ATP-binding subunit ClpAPosttranslational modification, protein turnover, chaperones [O] 0.83
COG1490D-aminoacyl-tRNA deacylaseTranslation, ribosomal structure and biogenesis [J] 0.83
COG2764Zn-dependent glyoxalase, PhnB familyEnergy production and conversion [C] 0.83
COG3132Uncharacterized conserved protein YceH, UPF0502 familyFunction unknown [S] 0.83
COG3764Sortase (surface protein transpeptidase)Cell wall/membrane/envelope biogenesis [M] 0.83
COG3824Predicted Zn-dependent protease, minimal metalloprotease (MMP)-like domainPosttranslational modification, protein turnover, chaperones [O] 0.83
COG3865Glyoxalase superfamily enzyme, possible 3-demethylubiquinone-9 3-methyltransferaseGeneral function prediction only [R] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms85.00 %
UnclassifiedrootN/A15.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001686|C688J18823_10587545Not Available711Open in IMG/M
3300002568|C688J35102_120228752All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium938Open in IMG/M
3300002914|JGI25617J43924_10149259All Organisms → cellular organisms → Bacteria → Acidobacteria805Open in IMG/M
3300002916|JGI25389J43894_1006725All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter1859Open in IMG/M
3300004114|Ga0062593_100204572All Organisms → cellular organisms → Bacteria1576Open in IMG/M
3300004479|Ga0062595_100043897All Organisms → cellular organisms → Bacteria1981Open in IMG/M
3300005166|Ga0066674_10050601Not Available1874Open in IMG/M
3300005174|Ga0066680_10160627All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1409Open in IMG/M
3300005179|Ga0066684_10008218All Organisms → cellular organisms → Bacteria4888Open in IMG/M
3300005186|Ga0066676_11085616Not Available528Open in IMG/M
3300005187|Ga0066675_10987594Not Available633Open in IMG/M
3300005434|Ga0070709_11202406All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_12_FULL_59_11609Open in IMG/M
3300005435|Ga0070714_100097759All Organisms → cellular organisms → Bacteria2581Open in IMG/M
3300005435|Ga0070714_100489095All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1172Open in IMG/M
3300005435|Ga0070714_100591496All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1065Open in IMG/M
3300005435|Ga0070714_101040829All Organisms → cellular organisms → Bacteria797Open in IMG/M
3300005435|Ga0070714_101106806All Organisms → cellular organisms → Bacteria772Open in IMG/M
3300005435|Ga0070714_101597049All Organisms → cellular organisms → Bacteria637Open in IMG/M
3300005436|Ga0070713_100039231All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3842Open in IMG/M
3300005436|Ga0070713_102390852Not Available511Open in IMG/M
3300005437|Ga0070710_10914204All Organisms → cellular organisms → Bacteria634Open in IMG/M
3300005439|Ga0070711_101177841All Organisms → cellular organisms → Bacteria662Open in IMG/M
3300005445|Ga0070708_100014975All Organisms → cellular organisms → Bacteria6393Open in IMG/M
3300005467|Ga0070706_100117979All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2472Open in IMG/M
3300005530|Ga0070679_100285561All Organisms → cellular organisms → Bacteria1602Open in IMG/M
3300005534|Ga0070735_10000947All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae29192Open in IMG/M
3300005537|Ga0070730_10745152Not Available619Open in IMG/M
3300005542|Ga0070732_10076345All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1958Open in IMG/M
3300005560|Ga0066670_10055957All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter2065Open in IMG/M
3300005560|Ga0066670_10351146All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium899Open in IMG/M
3300005563|Ga0068855_100428517All Organisms → cellular organisms → Bacteria1446Open in IMG/M
3300005578|Ga0068854_101906123All Organisms → cellular organisms → Bacteria546Open in IMG/M
3300005587|Ga0066654_10082728All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter1509Open in IMG/M
3300006028|Ga0070717_10005019All Organisms → cellular organisms → Bacteria9637Open in IMG/M
3300006028|Ga0070717_10007251All Organisms → cellular organisms → Bacteria8220Open in IMG/M
3300006028|Ga0070717_10744065All Organisms → cellular organisms → Bacteria891Open in IMG/M
3300006173|Ga0070716_100017244All Organisms → cellular organisms → Bacteria3740Open in IMG/M
3300006800|Ga0066660_10259553All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1364Open in IMG/M
3300006893|Ga0073928_10000372All Organisms → cellular organisms → Bacteria106648Open in IMG/M
3300007982|Ga0102924_1004357All Organisms → cellular organisms → Bacteria14308Open in IMG/M
3300009012|Ga0066710_101902601All Organisms → cellular organisms → Bacteria890Open in IMG/M
3300009038|Ga0099829_10060295All Organisms → cellular organisms → Bacteria2843Open in IMG/M
3300009088|Ga0099830_10311366All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1259Open in IMG/M
3300009093|Ga0105240_10036640All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6308Open in IMG/M
3300009093|Ga0105240_11772626All Organisms → cellular organisms → Bacteria644Open in IMG/M
3300009137|Ga0066709_100354272All Organisms → cellular organisms → Bacteria2018Open in IMG/M
3300009137|Ga0066709_101107384All Organisms → cellular organisms → Bacteria1164Open in IMG/M
3300009137|Ga0066709_101648142Not Available914Open in IMG/M
3300009826|Ga0123355_10596082Not Available1314Open in IMG/M
3300010043|Ga0126380_10113701All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1655Open in IMG/M
3300010048|Ga0126373_10202980All Organisms → cellular organisms → Bacteria → Acidobacteria1922Open in IMG/M
3300010049|Ga0123356_10993675Not Available1009Open in IMG/M
3300010320|Ga0134109_10310985All Organisms → cellular organisms → Bacteria → Acidobacteria609Open in IMG/M
3300010371|Ga0134125_10063515All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4108Open in IMG/M
3300010371|Ga0134125_11006943All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae913Open in IMG/M
3300010373|Ga0134128_10151323All Organisms → cellular organisms → Bacteria → Acidobacteria2623Open in IMG/M
3300010396|Ga0134126_10004551All Organisms → cellular organisms → Bacteria → Acidobacteria17340Open in IMG/M
3300010396|Ga0134126_10015612All Organisms → cellular organisms → Bacteria9611Open in IMG/M
3300010401|Ga0134121_10058168All Organisms → cellular organisms → Bacteria3181Open in IMG/M
3300011120|Ga0150983_16719662All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium705Open in IMG/M
3300011271|Ga0137393_10214975All Organisms → cellular organisms → Bacteria1625Open in IMG/M
3300012200|Ga0137382_11231010All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium531Open in IMG/M
3300012206|Ga0137380_10005563All Organisms → cellular organisms → Bacteria11210Open in IMG/M
3300012209|Ga0137379_10035399All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4814Open in IMG/M
3300012212|Ga0150985_116141330All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2504Open in IMG/M
3300012350|Ga0137372_10229909All Organisms → cellular organisms → Bacteria → Acidobacteria1468Open in IMG/M
3300012351|Ga0137386_10296616All Organisms → cellular organisms → Bacteria → Acidobacteria1163Open in IMG/M
3300012354|Ga0137366_10171387All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1629Open in IMG/M
3300012356|Ga0137371_10320690All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1206Open in IMG/M
3300012359|Ga0137385_10151314All Organisms → cellular organisms → Bacteria2041Open in IMG/M
3300012363|Ga0137390_10088599All Organisms → cellular organisms → Bacteria3050Open in IMG/M
3300012363|Ga0137390_10104838All Organisms → cellular organisms → Bacteria2796Open in IMG/M
3300012469|Ga0150984_101475504All Organisms → cellular organisms → Bacteria → Acidobacteria1147Open in IMG/M
3300012469|Ga0150984_102571885Not Available1515Open in IMG/M
3300012469|Ga0150984_106385170All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium515Open in IMG/M
3300012960|Ga0164301_10856452All Organisms → cellular organisms → Bacteria701Open in IMG/M
3300012984|Ga0164309_10860895All Organisms → cellular organisms → Bacteria735Open in IMG/M
3300017937|Ga0187809_10136141All Organisms → cellular organisms → Bacteria → Acidobacteria843Open in IMG/M
3300018088|Ga0187771_11502039All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatobacter → Candidatus Sulfotelmatobacter kueseliae572Open in IMG/M
3300018431|Ga0066655_10454104All Organisms → cellular organisms → Bacteria → Acidobacteria846Open in IMG/M
3300018468|Ga0066662_10428104All Organisms → cellular organisms → Bacteria1175Open in IMG/M
3300018468|Ga0066662_12753108Not Available521Open in IMG/M
3300019890|Ga0193728_1319351All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium577Open in IMG/M
3300020018|Ga0193721_1136036Not Available605Open in IMG/M
3300020579|Ga0210407_10333136All Organisms → cellular organisms → Bacteria → Acidobacteria1187Open in IMG/M
3300021362|Ga0213882_10523286Not Available509Open in IMG/M
3300021388|Ga0213875_10081000All Organisms → cellular organisms → Bacteria → Acidobacteria1515Open in IMG/M
3300021403|Ga0210397_11261299All Organisms → cellular organisms → Bacteria574Open in IMG/M
3300022557|Ga0212123_10000127All Organisms → cellular organisms → Bacteria258803Open in IMG/M
3300022557|Ga0212123_10019380All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae7918Open in IMG/M
3300025905|Ga0207685_10346919All Organisms → cellular organisms → Bacteria747Open in IMG/M
3300025910|Ga0207684_10199763All Organisms → cellular organisms → Bacteria → Acidobacteria1725Open in IMG/M
3300025910|Ga0207684_10497834All Organisms → cellular organisms → Bacteria1045Open in IMG/M
3300025913|Ga0207695_10034991All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae5453Open in IMG/M
3300025929|Ga0207664_10036398All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3805Open in IMG/M
3300025929|Ga0207664_11123644Not Available702Open in IMG/M
3300025929|Ga0207664_11326033Not Available639Open in IMG/M
3300025939|Ga0207665_11399386Not Available557Open in IMG/M
3300026300|Ga0209027_1115090All Organisms → cellular organisms → Bacteria946Open in IMG/M
3300026527|Ga0209059_1145644All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium869Open in IMG/M
3300026551|Ga0209648_10000115All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae53007Open in IMG/M
3300026552|Ga0209577_10159192All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis1768Open in IMG/M
3300027842|Ga0209580_10000546All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae20076Open in IMG/M
3300027842|Ga0209580_10007866All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4555Open in IMG/M
3300027842|Ga0209580_10241354All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium898Open in IMG/M
3300027846|Ga0209180_10113988All Organisms → cellular organisms → Bacteria1548Open in IMG/M
3300027986|Ga0209168_10002015All Organisms → cellular organisms → Bacteria15171Open in IMG/M
3300030991|Ga0073994_12243594Not Available628Open in IMG/M
3300031057|Ga0170834_104235730All Organisms → cellular organisms → Bacteria → Acidobacteria796Open in IMG/M
3300031057|Ga0170834_107770787All Organisms → cellular organisms → Bacteria1413Open in IMG/M
3300031231|Ga0170824_108609069All Organisms → cellular organisms → Bacteria1609Open in IMG/M
3300031231|Ga0170824_112452331All Organisms → cellular organisms → Bacteria4103Open in IMG/M
3300031474|Ga0170818_106970384All Organisms → cellular organisms → Bacteria781Open in IMG/M
3300031474|Ga0170818_111576357All Organisms → cellular organisms → Bacteria5351Open in IMG/M
3300031715|Ga0307476_10000001All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1135410Open in IMG/M
3300031754|Ga0307475_10120623All Organisms → cellular organisms → Bacteria2065Open in IMG/M
3300031823|Ga0307478_10881256All Organisms → cellular organisms → Bacteria749Open in IMG/M
3300031946|Ga0310910_11373699All Organisms → cellular organisms → Bacteria544Open in IMG/M
3300031962|Ga0307479_10641501All Organisms → cellular organisms → Bacteria1043Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere12.50%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil10.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil9.17%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil7.50%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil5.83%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil5.00%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil5.00%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring3.33%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.33%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.33%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.33%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere2.50%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere2.50%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.67%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.67%
Termite GutHost-Associated → Arthropoda → Digestive System → Gut → Unclassified → Termite Gut1.67%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.67%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.83%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.83%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.83%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.83%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.83%
Exposed RockEnvironmental → Terrestrial → Rock-Dwelling (Subaerial Biofilms) → Unclassified → Unclassified → Exposed Rock0.83%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.83%
Plant RootsHost-Associated → Plants → Roots → Unclassified → Unclassified → Plant Roots0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001686Grasslands soil microbial communities from Hopland, California, USAEnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005530Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaGEnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005563Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2Host-AssociatedOpen in IMG/M
3300005578Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2Host-AssociatedOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300007982Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPM 11 metaGEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009826Embiratermes neotenicus P1 segment gut microbial communities from Petit-Saut dam, French Guiana - Emb289 P1Host-AssociatedOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010049Embiratermes neotenicus P3 segment gut microbial communities from Petit-Saut dam, French Guiana - Emb289 P3Host-AssociatedOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300017937Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_4EnvironmentalOpen in IMG/M
3300018088Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP15_10_MGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019890Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c1EnvironmentalOpen in IMG/M
3300020018Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s2EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300021362Barbacenia macrantha exposed rock microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - ER_R09EnvironmentalOpen in IMG/M
3300021388Root-associated microbial communities from Barbacenia macrantha in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R8Host-AssociatedOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300025905Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025913Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026300Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027986Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300030991Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031946Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF172EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
C688J18823_1058754513300001686SoilMESKSPHRGIDMHRLPVGGDFPGLVFAVGSALIFLIALPALWYLVVGAVAVGLVIAAVLQMIHDKP
C688J35102_12022875213300002568SoilMESKSPHRGIDMHRLPVGGDLPGLVFAVGTALIFVIALPALWYLVVGALAVGLVIAAVLQIVHDRPDESQRSTLKL*
JGI25617J43924_1014925913300002914Grasslands SoilMQNEHPEHPHRGITMHRLSVGGDFPGLVFAVGSVLIFLLAIPALWYVVGGALVVGLVFAAVLQLVHKKPDETARLSIKI*
JGI25389J43894_100672533300002916Grasslands SoilMESKSPHRGITMHRLPIGGDFPGLVFAVGSALIFLIALPALWYLVVGALAIGLVIAAVLQIVRDKRDGSKPLSLKI*
Ga0062593_10020457233300004114SoilMHSENPHRGITMHQLPVAGNIPGLIFAVGSALIFLFAIPALWYVLVAAVAVGLVIAALLQALHQERPDETARLTLKI*
Ga0062595_10004389743300004479SoilMESKAPHRGIDMHRLPVGGDFPGLVFAVGSALIFLIALPALWYLVVGALAVGLVIAAVLQMIHDRPEESQHSTLKL*
Ga0066674_1005060113300005166SoilMQNEHPHRGITMHQLRFGAGFPGLVFAVGTALIFLFAIPALWYVLAAALVVGLLIAALLQILHCNLPGENSGVLKL*
Ga0066680_1016062733300005174SoilMRSRGNAMQNEHPEHPHRGITMHRLSVGGDFPGLVFAVGSVLIFLLAIPALWYVVGGALAVGLVIAAVLQLVHRKPDETARLSIKI*
Ga0066684_1000821823300005179SoilMESKSPHRGITMHRLPIGGDFPGLVFAVGSALIFLIALPALWYLVVGALAIGLVIAAVLQIVRDKRDGSKPLSLKL*
Ga0066676_1108561613300005186SoilMESKSPHRGIDMHRLPVGGDFPGLVFAVGTALIFVIALPALWYLVVGALAVGLVIAAVLQMIHDRPDDSKRFSLKI*
Ga0066675_1098759423300005187SoilMQNEHPHRGITMHQLRFGAGFPGLVFAVGTALIFLFAIPALWYVLAAALVVGLLIAALLQILHRNLPGENSGVLKL*
Ga0066675_1144384013300005187SoilMESKSPHRGIDMHRLPVGGDFPGLVFAVGSALIFLIALPALWYLVVGALAVGL
Ga0070709_1120240613300005434Corn, Switchgrass And Miscanthus RhizosphereMHQLPVGGDFPGLLFAVGSALIFLLAIPALWYVVAAALAVGCTVAAVLHFTRGKPTDSTRLSFKL*
Ga0070714_10009775923300005435Agricultural SoilMESKSPHRGIDMHRLPLGGDFPGLVFAVGSALIFLIALPALWYVVVGALAVGLVIAAVLQMVHDRPDDSQRSIFKI*
Ga0070714_10048909513300005435Agricultural SoilMHQLPVGSNFPGLLFAVGSSLIFLLAIPALWYVIVGAVVVGLVIAAVLQFVHQKPDESEKLTLKI*
Ga0070714_10059149623300005435Agricultural SoilMENKSPHRGIDMHRLPVGGDFPGLVFAIGSALIFLIALPALWYLVVGALAVGLVIAAVLQLVHDRPDESAKLTLKI*
Ga0070714_10104082913300005435Agricultural SoilMESKSPHRGIDMHRLPIGGDFPGLVFALGCALIFLIALPALWYVEMGALAVGLVVAATLQIIHDKSDKF*
Ga0070714_10110680623300005435Agricultural SoilMESKSPHRGIDMHRLPVGGDFPGLVFAVGTALIFLIALPALWYLVVGALAVGLVIAAVLQMVHDRRDESQRTTLKL*
Ga0070714_10159704923300005435Agricultural SoilMHRLPVGGDFPGLLFAVGSALIFLIALPALWYVVAGALVVGLVVAAVLQIIHDRPDKSQGSTLKL*
Ga0070713_10003923123300005436Corn, Switchgrass And Miscanthus RhizosphereMHQLPVGSNFPGLLFAVGSSLIFLLAIPALWYVVVGAVVVGLVIAAVLQFVHQKPDESEKLTLKI*
Ga0070713_10239085213300005436Corn, Switchgrass And Miscanthus RhizosphereMESKSPHRGIDMHRLPVGGDFPGLVFAVGTALIFLIALPALWYLVVGALAVGLVIAAVLQMVHDRRDESQ
Ga0070710_1091420413300005437Corn, Switchgrass And Miscanthus RhizosphereMESKSPHRGIDMHRLPVGGDFPGLVFAVGTALIFLIALPALWYLVVGALAVGLVIAAVLQMVHDRPDKSQRTTLKL*
Ga0070711_10117784113300005439Corn, Switchgrass And Miscanthus RhizosphereHRLPVGDNLPGLVFAVGSALIFLFAIPALWYVLVAAVVIGLVVAAVLQILHREHPDETARFMLKI*
Ga0070708_10001497533300005445Corn, Switchgrass And Miscanthus RhizosphereMQNEHPEHPHRGITMHRLSVGGDFPGLVFAVGSVLIFLLAIPALWYVVGGALVVGLVIAAVLQLVHRKPDETVRLSIKI*
Ga0070706_10011797923300005467Corn, Switchgrass And Miscanthus RhizosphereMHSEYPHRGITMHRLPVGDNFPGLVFAVGSALIFLFAIPALWYVLVAAVAIGLVIAAMLQVVHREHPDETARLTLKI*
Ga0070679_10028556143300005530Corn RhizosphereMHSENPHRGITMHQLPVAGNIPGLIFAVGSALIFLFAIPALWYVLVAAVAVGLVIAALLQAL
Ga0070735_10000947253300005534Surface SoilMESKSPHRGIDMHRLPIGGDIPGLVFAVGCSLIFLIALPALWYVEVGALAVGLLVAATLQIIHGRSDKF*
Ga0070730_1074515213300005537Surface SoilMESKSPHSGITMHRLTVGGNFPGLLFAVGTALIFLLAIPALWFVLVAALAAGFAIAAVLQITHRRPTESTRLSIKI*
Ga0070732_1007634533300005542Surface SoilMESESPHRGITMHRLTVGGNFPGLLFAVGTALIFLLAIPALWFVIVAALAAGFAIAAVFQIVYRKPTESARLSLKI*
Ga0066670_1005595753300005560SoilMENKSPHCGITMHQLPVGGNFPGFVFAVGSALIFLLAIPVLWFVLAAALAVGLIAAAVLQMTRKSPTESSRLSLKI*
Ga0066670_1035114623300005560SoilMESKSPHRGIDMHRLPVGGDFPGLVFAVGSALIFLIALPALWYLVVGALAVGLVIAAVLQIIHDKPDDSKSFSLKI*
Ga0068855_10042851713300005563Corn RhizosphereMHSENPHRGITMHQLPVAGNIPGLIFAVGSALIFLFAIPALWYVLVAAVAVGLVIAAL
Ga0068854_10190612313300005578Corn RhizosphereENPHRGITMHQLPVAGNIPGLIFAVGSALIFLFAIPALWYVLVAAVAVGLVIAALLQALHQERPDETARLTLKI*
Ga0066654_1008272813300005587SoilMESKSPHRGITMHRLPIGGDFPGLVFAVGSALIFLIALPALWYLVVGALAIGLVIAAVLQ
Ga0070717_1000501973300006028Corn, Switchgrass And Miscanthus RhizosphereMQNEHPEHPHRGITMHRLSVGGDFPGLVFAVGSVLIFLLAIPALWYVVGGALVVGLVIAAVLQLVHKKPDETARLSIKI*
Ga0070717_10007251113300006028Corn, Switchgrass And Miscanthus RhizosphereMHQLPVGSNFPGLLFAVGSSLIFLLAIPALWYVVAGALAVGLVIAAVLQFVHQKPDESEKLTLKI*
Ga0070717_1074406513300006028Corn, Switchgrass And Miscanthus RhizosphereMESKSPHRGITMHRLPVGGDFPGLVFAVGSALIFLIALPALWYLVVGALAVGLVIAAVLQVIHDRTDESQR
Ga0070716_10001724413300006173Corn, Switchgrass And Miscanthus RhizosphereMHSENPHRGITMHQLPVAGNIPGLIFAVGSALIFLFAIPALWYVLVAAVAVGLVIAALLQALHQERPDETARLTL
Ga0066660_1025955323300006800SoilMESKSPHRGIDMHRLPIGGDFPGLLFAVGSALIFLIALPALWYVVVGALAVGLVIAAVLQLIHDKSDQTERSTFKR*
Ga0073928_1000037253300006893Iron-Sulfur Acid SpringMESKSPHRGITMHQLQVGSNFPGLLFAVGSSLIFLLAIPALWYLVVGAPAVGLVIAAVLQMVHERPGKSERLTLKL*
Ga0102924_1004357113300007982Iron-Sulfur Acid SpringMRYRRTAMESKSPHRGITMHRLPVEGNFPGLVFAVGSSLIFLLAIPALWYVVAGALVVGLVIAAVLQFVHEKPDESERLTLKL*
Ga0066710_10190260123300009012Grasslands SoilMQNEHPHRGITMHQLRFGAGFPGLVFAVGTALIFLFAIPALWYVLAAALVAGLLIAALLQILHRDLPGENSGVLKI
Ga0099829_1006029553300009038Vadose Zone SoilMQNEYPEHPHRGITMHRLSVGGDFPGLVFAIGSVLIFLLAIPALWYVVGGALVLGLVIAAVLQLVHRKPDKTARLSIKIWRLRRLDPFNHRRSFF*
Ga0099830_1031136623300009088Vadose Zone SoilMQNEYPEHPHRGITMHRLSVGGDFHGLVFAIGSVLIFLLAIPALWYVVGGALVLGLVIAAVLQLVHRKPDKTARLSIKIWRLRRLDPFNHRRSFF*
Ga0105240_1003664053300009093Corn RhizosphereMESKSPHRGIDMHRLPIGGDFPGLVFAVGCALIFLIALPALWYVEVGALAVGLVVAATLQIIHDKSDKF*
Ga0105240_1177262613300009093Corn RhizosphereMESKSPHRGIDMHRLPVGGDFPGLVFAVGTALIFLIALPALWYLVVGALAVGLVIAAVLQVIYDKKDISRRSVFKI*
Ga0066709_10035427223300009137Grasslands SoilMESKSPHRGIDMHRLPVGGDFPGLVFAVGSALIFLIALPALWYLVVGALAVGLVIAAVLQMIHDKPDDSKSFSLKI*
Ga0066709_10110738423300009137Grasslands SoilMQNEHPHRGITMHQLRFGAGFPGLVFAVGTALIFLFAIPALWYVLAAALVAGLLIAALLQILHRDLPGENSGVLKI*
Ga0066709_10164814223300009137Grasslands SoilMQSEYPEHPHRGITMHRLTVGADFPGLVFAVGSMLIFQLAIPALWYVVAGALVVGLVIAAVLQLVHEKPDESARLTIKG*
Ga0123355_1059608213300009826Termite GutMRMMDTKFPHRGITMHRLPVGGDFPGLLFTVGSSLIFLLAIPALWYVLVGAVAVGVLIAGVLQIIRHSHESRPDEKVLFR*
Ga0126380_1011370123300010043Tropical Forest SoilMMMDTKSPHRGITMHRLPVGGDFPGLIFAVGSVLIFLLAIPALWYVLVGAVALGLFIAGVLQIIRHRHES
Ga0126373_1020298033300010048Tropical Forest SoilMGGEKEMEGKFPHRGITMHRLPVGGNFPGLVFALGTASIFLFAVPALWYVLVAALTAGFAIAAVLQLAHRKPLESKRLSLDI*
Ga0123356_1099367523300010049Termite GutMHRLPVGGDFPGLLFAVGCALIFLLAIPALWYVLVGAMAAGLFVAGVLQISRRSRESRPDEKILFR*
Ga0134109_1031098513300010320Grasslands SoilEHPHRGITMHQLRFGAGFPGLVFAVGTALIFLFAIPALWYVLAAALVVGLLIAALLQILHRNLPGENSGVLKL*
Ga0134125_1006351533300010371Terrestrial SoilMEGKSPHRGIDMHRLPVGGNFPGLVFAVGTALIFLIALPALWYVVAGALAIGLVIAAVLQTMRDKADTSQHSIYKI*
Ga0134125_1100694323300010371Terrestrial SoilMESKSPHRGIDMHRLPIGGDIPGLVFAVGCSLIFLIALPALWYVEVGALAVGLLVAATLQIIHDRSDKF*
Ga0134128_1015132343300010373Terrestrial SoilMHSENPHRGITMHQLPVAGNIPGLIFAVGSALIFLFAIPALWYVLVAAVAVGLVIAALLQALHQERPD
Ga0134126_10004551143300010396Terrestrial SoilMHSEHPHRGITMHQLPVAGNIPGLIFAVGSALIFLFAIPALWYVLVAAVAVGLVIAALLQALHQERPDETARLTLKI*
Ga0134126_1001561223300010396Terrestrial SoilMDGEKSMESKSPHLGIDMHRLPIGGDIPGLVFAVGCSLIFLIAVPALWYLEVGALAVGLLVAATLQIIHDRSDKF*
Ga0134121_1005816833300010401Terrestrial SoilMHQLPVSGNIPGLIFAVGSALIFLFAIPALWYVLVAAVAVGLVIAALLQALHQERPDETARLTLKI*
Ga0150983_1671966213300011120Forest SoilKVMESKSSHRGITMHQLPGGSNFPGLLFAVGSSLIFLLAIPALWYVVAGAVVVGLVIAAVLQLIHQKPDESKKLTLKI*
Ga0137393_1021497523300011271Vadose Zone SoilMQNEYPEHPHRGITMHRLSVGGDFPGLVFAIGSVLIFLLAIPALWYVVGGALVLGLVIAAVLQLVHRKPDNTARLSIKIWRLRRLDPFNHRRSFF*
Ga0137382_1123101023300012200Vadose Zone SoilMQNEHPEHPHRGITMHRLQVGGNFPGLVFAVGSVLIFLLAIPALWVVVAGALAVGLVIAAVLQLVHSKPDESVRLTIKS*
Ga0137380_1000556333300012206Vadose Zone SoilMQNEYPEHPHRGITMHRLSVGGDFPGLVFAIGSVLIFLLAIPALWYVVGGALVLGLVIAAVLQLVHRKPDKTARLSIKI*
Ga0137379_1003539933300012209Vadose Zone SoilMQNEYPEHPHRGITMHRLSVGGDFPGLVFAIGSVLIFLLAIPALWYVVGGALVLGLVIAGVLQLVHRKPDKTARLSIKI*
Ga0150985_11614133043300012212Avena Fatua RhizosphereMESKSPHRGIDMHRLPVGGDLPGLVFAVGTALIFLIALPALWYVVVGALAVGLVIAAVLQIVHDRPDQSQKLTLKI*
Ga0137372_1022990923300012350Vadose Zone SoilMHQLPVGSSFPGLLFAVGSALIFLLAIPALWYVLAAAVVVGLLIAAVLQTIHESGKRRKTPHIVP*
Ga0137386_1029661623300012351Vadose Zone SoilMQNEYPEHPHRGITMHRLSVGGDFPGLVFAMGSVLIFLLAIPALWYVVGGALVLGLVIAAVLQLVHRKPDKTARLSIKI*
Ga0137366_1017138723300012354Vadose Zone SoilMHQLPVGSSFPGLLFAVSSALIFLLAIPALWYVLAAAVVVGLLIAAVLQTIHESGKRRKTPHIVP*
Ga0137371_1032069013300012356Vadose Zone SoilMEDGCSEHPHRGITMHRLSVGANFPGLLFAVGSALIFLIAIPALWYVVAGALAVGLVIAAVLQLVHNKPDEFSRLSIKSDAL
Ga0137385_1015131423300012359Vadose Zone SoilMQNEYPEHPHRGITMHRLSVGGDFPGLVFAIGSVLIFLLAIPALWYVVGGALVLGLVIAAVLQLVHRKPDKTARLSIKIWRLRGLDPFNHRRSFFEPRYPIGSE*
Ga0137390_1008859923300012363Vadose Zone SoilMQNEHPEHPHLVISMHRVSVGGDLHGLVFAVGSVLIFLLAIPALWYVVGGALVVGLVIAAVLQLVHKKPDETARLSIKI*
Ga0137390_1010483833300012363Vadose Zone SoilMQNEYPEHPHRGITMHRLSVGGDFPGLVFAVGSVLIFLLAIPALWYVVGGALVLGLVIAAVLQLVHRKPDKTARLSIKIWRLRRLDPFNHRRSFF*
Ga0150984_10147550423300012469Avena Fatua RhizosphereMESKSPHRGIDMHRLPVGGDFPGLVFAVGSALIFLIALPALWYLVVGALAVGLVIAAVLQMIHDKPDDSKNFSLKI*
Ga0150984_10257188523300012469Avena Fatua RhizosphereESKSPHRGIDMHRLPVGGDLPGLVFAVGTALIFVIALPALWYLVVGALAVGLVIAAVLQILHDRPDQSQKLTLKI*
Ga0150984_10638517013300012469Avena Fatua RhizosphereVGESMESKSPHRGIDMHRLPVGGDFPGLVFAVGSALIFVIALPALWYLVVGALAVGLVTAAVLQILHDRPDDSKRFSLKI*
Ga0164301_1085645223300012960SoilMHSENLHRGITMHQLPVAGNIPGLIFAVGSALIFLFAIPALWYVLVAAVAVGLVIAALLQALHQERPDETARLTLKI*
Ga0164309_1086089523300012984SoilMHQLPVAGNIPGLIFAVGSALIFLFAIPALWYVLVAAVAVGLVIAALLQALHQERPDETARLTLKI*
Ga0187809_1013614123300017937Freshwater SedimentMASEHPHRGITMHQLPVAGNFPGFLFAVGSALIFLLAIPALWYVVVAALATGLVIAALLQLLHRKYPDEAAEFHLNV
Ga0187771_1150203913300018088Tropical PeatlandMMNGPPRESAKPHRGITMHKLPVGGDFPGLVFAVGSVAIFLIALPSLWYFVGFAALLGLGVAALLHFARPSRN
Ga0066655_1045410423300018431Grasslands SoilMENKSPHCGITMHQLPVGGNFPGFVFAVGSALIFLLAIPVLWFVLAAALAVGLIAAAVLQMTRKSPTESSRLSLKI
Ga0066662_1042810423300018468Grasslands SoilMESKPPHRGITMHRLPIGGDFPGLVFAVGSALIFLIALPALWYLVVGALAIGLVIAAVLQIVRDKRDGSKPLSLKL
Ga0066662_1275310823300018468Grasslands SoilMDSKYPHRGITMHQLPVSGNFPGLLFAVGSSLIFLLAIPALWYVLAGGLAVGLVIAAVLRIIHDKPKDLSTDILDGSGSSTNKT
Ga0193728_131935123300019890SoilMQNEHPEHPHRGITMHRLQVGGNFPGLVFAVGSVLIFLLAIPALWVVVAGALAVGLVIAAVLQLVHEKPDKSVRLTIKS
Ga0193721_113603623300020018SoilMQNEHPHRGITMHQLRFGAGFPGLVFAVGTALIFLFAIPALWYVLAAALVVGLLIAALLQILHRNLPGENSGVLKL
Ga0210407_1033313623300020579SoilMESKSPHRGITMHQLPVGSNFPGLLFAVGSSLIFLLAIPALWYVVAGAIVVGLVIAAVLQLVHERPDESKKLTLKI
Ga0213882_1052328613300021362Exposed RockMAAQYPHRGITMHRLPVDGNYPGAIFTLASALIFLLAVPALWYLLIAAGAAGFAIAAVLDLVHQKPVEAKRLSLLV
Ga0213875_1008100023300021388Plant RootsVEGKYPHRGITMHRLGVGGNVPGMVFAVGTALIFLLAVPALWYVLMAALAAGFAIAAVLQFAHRKPPKSERLFLNI
Ga0210397_1126129923300021403SoilMESKSPHRGIDMHRLPVGGDFPGLVFAVGSALIFLIALPALWYLVVGALAVGLVIAAVLQIIHDKTNGYQGSIFKI
Ga0212123_100001272393300022557Iron-Sulfur Acid SpringMESKSPHRGITMHQLQVGSNFPGLLFAVGSSLIFLLAIPALWYLVVGALAVGLVIAAVLQMVHERPGKSERLTLKL
Ga0212123_1001938093300022557Iron-Sulfur Acid SpringMESKSPHRGITMHRLPVEGNFPGLVFAVGSSLIFLLAIPALWYVVAGALVVGLVIAAVLQFVHEKPDESERLTLKL
Ga0207685_1034691923300025905Corn, Switchgrass And Miscanthus RhizosphereMQSEYPHRGITMHRLPVGDNLPGLVFAVGSALIFLFAIPALWYVLVAAVVIGLVVAAVLQILHREHPD
Ga0207684_1019976323300025910Corn, Switchgrass And Miscanthus RhizosphereMQNEHPEHPHRGITMHRLSVGGDFPGLVFAVGSVLIFLLAIPALWYVVGGALVVGLVIAAVLQLVHRKPDETVRLSIKI
Ga0207684_1049783423300025910Corn, Switchgrass And Miscanthus RhizosphereMHSEYPHRGITMHRLPVGDNFPGLVFAVGSALIFLFAIPALWYVLVAAVAIGLVIAAMLQVVHREHPDETARLTLKI
Ga0207695_1003499153300025913Corn RhizosphereMESKSPHRGIDMHRLPIGGDFPGLVFAVGCALIFLIALPALWYVEVGALAVGLVVAATLQIIHDKSDKF
Ga0207664_1003639823300025929Agricultural SoilMESKSPHRGIDMHRLPLGGDFPGLVFAVGSALIFLIALPALWYVVVGALAVGLVIAAVLQMVHDRPDDSQRSIFKI
Ga0207664_1112364423300025929Agricultural SoilMESKSPHRGIDMHRLPIGGDFPGLVFALGCALIFLIALPALWYVEMGALAVGLVVAATLQIIHDKSDKF
Ga0207664_1132603313300025929Agricultural SoilMESKAPHRGIDMHRLPVGGDFPGLVFAVGSALIFLIALPALWYLVVGALAVGLVIAAVLQMIHDRPEESQHSTLKL
Ga0207665_1139938623300025939Corn, Switchgrass And Miscanthus RhizosphereMESKSPHRGITMHRLPVEGNFPGLVFAVGSSLIFLLAIPALWYVVAGALAVGLVIAAVLQ
Ga0209027_111509023300026300Grasslands SoilMESKSPHRGITMHRLPIGGDFPGLVFAVGSALIFLIALPALWYLVVGALAIGLVIAAVLQIVRDKRDGSKPLSLKI
Ga0209059_114564413300026527SoilMHRLPIGGDFPGLVFAVGSALIFLIALPALWYLVVGALAIGLVIAAVLQIVRDKRDGSKPLSLKL
Ga0209648_1000011583300026551Grasslands SoilMQNEHPEHPHRGITMHRLSVGGDFPGLVFAVGSVLIFLLAIPALWYVVGGALVVGLVFAAVLQLVHKKPDETARLSIKI
Ga0209577_1015919213300026552SoilMENKSPHCGITMHQLPVGGNFPGFVFAVGSALIFLLAIPVLWFVLAAALAVGLIAAAVLQ
Ga0209580_1000054663300027842Surface SoilMESESPHRGITMHRLTVGGNFPGLLFAVGTALIFLLAIPALWFVIVAALAAGFAIAAVFQIVYRKPTESARLSLKI
Ga0209580_1000786643300027842Surface SoilMESKSPHSGITMHRLTVGGNFPGLLFAVGTALIFLLAIPALWFVLVAALAAGFAIAAVLQITHRRPTESTRLSIKI
Ga0209580_1024135423300027842Surface SoilLESESPHRGITMHRLTVGGNFPGLLFAVGTALIFLLAIPALWFVVVAALAAGFAIAAVFQIVHRKPTESARLSLKI
Ga0209180_1011398823300027846Vadose Zone SoilMQNEYPEHPHRGITMHRLSVGGDFPGLVFAIGSVLIFLLAIPALWYVVGGALVLGLVIAAVLQLVHRKPDKTARLSIKIWRLRRLDPFNHRRSFF
Ga0209168_10002015183300027986Surface SoilMESKSPHRGIDMHRLPIGGDIPGLVFAVGCSLIFLIALPALWYVEVGALAVGLLVAATLQIIHGRSDKF
Ga0073994_1224359413300030991SoilMESKSPHRGITMHRLPVEGNFPGLVFAVGSSLIFLLAIPALWYVVAGALVVGLVIAAVLQFVHQKPDASERLTLKI
Ga0170834_10423573013300031057Forest SoilMESKSPHRGITMHRLPVGGDFPGLLFAVGSALIFLIALPALWYLVVGALAVGLVIAAVLQTIHDMTNVSQGSIFKL
Ga0170834_10777078733300031057Forest SoilMHSEYPHRGITMHRLPVGDNLPGLVFAVGSALIFLFAIPALWYVLVAAVVIGLVVAALLQVVHREHADETARLMLKI
Ga0170824_10860906933300031231Forest SoilMHSEYPHRGITMHRLPVGDNLPGLVFAVGSALIFLFAIPALWYVLVAAVVIGLVIAALLQVVHQRNPGETSRLTLKI
Ga0170824_11245233143300031231Forest SoilMHSEYPHRGITMHRLPVGDNLPGLIFAVGSALIFLFAIPALWYVMVAAVAIGLVIAAVLQVLHQRHPDETARLSLKL
Ga0170818_10697038423300031474Forest SoilMHSEYPHRGITMHRLPVGDNLPGLVFAVGSALIFLFAIPALWYVLVAAVVIGLVIAALLQVVHQRNPGETSRLALKI
Ga0170818_11157635713300031474Forest SoilMHNEYPHRGITMHRLPVGDNLPGLIFAVGSALIFLFAIPALWYVMVAAVAIGLVIAAVLQVLHQRHPDETARLSLKL
Ga0307476_100000016203300031715Hardwood Forest SoilMESKSPHRGITMHRLPVGGDFPGLVFAVGSALIFLIALPALWYLVVGALAVGLVIAAVLQVIHDKPDESQRTTLKL
Ga0307475_1012062333300031754Hardwood Forest SoilMESKSPHRGITLHRLPVEGNFPGLVFAVGSSLIFLLAIPALWYLVAGALVVGLVIAAVLQFLHHKPDESERLTLKL
Ga0307478_1088125613300031823Hardwood Forest SoilMESKSPHRGITLHRLPVEGNFPGLVFAVGSSLIFLLAIPALWYLVAGALVVGLVIAAVLQFLHHKPD
Ga0310910_1137369913300031946SoilMMLMEPKPPHRGITMHRLPVGGNFPGLLFAIGTALIFLLAIPALWYVLVGAVAMGLFIAGVLQIIRHSRESQPDGRVLFR
Ga0307479_1064150113300031962Hardwood Forest SoilMESKSPHRGITLHRLPVEGNFPGLVFAVGSSLIFLLAIPALWYLVAGALVVGLVIAAVLQFVHEKPDESERLTL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.