NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F099635

Metagenome / Metatranscriptome Family F099635

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F099635
Family Type Metagenome / Metatranscriptome
Number of Sequences 103
Average Sequence Length 109 residues
Representative Sequence MNRLSLFVLGLLLFSTACFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAAQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDERKHVATDVKQ
Number of Associated Samples 89
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 95.45 %
% of genes near scaffold ends (potentially truncated) 16.50 %
% of genes from short scaffolds (< 2000 bps) 17.48 %
Associated GOLD sequencing projects 80
AlphaFold2 3D model prediction Yes
3D model pTM-score0.50

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (82.524 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(33.010 % of family members)
Environment Ontology (ENVO) Unclassified
(33.010 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(54.369 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 71.43%    β-sheet: 0.00%    Coil/Unstructured: 28.57%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.50
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 103 Family Scaffolds
PF00108Thiolase_N 10.68
PF07228SpoIIE 1.94
PF03167UDG 1.94
PF15594Imm50 1.94
PF13620CarboxypepD_reg 1.94
PF14559TPR_19 1.94
PF02223Thymidylate_kin 1.94
PF01022HTH_5 1.94
PF00069Pkinase 1.94
PF00881Nitroreductase 1.94
PF00072Response_reg 0.97
PF00989PAS 0.97
PF00501AMP-binding 0.97
PF02347GDC-P 0.97
PF00041fn3 0.97
PF00561Abhydrolase_1 0.97
PF15780ASH 0.97
PF00970FAD_binding_6 0.97
PF02371Transposase_20 0.97
PF07238PilZ 0.97
PF01740STAS 0.97
PF00589Phage_integrase 0.97
PF01638HxlR 0.97
PF02915Rubrerythrin 0.97
PF02700PurS 0.97
PF08241Methyltransf_11 0.97
PF10431ClpB_D2-small 0.97
PF01527HTH_Tnp_1 0.97
PF05746DALR_1 0.97

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 103 Family Scaffolds
COG0183Acetyl-CoA acetyltransferaseLipid transport and metabolism [I] 10.68
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 7.77
COG0125Thymidylate kinaseNucleotide transport and metabolism [F] 1.94
COG0692Uracil-DNA glycosylaseReplication, recombination and repair [L] 1.94
COG1573Uracil-DNA glycosylaseReplication, recombination and repair [L] 1.94
COG3663G:T/U-mismatch repair DNA glycosylaseReplication, recombination and repair [L] 1.94
COG0018Arginyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 0.97
COG0403Glycine cleavage system protein P (pyridoxal-binding), N-terminal domainAmino acid transport and metabolism [E] 0.97
COG0751Glycyl-tRNA synthetase, beta subunitTranslation, ribosomal structure and biogenesis [J] 0.97
COG1003Glycine cleavage system protein P (pyridoxal-binding), C-terminal domainAmino acid transport and metabolism [E] 0.97
COG1733DNA-binding transcriptional regulator, HxlR familyTranscription [K] 0.97
COG1828Phosphoribosylformylglycinamidine (FGAM) synthase, PurS subunitNucleotide transport and metabolism [F] 0.97
COG3547TransposaseMobilome: prophages, transposons [X] 0.97


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A82.52 %
All OrganismsrootAll Organisms17.48 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001545|JGI12630J15595_10078191All Organisms → cellular organisms → Bacteria650Open in IMG/M
3300005181|Ga0066678_10005798All Organisms → cellular organisms → Bacteria → Proteobacteria5655Open in IMG/M
3300005598|Ga0066706_11420321All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium523Open in IMG/M
3300005944|Ga0066788_10202702All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Corynebacteriales → Corynebacteriaceae → Corynebacterium → Corynebacterium kutscheri516Open in IMG/M
3300006175|Ga0070712_101786383Not Available538Open in IMG/M
3300006914|Ga0075436_100010853All Organisms → cellular organisms → Bacteria6249Open in IMG/M
3300010134|Ga0127484_1009056All Organisms → cellular organisms → Bacteria → Acidobacteria830Open in IMG/M
3300012202|Ga0137363_10985314Not Available716Open in IMG/M
3300012362|Ga0137361_11221861All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → Solibacteraceae → Candidatus Solibacter → Candidatus Solibacter usitatus675Open in IMG/M
3300012685|Ga0137397_10183227All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia1556Open in IMG/M
3300012685|Ga0137397_10791490Not Available703Open in IMG/M
3300012922|Ga0137394_10336221All Organisms → cellular organisms → Bacteria1289Open in IMG/M
3300012922|Ga0137394_11421890All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → Solibacteraceae → Candidatus Solibacter → Candidatus Solibacter usitatus554Open in IMG/M
3300012925|Ga0137419_11760163All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → Solibacteraceae → Candidatus Solibacter → Candidatus Solibacter usitatus530Open in IMG/M
3300012944|Ga0137410_10304882All Organisms → cellular organisms → Bacteria1262Open in IMG/M
3300012972|Ga0134077_10247333All Organisms → cellular organisms → Bacteria → Acidobacteria737Open in IMG/M
3300014494|Ga0182017_10000283All Organisms → cellular organisms → Bacteria39263Open in IMG/M
3300016445|Ga0182038_10291660All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1330Open in IMG/M
3300016445|Ga0182038_11550737Not Available595Open in IMG/M
3300026555|Ga0179593_1072364All Organisms → cellular organisms → Bacteria → Acidobacteria1836Open in IMG/M
3300026555|Ga0179593_1228945All Organisms → cellular organisms → Bacteria → Acidobacteria2590Open in IMG/M
3300027660|Ga0209736_1023894All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1857Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil33.01%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil13.59%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil10.68%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil5.83%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil5.83%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.83%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.88%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil2.91%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.91%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.94%
FenEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Fen1.94%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.94%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.97%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.97%
Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Bulk Soil0.97%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.97%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.97%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.97%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil0.97%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.97%
EctomycorrhizaHost-Associated → Plants → Roots → Unclassified → Unclassified → Ectomycorrhiza0.97%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.97%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001545Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300004120Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF238 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005944Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 2 DNA2013-048EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010134Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012386Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300014494Permafrost microbial communities from Stordalen Mire, Sweden - 712E3D metaGEnvironmentalOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300017961Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_20_MGEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300019788Permafrost microbial communities from Stordalen Mire, Sweden - 712E1D metaG (PacBio error correction)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021444Vellozia epidendroides bulk soil microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - BS_R02EnvironmentalOpen in IMG/M
3300024182Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK10EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027297Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF047 (SPAdes)EnvironmentalOpen in IMG/M
3300027635Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027660Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027698Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027812Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300028146Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK23EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300031469Fir Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033180Populus trichocarpa ectomycorrhiza microbial communities from riparian zone in the Pacific Northwest, United States - 12_EMHost-AssociatedOpen in IMG/M
3300033402Lab enriched peat soil microbial communities from McLean, Ithaca, NY, United States - MB31MNEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10538918113300000364SoilMNRSSILALGFLLSLTPCFGQATPSESQTLQALLSEVRQLRQDLQTTTIAVQRAQILLYRVQGQEAAVARASQRLDGARERLAAIQD
JGI12630J15595_1007819123300001545Forest SoilMNRSSLFALVLLLLGTTCFGQTTSGDSQTLQALLFEVRQLRQDMHTTIIASQRAQILIYRLQGQEVAVARASQRLDDVRDRLARIQDERKHVAIDVKQFEDFVSNAENPATQRKEREDRLPQLKTRLESLEYEE
JGIcombinedJ26739_10143908523300002245Forest SoilMNRSSLFALVLLLLGTTCFGQTTSGDSQTLQALLLEVRQLRQDMHTTIIAAQRAQILIYRLQGQEAAVARASQRLDDIRDKLARIQDERKHVATDVKQLEDFVSNTENPAT
JGI25613J43889_1015967913300002907Grasslands SoilMNRSSLLVLSLLLSSTACFGQTTPGDSQTLQALLLEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAMARAAQRLDEAREKLARIQDERKHVATDLKRQ
Ga0058901_156828813300004120Forest SoilMNRTSLFVLGFLVFSTSCFGQTTPGDSQTLQALLTEIRLLRQDLRTTTVAAQRSQILIYRLQGQEAAVARASQRLDEARDKLARTQDERKHVAAEVKRTEDFVSNTENPA
Ga0062386_10137432613300004152Bog Forest SoilMNRSSLFVLGLLLSSAACYGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAAQRAQILLYRLQGQEAAVARASQRLDEAREKLAGIQAQRKYLATDVKRHEDFISNTENPPTQRK
Ga0066683_1043676313300005172SoilMNRSSLFVLGLLLLSTACFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAAQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDDRKHVATDVKQ
Ga0066678_1000579883300005181SoilMNRQLLFVLSLLLFPTACLAQTTPNDSKTLQSLLLEVRQLRQDLQTTTIAGQRAQILIYRVQGQEAAVARASQRLDEFREKLARIQDERKHVAADVKRFEDSLSSSENPPTQRKEIEQGMLPQLKTRLESLENQEQ
Ga0070709_1046979013300005434Corn, Switchgrass And Miscanthus RhizosphereMNRSSFLFLGLVLLPRGCFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGEEAAVARASQRLDDARDRLARVQDERKH
Ga0066686_1004953043300005446SoilMNRSSFLMLGLVLGFLSFSTTCFGQTTTGDSQRLQALLSEIRLLRQDLQTTTIAAQRAQILIYRLQGQEAAVARASQRLDDARDKLARIQDERKHVS
Ga0070707_10039135223300005468Corn, Switchgrass And Miscanthus RhizosphereMLSLLLFSTACFGQTTPVDSQTLQALLMTIAGQRAQILIYRLQGQEAAVARASQRLDEARDKLARIQDQRKQVVSETKRTESFISNTDNPPTQRKELEDICKCSRSWRTSNRHNSRQD*
Ga0066697_1044564713300005540SoilVLGLLIFSPDCFGQSTPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVARASQRLDEAREKLTRNQDERKHVAADVKRFEDSISD
Ga0066701_1039023823300005552SoilMNRSSFLMLGLVPGLLSFSTACFGQTTPTDSETLQALLSEIRLLRQDLQTTTIAAQRAQILIYRLQGQEAAVARASQRLDDARDKLARIQDE
Ga0066703_1007392813300005568SoilMKRSSFLVLDILSFSTTCFGQTTTTDSQTLQALLSEIRLLRQDMQTTTIAAQRAQILIYRLQGQEAAVARASQRHDDARDKLARIQDERKHVAADVKQQEDFISNLENPATQRKELEGVV
Ga0066694_1027638313300005574SoilMKRSSLFVLSLLLFPTACFGQSTPSDSQTLQALLLEIRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVARASQRLDEAPDKLARIQDQRKQVVGEI
Ga0066706_1004255153300005598SoilMNRSGLFVLGLLLFSTACFGETIPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVARASQRLEEAREKLARTQDERKHVAADVKQQEEFISN
Ga0066706_1142032123300005598SoilMKRSFPVLGLLSFSTTCFGQTTTTDSQTLQALLSEIRLLRQDMQTTTIAAQRAQFSSTGCRGRKRLSRASQRLDDARDKLARIQDERKHVASDVKQTEDFINNTENPATQRKELEDRVRQLKTRLELL
Ga0066788_1020270213300005944SoilMKRWSILGVTLLLCSAGCFGQVATGDSPTLQELLSEVRQLRQDVQATFIAAQRAQILIYRLQGQEAAVARASQRLDEAREKLARTEDERKRIAAQVKRMEDFLGDNDNSGTERKQVETNLAQLKARLESLDVDEQQNQSR
Ga0070712_10111414023300006175Corn, Switchgrass And Miscanthus RhizosphereMNRLSVITVGILLFSATCFAQTTTNDSQTLQALLSEVRQMRQDLRTTTIAAQRSQILIYRLQGQEARVARASERLDEAREKQARIQDERRHVAAEVKQNEDFVNNSENPAT
Ga0070712_10178638313300006175Corn, Switchgrass And Miscanthus RhizosphereMNRSSFLFLGLVLLPRGCFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGEEAAVARASQRLDDARDRLARVQDERKHVTADIKKFDDSVNSAENPDAQRKEIEEGLLPQLKARLESLESQEQQLQTREVE
Ga0070765_10000806843300006176SoilMNRSSLLTVSLLLFSAACFGQTSQTDSQTLLALLSEVRQLRQDMRVTIIAAQRAQVLIYRLQAQEADVARESQRLDEAREKLGRIQDRRKHETAELKMVEDFIANTENSATQR*
Ga0079222_1084464523300006755Agricultural SoilMNRSGLFVLGLLLFSTDCFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVARASQRLEEAREKLARSQDERKHVAADV
Ga0079222_1185590413300006755Agricultural SoilMNRSSFLFLGLALLPPGCFGQTTSGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGEEAAVARASQRLDDVRDRLARVQDERKHVAADIKKFDDSVNSAENTDAQRKEIEEG
Ga0066665_1029042713300006796SoilMKRSFPVLGLLSFSTTCFGQTTTTDSQTLQALLSEIRLLRQDMQTTTIAAQRAQFSSTGCRGRKRLSRASQRLDDARDKLARIQDERKHVASDVKQTEDFINNTQNPATQRKELEDRVR
Ga0075436_10001085383300006914Populus RhizosphereMNRSSFLFLGLALLPPGCFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGEEAAVARASQRLDDVRDRLARVQDERKHVAADIKKFDDSVNSAENTDAQRKEIEEGLLPQLK
Ga0099794_1080198613300007265Vadose Zone SoilMHRSSFFLLGFLLISTAAFGQTSSTDSQTLQALLAEVRQLRQDLQNITVAAQRAQILVYRLQLQQAAVARASQRLDDARSKLEAGQAN
Ga0099828_1139755113300009089Vadose Zone SoilMNRSSLFVLGLLLLSTACFGQTTPADSQTLQALLSEVRQLRQDLQTTTIAAQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDERKHVATDVKQHEDF
Ga0099792_1095448713300009143Vadose Zone SoilMNRSSLFVLNLLLFSTACFGQTTPSDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVARASQRLDEARDKLARI
Ga0127484_100905623300010134Grasslands SoilMNRQLLFVLSLLLFPTACLAQTTPNDSKTLQSLLLEVRQLRQDLQTTTIAGQRAQILIYRVQGQEAAVARASQRLDEFREKLARIQDERKHVAADVKRFEDSLSSSENPPTQRKEIEQGMLPQLKTR
Ga0134067_1035265213300010321Grasslands SoilMNRSALFVLGLLLFSTACFGQTIPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDERK
Ga0126370_1113398623300010358Tropical Forest SoilMNRLSLLVLGILLCSTASSGQTSPSDSQTLQVLLSEIHQLRLDLQTRIIAVQRGQILIYRLQGQEAIVARASQHLDEARDKLKKIQEERENVTTEIKQGE
Ga0137392_1013337833300011269Vadose Zone SoilMNRSSLVALSHLLFSAACFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDERKHVATDINHH*
Ga0137391_1070277623300011270Vadose Zone SoilMNRLSLFVLGLLLFSTACFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAAQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDERKHVATDVKQ
Ga0137393_1148682623300011271Vadose Zone SoilMNRSSLFVLGLLLFSTACFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAAQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDE
Ga0137364_1004564733300012198Vadose Zone SoilMNRSALFVLGLLLFSTDCFGQSTPGDSQALQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDERKHVAADVKRFED
Ga0137383_1078769123300012199Vadose Zone SoilMNRSSLFVLGLLLLSTACFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAAQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDERKHVAT
Ga0137363_1096265623300012202Vadose Zone SoilMKRSSLLVLSLLVFSTACFGQTTSGDSQTLQALLLELRQLRQDLQTTTIAAQRAQILIYRLQGQGAAVARASQRLDEARDKLARIQDERKHVA
Ga0137363_1098531423300012202Vadose Zone SoilMNRSSLFVLSSLVFSTACFGQTTTGDSQTLQALLLEVRQLRQDLQTTTIAGQRVQILIYRLQGQEAAVTRASQRLDEAREKVARIQDERKHVAADVKRFEDSLSGTVSPATQRKDIEQGVLPQLKTRLE
Ga0137380_1081390023300012206Vadose Zone SoilMNRFSLFVLGLLLFSTACFAQTTPGDSQTLQALLSEVRQLRQDLQTTTIAAQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDERKHVAT
Ga0137381_1087792713300012207Vadose Zone SoilMNRLSLLVLGPLLFSTACFGQTTPRDSQTLQALLSEVRQLRQDLQTTTIAAQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDERKHVATD
Ga0137381_1104165523300012207Vadose Zone SoilMNRFSLFVLGLLLFSTACFAQTTPGDSQTLQALLSEVRQLRQDLQTTTIAAQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDE
Ga0137379_1127038813300012209Vadose Zone SoilMNRLSLLVLGPLLFSTACFGQTTPRDSQTLQALLSEVRQLRQDLQTTTIAAQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDERKH
Ga0137377_1047562613300012211Vadose Zone SoilMKRSFLVLGLLSFSTTCFGQTTTTDAQTLQALLSEIRLLRQDMQTTTIAAPRAQILIYQLQGQEAAVARAPRRLDDARDKLARIQDERKPVASDAKQGEDFISSTE
Ga0137371_1089110823300012356Vadose Zone SoilMNRLSLLVLGPLLFSTACFGQTTPRDSQTLQALLSEVRQLRQDLQTTTIAAQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDERKHVAT
Ga0137385_1102771923300012359Vadose Zone SoilMNRSSLFVLGLLLFSTACFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAAQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDERKHVA
Ga0137361_1122186113300012362Vadose Zone SoilMNRSSLVALSHLLFSAACFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDERKHVATDIKQHENFMSNTENPPTQRKEVEAVLPGLKTRLESLRLTEAIQVLS*
Ga0137361_1153989623300012362Vadose Zone SoilMLGLVLGFLSFSTTCFGQTTTSDSQTLQALLSEIRLLRQDLQTTTIAAQRAQILIYRLQGQEAAVARASQWLDDARDKLARIQDERKRAASDAEHREDFISSKRESCNATERA*
Ga0134046_124464313300012386Grasslands SoilMNRQLLFVLSLLLFPTACLAQTTPNDSKTLQSLLLEVRQLRQDLQTTTIAGQRAQILIYRVQGQEAAVARASQRLDEFR
Ga0137358_1106164413300012582Vadose Zone SoilMNPSSLFVLGFLLSSTACFGQTTPADSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVARASQRLDEARERVARIQDQ
Ga0137397_1018322723300012685Vadose Zone SoilMNRSSLVVLGHLLFSAACFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVAHASQRLDDGRAKLAGTQSERKRLAAEVKQQEDFISNTENPPAQRKEVEAVLPQRKTRLEWLENEEPQ*
Ga0137397_1079149023300012685Vadose Zone SoilMNRSSLFVLSSLVFSTACFGQTTTGDSQTLQALLFEVRQLRQDLQTTTIAGQRVQILIYRLQGQEAAVARASQRLDEAREKVERIQDERKHVAADVKRFEDSLSGTVNPATQRKDIEEGVLPQLKTRLESLGNQE
Ga0137395_1005796643300012917Vadose Zone SoilMNRSSLFVLNLLLFSTACFGQTTPSDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVARASQRLDEARDKLARIEEQRQQVVTAIKQTEGLISNTDNP
Ga0137396_1014393413300012918Vadose Zone SoilMNRSSLFVLGFLLSSTACFGQTTPTDSQTLQALLLEVRHLRQDLQTTTIAGQRVQILIYRLQGQEAAVARASQRLDEARDKLARIQEQRQQVVTAIKRTE
Ga0137394_1033622113300012922Vadose Zone SoilMNRSSLVVLGHLLFSAACFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVAHASQRLDDARAKLAGTQSERKRLAAEVKQQEDFISNTENPPAQRKEVEAVLPQRKTRLEWLENEEPQ*
Ga0137394_1142189013300012922Vadose Zone SoilMNRSSLFVLSSLVFSTACFGQTTTGDSQTLQALLFEVRQLRQDLQTTTIAGQRVQILIYRLQGQEAAVARASQRLDEAREKVERIQDERKHVAADVKRFEDSLSGAVNPATQRKDIEEGVLPQLKTRLESLGNQEQQLQTREIE
Ga0137419_1158391013300012925Vadose Zone SoilMNRSSLFVLSSLVFSTACFGQTTTGDSQTLQALLLEVRQLRQDLQTTTIAGQRVQILIYRLQGQEAAVARASQRLDEAREKV
Ga0137419_1176016313300012925Vadose Zone SoilACFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVAHASQRLDDGRAKLAGTQSERKRLAAEVKQQEDFISNTENPPAQRKEVEAVLPQRKTRLEWLENEEPQ
Ga0137416_1020407113300012927Vadose Zone SoilMNRSSLFVLSSLVFSTACFGQTTTGDSQTLQALLFEVRQLRQDLQTTTIAGQRVQILIYRLQGQEAAVARASQRLDEARERVARIQDQREHVAADVKRFEDSLSGTVNPATQRKDIE
Ga0137410_1030488233300012944Vadose Zone SoilMNRSSLFVLSLLFFPTACFGQSTPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDERK
Ga0134077_1024733313300012972Grasslands SoilMNRQLLFVLSLLLFPTACLAQTTPNDSKTLQSLLLEVRQLRQDLQTTTIAGQRAQILIYRVQGQEAAVARASQRLDEFREKLARIQDE
Ga0134075_1002007643300014154Grasslands SoilMNRLPLIVLGLLLISPDCFGQSTPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVARASQRMDEARDKLARIQDERK
Ga0134078_1002846413300014157Grasslands SoilMNRSGLFVLGLLLFSTACFGETIPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVARASQRLEEAREKLARTQDERKHVAADVKQQEEFISNMENPAAERKEVERML
Ga0182017_10000283243300014494FenMNRSSLSLLVLLLFSTACFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAAQRAQILLYRLQGQEAAVARASQRIDEAREKLAGIQAQRKYSATEVKRHEDFISNTENPPTQRKEFEERHPISNQNSNRQKIWNNSNKRKK*
Ga0182036_1026664013300016270SoilMNPRSSLLALGCVLFATTCFGQSAPGDSQTLQALLSEVRQLRQELRTTTIAAQRSQILIYRLQGREASVARASQRLDEAREKLARIQDARKHLAADVKQTEDFVNNTDNPAAQ
Ga0182041_1007508353300016294SoilMNRRLSFLALGCLLFATASFGQYAPSDSQTLQALLSEVRQLRQELRQELRTTTIAAQRSQILIYRLQGQEAVVVRASQRLGETRDKLARTRDAQMHVAADIKQTEDFVNNTDNPAAQRK
Ga0182041_1083976823300016294SoilMYPRSSLLALGCLLFATTCFGQSAPGDSQTLQSLLSEVRQLRQELRATTIAAQRSQILIYRLQGQEASVARASQRLDEAREKLARTRDARKHLAADVKQTEDFVNNTDNPA
Ga0182035_1086056923300016341SoilMNRRLSFLALGCLLFATASFGQYAPSDSQTLQALLSEVRQLRQELRQELRTTTIAAQRSQILIYRLQGQEAAVVRASQRLGETRDKLARTRDAQMHVAADIKQTEDFVNNTDNPAAQRKEFER
Ga0182039_1010704543300016422SoilMNRRLSFLALGCLLFATASFGQYAPSDSQTLQALLSEVRQLRQELRQELRTTTIAAQRSQILIYRLQGQEAVVVRASQRLGETRDKLARTRDAQMHVAADIKQTEDFVNNTD
Ga0182038_1029166033300016445SoilMNPRSSLLALGCVLFATTCFGQSAPGDSQTLQALLSEVRQLRQELRTTTIAAQRSQILIYRLQGREASVARASQRLDEAREKLARIQDARKHLAADVKQTEDFVNNTDNPAAQRKALENRLSEFKTRLESSEGDEQKAQSQE
Ga0182038_1155073713300016445SoilMYPRSSLLALGCLLFATTCFGQSAPGDSQTLQSLLSEVRQLRQELRATTIAAQRSQILIYRLQGQEASVARASQRLDEAREKLARTRDARKHLAADFKQTEDFVNNTDNPIAQKKILENRHSELKTRLESLESNEQRYQS
Ga0187817_1025755613300017955Freshwater SedimentMNRSSLLALGLLLFSDPCFAQLAPSDSQTLQALLSEVRQMRQDLRMTTIAAQRSQILIYRLQRQEASVARASQRLDEVREKLARTQDERKHVVADVKQVEDFVNNRESGNPKESV
Ga0187778_1038948813300017961Tropical PeatlandMNRSSLFVLGLLLVPTTCFGQTTPGDSQTLQALLSEVRQLRQDLQTTTVAAERAQILIYRLQGQEAAVARASQRLDEVREKLTQIQVEQKHLATEVKQNEDF
Ga0066667_1055261423300018433Grasslands SoilMNRSGLFVLGLLLFSTACFGETIPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVARASQRLEEAREKLARTQDERKHVAADVKQ
Ga0182028_1230099103300019788FenMNRSSIISPGPSLFSTACFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAAQRAQILLYRLQGQEAAVARASQRIDEAGKSSPGIQAQ
Ga0179592_1036734413300020199Vadose Zone SoilMNRSSLFVLSSLVFSTACFGQTTTGDSQTLQALLFEVRQLRQDLQTTTIAGQRVQILIYRLQGQEAAVARASQRLDEAREKVARIQDQREHVVAEVKRFEDSLSGTVNPATQRKDIEQGVLPQLK
Ga0210383_1003059453300021407SoilMSGGGHMNRSSLLTVSLLLFSAACFGQTSQTDSQTLLALLSEVRQLRQDMRVTIIAAQRAQVLIYRLQAQEADVARESQRLDEAREKLGRTQDRRKHETAELKMVEDFIANTENSATQR
Ga0213878_1057290513300021444Bulk SoilMNPRSTLLLLAGLLFATTCFGQSAPGDSQTLQALLAEVRQLRQELRTTTIAAQRSQILSYRLQGQEASVARASQRLDEALGSNWAGEDR
Ga0247669_106811323300024182SoilMNRSSLFVLGLLVFSTSCFGQTTPGDSQTLQALLAEIRLLRQDLRTTTVAAQRSQILIYRLQGQEAAVGRASQRLDEAREKLARIQDERQHVAAEVKQTEDFLSNTEN
Ga0207693_1024945813300025915Corn, Switchgrass And Miscanthus RhizosphereMNRSSILALGFLLSLTPCFGQATPSESQTLQALLSEVRQLRQDLQTTTIAVQRAQILLYRVQGQEAAVARASQRLDGARERLAAIQDQREHVTADVKRQEDFVSNTENP
Ga0207646_1040928323300025922Corn, Switchgrass And Miscanthus RhizosphereMLSLLLFSTACFGQTTPVDSQTLQALLMTIAGQRAQILIYRLQGQEAAVARASQRLDEARDKLARIQDQRKQVVSETKRTESFISNTDNPPTQRKELEDICKCSRSWRTSNRHNSRQD
Ga0209235_106835333300026296Grasslands SoilMNRQLLFVLSLLLFPTACLAQTTPNDSKTLQSLLLEVRQLRQDLQTTTIAGQRAQILIYRVQGQEAAVARASQRLDEFREKLARIQDERKHVAADVKRFE
Ga0209267_102181613300026331SoilMNRQLLFVLSLLLFPTACLAQTTPNDSKTLQSLLLEVRQLRQDLQTTTIAGQRAQILIYRVQGQEAAVARASQRLDEFREKLARIQDERKHVAADVKRFEDSLSSSENPPTQR
Ga0209161_1000035913300026548SoilMNRQLLFVLSLLLFPTACLAQTTPNDSKTLQSLLLEVRQLRQDLQTTTIAGQRAQILIYRVQGQEAAVARASQRLDEFREKLARIQDERKHVAADVKR
Ga0209161_1004734213300026548SoilMNRSGLFVLGLLLFSTACFGETIPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVARASQRLEEAREKLARTQDERKHVAADVKQQEEFISNME
Ga0209648_1007274413300026551Grasslands SoilMNRSSLVALGHLLFSAACFGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAGQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDERKHVAADIKQHEN
Ga0179593_107236433300026555Vadose Zone SoilMNRSSLFVLSSLVFSTACFGQTTTGDSQTLQALLLEVRQLRQDLQTTTIAGQRVQILIYRLQGQEAAVTRASQRLDEAREKVARIQDERKHVAADVKRFEDSLSGTVSPATQRKDIEQGVLPQLKTRLESLGNQEQQLQTRE
Ga0179593_122894523300026555Vadose Zone SoilMNRSSLFVLSSLVFSTACFGQTTTGDSQTLQALLSEVRQLRQDLQTTTIAGQRVQILIYRLQGQETAVARASQRLDEAREKVARIQDQREHVVAEVKRFEDSLSGTVNPATQRKDIEQGVLPPTQDKTRIARKPRTTIANARNRGRATTSGGRGQTQ
Ga0179587_1042064813300026557Vadose Zone SoilMNRSSLFVLSFLLFSTACFGQTTPGDSQTLQALLLEVRQLRQDLQTTTIAGQRVQILIYRLQGQEAAVARASQRLDEAREKVARIQDQREHVAADVKRFEDSLSGTVNPATQRKDIEQG
Ga0208241_103464713300027297Forest SoilMNRTSLFVLGFLVFSTSCFGQTTPGDSQTLQALLTEIRLLRQDLRTTTVAAQRSQILIYRLQGQEAAVARASQRLDEARDKLARTQDERKHVAAEVKRTEDFVSNTENPATQRKELENRL
Ga0209625_106164123300027635Forest SoilMNRSSLFALVLLLLGTTCFGQTTSGDSQTLQALLLEVRQLRQDMHTTIIAAQRAQILIYRLQGQEAAVARASQRLDDIRDKLARIQDERKHVATDVK
Ga0209736_102389443300027660Forest SoilMNRSSLFVLGLLLFSAACFGQTTPADSQTLQALLSEVRQLRQDLQIRIIAGQRAQILIYRLQGQEAAVARASQRLDEAREKLARIQDERKHVAADVKRQEDFISNTQNPAAERKDVEGMLSQSKTRVESLENQE
Ga0209446_112416913300027698Bog Forest SoilMNRSSLFALGLLLFSTACYGQTASGDSQTLQALLLEVRQLRQDLQTTTVAAQRAQILLYRLQGEEAAVARASQRFEEAREKVTVIQDQREHVATNIKEYEDFI
Ga0209656_1006217543300027812Bog Forest SoilMNRSSLFVLGLLLSSAACYGQTTPGDSQTLQALLSEVRQLRQDLQTTTIAAQRAQILLYRLQGQEAAVARASQRLDEAREKLAGIQAQRKYLATDV
Ga0247682_105839113300028146SoilMNRSSLLFLGLALLPPGCFGQTTPGDSQTLQALLSEVRQLRLDLQTTTIAGQRAQILIYRLQGEEAAVARASQRLDDVRDRLARTQDEK
Ga0308309_1008127243300028906SoilLLTVSLLLFSAACFGQTSQTDSQTLLALLSEVRQLRQDMRVTIIAAQRAQVLIYRLQAQEADVARESQRLDEAREKLGRIQDRRKHETAELKMVEDFIANTENSATQR
Ga0170819_1496337013300031469Forest SoilMNRSSILALGFLLSLTPCFGQTTPSESQTLQALLFEVRQLRQDLQTMTIAVQRAQILLYRSQGQEAAVARASQRLDGARERLAAIQDQ
Ga0306921_1122580513300031912SoilMNRLSLLVLGILLCSTASSGQTSSSDSQTLQGLLSEIRQLRLDLQTRIIAVQRGQILIYRLQGQEAIVARASQHLDEARDKLKKI
Ga0306926_1013247653300031954SoilMNRRLSFLALGCLLFATASFGQYAPSDSQTLQALLSEVRQLRQELRQELRTTTIAAQRSQILIYRLQGQEAVVVRASQRLGETRDKLARTRDAQMHVAADIKQTEDFVNNTDNPAAQRKE
Ga0306924_1014321143300032076SoilMNPRSSLLALGCVLFATTCFGQSAPGDSQTLQALLSEVRQLRQELRTTTIAAQRSQILIYRLQGREASVARASQRLDEAREKLARIQDARKHLAADVKQTEDFVNNTDNPAAQRKEFERRLSELKT
Ga0306924_1166067913300032076SoilMNRRLSFLALGCLLFATASFGQYAPSDSQTLQALLSEVRQLRQELRQELRTTTIAAQRSQILIYRLQGQEAAVVRASQRLGETRDKLARTRDAQMHVAADIKQTEDFVNNTDNP
Ga0307471_10136636423300032180Hardwood Forest SoilMNRRSSLLTFGFLFFSTDCFGQSAPGDSQTLQALLSEVRQLRQELRTTTIAAQRSQILIYRLQGQEASVGRASQRLEEAREKLARTQDARKHVAAEVKQSEDFVNNTENPATQRK
Ga0307510_1064501813300033180EctomycorrhizaMTRPTLLTLALLVSSTACFAQTSSPDSQALQALLSEVRQLRQDMQTTVIASQRAQILIYRLQMEEAAMERASKRTEDARDKLARIQDERRHIA
Ga0326728_1048669013300033402Peat SoilMNRSTLLVLGLLLVPTACSGQATPGDSQTLQALLSEVRQLRQDLQTTTIAAQRAQILIYRLQGQEAAVARASQRLDEAREKLAGIQAQRNYLATDIKRHEEFIGNTENPATQRKEFEERLPKLRAEL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.