NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F098817

Metagenome Family F098817

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F098817
Family Type Metagenome
Number of Sequences 103
Average Sequence Length 56 residues
Representative Sequence AKVVSIRPRSEFATRRNWGLQSRDLRTFSVRLAPEGLPVVSGQTFVVEAGAN
Number of Associated Samples 90
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 4.85 %
% of genes from short scaffolds (< 2000 bps) 5.83 %
Associated GOLD sequencing projects 84
AlphaFold2 3D model prediction Yes
3D model pTM-score0.41

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (94.175 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(9.709 % of family members)
Environment Ontology (ENVO) Unclassified
(34.951 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(50.485 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 12.50%    β-sheet: 31.25%    Coil/Unstructured: 56.25%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.41
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 103 Family Scaffolds
PF00005ABC_tran 85.44
PF02445NadA 0.97
PF00487FA_desaturase 0.97
PF0563523S_rRNA_IVP 0.97
PF01078Mg_chelatase 0.97
PF00111Fer2 0.97
PF12698ABC2_membrane_3 0.97
PF00437T2SSE 0.97
PF00561Abhydrolase_1 0.97

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 103 Family Scaffolds
COG0379Quinolinate synthaseCoenzyme transport and metabolism [H] 0.97
COG1398Fatty-acid desaturaseLipid transport and metabolism [I] 0.97
COG3239Fatty acid desaturaseLipid transport and metabolism [I] 0.97


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A94.17 %
All OrganismsrootAll Organisms5.83 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005347|Ga0070668_101091214All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium720Open in IMG/M
3300005436|Ga0070713_101637476All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium625Open in IMG/M
3300018052|Ga0184638_1120589All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium958Open in IMG/M
3300021073|Ga0210378_10396830All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium512Open in IMG/M
3300025972|Ga0207668_11720587All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium566Open in IMG/M
3300031720|Ga0307469_11185844All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium721Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil9.71%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.80%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere6.80%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.85%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.85%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.85%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.85%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere3.88%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands2.91%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.91%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil2.91%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland2.91%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere2.91%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere2.91%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere2.91%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere2.91%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.94%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.94%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand1.94%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil1.94%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.94%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.94%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.97%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil0.97%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.97%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.97%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost0.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.97%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.97%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.97%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.97%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.97%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.97%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300003371Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M3 S PMHost-AssociatedOpen in IMG/M
3300004013Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailA_D2EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005206Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300005331Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S6-3 metaGHost-AssociatedOpen in IMG/M
3300005344Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-3 metaGHost-AssociatedOpen in IMG/M
3300005347Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaGHost-AssociatedOpen in IMG/M
3300005367Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaGHost-AssociatedOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005438Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-2 metaGEnvironmentalOpen in IMG/M
3300005455Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3 metaGHost-AssociatedOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005539Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2Host-AssociatedOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005616Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2Host-AssociatedOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006894Agricultural soil microbial communities from Utah to study Nitrogen management - NC ControlEnvironmentalOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006931Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2 (version 2)Host-AssociatedOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009814Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_50_60EnvironmentalOpen in IMG/M
3300010029Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_10_20EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012951Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_226_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012985Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_246_MGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300013832Permafrost microbial communities from Nunavut, Canada - A3_5cm_0MEnvironmentalOpen in IMG/M
3300014320Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqA_D1EnvironmentalOpen in IMG/M
3300015162Arctic soil microbial communities from a glacier forefield, Storglaci?ren, Tarfala, Sweden (Sample st-4c, rock/ice/stream interface)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017947Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0815_BV2_4_20_MGEnvironmentalOpen in IMG/M
3300018032Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_BV01_MP10_20_MGEnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018060Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_10_MGEnvironmentalOpen in IMG/M
3300018070Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_90_b1EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021413Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1c1EnvironmentalOpen in IMG/M
3300025567Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025904Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025911Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025935Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025972Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025981Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025986Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026035Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026089Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300027395Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M2 PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027462Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant Co PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300030336Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day1EnvironmentalOpen in IMG/M
3300031543Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.176b2f20EnvironmentalOpen in IMG/M
3300031640Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f23EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031799Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f21EnvironmentalOpen in IMG/M
3300031805Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.109b1f23EnvironmentalOpen in IMG/M
3300031860Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f25EnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033416Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_OW2_C1_D5_CEnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300034090Peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF00NEnvironmentalOpen in IMG/M
3300034178Sediment microbial communities from East River floodplain, Colorado, United States - 27_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI26145J50221_100641623300003371Arabidopsis Thaliana RhizosphereDARVRARVVSIRPRSEFATRRNWGLQSRDLRTFSVRLAAEGLPIVSGQTFVVEAGAN*
Ga0055465_1012729823300004013Natural And Restored WetlandsPQARPNAPFQAKIVSIKPRSEFATRKNWGLQSRDLQTFSVRLQPVNATVVSGQTFVVSAGRETPRV*
Ga0066685_1071846413300005180SoilARARARVVSIRPRSEFATRRNWGLQSRDLRTFSVRLAPEGLPVVSGQTFVVEAGAH*
Ga0068995_1010747423300005206Natural And Restored WetlandsVSIKPRSEFATRKNWGLQSRDLQTFSVRLQPLNAGIVSGQTFVVEAAGGPPKA*
Ga0070670_10136892423300005331Switchgrass RhizosphereRFQAKIVSIKPRSEFANRRNWGVQSRDLQTFSVRLQPVNANVVSGQTFVVSAGSETPRA*
Ga0070661_10010644033300005344Corn RhizosphereVSIKPRAEYASRKNWGLQSRDLLTFSVRLQPINAPVVSGQTFVVEVGGQG*
Ga0070668_10097443613300005347Switchgrass RhizosphereKIVSIRPRSEFATRKNWGLQSRDLLTFSVRLQPLNAPVVSGQTFVVEVGRQG*
Ga0070668_10109121423300005347Switchgrass RhizosphereAFTLGKTVEIWPQARPDARARAKVVSIRPRSEFATRRNWGIRERDLRTFSVRLAPEGLPVVSGQTFVVEAGAN*
Ga0070668_10203459623300005347Switchgrass RhizosphereGKTVEIWPQARPGTHSRARVVSIRPRSEFATRRNWGLQSRDLRTFSVRLVPEGLAVVSGQTFVVEAGAN*
Ga0070667_10139197013300005367Switchgrass RhizosphereARAKVVSIRPRSEFATRRNWGIRERDLRTFSVRLAPEGLPVVSGQTFVVEAGAN*
Ga0070713_10163747623300005436Corn, Switchgrass And Miscanthus RhizosphereKTVRVWPQARASGQSFPARVLSIKPRSEFATRRNWGMQSRDLKTFSVRLKPEGAPVVSGQTFVVEAGS*
Ga0070701_1016345423300005438Corn, Switchgrass And Miscanthus RhizospherePRSEFATRKNWGLQSRDLQTFSVRLQPLNATVVSGQTFVVEAGSEPPKA*
Ga0070663_10039691023300005455Corn RhizosphereRSEFATRRNWGLQSRDLRTFSVRLAPEGLPVVSGQTFVVEAGAN*
Ga0068867_10165039113300005459Miscanthus RhizosphereVVSIRPRSEFATRRNWGLQSRDLRTFSVRLAPEGLPVVSGQTFVVEAGAN*
Ga0068853_10181128223300005539Corn RhizosphereAKVVSIRPRSEFATRRNWGLQSRDLRTFSVRLAPEGLPVVSGQTFVVEAGAN*
Ga0070732_1078376023300005542Surface SoilPAKVLSIKPRSEFATRRNWGLQSRDLKTFSVRLKPEGAPVVSGQTFVVEAGS*
Ga0070704_10222522123300005549Corn, Switchgrass And Miscanthus RhizosphereSIKPRAEYASRKNWGLQSRDLLTFSVRLQPINASVVSGQTFVVEVGGQG*
Ga0068852_10192096413300005616Corn RhizosphereFATRRNWGIRERDLRTFSVRLAPEGLPVVSGQTFVVEAGAN*
Ga0066696_1091207813300006032SoilARIVSIKPRSEFATRKNWGLHSRDLLTFSVRLQPLQAPVIAGQTFVVEAGSGPGV*
Ga0075425_10258490713300006854Populus RhizosphereARPEGAFFARVVSVKPRSEFATRKNWGIQSRDLKTFSVRLAPLATTVISGQTFVVEAIKT
Ga0075425_10301336113300006854Populus RhizosphereDASFFARITSIKPRSEFATRRNWGLQSRDLKTFSVRLVPEGEGVVAGQTFVVEAGKR*
Ga0079215_1012487513300006894Agricultural SoilRPRSEFATRRNWGMQSRDLRTFSVRLAPEGLPIVSGQTFVVEAGS*
Ga0075424_10048374613300006904Populus RhizosphereKSVEVWPQARPDAKTRARIVSIRPRSEFATRRNWGLQSRDLRTFSVRLAPEGSAVVSGQTFVVEAGAN*
Ga0097620_10203752423300006931Switchgrass RhizospherePNARFQAKIVSIKPRSEFANRRNWGVQSRDLQTFSVRLLPVNATVVSGQTFVVSAGSETPRA*
Ga0114129_1243184813300009147Populus RhizosphereRFQAKIVSIKPRSEFANRRNWGVQSRDLQTFSVRLLPVNATVVSGQTFVVSAGSETPRA*
Ga0105243_1106122013300009148Miscanthus RhizosphereVSIKPRSEFANRRNWGVQSRDLQTFSVRLLPVNATVVSGQTFVVSAGSETPRA*
Ga0075423_1151511113300009162Populus RhizosphereKVVSIRPRSEFATRRNWGIRERDLRTFSVRLAPEGLPVVSGQTFVVEAGAN*
Ga0105082_111035813300009814Groundwater SandFATRRNWGLQSRDLKTFSVRLAPQGASVIAGQTFVVEAGKS*
Ga0105074_104800313300010029Groundwater SandAKIVSIKPRSEFATRKNWGLQSRDLQTFSVRLQPVAATVVSGQTFVVEAGTETPRA*
Ga0126384_1162381013300010046Tropical Forest SoilPRSEFATRKNWGLQSRDLKTFSVRLVPQGTTIISGQTFVVEAGRSQS*
Ga0126384_1198379023300010046Tropical Forest SoilRAEFATRKNWGLQSRDLKTFSVRLAPQNTSVISGQTFVVEAGRS*
Ga0099796_1033458213300010159Vadose Zone SoilAARVVSVKPRAEFATRKNWGLQSRDLKTFSVRLAPQNAAVISGQTFVVEAGRN*
Ga0126370_1036962913300010358Tropical Forest SoilKVVTVKPRAEFATRKNWGLQSRDLKTFSVRLAPQNTSVISGQTFVVEAGRS*
Ga0126379_1211487923300010366Tropical Forest SoilKPRAEFATRKNWGLQSRDLKTFSVRLAPQNIAVISGQTFVVEAGRN*
Ga0134128_1321264223300010373Terrestrial SoilRSEFATRRNWGLNDRDLKTFSVRLAPQGTSVVSGQTFVVEAGKS*
Ga0126383_1142177313300010398Tropical Forest SoilSIKPRSEFATRKNWGIQSRDLQTFSVRLQPLDASVVAGQTFVVEAGGAHPKA*
Ga0134121_1066318913300010401Terrestrial SoilKPRAEFATRKNWGLHSRDLKTFSVRLSPQNSGVISGQTFVVEAGRN*
Ga0137374_1103522713300012204Vadose Zone SoilAARVVTIKPRAEFATRKNWGLQSRDLKTFSVRLAPQNAGVISGQTFVVEAGRN*
Ga0137374_1110327623300012204Vadose Zone SoilRFAARVVTIKPRAEFATRKNWGLQSRDLKTFSVRLAPQNAGVISGQTFVVEAGRN*
Ga0137379_1024926233300012209Vadose Zone SoilFARVTSIKPRSEFATRRNWGLQSRDLKTFSVRLVPEGEGVVAGQTFVVEAGKH*
Ga0137379_1077544523300012209Vadose Zone SoilVTIKPRAEFATRKNWGLQSCDLKTFSVRLAPQNAGVISGQTFVVEAGRN*
Ga0137378_1068225513300012210Vadose Zone SoilIRPRSEFATRRNWGLQSRDLRTFSVRLAPEGLPVVSGQTFVVEAGAN*
Ga0137366_1107850923300012354Vadose Zone SoilTVKPRAEFATRKNWGLQSRDLKTFSVRLAPQNTAVVSGQTFVIEAGRLKD*
Ga0137366_1113348713300012354Vadose Zone SoilARFAARVVTVKPRAEFATRKNWGLQSRDLKTFSVRLAPQNTSVISGQTFVVEAGRN*
Ga0137368_1019665233300012358Vadose Zone SoilSIRPRSEFATRRNWGLQSRDLRTFSVRLVPEGLPVVSGQTFVVEAGTN*
Ga0164300_1093236813300012951SoilIVSIKPRSEFATRKNWGLQSRDLLTFSVRLQPLNESVVSGQTFVVEVGRQG*
Ga0164302_1086407023300012961SoilFATRRNWGLQSRDLRTFSVRLAPEGLPVVSGQTFVVEAGAN*
Ga0164308_1044408513300012985SoilNSRFQAKIVSIRPRSEFATRKNWGLQSRDLLTFSVRLQPLNAPVVSGQTFVVEVGRQG*
Ga0164307_1089971823300012987SoilDARIAARVVSVKPRAEFATRKNWGLQSRDLLTFSVRLQPLNAPVVSGQTFVVEVGRQG*
Ga0163162_1140836623300013306Switchgrass RhizosphereFQAKIVSIRPRSEFATRKNWGLQSRDLLTFSVRLQPLNAPVVSGQTFVVEVGRQG*
Ga0120132_114962113300013832PermafrostVRVWPQARVGESFPARVISIKPRSEFATRRNWGMQSRDLKTFSVRLKPEGVPLVSGQTFVVEAGS*
Ga0075342_124262323300014320Natural And Restored WetlandsRFQAKIVSIKPRSEFATRKNWGLQSRDLQTFSVRLQPVDATVVSGQTFVVEAGSETPRA*
Ga0167653_102233613300015162Glacier Forefield SoilVVSVKPRAEFATRKNWGLQSRDLKTFSVRLAPQNSAVISGQTFVVEAGRN*
Ga0132258_1287162623300015371Arabidopsis RhizosphereRPDARARAKVVSIRPRSEFATRRNWGLQSRDLRTFSVRLAPEGLPVVSGQTFVVEAGAN*
Ga0132256_10123473423300015372Arabidopsis RhizosphereRSEFATRRNWGLQSRDLKTFSVRLAPQGAAVIAGQTFVVEAGKS*
Ga0132255_10469614923300015374Arabidopsis RhizosphereRSEFATRKNWGLQSRDLQTFSVRLQPLKEGAVVSGQTFVVETGSGPPKA*
Ga0132255_10473547913300015374Arabidopsis RhizosphereQARGGSPFPARVISIKPRSEFATRRNWGLQSRDLKTFSVRLKPEGGAAVPGQTFVVEAGS
Ga0187785_1063481013300017947Tropical PeatlandPDAPFSARIMSIKPRSEYATRRNWGLQSRDLKTFSVRLQPIDVKVVSGQTFVVEAGKS
Ga0187788_1016603313300018032Tropical PeatlandPDARFQARVVSIKPRSEFATRKNWGLQSRDLQTFSVRLQPLNGSVVSGQAFVVEAGSGPPKA
Ga0184638_112058913300018052Groundwater SedimentMPAQVPGKKVEIWPQARPDTRATARVVSIRPRSEFATRRNWGMQSRDLRTFSVRLAPEGLPVVSGQTFVVEAGS
Ga0187765_1067744913300018060Tropical PeatlandTRFQAKIVSIKPRSEFATRKNWGIQSRDLQTFSVRLQPLNGSVVSGQTFVVEAGSGPPGA
Ga0184631_1027528313300018070Groundwater SedimentRSEFATRRNWGLQSRDLRTFSVRLVPEGLPVVSGQTFVVEAGAN
Ga0184632_1020071613300018075Groundwater SedimentQARPDVRAHARVVSIRPRSEFATRRNWGLQSRDLRTFSVRLAPEGLPVVSGQTFVVEAGA
Ga0210378_1039683023300021073Groundwater SedimentVGKSVEIWPQARPDARARARVVSIRPRSEFATRRNWGLQSRDLRTFSVRLVPEGLPVVAGQTFVVEAGAN
Ga0193750_104824213300021413SoilWPQARPDASFFARVTSIKPRSEFATRRNWGLQSRDLKTFSVRLVPEAEGVVAGQTFVVEAGKP
Ga0210076_106602813300025567Natural And Restored WetlandsPQARPNAPFQAKIVSIKPRSEFATRKNWGLQSRDLQTFSVRLQPVNATVVSGQTFVVSAGRETPRV
Ga0207647_1071792313300025904Corn RhizosphereSVKPRAEFATRKNWGLQSRDLKTFSVRLAPQNAAVISGQTFVVEAGRN
Ga0207654_1077592013300025911Corn RhizosphereVTVWPQARPDSRFQARVVSIKPRSEFATRKNWGLQSRDLQTFSVRLQPLNATVVSGQTFVVEAGSEPPKA
Ga0207646_1017281433300025922Corn, Switchgrass And Miscanthus RhizosphereQGKRVRVWPQARPEAPFFARIVSVKPRSEFATRKNWGLQSRDLKTFSVRLAPEGQGVVSGQTFVVEAGKT
Ga0207646_1086182523300025922Corn, Switchgrass And Miscanthus RhizosphereRVRVWPQARPDASFFARVTSIKPRSEFATRRNWGLQSRDLKTFSVRLVPEGEGVVAGQTFVVEAGKH
Ga0207706_1132688713300025933Corn RhizosphereNSRFQAKIVSIRPRSEFATRKNWGLQSRDLLTFSVRLQPLNAPVVSGQTFVVEVGRQG
Ga0207709_1055074213300025935Miscanthus RhizosphereQAKIVSIKPRSEFANRRNWGVQSRDLQTFSVRLLPVNATVVSGQTFVVSAGSETPRA
Ga0207665_1045262523300025939Corn, Switchgrass And Miscanthus RhizosphereRGGSPFPARVLSIKPRKEFATRRNFGLQSRDLKTFSVRLKPEGGGVAVAGQTFVVEAGS
Ga0207665_1103928023300025939Corn, Switchgrass And Miscanthus RhizosphereARIVSIKPRSEFATRKNWGLQSRDLLTFSVRLQPLNESVVSGQTFVVEVGRQG
Ga0207668_1172058723300025972Switchgrass RhizosphereAFTLGKTVEIWPQARPDARARAKVVSIRPRSEFATRRNWGIRERDLRTFSVRLAPEGLPVVSGQTFVVEAGAN
Ga0207640_1041636033300025981Corn RhizosphereARAKVVSIRPRSEFATRRNWGLQSRDLRTFSVRLAPEGLPVVSGQTFVVEAGAN
Ga0207658_1003203313300025986Switchgrass RhizospherePNSRFQAKIVSIRPRSEFATRKNWGLQSRDLLTFSVRLQPLNAPVVSGQTFVVEVGRQG
Ga0207703_1007902343300026035Switchgrass RhizosphereRTVMVWPQARPDARFQAKIVSIKPRSEFATRKNWGLQSRDLQTFSVRLQPLNATVVSGQTFVVEAGSEPPKA
Ga0207648_1107965613300026089Miscanthus RhizosphereRPRSEFATRRNWGLQSRDLRTFSVRLVPEGLAVVSGQTFVVEAGAN
Ga0209805_131965423300026542SoilVEIWPQARPDARARARVISIRPRSEFATRRNWGLQSRDLRTFSVRLAPEGLPVVSGQTFVVETGAH
Ga0209996_102438513300027395Arabidopsis Thaliana RhizosphereRPDARVRARVVSIRPRSEFATRRNWGLQSRDLRTFSVRLAAEGLPIVSGQTFVVEAGAN
Ga0210000_102865123300027462Arabidopsis Thaliana RhizosphereRVVSIRPRSEFATRRNWGLQSRDLRTFSVRLAAEGLPIVSGQTFVVEAGAN
Ga0209590_1055368613300027882Vadose Zone SoilSEFATRRNWGLQSRDLRTFSVRLQPVDVKVVSGQTFVIEAGKS
Ga0268265_1059331323300028380Switchgrass RhizospherePQARPNARFQAKIVSIKPRSEFANRRNWGVQSRDLQTFSVRLQPVNANVVSGQTFVVSAGSETPRA
Ga0247826_1021681813300030336SoilVWPQARPNSRFQAKIVSIRPRSEFATRKNWGLQSRDLLTFSVRLQPLNAPVVSGQTFVVEVGRQG
Ga0318516_1085943813300031543SoilVSVWAQARPDAPFSARIISIKPRSEYATRRNWGLQSRDLKTFSVRLQPIDVKVVSGQTFVVEAGKS
Ga0318555_1076677113300031640SoilIKPRSEFATRKNWGIQSRDLQTFSVRLQPLNGSVVSGQTFVVEAGTGPPGA
Ga0307469_1098212013300031720Hardwood Forest SoilDTRFQAKIVSIKPRSEYATRKNWGLQSRDLQTFSVRLQPLNETVVSGQTFVVEAGSGPPG
Ga0307469_1118584413300031720Hardwood Forest SoilAFPVGQLVRAWPQARPDAPFSVRIISIKPRAEFATRKNWGLQSRDLKTFSVRLAPQGTRVISGQTFVVEAGKS
Ga0318565_1064557423300031799SoilAPFSARIISIKPRSEYATRRNWGLQSRDLKTFSVRLQPIDVKVVSGQTFVVEAGKS
Ga0318497_1026924723300031805SoilVWAQARPDAPFSARIISIKPRSEYATRRNWGLQSRDLKTFSVRLQPIDVKVVSGQTFVVEAGKS
Ga0318495_1033957923300031860SoilTRFQAKVVSIKPRSEFATRKNWGIQSRDLQTFSVRLQPLNGSVVSGQTFVVEAGSGPPGA
Ga0306923_1186612923300031910SoilTRFQAKIVSIKPRSEFATRKNWGIQSRDLQTFSVRLQPLNGSVVSGQTFVVEAGTGPPGA
Ga0306922_1042712933300032001SoilVVWPQARPDTRFEAKIVSIKPRSEFATRKNWGIQSRDLQTFSVRLQPLNGTVVSGQTFVVEAGSGSPGA
Ga0307471_10082560723300032180Hardwood Forest SoilPQARPQDRFPARIVSVKPRSEFATRRNWGLQSRDLKTFSVRLAPQGATVIAGQTFVVEAGKS
Ga0307471_10380808213300032180Hardwood Forest SoilDAPFSARVISIKPRAEFATRKNWGLQSRDLKTFSVRLSPQGVRVISGQTFVVEAGSTPPTNTSTNTPTRTSTP
Ga0307471_10417181223300032180Hardwood Forest SoilSVKPRSEFATRRNWGLQSRDLRTFSVRLQPVDVKVVSGQTFVVEAGKS
Ga0307472_10100423613300032205Hardwood Forest SoilVKPRSEFATRRNWGLQSRDLRTFSVRLQPVETKVVAGQTFIVEAGKT
Ga0335084_1004326413300033004SoilPRSEFATRRNWGLQSRDLKTFSVRLAPQGARVIAGQTFVVEAGKS
Ga0316622_10236672923300033416SoilPNARFQAKIVSIKPRSEFATRKNWGLQSRDLQTFSVRLQPVNATVVSGQTFVVQAGSETPRA
Ga0326726_1151988723300033433Peat SoilVGGSVTVWPQARPDARFAARVVTIKPRAEFATRKNWGLQSRDLKTFSVRLAPQNAGIVSGQTFVVEAGRN
Ga0326723_0552352_391_5313300034090Peat SoilKPRAEFATRKNWGLQSRDLKTFSVRLAPQNAGVVSGQTFVVEAGRN
Ga0364934_0076324_16_1803300034178SedimentVVSIKPRSEFATRKNWGLQSRDLQTFSVRLQPLNATVVSGQTFVVEAGSEPRRA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.