NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F080173

Metagenome Family F080173

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F080173
Family Type Metagenome
Number of Sequences 115
Average Sequence Length 55 residues
Representative Sequence MKKYGSIFHILSLACACLISVGCATNQPGAATAPPPNSGHLLVYRVPNFGTDLFLVL
Number of Associated Samples 102
Number of Associated Scaffolds 115

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 7.83 %
% of genes from short scaffolds (< 2000 bps) 7.83 %
Associated GOLD sequencing projects 101
AlphaFold2 3D model prediction Yes
3D model pTM-score0.31

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (93.043 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(23.478 % of family members)
Environment Ontology (ENVO) Unclassified
(33.043 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(56.522 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 23.53%    β-sheet: 0.00%    Coil/Unstructured: 76.47%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.31
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 115 Family Scaffolds
PF06742DUF1214 19.13
PF14023DUF4239 16.52
PF01914MarC 4.35
PF13505OMP_b-brl 2.61
PF06863DUF1254 1.74
PF08495FIST 1.74
PF09865DUF2092 0.87
PF01728FtsJ 0.87
PF02518HATPase_c 0.87

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 115 Family Scaffolds
COG5361Uncharacterized conserved proteinMobilome: prophages, transposons [X] 20.87
COG5402Uncharacterized protein, contains DUF1214 domainFunction unknown [S] 19.13
COG2095Small neutral amino acid transporter SnatA, MarC familyAmino acid transport and metabolism [E] 4.35
COG3287FIST domain protein MJ1623, contains FIST_N and FIST_C domainsSignal transduction mechanisms [T] 1.74
COG4398Small ligand-binding sensory domain FISTSignal transduction mechanisms [T] 1.74
COG029323S rRNA U2552 (ribose-2'-O)-methylase RlmE/FtsJTranslation, ribosomal structure and biogenesis [J] 0.87
COG1189Predicted rRNA methylase YqxC, contains S4 and FtsJ domainsTranslation, ribosomal structure and biogenesis [J] 0.87


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A93.04 %
All OrganismsrootAll Organisms6.96 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005167|Ga0066672_10096105All Organisms → cellular organisms → Bacteria1799Open in IMG/M
3300005331|Ga0070670_101665185All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300005536|Ga0070697_101681807All Organisms → cellular organisms → Bacteria568Open in IMG/M
3300006237|Ga0097621_101524036All Organisms → cellular organisms → Bacteria635Open in IMG/M
3300010048|Ga0126373_13288342All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium503Open in IMG/M
3300010373|Ga0134128_11357747All Organisms → cellular organisms → Bacteria783Open in IMG/M
3300010376|Ga0126381_103508452All Organisms → cellular organisms → Bacteria616Open in IMG/M
3300013102|Ga0157371_10912734All Organisms → cellular organisms → Bacteria666Open in IMG/M
3300028824|Ga0307310_10720680Not Available512Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil23.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil11.30%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil9.57%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil8.70%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil6.96%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.96%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment3.48%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.61%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.61%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.61%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.74%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Miscanthus Rhizosphere1.74%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.74%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.87%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.87%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.87%
TerrestrialEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial0.87%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.87%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.87%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.87%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.87%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.87%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.87%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.87%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.87%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.87%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.87%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.87%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.87%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.87%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.87%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.87%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001139Soil microbial communities from Great Prairies - Wisconsin, Switchgrass soilEnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005290Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Rhizosphere Soil Replicate 1: eDNA_1Host-AssociatedOpen in IMG/M
3300005293Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Bulk Soil Replicate 1 : eDNA_1Host-AssociatedOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005331Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S6-3 metaGHost-AssociatedOpen in IMG/M
3300005339Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaGHost-AssociatedOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005547Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-3 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006237Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2 (version 2)Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009092Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-4 metaGHost-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012022Terrestrial microbial communites from a soil warming plot in Okalahoma, USA - C6EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300012988Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_242_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013102Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C4-5 metaGHost-AssociatedOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017792Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S4-5 metaGHost-AssociatedOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018066Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_b1EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019362Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S104-311B-1 (version 2)EnvironmentalOpen in IMG/M
3300020012Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s1EnvironmentalOpen in IMG/M
3300020021Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c1EnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300021418Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3s2EnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025923Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026340Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-AEnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300028717Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_158EnvironmentalOpen in IMG/M
3300028811Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_149EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028881Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_116EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031858Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D2EnvironmentalOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300033289Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN108EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI1027J12803_10134346713300000955SoilMKKYGSIIQVLSLACAGIILIGCGTTTQPGAATVPPPKNAGHLLVYRVANFGSNLVLVLSID
JGI1027J12803_10265564933300000955SoilIHVLSLACAGIILIGCGTTTQPAAATAPPPKNSGHLLVYRVANFGSNL
JGI10220J13317_1045650513300001139SoilMKKYGSIVHILSLACVCLISVGCATNQPAAATAPPNSGHLLVYRVPNSGTGIFLVLSVDGKHVGSF
Ga0062591_10153908623300004643SoilMKKYYSIICILTLACAGIFLVGCETTQSGAAAAPPPPNSARLLVNRVANFGSDL
Ga0066672_1009610513300005167SoilMKKYGSIVHILSLACVCLISVGCATNQPGAATAPPNSGHLLVYRVPNFGTDLGLILSVDGKDVGSFTEGRNY
Ga0066673_1018338633300005175SoilMKKYGSIFHILSLACACLISVGCATNQPGAATAPPPNSGHLLVYRVPNFGTDLFLVL
Ga0066673_1069610333300005175SoilMKKYSSIVYILSLAFACLISVGCATNQPGAATAPPNSGHLLVYRVPNFGTDLFLVLSVDGKDVG
Ga0066684_1092558423300005179SoilMKKYSSIVCILSLAFACLISVGCATNQPGAATAPPNSGHLLVYRVPNFGTDLFLVLSVDG
Ga0066675_1110581113300005187SoilMKKYSSIIHILSLACACLIAGCATQPGAATAPPPNSGHVL
Ga0065712_1004092413300005290Miscanthus RhizosphereMKKYHSIICILTLACIGLMMVGCETTQQGASAPPPPNSARLLVNRVANFGSD
Ga0065715_1069651823300005293Miscanthus RhizosphereMKKYNSIIYILSLACACLIAVGCTTGPGAATAPPPNSGHVLIYRVANFGENLGLVVSVD
Ga0065705_1039711913300005294Switchgrass RhizosphereMKKYYSIICILILACIGLIMVGCETTQSGAAAAPPPPNSARLLVNRVA
Ga0070670_10166518513300005331Switchgrass RhizosphereMKKHGSIVHILSLACACAISVGCATNQPGAATAPPNSGHLLVNRVANFG
Ga0070660_10124315923300005339Corn RhizosphereMKKHGSIVQILSLACACVISVGCATNQPGAATAPPNSGHLLVYRVPNFGTDLGLILSI
Ga0070711_10027322513300005439Corn, Switchgrass And Miscanthus RhizosphereMKKHGSIVLILSLACACVISVGCATNHPGAATAPPNSGHLLVNRAANFGTNLGLILSVDGKDVGSFPE
Ga0070711_10181914423300005439Corn, Switchgrass And Miscanthus RhizosphereMKKHGSIVQILSLACACVISVGCATNQPGAATAPPNSGHLLVNRAANFGTNLGLILSVDGKDVGSFPE
Ga0070708_10065471233300005445Corn, Switchgrass And Miscanthus RhizosphereMKRYKSIIHILSLACAGLIAGCATNQPGAATAPPPNSGHVIINRVANFGS
Ga0066686_1047018733300005446SoilMKKHSLIIHTLNLACAGLIAGCATNQPGAATAPPPNSGHVIINRVANFGSNLSLV
Ga0066682_1039357923300005450SoilMKKYGSIFHILSLACACLISVGCATNQPGAATAAPPNSGHLLVYRVPNFGTDLFLV
Ga0066687_1045376313300005454SoilMGASKGVRNMKKSRSIVHILSLACACLIGAGCATQPGAATAQPPNSGHVLIYR
Ga0070707_10176420523300005468Corn, Switchgrass And Miscanthus RhizosphereMKKYYSIIYILSLACAGLIFTGCETTQSGAATAPPPNSGHLLIYRVANFGSDLSLV
Ga0070697_10168180713300005536Corn, Switchgrass And Miscanthus RhizosphereMKRYSSIIHILGLACVCLISVGCATNQPGAATAPPNSGHLLVYRVPNFGTDLGLILSIDGKDVGSFTEG
Ga0066697_1077836923300005540SoilMKKYGSIVQILSLACACAISVGCATSQPGAATAPPNSGHLLVYRVPNFGTDLFLVLSVDGKDVG
Ga0070693_10148817213300005547Corn, Switchgrass And Miscanthus RhizosphereMKKHGSIVHILSLACACAISVGCATNQPGAATAPPNSGHLLVNRVANFGTDLFVVLSVDGKDV
Ga0066701_1026575413300005552SoilMKKYGSIFHILSLACACLISVGCATNQPGAATAAPPNSGHLLVYRVPNFGTDLFLVLSVDGKDVGSF
Ga0066661_1009361443300005554SoilMKKSRSIVHILSLACACLIGAGCATQPGTATAQPPNSGHVLIYR
Ga0066661_1020848813300005554SoilMKKYGSIVHILSLACVCLISVGCATNQPGAATAPPNSGHLLVYRVPNFGTDLGLILSVDGKDVGSF
Ga0066670_1091093823300005560SoilMKKYCSIIYIFSLACACLIAVGCATQPGAATAPPPNAGHLIITRVANFG
Ga0066703_1005783313300005568SoilMKKYSLIIHTLSLACAGLIAGCATNQPGAATAPPPNSGHVIINRVANFGT
Ga0066705_1045164913300005569SoilMKKYGSIVQILSLACACVISVGCATNQPGAATAPPNSGHLLVYRVPNFGTDLGLILSV
Ga0066705_1066179823300005569SoilMKKYGSIVHILSLACVCLISVGCATNQPGAATAPPNSGHLLVYRVPNFGTDLGLILSV
Ga0066694_1019802033300005574SoilMKKYGSIVHILSLACVCLISVGCATNQPGAATAPPNSGHLLVNR
Ga0066708_1070778333300005576SoilMKRYSSIIHILNLACACLIFTGCATNQPGAATAPPNSGHLLVYRVPNFGTDLFLVL
Ga0066706_1125792613300005598SoilMKKYCSIINILSLAFACLIAAGCATQPGAATAPPPNSGRLLINRVANFGSDLSLVVSVDG
Ga0070717_1079332213300006028Corn, Switchgrass And Miscanthus RhizosphereMKKHGSIVQILSLACACVISVGCATNQPGAATAPPNSGHLLVNRAANFGTNLGLI
Ga0097621_10152403623300006237Miscanthus RhizosphereMKKHGSIVHILSLACACAISVGCATNQPGAATAPPNSGHLLVNRVANFGSDLFLVLSVDGKDVGSF
Ga0075424_10079792923300006904Populus RhizosphereMKSYNVIIGILTLACAGMVLVGCETTQSGAAAAPPPNSGHLLIDRVANFGSNM
Ga0075424_10174118923300006904Populus RhizosphereMKKYGSIVYILSLAYACLIAAGCTTGPGAASAPPPNSGQVLINRVANFGSNLSLVVS
Ga0075435_10137913823300007076Populus RhizosphereMKKHGSIVHILSLACACAISVGCATNQPGAATAPPNSGHLLVNRVANFGTDLFVVLSVDG
Ga0099791_1006626533300007255Vadose Zone SoilMKKYSLIIHTLSLACAGLIAGCATNQPGAATAPPPNSGHVIINRVANFGTNL
Ga0105250_1018329513300009092Switchgrass RhizosphereMKKYYSIICILTLACAGLFLVGCETTQSGAATAPPPNSGHVLIYRVANFGENLAL
Ga0066709_10169803923300009137Grasslands SoilMKNNKTIIQFLGLACACLISVGCATNQPGAATASPPNSGHLLVYRVANFGTDLSLILSVDGKDVG
Ga0105241_1013971113300009174Corn RhizosphereMKKFGSTIQILSLGCACLIGVGSPTQSGAATAPPNSGHLLVYRAAKFGDRLNLALSVDGK
Ga0105242_1142530523300009176Miscanthus RhizosphereMKKHGSIVLILSLACACVISVGCATNHPGAATAPPNSGHLLVNRAANFGTNLGL
Ga0126384_1101537313300010046Tropical Forest SoilMKKYSLIIQTLSLACACLISAGCASQPAGSASAPPPPNSARLVVDRIANFGTYVNLVL
Ga0126384_1219862223300010046Tropical Forest SoilMKKYGSIIHILSLTCACLITAGCATQPGAATAPPPPNSGRLLVNRVANFGSDLSLV
Ga0126373_1328834223300010048Tropical Forest SoilMKKYGSIIHVLSLACAGIILIGCSTTQQSTATGPPPPNSGRLIVDREANFGSGLVLVLSIDGKDVA
Ga0134070_1019529813300010301Grasslands SoilMKRYSLIIHTLSLACAGLIAGCATNQPGAATAPPPNSGHVIINRVANFGSNLSLVVSV
Ga0134070_1043737513300010301Grasslands SoilMKKYSSIIHILSLACACLIAGCATQPGAATAPPPNSGHLIVTRV
Ga0134065_1006973613300010326Grasslands SoilMTTYRSIFHILSLACACLISVGCATNQPGAATAAPPNSGHLLVYRVANFGT
Ga0134065_1014425713300010326Grasslands SoilMKKYCSIINILSLAFACLIAAGCATQPGAATAPPPN
Ga0134111_1053442513300010329Grasslands SoilMKKHSLIIHALSLACAGLIAGCATNQPGAATAPPPNSGHVIINRDANFGSNLS
Ga0134063_1026299313300010335Grasslands SoilMKKYGSIFHILSLACACLISVGCATNQPGAATAAPPNSGHLLVYRVPNFGTDLFLVLSVD
Ga0126372_1303481413300010360Tropical Forest SoilMKKYSSIINVLTLACACLITAGCATQPGAATAPPPNSGHVIINRVA
Ga0126377_1222041723300010362Tropical Forest SoilMKRYNSIVNVFSLACACLIAAGCSTQSGTANAPPPPNSGRL
Ga0134066_1002423933300010364Grasslands SoilMKKSRSIVHILSLACACLLGAGCATQPGAATAQPPNSGHVLIYRV
Ga0126379_1011713013300010366Tropical Forest SoilMKKYSSIISILSLACACLIAVGCATQPGAATAPPPNSGRVIINRVANFGADL
Ga0134128_1135774713300010373Terrestrial SoilMKKYYPIICILTLACAGMALVSCETTQSGAAAAPTNSGHVLIYRVANFGADMALVVSVDGKDVGS
Ga0126381_10350845213300010376Tropical Forest SoilMKKDRSITYILSLACAGLILSACETTGPGAASAPPPNSGHVIITRVPNFGSDLSLVVSV
Ga0126383_1177718123300010398Tropical Forest SoilMKKHIKTIYILSVACACLISVGCTTGPGAATAPPPNSGHVLIYRVANFG
Ga0120191_1010306123300012022TerrestrialMKKYGSIINILSLACACLLAVGCTTGPGAARAPPNSGHVLIY
Ga0137383_1054071133300012199Vadose Zone SoilMKKYCSIIYILSLACACLIAVGCATQPGAATAPPPNAGHLIITRVANFGSNLSLVVS
Ga0137382_1017649913300012200Vadose Zone SoilMKKSRSIVHILSLACACLIGAGCATQPGTATAQPPNSGHVLIYRVPNFGSNLSL
Ga0137363_1141325713300012202Vadose Zone SoilMKKYASIIHILSLACACLIAGCATQPGAAIAPPNSGHLIVTR
Ga0137377_1046945513300012211Vadose Zone SoilMKKHGSIVQILSLACACAISVGCATSQPGAATAPPNSGHLLVYRVPNFGTDLFLVLS
Ga0137377_1067416933300012211Vadose Zone SoilMKKYSSIIHILSLACACLIAGCATQPGAATAPPNSGHVLIYRVANFGSNLS
Ga0137366_1066992723300012354Vadose Zone SoilMKKHSLITHTLSLACAGLIAGCATNQPGAATAPPPNSGHVIINRVANFGSNLSLVVSVD
Ga0137384_1058575623300012357Vadose Zone SoilMKNHKTIIQFLGLVCACLISAGCATNQPGAATAPPPNSGHLL
Ga0137360_1166915433300012361Vadose Zone SoilMKKYGSIIHILSLACAGIILIGCETTQQGAATAPPPKNSGHLLINRVANFGSNMSLVV
Ga0137394_1115202423300012922Vadose Zone SoilMKKYGSIVQILSLACVCAISVGCATNQPGAATAPPNSGHLLVYRVANFGTDLGLILSVDG
Ga0134076_1027838913300012976Grasslands SoilMKKHSLIIHTLSLACAGLIAGCATNQPGAATAPPPNSGHVIINRVANFGSNLSMVVSVDG
Ga0134076_1039949723300012976Grasslands SoilMKRYSSIIYILSLACIGLILVACQTTGPGAATAPPPNSARLLIYRVANFGSNMA
Ga0164307_1098942723300012987SoilMKKHGSIGLILSLACACAISAGCATNQHGAATAPPNSGHLLVYRVPNFGTDIGLILSIDGKDVGSFTE
Ga0164306_1099781323300012988SoilMKKHGLIVHILSLACVCLISVGCATNQHGAATAPPNSGHLLVYRVANFGTDLGLILSIDGKDVG
Ga0164305_1042407423300012989SoilMKKHGSIVLILSLACACVISVGCATNHPGAATAPPNSGHL
Ga0164305_1178534713300012989SoilMKKHGSIVLILSLACACVISVGCATNHPGAATAPPNSGHLL
Ga0157371_1091273413300013102Corn RhizosphereMKKHGSIVHILSLACACAISVGCATNQPGAATAPPNSGHLLVNRVANFGTDL
Ga0157374_1122678723300013296Miscanthus RhizosphereMKKHGSIVQILSLACACVISVGCATNQPGAATAPPNSGHLLVNRAANFGTNLGLILSVDGKDVGSFP
Ga0134085_1029781113300015359Grasslands SoilMKKYCSIIYIFSLACACLIAVGCATQPGAATAPPPNSGHVLI
Ga0132256_10339406823300015372Arabidopsis RhizosphereMKKHGSIVLILSLACACVISVGCATNQPSAATAPPNSGHLLVNRVPNFGTDLGLILSI
Ga0132257_10421755923300015373Arabidopsis RhizosphereMKKHGSIVLILSLACACVISVGCATNHPGAATAPPNSGHLLVNRAANFGTNLGLILSVDGKDVGS
Ga0134083_1004867913300017659Grasslands SoilMKKYSSIVYILSLACAGLIAGCATNQPGAVTAPPPNSGHVIINR
Ga0163161_1032546423300017792Switchgrass RhizosphereMKKHGSIVLILSLACACVISVGCATNHPGAATAPPNSGHLLVNR
Ga0184604_1007978413300018000Groundwater SedimentMKKYNSIICILTLACVGIGLVGCETTQSGAAAAAPPNSGRVLINRVANFGADMALV
Ga0184608_1012328133300018028Groundwater SedimentMKKYGSIIHILSLACVCLISVGCATNQPGAATAPPNSGHLLVYRVPNFGTDLFLVLSV
Ga0184608_1026946213300018028Groundwater SedimentMKKYSSIIYILSLACAGIILSGCETTQSGAATAPIPANSGRLTVTRVANFGTDLSLVL
Ga0184617_124566113300018066Groundwater SedimentMKKYGSIIHILSLACVCLISVGCATNQPGAATAPPNSGHLLVNRVPNF
Ga0066669_1225791423300018482Grasslands SoilMKKYGSIFHILSLACACLIAVGCATNQPGAATAAPPNSGHLLVYRVPNFGTDLFLV
Ga0173479_1060337623300019362SoilMKKHGSIVLILSLACACVISVGCATNHPGAATAPPNSGHLLVNRAANFGTNLGLILSVDGKDVG
Ga0193732_107069713300020012SoilMKKYSSIIHILSLACVCLIFVGCATNQPGAATAPPNSGHLLVNRVPNFGTDLF
Ga0193726_121531523300020021SoilMKKYSSIIYILSLACACLISVGCATNQPGAATAPPPNSARLLVNRVANFGSNLVLI
Ga0210381_1040333723300021078Groundwater SedimentMKKHGSIVHILSLACVCLISVGCATNQPGAATAPPNSGHLLVHRVANFGT
Ga0193695_109486423300021418SoilMKKYSSIIHILSLACACLIGGCATQPGAATAPPPNSGHLIVTRVA
Ga0222622_1086093013300022756Groundwater SedimentMKKHGSIVHILSLACVCLISVGCATNQPGAATAPPNSGHLLVNRVPNFG
Ga0207663_1113290213300025916Corn, Switchgrass And Miscanthus RhizosphereMKKHGSIVQILSLACACVISVGCATNQPGAATAPPNSGHLLVNRAANFGTNLGLILSV
Ga0207681_1131334323300025923Switchgrass RhizosphereMKKHGSIVLILSLACACVISVGCATNHPGAATAPPNSGHLLVNRAANFGTNLGLILSVDGKDVGSFPEGQ
Ga0207664_1172015213300025929Agricultural SoilMKKHGSIVQILSLACACVISVGCATNQPGAATAPPNSGHLLVYRVPNFGTDIGLILSIDGKDVGSFT
Ga0209234_108004213300026295Grasslands SoilMKKYGSIVQILSLACACVISVGCATNQPGAATAPPNSGHLLVYRVPNFGTDLFLVVSVDGKDVGSFTEG
Ga0209472_116968313300026323SoilMKKYGSIFHILSLACACLISVGCATNQPGAATAPPPNSGHLLVYRVPNFGTDLFLVLS
Ga0209152_1033842723300026325SoilMKKYGSIVHILSLACVCLISVGCATNQPGAATAPPNSGHLLVYRVPNFGTDLFLVLSVDGKDVGSFSE
Ga0209802_101715713300026328SoilMKRYSSIIHILSLACACLIAGCATQPGAATAPPNSGHVLIY
Ga0209803_116335923300026332SoilMKKYGSIVQILSLACACVISVGCATNQPGAATAPPNSGHLLVYRVPNFGTDLGLILSVDG
Ga0209158_127079513300026333SoilMKKYSLIIHTLSLACAGLIAGCATNQPGAVTAPPPNSGHVIINRVANFGTNLSL
Ga0257162_104258713300026340SoilMKKHGSILQILSLACACAISVGCATNQPGTATAPPNSGHLLVYRVPNFGTDLFLVVSID
Ga0209160_123779813300026532SoilMKKYSLIIHTLSLACAGLIAGCATNQPGAVTAPPPNSGHVIINRVANFGTNLSLVVSVDG
Ga0209376_114598123300026540SoilMKKYCSIINILSLAFACLIAAGCATQPGAATAPPPNSGRLLINRVANFGSDLSLVVSV
Ga0307298_1019391913300028717SoilMKKYYSIICILSLACACLISVGCATNQPGAATAPPPNSARLLVNRVANFGSNL
Ga0307292_1046054213300028811SoilMKKHGSIVHILSLACVCLITVGCATNQPGAATAPPNSGHLLVNRVPNFGTDLFLVLSV
Ga0307310_1072068023300028824SoilMKKYYSIIYILGLACACLISVGCATNQPGAATAPPPNSARLLVNRVANFGSNLVLILSVDGKD
Ga0307312_1082952813300028828SoilMKKYSSIVQILSLACVCAISVGCATNQPGAATAPPNSGHLLVNRVPNFGTDLFLVLSVDGKD
Ga0307277_1005014513300028881SoilMKKHGSIVHILSLACVCLITVGCATNQPGAATAPPNSGHLLVNRVPNFGTDLFLVLSVDG
Ga0170824_10759053313300031231Forest SoilMKKYGSIIQVLSLACAGIILIGCGTTTQPAAATAPPPKNAGHLLVYRVANFGSG
Ga0310892_1094060723300031858SoilMKKYYSIICILTLACAGIFLVGCETTQSGAAAAPPPPNSARLLVNRVANFGSDLVL
Ga0310912_1096684113300031941SoilMKKHSLIIPTLSLACAGLIAGCATNQPGAATAPPPNSGHVIINRVANFGSNLSLVV
Ga0310914_1040628013300033289SoilMKKHSLIIPTLSLACAGLIAGCATNQPGAATAPPPNSGHVIINRVANFGS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.