NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F101703

Metagenome / Metatranscriptome Family F101703

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101703
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 47 residues
Representative Sequence LMKLVEKKRKQHKDIVEVEEPERDAGKVVDLMEVLKKSLAGKRKAA
Number of Associated Samples 93
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 16.67 %
% of genes from short scaffolds (< 2000 bps) 17.65 %
Associated GOLD sequencing projects 86
AlphaFold2 3D model prediction Yes
3D model pTM-score0.29

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (90.196 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(18.628 % of family members)
Environment Ontology (ENVO) Unclassified
(40.196 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(50.980 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 33.78%    β-sheet: 0.00%    Coil/Unstructured: 66.22%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.29
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF00072Response_reg 8.82
PF01370Epimerase 4.90
PF00293NUDIX 2.94
PF03465eRF1_3 1.96
PF05598DUF772 1.96
PF00903Glyoxalase 1.96
PF00664ABC_membrane 0.98
PF01258zf-dskA_traR 0.98
PF03815LCCL 0.98
PF07484Collar 0.98
PF13307Helicase_C_2 0.98
PF01176eIF-1a 0.98
PF02735Ku 0.98
PF10116Host_attach 0.98
PF03734YkuD 0.98
PF01344Kelch_1 0.98
PF10601zf-LITAF-like 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG0361Translation initiation factor IF-1Translation, ribosomal structure and biogenesis [J] 0.98
COG1273Non-homologous end joining protein Ku, dsDNA break repairReplication, recombination and repair [L] 0.98
COG1376Lipoprotein-anchoring transpeptidase ErfK/SrfKCell wall/membrane/envelope biogenesis [M] 0.98
COG1734RNA polymerase-binding transcription factor DksATranscription [K] 0.98
COG3034Murein L,D-transpeptidase YafKCell wall/membrane/envelope biogenesis [M] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A90.20 %
All OrganismsrootAll Organisms9.80 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005438|Ga0070701_10983764Not Available587Open in IMG/M
3300005546|Ga0070696_101033686All Organisms → cellular organisms → Bacteria688Open in IMG/M
3300005569|Ga0066705_10676807All Organisms → cellular organisms → Bacteria624Open in IMG/M
3300006800|Ga0066660_11185383All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pseudomonadales → Pseudomonadaceae → Pseudomonas → Pseudomonas fluorescens group → Pseudomonas fluorescens600Open in IMG/M
3300006904|Ga0075424_102264297Not Available571Open in IMG/M
3300007076|Ga0075435_101914747Not Available520Open in IMG/M
3300009012|Ga0066710_101926922All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium883Open in IMG/M
3300010043|Ga0126380_10374764All Organisms → cellular organisms → Bacteria → Acidobacteria1046Open in IMG/M
3300010047|Ga0126382_11966549Not Available555Open in IMG/M
3300010400|Ga0134122_12855297Not Available536Open in IMG/M
3300010403|Ga0134123_11282975Not Available767Open in IMG/M
3300010403|Ga0134123_11453663All Organisms → cellular organisms → Bacteria → Acidobacteria728Open in IMG/M
3300011271|Ga0137393_11602619All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium540Open in IMG/M
3300012212|Ga0150985_110958955All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium619Open in IMG/M
3300012532|Ga0137373_10152357All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1950Open in IMG/M
3300031231|Ga0170824_102425114Not Available601Open in IMG/M
3300031847|Ga0310907_10456553Not Available676Open in IMG/M
3300034147|Ga0364925_0097020All Organisms → cellular organisms → Bacteria → Acidobacteria1043Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil18.63%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil18.63%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil6.86%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere5.88%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.92%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil3.92%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.92%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.92%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere3.92%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.94%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere2.94%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.96%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.96%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.96%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.96%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.96%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.98%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.98%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.98%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.98%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.98%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.98%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005354Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-3 metaGHost-AssociatedOpen in IMG/M
3300005355Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaGHost-AssociatedOpen in IMG/M
3300005356Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaGHost-AssociatedOpen in IMG/M
3300005365Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3H metaGEnvironmentalOpen in IMG/M
3300005438Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-2 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005544Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005615Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaGEnvironmentalOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005844Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2Host-AssociatedOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006881Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011443Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT630_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300024186Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK29EnvironmentalOpen in IMG/M
3300024290Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK08EnvironmentalOpen in IMG/M
3300025911Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025931Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025935Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025938Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025960Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026095Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026312Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300030991Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031058Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_184 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031469Fir Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031847Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D4EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M
3300034147Sediment microbial communities from East River floodplain, Colorado, United States - 44_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI1027J12803_10008905513300000955SoilKLLKLVEKKRKQHKNIVEVDVPEQEEGKVVDLMEALKKSLAGKRKAA*
JGI1027J12803_10922178623300000955SoilLKDEQTEKLLKLVARKRKQHKDVVEVEVPEQEPGKVVDLMAALKKSLAGKRRAA*
Ga0066680_1054242923300005174SoilEQTEKLLKLVAKKRKQHKDIVEVEAPERTEGKVVDLMAALKKSLAGKRRVA*
Ga0066690_1008159013300005177SoilKTEQLLKLVEKKRKQQKDIVEVEEPEHEQGKVVDLMEVLKKSLAGKRKAA*
Ga0066671_1019574013300005184SoilEQTEKLLKLVEKKRKQHKDVVEVEVPEREEGKVVDLMAALKKSLAGKRRAA*
Ga0070675_10138423713300005354Miscanthus RhizosphereKLLKLVERKRRQHKAVVKVEVPERADGKVIDLMAALKKSLAGKRKAA*
Ga0070671_10061290713300005355Switchgrass RhizosphereKLLKLVEKKRKQHKDLVEVELPEQEPAKVVDLMEALKKSLAGKRQAA*
Ga0070674_10193930613300005356Miscanthus RhizosphereLKLVERKRKQHKDIIEVEVPEREAGKVIDLMAALKKSLAGKRKAA*
Ga0070688_10014576913300005365Switchgrass RhizosphereKRKQHKDVVQVEVPERQEGKVIDLMAALKKSLAGKRKAA*
Ga0070701_1098376423300005438Corn, Switchgrass And Miscanthus RhizosphereLKELKDEKTEQLMKLVEKKRKQHKDIVEVTEPERDGGKVVDLMEVLKKSLAGKRKAA*
Ga0066689_1015236623300005447SoilVEKKRKQHKDVVEVEEPEERAQGKVVDLVEVLKRSLARKQKAA*
Ga0066689_1079517823300005447SoilKLVEKKRKQHKDLVEVEEPEHAEGKVIDLMEVLKKSLAGKRKAA*
Ga0066681_1054053813300005451SoilEKKRKQHKDIVEVEVPEREEGKVVDLMAALKKSLAGKRRAA*
Ga0068867_10189474423300005459Miscanthus RhizosphereELKDEQTGKLLKLVEKKRKQRKDIVEVEVPEEEQGKVVDLMEALKKSLAGKRKAA*
Ga0070686_10055602233300005544Switchgrass RhizosphereLVEKKRKQRKDIVEVEVPEREEGKVIDLMEVLKKSLAGKRKAA*
Ga0070696_10103368623300005546Corn, Switchgrass And Miscanthus RhizosphereDLKDEKTAQLLKLVEKKRKQHKDVVEVEVTERDDGKVVDLMEVLKKSLAGKRKAAA*
Ga0070696_10108050013300005546Corn, Switchgrass And Miscanthus RhizosphereKDEKTAQLLKLVEKKRKQHKDVVEVEEPDRDEGKVVDLMEVLKKSLARKRKAA*
Ga0066698_1039465433300005558SoilEELLKLIEKKRKQHKDVVEVEEPEEREGGKVVDLVEVLKRSLARKQKAA*
Ga0066699_1063885313300005561SoilVEKKRKQHKDVVEVEEPEERAQGKVVDLVEVLKQSLARKQKAA*
Ga0066705_1067680723300005569SoilKDLKDEQTEKLLKLVEKKRKQHKDVVEVEVPEREEGKVVDLMAALKKSLAGKRRAA*
Ga0066706_1126066613300005598SoilEKTEKLLKLVAKKHKQHKDLVEVEEPEREEGKVVDLMEVLKRSLAGKRKAA*
Ga0070702_10121712823300005615Corn, Switchgrass And Miscanthus RhizosphereADLLKLVEKKRKQHKDIIEVEAPDREEGKVIDLMEVLKKSLAGKRKAA*
Ga0068859_10165138013300005617Switchgrass RhizosphereLKLVEKKRKQHKDLVEVELPEQEPAKVVDLMEALKKSLAGKRKAA*
Ga0068861_10260796213300005719Switchgrass RhizosphereQHKDIVEVEVPERDDGKVVDLMEVLKKSLAGKRKAA*
Ga0068862_10164066613300005844Switchgrass RhizosphereKRKQHKDVVEVEEADRDEGKVVDLMEVLKKSLAGKRKAA*
Ga0070716_10158511413300006173Corn, Switchgrass And Miscanthus RhizosphereLIEKKRARHQDVVEVDEPDRDEGKVVDIMTALKKSLAKKRKAA*
Ga0066665_1051496333300006796SoilEKKRKQHKDVVQVEEPEREEGKVVDLMEVLKKSLAGKRKAA*
Ga0066659_1060754013300006797SoilKLIEKKRARHQDVVEVEETKRDEGKVVDIMEALKKSLARKRKAA*
Ga0066660_1118538323300006800SoilKLVEKKRKQHKDVVEVEEPEERAHGNVVDLVEVLKKSLARKQKAA*
Ga0079220_1215863613300006806Agricultural SoilLMKLVEKKRKQHKDIVEVEEPERDAGKVVDLMEVLKKSLAGKRKAA*
Ga0075433_1108384323300006852Populus RhizosphereQHKDIVEVEEPERDEGKVVDLMEVLKKSLAGKRKAA*
Ga0068865_10100507513300006881Miscanthus RhizosphereEKLLKLVEKKRKQHKDMVEVEVPEREPGKVVDLMEALKKSLAGKRKAA*
Ga0075424_10226429713300006904Populus RhizosphereLTHLKDVQTDQLLKLIEKKRAHHKDVVEVELPERKQEKVVDLMEVLKRSLQGKKRKTA*
Ga0079219_1154810723300006954Agricultural SoilRKQHKDIVEVEEPERDAGKVVDLMEVLKKSLAGKRKAA*
Ga0075435_10191474723300007076Populus RhizosphereSLQGLKDEQTEKLLKLVERKRKQHKDIVEVEVPEEEQGKVVDLMAALKKSLAGKKRAA*
Ga0066710_10192692213300009012Grasslands SoilHLSLKELKDEKAADLLQLVEKKRKQHKDVVEVEEPEERAQGKVVDLVEVLKRSLARKQKA
Ga0099829_1126659913300009038Vadose Zone SoilLLKLVEKKRKQHKDVVEVEEPERDEGKVVDLMEVLKKSLARKRKTAA*
Ga0099830_1095450513300009088Vadose Zone SoilLVEKKRKQHKDVVEVEETDRDEGKVVDLVEVLKKSLAGKRKAAA*
Ga0099827_1191953823300009090Vadose Zone SoilARHKDVVEVETPRRDEGKVIDLMEVLKKSLARKKRAA*
Ga0075423_1025482513300009162Populus RhizosphereLVEKKRSRHKDVVEVEVPARSQEEKVVDLMEVLKRSLAGGKKKRRAS*
Ga0126380_1037476423300010043Tropical Forest SoilLSLKDLKDEQTEKLLKLVDRKRKQRQNVVEVEVPERKEGKVVDLMVALKKSLAGKRKAA*
Ga0126382_1048173613300010047Tropical Forest SoilQHKDVVEVEVPEREEGKVVDLMAALKKSLAGKRKAA*
Ga0126382_1196654923300010047Tropical Forest SoilPQSLKDEQTKKLLQLVERKRKQHKDVVEVEVPDQEEGKVVDLMAALKRSLAGKRKAA*
Ga0134062_1015394223300010337Grasslands SoilEKKRKQHKDIVEVEEPERNKGKVIDLVEVLKRSLAGKRKAAA*
Ga0126370_1095660813300010358Tropical Forest SoilKQAKHKDVVEVDEPAGEGERSNVVDIMEVLKKSMASKR*
Ga0126383_1033770913300010398Tropical Forest SoilKQHQDVIEVEVPAQEEGKVVDLMEALKKSLAGKRRAA*
Ga0134127_1313803223300010399Terrestrial SoilHKDIVEVDVPEQEQGKVVDLMAALKKSLAGKRKAA*
Ga0134122_1285529713300010400Terrestrial SoilELKDEKTAALLKLVEKKRKQHKDIVEVEVPEREEGKVIDLMEVLKKSLAGKRKAA*
Ga0134123_1128297533300010403Terrestrial SoilKELKDEKTADLLKLVEKKRKQHKDIVEVEVPEREEGKVVDLMEVLKKSLAGKRKAA*
Ga0134123_1145366323300010403Terrestrial SoilLSELKDEQTEKLRKLVEKKRKQHKDVVEVEVPEQEPGKVVDLMEALKKSLAGKRKAA*
Ga0137391_1042573023300011270Vadose Zone SoilMLVEKKRQQHKDVIEVEEPERDEGKVVDLMEVLKKSLARKRKAAA*
Ga0137393_1160261913300011271Vadose Zone SoilELKDEKTDQLLKLIAKKRARHKDVVEVEEPERDEGKVVDIMEALKKSLAKKRKAA*
Ga0137457_115169213300011443SoilAEKLLKLVERKRKQHKDVVQVEVPEREEGKVIDLMAALKKSLAGKRKAA*
Ga0137389_1066218823300012096Vadose Zone SoilHKDVVEVEEPERDEGKVVDLMEVLKKSLARKRKTAA*
Ga0137365_1109396823300012201Vadose Zone SoilDLKDEQTEKLLKLVDKKRKQHKDIVEVEMPEREEGKVVDLMEALKKSLASKRKAA*
Ga0137376_1050891913300012208Vadose Zone SoilKRARHKDIVEVEEPERDEGKVIDIMDALKKSLAKKRKAA*
Ga0150985_11095895523300012212Avena Fatua RhizosphereALKEFKDEQTAKLLKLVAKKRKQHKDVVEVEVPEQEQGKVVDLMAALKKSLAGKKRAA*
Ga0137367_1008344713300012353Vadose Zone SoilKKRSRHKDVVEVEAPERDEGKVVDMMEALKKKLAQKRKAA*
Ga0137360_1169593713300012361Vadose Zone SoilHKDVVEIEAPERDEGKVIDIMEALKKSLARKRKAA*
Ga0137373_1015235713300012532Vadose Zone SoilKHLSLSELKDEKTEQLLKLVEKKRKQHKDVIEVEAPERDEGKVIDLMEVLKKSLAGKRKAAA*
Ga0137396_1126880813300012918Vadose Zone SoilKDQKTDQLLKLIEQKRKRHKDLVEVEAPERNEGKVVDIMDALKRSMARKRKAA*
Ga0137359_1051720013300012923Vadose Zone SoilDVVEVEMPARDEGKVVDLMAALKKSLVRKRKAAA*
Ga0137359_1133996713300012923Vadose Zone SoilHKDVVEVETPRRDEGKVVDLMEVLKKSLARKKRAA*
Ga0137419_1108452913300012925Vadose Zone SoilRKRHKDVVEVEAPERDEGKVVDIMDALKRSMARKRKAA*
Ga0137404_1030339123300012929Vadose Zone SoilDLKGQQTEKLLKLVEKKRKQNKDIVEVEVPEREEGKVVDLMAALKKSLAGKRRAA*
Ga0137407_1116355123300012930Vadose Zone SoilEKKRSRHKDVVKVEVPAHQREGKVVDLMEVLKRSLAGKKKRA*
Ga0164299_1061418323300012958SoilLVERKRKQHKDIVEVDVPEEAQGKVVDLMAALKKSLAGKKRAA*
Ga0126369_1271676513300012971Tropical Forest SoilKDEQTEKLLKLVERKRKDHKNLVEAEVKEREEGQVIDLMAALKKSLAGKRKAA*
Ga0126369_1281671713300012971Tropical Forest SoilKLVKRKRKQHKDVVEVEVPEQKEGKVVDLMEALKKSLAGKRRAA*
Ga0137405_141265613300015053Vadose Zone SoilLKLVAKKHKQHKDLVEVEEPEREEGKVVDLMEVLKRSSPASAKAA*
Ga0137418_1075120013300015241Vadose Zone SoilRARHKDVVQVEEPEREEGKVVDIMAALKKSLAGKRKAA*
Ga0132257_10143781833300015373Arabidopsis RhizosphereLVEKKRKQHKDLVEVDEPERDEAKVVDLMEVLKKSLAGKRKAA*
Ga0132255_10303713513300015374Arabidopsis RhizosphereKQHKDIVEVEEPARDEGKVVDLMEVLKKSLARKRKAAA*
Ga0190272_1154756713300018429SoilHKDVVEVEEPARDEGKVVDLMEVLKKSLARKRKAA
Ga0066662_1057604513300018468Grasslands SoilIEKKRKGHKDVVEVEAAERDEGKVVDIMEALKKSLARKRKAA
Ga0066662_1159563023300018468Grasslands SoilKKRKQHKDIVEVEEPKREEGKVVDLMAALKKSLASKRKAA
Ga0247688_109025913300024186SoilEKLLKLVEKKRKQRKDIVEIEVPEREAGKVVDLMEALKKSLAGKRRAA
Ga0247667_110669423300024290SoilHKDVVEVEEPERDEGKVVDIMEALKKSLAKKRKAA
Ga0207654_1094782713300025911Corn RhizosphereELKDEKTEQLMKLVEKKRKQHKDIVEVAEPERDWGKVVDLMEVLKKSLAGKRKAA
Ga0207644_1144989113300025931Switchgrass RhizosphereKLLKLVEKKRKQHKDLVEVELPEQEPAKVVDLMEALKKSLAGKRQAA
Ga0207686_1027690713300025934Miscanthus RhizosphereKLLKLVEKKRKQHKDIVEVEASEPEQGKVVDLMEALKKSLAGKRKAA
Ga0207709_1041132513300025935Miscanthus RhizosphereELKDEKTADLLKLVEKKRKQHKDIVEVDVPEREEGKVIDLMEVLKKSLAGKRKAA
Ga0207704_1066762613300025938Miscanthus RhizosphereTEKLLKLVEKKRKQHKDMVEVEVPEREPGKVVDLMEALKKSLAGKRKAA
Ga0207651_1060168723300025960Switchgrass RhizosphereKKRKQHKDIVEVTEPERDGGKVVDLMEVLKKSLAGKRKAA
Ga0207651_1166567223300025960Switchgrass RhizosphereERKRKQHKDIVEVEVPEEAQGKVVDLMAAFKKSLAGKKRAA
Ga0207676_1071460023300026095Switchgrass RhizosphereARKRKQHKDIVEVEVPEQEQGKVVDLMAALKKSLAGKRRAA
Ga0209239_120979923300026310Grasslands SoilQHKDVVEVEEPEEREGGKVVDLVEVLKRSLARKQKAA
Ga0209153_109810923300026312SoilVEKKRKQHKDVVEVEVSEREEGKVVDLMAALKKSLAGKRRAA
Ga0209471_126203623300026318SoilNLLRLVEKKRKQHKDIVEVEEPEERAQGKVVDLVEVLKRSLARKQKAA
Ga0209266_125973913300026327SoilKKRKQHKDVVEVEEPEEREGGKVVDLIEVLKRSLARKQKAA
Ga0209376_123975023300026540SoilKLVAKKHKQHKDLVEVEEPEREEGKVVDLMEVLKRSLAGKRKAA
Ga0209177_1022106933300027775Agricultural SoilEHLMKLVEKKRKQHKDVVKVEEPETGEGKVIDLMEVLKKSLAGKRRAA
Ga0268265_1021378923300028380Switchgrass RhizosphereASKKNVVHVEVPEREEGKVVDLLEVLKKSLAGKKRAA
Ga0268264_1266341823300028381Switchgrass RhizosphereEKLLKLVERKRRQHKAVVKVEVPEREDGKVIDLMAALKKSLAGKRKAA
Ga0073994_1232511813300030991SoilQKTDQLLKLIEKKRAQHKDVVEVETAERDEGKVVDIMDALKKSLARKRKAA
Ga0308189_1035867623300031058SoilDEKTERMLKLIEKKRARHQNVVEVEESERDEGKVVDIMTALKQSLAKKRKAA
Ga0170824_10242511413300031231Forest SoilKELKDEQTEKLLKLVEKKRKQHKDVIEVEAPKQDQGQVVDLMAALKKSLTHKRNAA
Ga0170819_1497016413300031469Forest SoilEKLLKLVEKKRKQHKDVIEVEAPKQDQGQVVDLMAALKKSLTHKRNAA
Ga0307473_1033023413300031820Hardwood Forest SoilEKKRKQDKNVVEVEAPEREEGKVVDLMAALKKSLAGKRKAA
Ga0310907_1045655323300031847SoilLSLKEFKDEQAEKLLKLVERKRKQHKDVVRVEVPERQEGKVIDLMAALKKSLAGKRKAA
Ga0310810_1121353213300033412SoilEKTEQLMKLVEKKRKQHKDIVEVEEPARDEGKVVDLMEVLKKSLAGKRKAA
Ga0364925_0097020_13_1863300034147SedimentLKDFKDEQAEKLLKLVERKRKQHKDVVEVEVPEREEGKVIDLMAALKKSLAGKRKAA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.