NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F103663

Metagenome Family F103663

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103663
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 46 residues
Representative Sequence MAHWMAEHEVLAAAIFVGLSLALIAIFAALEHRYKQQLTARRKDYFS
Number of Associated Samples 73
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 31.68 %
% of genes near scaffold ends (potentially truncated) 19.80 %
% of genes from short scaffolds (< 2000 bps) 79.21 %
Associated GOLD sequencing projects 62
AlphaFold2 3D model prediction Yes
3D model pTM-score0.53

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (56.436 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(15.842 % of family members)
Environment Ontology (ENVO) Unclassified
(32.673 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(52.475 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 60.00%    β-sheet: 0.00%    Coil/Unstructured: 40.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.53
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF03572Peptidase_S41 6.93
PF14066DUF4256 5.94
PF07517SecA_DEAD 4.95
PF02517Rce1-like 4.95
PF13517FG-GAP_3 4.95
PF13180PDZ_2 2.97
PF04237YjbR 2.97
PF14582Metallophos_3 0.99
PF13683rve_3 0.99
PF02371Transposase_20 0.99
PF00166Cpn10 0.99
PF01527HTH_Tnp_1 0.99
PF14022DUF4238 0.99
PF00902TatC 0.99
PF00903Glyoxalase 0.99
PF01839FG-GAP 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG0793C-terminal processing protease CtpA/Prc, contains a PDZ domainPosttranslational modification, protein turnover, chaperones [O] 6.93
COG0653Preprotein translocase subunit SecA (ATPase, RNA helicase)Intracellular trafficking, secretion, and vesicular transport [U] 4.95
COG1266Membrane protease YdiL, CAAX protease familyPosttranslational modification, protein turnover, chaperones [O] 4.95
COG4449Predicted protease, Abi (CAAX) familyGeneral function prediction only [R] 4.95
COG2315Predicted DNA-binding protein with ‘double-wing’ structural motif, MmcQ/YjbR familyTranscription [K] 2.97
COG0234Co-chaperonin GroES (HSP10)Posttranslational modification, protein turnover, chaperones [O] 0.99
COG0805Twin-arginine protein secretion pathway component TatCIntracellular trafficking, secretion, and vesicular transport [U] 0.99
COG3547TransposaseMobilome: prophages, transposons [X] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms56.44 %
UnclassifiedrootN/A43.56 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2199352024|deeps__Contig_107346Not Available714Open in IMG/M
2199352024|deeps__Contig_175120Not Available916Open in IMG/M
3300002568|C688J35102_120902059All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Granulicella → unclassified Granulicella → Granulicella sp. L602154Open in IMG/M
3300003321|soilH1_10260224All Organisms → cellular organisms → Bacteria2135Open in IMG/M
3300003324|soilH2_10447776All Organisms → cellular organisms → Bacteria1425Open in IMG/M
3300004479|Ga0062595_100110917All Organisms → cellular organisms → Bacteria1489Open in IMG/M
3300004479|Ga0062595_100281132Not Available1106Open in IMG/M
3300004479|Ga0062595_101008111Not Available716Open in IMG/M
3300005186|Ga0066676_11158597All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → unclassified Terriglobales → Acidobacteriales bacterium 13_1_40CM_3_55_5507Open in IMG/M
3300005336|Ga0070680_101608361Not Available563Open in IMG/M
3300005338|Ga0068868_100479747Not Available1086Open in IMG/M
3300005345|Ga0070692_10146490Not Available1341Open in IMG/M
3300005345|Ga0070692_10270192Not Available1026Open in IMG/M
3300005434|Ga0070709_10330831All Organisms → cellular organisms → Bacteria1121Open in IMG/M
3300005434|Ga0070709_10970992All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium675Open in IMG/M
3300005434|Ga0070709_11125369Not Available629Open in IMG/M
3300005435|Ga0070714_101118905Not Available768Open in IMG/M
3300005436|Ga0070713_102087195All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium549Open in IMG/M
3300005437|Ga0070710_10802049All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium673Open in IMG/M
3300005439|Ga0070711_100115118Not Available1980Open in IMG/M
3300005458|Ga0070681_10212900All Organisms → cellular organisms → Bacteria1849Open in IMG/M
3300005530|Ga0070679_101115389All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium734Open in IMG/M
3300005533|Ga0070734_10007785All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae8512Open in IMG/M
3300005533|Ga0070734_10338865Not Available859Open in IMG/M
3300005535|Ga0070684_100180106All Organisms → cellular organisms → Bacteria1921Open in IMG/M
3300005537|Ga0070730_10078117All Organisms → cellular organisms → Bacteria2322Open in IMG/M
3300005537|Ga0070730_10717686All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Rhodanobacteraceae633Open in IMG/M
3300005539|Ga0068853_100048630All Organisms → cellular organisms → Bacteria3643Open in IMG/M
3300005542|Ga0070732_10045946All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter2517Open in IMG/M
3300005542|Ga0070732_10154937All Organisms → cellular organisms → Bacteria1364Open in IMG/M
3300005542|Ga0070732_10427944All Organisms → cellular organisms → Bacteria799Open in IMG/M
3300005547|Ga0070693_100066298All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2113Open in IMG/M
3300005568|Ga0066703_10284278Not Available1002Open in IMG/M
3300005575|Ga0066702_10220562Not Available1153Open in IMG/M
3300005575|Ga0066702_10690592Not Available609Open in IMG/M
3300005575|Ga0066702_10921963Not Available521Open in IMG/M
3300006028|Ga0070717_10777459Not Available871Open in IMG/M
3300006028|Ga0070717_11357576Not Available646Open in IMG/M
3300006175|Ga0070712_100680614Not Available876Open in IMG/M
3300007788|Ga0099795_10120166Not Available1050Open in IMG/M
3300009093|Ga0105240_10163914Not Available2638Open in IMG/M
3300009098|Ga0105245_10014319All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6911Open in IMG/M
3300009143|Ga0099792_10041269All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2202Open in IMG/M
3300009143|Ga0099792_10110473All Organisms → cellular organisms → Bacteria1460Open in IMG/M
3300009174|Ga0105241_10709565Not Available919Open in IMG/M
3300009545|Ga0105237_10650122All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1061Open in IMG/M
3300009551|Ga0105238_10020903All Organisms → cellular organisms → Bacteria6668Open in IMG/M
3300009551|Ga0105238_11331788Not Available744Open in IMG/M
3300010159|Ga0099796_10187711All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium833Open in IMG/M
3300010361|Ga0126378_11647106Not Available729Open in IMG/M
3300010373|Ga0134128_10177693All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Rhodanobacteraceae2402Open in IMG/M
3300010373|Ga0134128_10454502Not Available1429Open in IMG/M
3300010373|Ga0134128_10869761All Organisms → cellular organisms → Bacteria998Open in IMG/M
3300010373|Ga0134128_11406997Not Available768Open in IMG/M
3300010375|Ga0105239_10517133All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Rhodanobacteraceae1358Open in IMG/M
3300010375|Ga0105239_12216741Not Available639Open in IMG/M
3300010396|Ga0134126_11923488Not Available647Open in IMG/M
3300010396|Ga0134126_12977775Not Available512Open in IMG/M
3300010399|Ga0134127_10447775Not Available1290Open in IMG/M
3300011269|Ga0137392_10619040All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium898Open in IMG/M
3300011271|Ga0137393_10657110All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium898Open in IMG/M
3300012683|Ga0137398_10075221All Organisms → cellular organisms → Bacteria2069Open in IMG/M
3300012917|Ga0137395_10121644All Organisms → cellular organisms → Bacteria → Acidobacteria1759Open in IMG/M
3300012924|Ga0137413_10420795All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium965Open in IMG/M
3300012930|Ga0137407_10887130Not Available843Open in IMG/M
3300012957|Ga0164303_10695989All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Rhodanobacteraceae684Open in IMG/M
3300012960|Ga0164301_10269095Not Available1130Open in IMG/M
3300012960|Ga0164301_11148052Not Available621Open in IMG/M
3300012971|Ga0126369_10864557All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium988Open in IMG/M
3300012984|Ga0164309_10459800Not Available964Open in IMG/M
3300012986|Ga0164304_10060303All Organisms → cellular organisms → Bacteria2109Open in IMG/M
3300012986|Ga0164304_10607807All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium817Open in IMG/M
3300012986|Ga0164304_11199512All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium613Open in IMG/M
3300013296|Ga0157374_12867850All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium509Open in IMG/M
3300015242|Ga0137412_10507873Not Available922Open in IMG/M
3300015264|Ga0137403_11146074Not Available623Open in IMG/M
3300018468|Ga0066662_10465366All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis → Candidatus Koribacter versatilis Ellin3451138Open in IMG/M
3300020140|Ga0179590_1051359All Organisms → cellular organisms → Bacteria1063Open in IMG/M
3300021170|Ga0210400_10013624All Organisms → cellular organisms → Bacteria6477Open in IMG/M
3300021170|Ga0210400_10401711All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1130Open in IMG/M
3300021560|Ga0126371_10389973All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1533Open in IMG/M
3300024288|Ga0179589_10135791All Organisms → cellular organisms → Bacteria1038Open in IMG/M
3300025912|Ga0207707_10386528All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1203Open in IMG/M
3300025912|Ga0207707_10497566All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1040Open in IMG/M
3300025913|Ga0207695_11582761Not Available536Open in IMG/M
3300025921|Ga0207652_11713370Not Available533Open in IMG/M
3300025924|Ga0207694_10012058All Organisms → cellular organisms → Bacteria6517Open in IMG/M
3300025924|Ga0207694_10105305All Organisms → cellular organisms → Bacteria2239Open in IMG/M
3300025927|Ga0207687_10012353All Organisms → cellular organisms → Bacteria5583Open in IMG/M
3300025949|Ga0207667_11913404Not Available555Open in IMG/M
3300026041|Ga0207639_11900796All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Rhodanobacteraceae556Open in IMG/M
3300027583|Ga0209527_1108079Not Available623Open in IMG/M
3300027738|Ga0208989_10238665Not Available595Open in IMG/M
3300027826|Ga0209060_10005952All Organisms → cellular organisms → Bacteria → Acidobacteria8537Open in IMG/M
3300027842|Ga0209580_10004261All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6093Open in IMG/M
3300027842|Ga0209580_10389197All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium695Open in IMG/M
3300027903|Ga0209488_10005003All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae10239Open in IMG/M
3300027903|Ga0209488_10338266All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1120Open in IMG/M
3300031122|Ga0170822_15132768Not Available574Open in IMG/M
3300031231|Ga0170824_116501905Not Available1112Open in IMG/M
3300031938|Ga0308175_102318520Not Available602Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil15.84%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere11.88%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil9.90%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere9.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.93%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil6.93%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere6.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil4.95%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil2.97%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere2.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.98%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil1.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.98%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.98%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.98%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.98%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.99%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.99%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.99%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2199352024Bare-fallow DEEP SOILEnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300003321Sugarcane bulk soil Sample H1EnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005530Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaGEnvironmentalOpen in IMG/M
3300005533Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1EnvironmentalOpen in IMG/M
3300005535Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.2-3L metaGEnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005539Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2Host-AssociatedOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005547Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-3 metaGEnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009545Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaGHost-AssociatedOpen in IMG/M
3300009551Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaGHost-AssociatedOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010375Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-4 metaGHost-AssociatedOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020140Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025913Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025921Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025924Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025949Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026041Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300027583Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027738Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027826Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300031122Oak Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031938Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R1EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
deeps_019060902199352024SoilMAHWMAEHEILAAVIFVGLSLALIAIFAALEHRYKQQLTARRKDYFS
deeps_032532002199352024SoilVMAHWMAEHEVLAAAIFVALSLALTAIFAALERHYQQQLAVRGKNDFS
C688J35102_12090205933300002568SoilMAHWMAEHEILSAVIFVGLSLALIAFFAALERRYKEQLTARRKDYFL*
soilH1_1026022423300003321Sugarcane Root And Bulk SoilMAHWMAEHEILAAIIFVGLSLTLTAIFAALERRYKHELTARRKDNLS*
soilH2_1044777623300003324Sugarcane Root And Bulk SoilMAHWMAEHEILAAIIFVGLSLALTAIFAALERRYKHELTARRKDNLS*
Ga0062595_10011091713300004479SoilMAHWMAQHEILAAVIFVGLSLALIAFFATLERRYKEHLMTRRKDYL*
Ga0062595_10028113213300004479SoilMAHWMAEHEVLAATIFVGLSLALTAIFAAMERRYKQQLTARRKNNFS*
Ga0062595_10100811113300004479SoilMAEHEVLAAAIFVGLSLALTAIFAALERRYKQQLTARRKDNFS*
Ga0066676_1115859713300005186SoilMAEHEILSAVIFVGLSLALIAFFAALERRYKEQLTARRKD
Ga0070680_10160836113300005336Corn RhizosphereMAHWMAEHEAFAAAIFVGLSLALIAIFAAMEHRYKQKLTARRKDYFS*
Ga0068868_10047974713300005338Miscanthus RhizosphereMAQWMAEHEVLAAAFFVGLSVALIVIFAALERRYKEQLTARRKDRFS*
Ga0070692_1014649013300005345Corn, Switchgrass And Miscanthus RhizosphereMAHWMAEHEVLAAAIFVGLSLALIALFSALERRYKEQLTSRRKNYFS*
Ga0070692_1027019223300005345Corn, Switchgrass And Miscanthus RhizosphereMAHWMAEHEAFAAAIFVGLSLALIAIFAALEHRYKQQLTARRKDYFT*
Ga0070709_1033083123300005434Corn, Switchgrass And Miscanthus RhizosphereMAHWMAEHEVLAAAIFVGLSLALIALFAALERRYKEQLTSRRKNDFF*
Ga0070709_1097099223300005434Corn, Switchgrass And Miscanthus RhizosphereMAHWMAEHEILAAAMFVGLSLALIALFAALERRYKEQLTSRRKNYFS*
Ga0070709_1112536913300005434Corn, Switchgrass And Miscanthus RhizosphereMAQWMAEHEIMAAAVFVGLSLALIAIFAALEHRYKEQLTARRKDYFS*
Ga0070714_10111890533300005435Agricultural SoilMAHWMAEHEVLAAAIFVGLSLALIALFAALERRYKEQLTSRRKNDFS*
Ga0070713_10208719513300005436Corn, Switchgrass And Miscanthus RhizosphereVLPFGMAHWMAEHEILAAAAFVGLSLALIAFFAALERRYKEQLTSRRKSYFS*
Ga0070710_1080204913300005437Corn, Switchgrass And Miscanthus RhizosphereMAHWMADHEVLAAALFVGLSLALIAFFAALERRYKEQLTSRRKNYFS*
Ga0070711_10011511813300005439Corn, Switchgrass And Miscanthus RhizosphereMAQWMAEHEVLAAAIFVGLSLALIAIFAALEHRYKEELTARRKDHFS*
Ga0070681_1021290023300005458Corn RhizosphereMAHWMAQHEVLAAAIFVGLSLALIALFAALERRYKEQLTSRRKSYFS*
Ga0070679_10111538923300005530Corn RhizosphereMAHWMAEHEAFAAAIFVGLSLALIAIFAALEHRYKQQLTARRKDYFS*
Ga0070734_1000778593300005533Surface SoilMAHWMAEHEVLAAAIFVALSFALTAFFAALERHYEQQLTSRRKNDFS*
Ga0070734_1033886523300005533Surface SoilMAHWMAEHEIMAAVVFVGLSLALIAIFAALEHRYKQQLTARRKNYYS*
Ga0070684_10018010623300005535Corn RhizosphereMAQWMAEHEVLAAAFFVGLSVALIVIFAALERRYKEQLTARRKDHFS*
Ga0070730_1007811723300005537Surface SoilMAHWMAEHEVLAAAIFVALSFALTAFFAALERHYEQQFTSRRKNDFS*
Ga0070730_1071768623300005537Surface SoilMAHWMAEHEVLAAAIFVGLSLALIAIFAALEHRYK
Ga0068853_10004863043300005539Corn RhizosphereMAQWMAEHEVLAAAFFVGLSVALIVIFAALERRYKEQLTARRKDYFS*
Ga0070732_1004594613300005542Surface SoilHTPYAYGMAHWMAEHEVLAAAIFVGLSLALTAFFSALERHYKQQLTSRRKNDFS*
Ga0070732_1015493723300005542Surface SoilMAHWMAEHEVLAAAVFVAVSLALTAVFAALERHYKQQLTSRRKNNFS*
Ga0070732_1042794423300005542Surface SoilMAHWMAEHEVLAAAVFVAVSLALTAIFAALERHYKQQLTSRRKNNFS*
Ga0070693_10006629823300005547Corn, Switchgrass And Miscanthus RhizosphereMAQWMAEHEVLAAAFFVGLSLALIVIFAALERRYKEQLTARRKDHFS*
Ga0066703_1028427823300005568SoilMAQWMAEHEVLAAVIFVGLSLALIAIFAALEHRYKQQLTTRRKDYFS*
Ga0066702_1022056213300005575SoilMAQWMAEHEVLAAVIFVGLSLALIAIFAALEHRYKQQLTARREDYFS*
Ga0066702_1069059223300005575SoilMAHWMAEHEVLAAAIFVGLSLGLIAIFAALEHRYKQQLTARRKDYFS*
Ga0066702_1092196323300005575SoilMAHWMAEHEILAAAIFVGLSLALIAFFAALERRYKEQLASR
Ga0070717_1077745913300006028Corn, Switchgrass And Miscanthus RhizosphereMAHWMAEHEILAAAVFVGLSLALIALFAALERRYKEQLTSRRKNDFS*
Ga0070717_1135757613300006028Corn, Switchgrass And Miscanthus RhizosphereWMAEHEVLAAAIFVGLSLALIAIFAALEHRYKEELTARRKDHFS*
Ga0070712_10068061423300006175Corn, Switchgrass And Miscanthus RhizosphereMAEHEIMAAAVFVGLSLALIAIFAALEHRYKQQLTARRKDYFS*
Ga0099795_1012016613300007788Vadose Zone SoilMNQHEILAAVIFVGLSLALISFFAALARRDKGQLTSRRKNY
Ga0105240_1016391433300009093Corn RhizosphereMAHWMAEHEVLAAAIFVGLSLALTAFFAVLERHYKQPLTSRRKNYFS*
Ga0105245_1001431923300009098Miscanthus RhizosphereMAHWMAEHEILAAAIFVGLSLALIGFFAALERRYKEQLTSRRKNYFS*
Ga0099792_1004126933300009143Vadose Zone SoilMASWMNQHEILAAVIFVGLSLALIAFFAALERRYKGQLTSRRKNYFP*
Ga0099792_1011047323300009143Vadose Zone SoilMAEHEVLAAGIFVGLSLALIALFAALEQRYKKQLTSRRKNYFS*
Ga0105241_1070956523300009174Corn RhizosphereMAEHEVLAAAIFVGLSLALTAIFAALERRYKQQLTARQKDNFS*
Ga0105237_1065012223300009545Corn RhizosphereFGMAHWMAQHEVLAAAIFVGLSLALIALFAALERRYKEQLTSRRKSYFS*
Ga0105238_1002090313300009551Corn RhizosphereMAQWMAEHEVLAAAFFVGLSLALIVIFAALERRYKGQLTARRKDHFS*
Ga0105238_1133178823300009551Corn RhizosphereAAFFVGLSVALIVIFAALERRYKEQLTARRKDHFS*
Ga0099796_1018771123300010159Vadose Zone SoilMNQHEILAAVIFVGLSLALIAFFAALERRYKGQLTSRRKNYFP*
Ga0126378_1164710613300010361Tropical Forest SoilMAQWMAEHEVLAAAVFVALSLAITAIFAALERRYGEQ
Ga0134128_1017769333300010373Terrestrial SoilMAQWMAEHEVLAAALFVGLSLALIAVFAVLEHRYKDELTARRKDNFS*
Ga0134128_1045450223300010373Terrestrial SoilMAHWMAEHEVLAAAIFVAISLALTAIFAALERHYQQQLAVRGKNDFS*
Ga0134128_1086976113300010373Terrestrial SoilMAHWMAQHEILAAVIFVGLSLALIAFFATLERRYKEHLMTRRKD
Ga0134128_1140699723300010373Terrestrial SoilEHEALAAAIFVALSLALIAVFAAVERHYKEQLTSSRKNYFS*
Ga0105239_1051713323300010375Corn RhizosphereMAHWMAKNEVLSAVVFVGLSLARIALFAALERRYKERLASRRKDYFS*
Ga0105239_1221674123300010375Corn RhizosphereMADWMAEHEVLAAAIFVGLSLALTAIFAALERRYKQQLTARRKDNFS*
Ga0134126_1192348813300010396Terrestrial SoilMAQCMAEHEVLAAAIFVGLSLALIAIFAALEHRYKEELTARRKDHFS*
Ga0134126_1297777523300010396Terrestrial SoilMAHWMAQHEILAAVIFVGLSLALIAFFATLERRYKEHMMTRRKDYL*
Ga0134127_1044777533300010399Terrestrial SoilMAQWMAEHEVLAAAFFVGLSLALIVIFAALERRYKEQLTARRKDYFS*
Ga0137392_1061904013300011269Vadose Zone SoilMASWMNQHEILAAVIFVGLSLALIAFFAALERRYKEQLTSRRKNYFP*
Ga0137393_1065711023300011271Vadose Zone SoilMASWMNQHEILAAVIFVGLSLALIAFFAALERRYKEQLTSRRKNYFS*
Ga0137398_1007522123300012683Vadose Zone SoilMSEHEVLAAAVFVGLSLALTVFFAFLERRYKQQLTGAAKIKLDK*
Ga0137395_1012164423300012917Vadose Zone SoilMNQHEILAAVIFVGLSLALIAFFAALERRYKEQLTSRRKNYFS*
Ga0137413_1042079523300012924Vadose Zone SoilMAHWMAEHETLAAAIFVGLSLALIAIFAALEHRYKQQLTARRKDYFS*
Ga0137407_1088713023300012930Vadose Zone SoilMASWMNQHEILAAAISVGVSLALIAFFAALERRYKEQLTGRRKG*
Ga0164303_1069598923300012957SoilMAHWMADHEILAAAIFVGLSLALIGFFAALERRYKEQLTSRRKNYFS*
Ga0164301_1026909523300012960SoilMAHWMAEHEIMAAVVFVGLSLALIAIFAALEHRYKEQLTARRKDYFS*
Ga0164301_1114805223300012960SoilMAQWMAEHEIMAAAVFVGLSLALIAIFASLEHRYKQQLTARRKDYFS*
Ga0126369_1086455723300012971Tropical Forest SoilMAHWMAQHEVFAAAVFVALSLAITAIFAALERRYGEQITARRKDNYS*
Ga0164309_1045980023300012984SoilEVLAAAIFVGLSLALIAIFAALEHRYKEELTARRKDHFS*
Ga0164304_1006030313300012986SoilMAQWMAEHEIMAAVVFVGLSLALIAIFAALEHRYKEQLTARRKDYFS*
Ga0164304_1060780723300012986SoilAGNQSILPSGMAHWMAEHEILAAAIFVGLSLALIGFFAALERRYKEQLTSRRKNYFS*
Ga0164304_1119951213300012986SoilMAHWMAENEVLAAALFVALSLALTAFFAALERHYEQQLTSRRKNNFS*
Ga0157374_1286785013300013296Miscanthus RhizosphereWMAEHEVLAAAIFVGLSLALIALFSALERRYKEQLTSRRKNYFS*
Ga0137412_1050787323300015242Vadose Zone SoilMNQHEVLAAAIFVGVSLALMAFFSALERRYKERLTSCRKNEIP*
Ga0137403_1114607423300015264Vadose Zone SoilMNQHEILAAAVFVGLSLALIAFFAALERRYKEQLTGRRKD*
Ga0066662_1046536613300018468Grasslands SoilMAHWMAEHEVLAAAIFVGLSLGLIAIFAALEHRYKQQLTARRKDYFS
Ga0179590_105135923300020140Vadose Zone SoilMNQHEVLAAAIFVGVSLALMAFFSALERRYKQQLTSCRKNEIP
Ga0210400_1001362463300021170SoilMNQHEILAAALFVGLSLALTVFFAALERRYKQQLTSPRKNYFSE
Ga0210400_1040171113300021170SoilMSQHEALAAVIFVGLSLALIAFFAALERRYKKQLTSPRKNYFSERAP
Ga0126371_1038997323300021560Tropical Forest SoilMAQWMVEHEILAAAIFVGLSLALVAIFAALEHRYKKQLTARRKDHFS
Ga0179589_1013579123300024288Vadose Zone SoilMNQHEVLAAAIFVGVSLALMAFFSALERRYKQQLTSSRKNEI
Ga0207707_1038652823300025912Corn RhizosphereMAHWMAEHEVLAAAIFVGLSLALIALFSALERRYKEQLTSRRKNYFS
Ga0207707_1049756623300025912Corn RhizosphereMAHWMAQHEVLAAAIFVGLSLALIALFAALERRYKEQLTSRRKSYFS
Ga0207695_1158276113300025913Corn RhizosphereMAHWMAQHEILAAVIFVGLSLALIAFFATLERRYKEHLMTRRKDYL
Ga0207652_1171337013300025921Corn RhizosphereMAEHEAFAAAIFVGLSLALIAIFAALEHRYKQQLTARRKDYFS
Ga0207694_1001205833300025924Corn RhizosphereMAHWMAEHEVLAAAIFVGLSLALTAFFAVLERHYKQPLTSRRKNYFS
Ga0207694_1010530543300025924Corn RhizosphereMAQWMAEHEVLAAAFFVGLSLALIVIFAALERRYKEQLTARRKDYFS
Ga0207687_1001235313300025927Miscanthus RhizosphereMAHWMAEHEILAAAIFVGLSLALIGFFAALERRYKEQLTSRRKNYFS
Ga0207667_1191340413300025949Corn RhizosphereFYAYVMAHWMAEHEVLAAAIFVAISLALTAIFAALERHYQQQLAVRGKNDFS
Ga0207639_1190079613300026041Corn RhizosphereMAEHEVLAAAIFVGLSLALIALFSALERRYKEQLTSRRKN
Ga0209527_110807923300027583Forest SoilMNQHEVLAAVIFVGLSLAITVLFSFLERRYQRKLTGRRQDKSHQQMG
Ga0208989_1023866513300027738Forest SoilMTEHEVLAAVAFVGLSLALTVFFAALERRYKRELTDRSNKYTP
Ga0209060_1000595223300027826Surface SoilMAHWMAEHEVLAAAIFVALSFALTAFFAALERHYEQQLTSRRKNDFS
Ga0209580_1000426163300027842Surface SoilMAHWMAEHEVLAAAVFVAVSLALTAVFAALERHYKQQLTSRRKNNFS
Ga0209580_1038919723300027842Surface SoilMAHWMAEHEVLAAAIFVGLSLALTAFFSALERHYKQQLTSRRKNDFS
Ga0209488_1000500333300027903Vadose Zone SoilMASWMNQHEILAAVIFVGLSLALIAFFAALERRYKGQLTSRRKNYFP
Ga0209488_1033826623300027903Vadose Zone SoilMAHWMTEHEVLAAGIFVGLSLALIALFAALEQRYKKQLTSRRKNYFS
Ga0170822_1513276823300031122Forest SoilMAHWMVEHEVLAAAIFVGLSLALIAIFAALEHRYKQQLTARRKDYFS
Ga0170824_11650190513300031231Forest SoilMAHWMAEHEVLAAAIFVGLSLALIAIFAALEHRYKQQLTARRKDYFS
Ga0308175_10231852013300031938SoilMAEHEALAAAIFVALSLALIAIFAALERHYKQPLTSRRKNYFS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.