NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F101969

Metagenome / Metatranscriptome Family F101969

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101969
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 45 residues
Representative Sequence MRVLETLIVVLVVTVAFMWVILAHTTPEEQARLHDSMGESTHIAWR
Number of Associated Samples 78
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 80.39 %
% of genes near scaffold ends (potentially truncated) 31.37 %
% of genes from short scaffolds (< 2000 bps) 85.29 %
Associated GOLD sequencing projects 76
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (70.588 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(32.353 % of family members)
Environment Ontology (ENVO) Unclassified
(33.333 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(55.882 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 63.04%    β-sheet: 0.00%    Coil/Unstructured: 36.96%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF02357NusG 3.92
PF01610DDE_Tnp_ISL3 3.92
PF13490zf-HC2 1.96
PF00072Response_reg 0.98
PF03092BT1 0.98
PF04226Transgly_assoc 0.98
PF04392ABC_sub_bind 0.98
PF00496SBP_bac_5 0.98
PF02954HTH_8 0.98
PF13189Cytidylate_kin2 0.98
PF14534DUF4440 0.98
PF13751DDE_Tnp_1_6 0.98
PF13358DDE_3 0.98
PF14336DUF4392 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG0250Transcription termination/antitermination protein NusGTranscription [K] 3.92
COG3464TransposaseMobilome: prophages, transposons [X] 3.92
COG2261Uncharacterized membrane protein YeaQ/YmgE, transglycosylase-associated protein familyGeneral function prediction only [R] 0.98
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A70.59 %
All OrganismsrootAll Organisms29.41 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005341|Ga0070691_10069110All Organisms → cellular organisms → Bacteria → Proteobacteria1711Open in IMG/M
3300005434|Ga0070709_10380799Not Available1049Open in IMG/M
3300005436|Ga0070713_101637071Not Available625Open in IMG/M
3300005445|Ga0070708_100068209All Organisms → cellular organisms → Bacteria3196Open in IMG/M
3300005445|Ga0070708_100080412All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2949Open in IMG/M
3300005445|Ga0070708_101738136Not Available579Open in IMG/M
3300005467|Ga0070706_100035272All Organisms → cellular organisms → Bacteria → Proteobacteria4620Open in IMG/M
3300005467|Ga0070706_100321301Not Available1443Open in IMG/M
3300005467|Ga0070706_102044557Not Available518Open in IMG/M
3300005471|Ga0070698_100615598All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1027Open in IMG/M
3300005536|Ga0070697_100300701Not Available1379Open in IMG/M
3300005545|Ga0070695_101345985Not Available591Open in IMG/M
3300005556|Ga0066707_10499747Not Available789Open in IMG/M
3300005586|Ga0066691_10831458Not Available544Open in IMG/M
3300006049|Ga0075417_10745355Not Available505Open in IMG/M
3300006163|Ga0070715_10257158Not Available915Open in IMG/M
3300006173|Ga0070716_100003637All Organisms → cellular organisms → Bacteria7290Open in IMG/M
3300006176|Ga0070765_100096731All Organisms → cellular organisms → Bacteria2541Open in IMG/M
3300006755|Ga0079222_11420856Not Available644Open in IMG/M
3300006852|Ga0075433_11007046Not Available726Open in IMG/M
3300006904|Ga0075424_100629495Not Available1146Open in IMG/M
3300007255|Ga0099791_10166433Not Available1034Open in IMG/M
3300007258|Ga0099793_10310115Not Available767Open in IMG/M
3300007265|Ga0099794_10002971All Organisms → cellular organisms → Bacteria6395Open in IMG/M
3300007788|Ga0099795_10507097Not Available563Open in IMG/M
3300009038|Ga0099829_10988217Not Available698Open in IMG/M
3300009089|Ga0099828_11648987Not Available564Open in IMG/M
3300009143|Ga0099792_10133320Not Available1349Open in IMG/M
3300009148|Ga0105243_12937844Not Available517Open in IMG/M
3300010401|Ga0134121_10711787Not Available954Open in IMG/M
3300011003|Ga0138514_100113215Not Available594Open in IMG/M
3300011269|Ga0137392_11412273Not Available555Open in IMG/M
3300011270|Ga0137391_10062958All Organisms → cellular organisms → Bacteria3177Open in IMG/M
3300011270|Ga0137391_10224216All Organisms → cellular organisms → Bacteria1632Open in IMG/M
3300011271|Ga0137393_11090754Not Available679Open in IMG/M
3300012189|Ga0137388_10983477Not Available779Open in IMG/M
3300012199|Ga0137383_10300621All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1175Open in IMG/M
3300012203|Ga0137399_10313425Not Available1298Open in IMG/M
3300012205|Ga0137362_10263576Not Available1490Open in IMG/M
3300012206|Ga0137380_10372632Not Available1271Open in IMG/M
3300012207|Ga0137381_10152419All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1989Open in IMG/M
3300012208|Ga0137376_10572080Not Available978Open in IMG/M
3300012209|Ga0137379_10350452All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1386Open in IMG/M
3300012211|Ga0137377_11966577Not Available501Open in IMG/M
3300012356|Ga0137371_11397739Not Available513Open in IMG/M
3300012361|Ga0137360_10901573Not Available762Open in IMG/M
3300012362|Ga0137361_10435196Not Available1205Open in IMG/M
3300012362|Ga0137361_11105163Not Available714Open in IMG/M
3300012362|Ga0137361_11600438Not Available572Open in IMG/M
3300012363|Ga0137390_10303894All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1578Open in IMG/M
3300012363|Ga0137390_11702521Not Available565Open in IMG/M
3300012363|Ga0137390_11840239Not Available536Open in IMG/M
3300012923|Ga0137359_10240021Not Available1617Open in IMG/M
3300012923|Ga0137359_10299637All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1431Open in IMG/M
3300012923|Ga0137359_11719017Not Available515Open in IMG/M
3300012925|Ga0137419_10277945All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1272Open in IMG/M
3300012929|Ga0137404_11247349Not Available684Open in IMG/M
3300014885|Ga0180063_1076243All Organisms → cellular organisms → Bacteria997Open in IMG/M
3300017927|Ga0187824_10092443All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Planctomycetales → Planctomycetaceae → Symmachiella968Open in IMG/M
3300017927|Ga0187824_10097708Not Available943Open in IMG/M
3300017936|Ga0187821_10071464Not Available1259Open in IMG/M
3300019877|Ga0193722_1060010All Organisms → cellular organisms → Bacteria954Open in IMG/M
3300020579|Ga0210407_10092911All Organisms → cellular organisms → Bacteria2286Open in IMG/M
3300020581|Ga0210399_10045078All Organisms → cellular organisms → Bacteria → Proteobacteria3536Open in IMG/M
3300021170|Ga0210400_10949352Not Available701Open in IMG/M
3300021178|Ga0210408_10086073Not Available2466Open in IMG/M
3300021178|Ga0210408_10574356Not Available894Open in IMG/M
3300025910|Ga0207684_10018044All Organisms → cellular organisms → Bacteria6048Open in IMG/M
3300025910|Ga0207684_10038499All Organisms → cellular organisms → Bacteria4057Open in IMG/M
3300025910|Ga0207684_10554291All Organisms → cellular organisms → Bacteria983Open in IMG/M
3300025910|Ga0207684_10661348Not Available890Open in IMG/M
3300025922|Ga0207646_10042440Not Available4085Open in IMG/M
3300025922|Ga0207646_10087642Not Available2785Open in IMG/M
3300026005|Ga0208285_1009963All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria710Open in IMG/M
3300026285|Ga0209438_1039228Not Available1568Open in IMG/M
3300026333|Ga0209158_1273276Not Available579Open in IMG/M
3300026354|Ga0257180_1014554All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria977Open in IMG/M
3300026361|Ga0257176_1009451All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1254Open in IMG/M
3300026446|Ga0257178_1015781Not Available873Open in IMG/M
3300026475|Ga0257147_1006957All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1433Open in IMG/M
3300026475|Ga0257147_1032908Not Available749Open in IMG/M
3300026497|Ga0257164_1026765Not Available849Open in IMG/M
3300026514|Ga0257168_1051487Not Available903Open in IMG/M
3300027645|Ga0209117_1014362All Organisms → cellular organisms → Bacteria2624Open in IMG/M
3300027651|Ga0209217_1073902Not Available998Open in IMG/M
3300027651|Ga0209217_1141062Not Available671Open in IMG/M
3300028824|Ga0307310_10610342Not Available556Open in IMG/M
3300029636|Ga0222749_10055549Not Available1760Open in IMG/M
(restricted) 3300031248|Ga0255312_1115153Not Available660Open in IMG/M
(restricted) 3300031248|Ga0255312_1155968Not Available569Open in IMG/M
3300031720|Ga0307469_10468319Not Available1096Open in IMG/M
3300031720|Ga0307469_12197528Not Available537Open in IMG/M
3300031740|Ga0307468_101476492Not Available629Open in IMG/M
3300031754|Ga0307475_11421001Not Available534Open in IMG/M
3300031820|Ga0307473_10161995Not Available1286Open in IMG/M
3300032174|Ga0307470_10147152Not Available1435Open in IMG/M
3300032180|Ga0307471_102593119Not Available642Open in IMG/M
3300032180|Ga0307471_102721952Not Available627Open in IMG/M
3300032205|Ga0307472_100587285Not Available980Open in IMG/M
3300032205|Ga0307472_100645945All Organisms → cellular organisms → Bacteria942Open in IMG/M
3300032205|Ga0307472_101397091Not Available679Open in IMG/M
3300033433|Ga0326726_11385008Not Available684Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil32.35%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere19.61%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil12.75%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil10.78%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment2.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil2.94%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.94%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.96%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.96%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.98%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.98%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.98%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.98%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil0.98%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.98%
SoilEnvironmental → Terrestrial → Agricultural Field → Unclassified → Unclassified → Soil0.98%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.98%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011003Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t9i015EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300019877Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m1EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026005Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_101 (SPAdes)EnvironmentalOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026354Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-BEnvironmentalOpen in IMG/M
3300026361Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-BEnvironmentalOpen in IMG/M
3300026446Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-11-BEnvironmentalOpen in IMG/M
3300026475Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-AEnvironmentalOpen in IMG/M
3300026497Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-BEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
Ga0070691_1006911023300005341Corn, Switchgrass And Miscanthus RhizosphereMRVLETLIVVLVVTVAFMWVILAHTTPEEQARLHDSMGESTHIAWR*
Ga0070709_1038079913300005434Corn, Switchgrass And Miscanthus RhizosphereMRVLETSIVVLVVMVAFMWVILAHTTPEEQARLHDSIGESTHIAWR*
Ga0070713_10163707123300005436Corn, Switchgrass And Miscanthus RhizosphereMRVLDTFIVLLIATVAFLWVILAHSTPEEHARLYDTMGESTHIAWR*
Ga0070708_10006820943300005445Corn, Switchgrass And Miscanthus RhizosphereMRVLETLIVMLVVTVVFMWVILAHTTPEEQARLHDSMGESTHIAWR*
Ga0070708_10008041253300005445Corn, Switchgrass And Miscanthus RhizosphereMRALETLVVLTVAIVAFLWVILAHSTPEEHARLHDSLGQSTHIAWR*
Ga0070708_10173813623300005445Corn, Switchgrass And Miscanthus RhizosphereMRALEALVVLTVATVAFLWVILAHSTPEDQARLHDSLGQRTHIAWR*
Ga0070706_10003527243300005467Corn, Switchgrass And Miscanthus RhizosphereMRVSETFIVVLIVAVACLWVILAHSTPEEHARLYDAMGDSTHIA*
Ga0070706_10032130113300005467Corn, Switchgrass And Miscanthus RhizosphereMRGLETFIVVFIAAVAFLWVILAHSTPEEHARLYDAMGDSTHIAWR*
Ga0070706_10204455713300005467Corn, Switchgrass And Miscanthus RhizosphereMRVLETSIVVLVVMVAFMWVILAHTTPEEQARLHDSMGESTHIAWR*
Ga0070698_10061559813300005471Corn, Switchgrass And Miscanthus RhizosphereMRGLETFIVVFIAAVAFMWVILAHSTPEEHARLYDAMGDSTHIAWR*
Ga0070697_10030070123300005536Corn, Switchgrass And Miscanthus RhizosphereMRVLETSIVVLVVMVTFMWVILAHTTPEEQARLHDSMGESTHIAWR*
Ga0070695_10134598523300005545Corn, Switchgrass And Miscanthus RhizosphereEAVMRVLETSIVVLVVMVAFMWVILAHTTPEEQARLHDSIGESTHIAWR*
Ga0066707_1049974733300005556SoilMRVLETFIVVFIAAVAFMWVILAHSTPEEHARLYDAMGESTHIAWR*
Ga0066691_1083145813300005586SoilMRGLETFIVVFIAAVAFMWVILAHSTPEEHARLYDAMGESTHIAWR*
Ga0075417_1074535513300006049Populus RhizosphereMRVLETLIVVFVVTVAFMWVILAHTTPEEQARLHDSIGESTHIAWR*
Ga0070715_1025715813300006163Corn, Switchgrass And Miscanthus RhizosphereMRVLETFIVLLIATVAFLWVILAHSTPEEHARLYDAMGESTHI
Ga0070716_10000363713300006173Corn, Switchgrass And Miscanthus RhizosphereMRVLETFIVLLIATVAFLWVILAHSTPEEHARLYDAMGESTHIAWR*
Ga0070765_10009673153300006176SoilMRALETLVVLTIATVAFLWVILAHTTPEEHARLHDSIGQSTHIAWR*
Ga0079222_1142085633300006755Agricultural SoilMRVLETSIVLLVVMVAFMWVILTHTTPEEQARLHDSIGESTHIAWR*
Ga0075433_1100704623300006852Populus RhizosphereMRVLETLIVVFVVTVAFMWVILAHTTPEEQARLHDSIGE
Ga0075424_10062949533300006904Populus RhizosphereMRVLETLIVVFVVTVAFMWVILAHTTPEEQARLHDSIGESTH
Ga0099791_1016643323300007255Vadose Zone SoilLIATVAFLWVILAHSTPEEHARLYDAMGESTHIAWR*
Ga0099793_1031011523300007258Vadose Zone SoilMRVLETSIVVLVVMVTFMWVILVHTIPEEQACLHDSMGESTHIAWR*
Ga0099794_1000297173300007265Vadose Zone SoilMRVLETCIVVLVVMVTFMWVILAHTTPEEQARLHDSMGESTHIAWR*
Ga0099795_1050709723300007788Vadose Zone SoilPQRPEAVMRVLETSIVVLVVMVTFMWVILAHTTPEEQARLHDSMGESTHIAWR*
Ga0099829_1098821713300009038Vadose Zone SoilMRVLETFIVAFIAAVAFMWVILAHSTPEEHARLYDAMGESTHIAWR*
Ga0099828_1164898713300009089Vadose Zone SoilMRVLETFIVLLIATVAFLWVILAHSTPEEHARLYDAMGDSTHIAWR*
Ga0099792_1013332013300009143Vadose Zone SoilMRVLETSIVVLVVMVTFMWVILAHTTPEEHARLHDSMGESTHIAWR*
Ga0105243_1293784413300009148Miscanthus RhizosphereMRVLETSIVMLVLMVAFLWAILPHTTPEEQARLHDSIGESTHIAWR*
Ga0134121_1071178713300010401Terrestrial SoilMRVLETSIVMLVLMVAFMWVILAHTTPEEQARLHDSMGESTHIAWR*
Ga0138514_10011321523300011003SoilMRVLETSILMLVLMVAFMWVILAHTTPEEQARLHDSIGESTHIAWR*
Ga0137392_1141227313300011269Vadose Zone SoilMRVLETFIVVFIAAVAFMWVILAHSTPEEHARLYDAMGDSTHIAWR*
Ga0137391_1006295813300011270Vadose Zone SoilMRVLETFIVVLIATVAFLWVILAHSTPEEHARLYDAMGDGTHIAWR*
Ga0137391_1022421623300011270Vadose Zone SoilMRVLETFIVVFIAAIAFMWVILAHSTPEEHARLYDAMGDSTHIAWR*
Ga0137393_1109075413300011271Vadose Zone SoilMRVLETSIVVLVVMVTFMWVILAHTTPEEHARLHDSMGESTH
Ga0137388_1098347713300012189Vadose Zone SoilMRVLETSIVVLVVMVTFMWVILAHTTPEEQARLHDSM
Ga0137383_1030062133300012199Vadose Zone SoilMKVGPAEESVMRVLETFIVVLIATVAFLWVILAHSTPEEHARLYDAMGESTHIAWR*
Ga0137399_1031342513300012203Vadose Zone SoilMRVRETFIVVLIATVAFLWVILAHSTPEEHARLYDAMGESTHIAWR*
Ga0137362_1026357633300012205Vadose Zone SoilMRVLETWIVVLIVAVTFMWVILAHSTPEEHARLYDAMGDSTHIAWR*
Ga0137380_1037263213300012206Vadose Zone SoilMRGLETFIVVFIAAVAFLWVILAHSTPEEHARLYDAMGESTHIAWR*
Ga0137381_1015241933300012207Vadose Zone SoilMRGLETFIVLLIATVAFLWVILAHSTPEEHARLYDAMGESTHIAWR*
Ga0137376_1057208023300012208Vadose Zone SoilMRVLETSIVMLVLMVAFMWVILAHTTPEEQARLHDS
Ga0137379_1035045253300012209Vadose Zone SoilMKVGPAEESVMRVLETFIVLLIAAVAFLWVILAHSTPEEHARLYDAMGESTHIAWR*
Ga0137377_1196657713300012211Vadose Zone SoilMRVLETFIVVFIAAVAFLWVILAHSTPEEHARLYDAMGEST
Ga0137371_1139773913300012356Vadose Zone SoilMRGLETFIVVFIAAVAFMWVILAHATPEEHARLYDAMEESTHIAWR*
Ga0137360_1090157313300012361Vadose Zone SoilEESVMRGLETFIVVFIAAFAFRWVILAHSTPEEHARLYDAMGGSTHIAWR*
Ga0137361_1043519633300012362Vadose Zone SoilMRVLETFIVVLVAKVAFLWVILAHSTPEEHARLYDAMGDSTHIAWR*
Ga0137361_1110516323300012362Vadose Zone SoilMRVLETFIVLLIATVAFLWVILAHSTPEEHACLYDAMGDSTHIAWR*
Ga0137361_1160043813300012362Vadose Zone SoilMRVRETFIVLLIATVAFLWVILAHSTPEEHARLYDAMGESTHIAWR*
Ga0137390_1030389443300012363Vadose Zone SoilESVMRVLETFIVVFIAAVAFMWVILAHSTPEEHARLYDAMGDSTHIAWR*
Ga0137390_1170252113300012363Vadose Zone SoilMRVLETFIVVFIAAVAFMWVILAHSTPEEHARLYDA
Ga0137390_1184023923300012363Vadose Zone SoilGSVMRVLETFIVVLIATVAFLWVILAHSTPEEHARLYDAMGESTHIAWR*
Ga0137359_1024002133300012923Vadose Zone SoilVIRALETFIVVLVATVAFLWVILAHSTPEEHARLYDAMGDSTHIAWR*
Ga0137359_1029963713300012923Vadose Zone SoilMRVLETFIVLLIATVAFLWVILAHSTPEEHARLYDAMGESTHIAWQ*
Ga0137359_1171901713300012923Vadose Zone SoilMRVLETSIVVLVVMVTFMWVILAHTTTEEQARLHDSMGESTHIAWR*
Ga0137419_1027794513300012925Vadose Zone SoilVLETSIVVLVVMVTFMWVILAHTTPEEQARLHDSMGESTHIAWR*
Ga0137404_1124734913300012929Vadose Zone SoilMRVLETFIVLLIATVAFLWVILAHSTPEEHARLYDAMGESTH
Ga0180063_107624323300014885SoilMRVLETFIVVLIATVAFLWVILAYSTSEEHARLHDSIGESTHIAWR*
Ga0187824_1009244323300017927Freshwater SedimentMRVLETLIVVLVVTVAFMWVILAHTTPEEQARLHDSMGESTHIAWR
Ga0187824_1009770823300017927Freshwater SedimentMRALETLIVVLVVTVAFMWVILAHSTPEEQARLHDSMGESTHIAWR
Ga0187821_1007146423300017936Freshwater SedimentMRVLETLIVVLVGHGRVHVILAHTMPEEHARLYDAMGESTHIAWR
Ga0193722_106001013300019877SoilMRVLETSIVMLVIMVAFMWVILAHTTPEEQARLHDSIGESTHIAWR
Ga0210407_1009291133300020579SoilMRVLETLIVLFVVTVAFMWVILAHTTPEEQARLHDSIGESTHIAWR
Ga0210399_1004507883300020581SoilMRALETLVVLTIVTVTFLWVILAHTTPEEHARLHDSIGQSTHIAWR
Ga0210400_1094935213300021170SoilMRVLETAIVVLVVTVTFMWVILAHTTPEEQARLHDSMGESTRIAWR
Ga0210408_1008607323300021178SoilMRVLETAIVVLVVMVTFMWVILAHTTPEEQARLHDSMGESTRIAWR
Ga0210408_1057435623300021178SoilMRALETLVVLTIATVAFLWVILAHTTPEEHARLHDSIGQSTHIAWR
Ga0207684_1001804463300025910Corn, Switchgrass And Miscanthus RhizosphereMRALETLVVLTVAIVAFLWVILAHSTPEEHARLHDSLGQSTHIAWR
Ga0207684_1003849933300025910Corn, Switchgrass And Miscanthus RhizosphereMRVSETFIVVLIVAVACLWVILAHSTPEEHARLYDAMGDSTHIA
Ga0207684_1055429123300025910Corn, Switchgrass And Miscanthus RhizosphereVLETSIVVLVVMVTFMWVILAHTTPEEQARLHDSMGESTHIAWR
Ga0207684_1066134813300025910Corn, Switchgrass And Miscanthus RhizosphereMRGLETFIVVFIAAVAFLWVILAHSTPEEHARLYDAMGDSTHIAWR
Ga0207646_1004244043300025922Corn, Switchgrass And Miscanthus RhizosphereMRVLETLIVMLVVTVVFMWVILAHTTPEEQARLHDSMGESTHIAWR
Ga0207646_1008764233300025922Corn, Switchgrass And Miscanthus RhizosphereMRALEALVVLTVATVAFLWVILAHSTPEDQARLHDSLGQRTHIAWR
Ga0208285_100996313300026005Rice Paddy SoilMRVLETLIVVLVVTVAFMWVILAHTTPEEQARLHD
Ga0209438_103922833300026285Grasslands SoilMRVRETFIVLLIATVAFLWVILAHSTPEEHARLYDAMGDSTHIAWR
Ga0209158_127327623300026333SoilMRGLETFIVVFIAAVAFMWVILAHSTPEEHARLYDAMGESTHIAWR
Ga0257180_101455413300026354SoilVLLIATVAFLWVILAHSTPEEHARLYDAMGESTHIAWR
Ga0257176_100945113300026361SoilIATVAFLWVILAHSTPEEHARLYDAMGESTHIAWR
Ga0257178_101578113300026446SoilMRVLETFIVLLIATVAFLWVILAHSTPEEHARLYDAMGERTHIAWR
Ga0257147_100695733300026475SoilCMRVLETFIVLLIATVAFLWVILAHSTPEEHARLYDAMGESTHIAWR
Ga0257147_103290823300026475SoilRVLETSIVVLVVMVTFMWVILAHTTPEEQARLHDSMGESTHIAWR
Ga0257164_102676533300026497SoilPQRQEAVMRVLETSIVVLVVMVTFMWVILAHTTPEEQARLHDSMGESTHIAWR
Ga0257168_105148723300026514SoilMRVLETFIVVFIAAVAFMWVILAHSTPEEHARLYDAMGDSTHIAWR
Ga0209117_101436243300027645Forest SoilMRVRETFIVLLIATVAFLWVILAHSTPEEHARLYDAMGESTHIAWR
Ga0209217_107390213300027651Forest SoilMRVLETSIVMLVLMVAFMWVILAHTTPEEQARLHDSIGESTHIAWR
Ga0209217_114106223300027651Forest SoilMRALETLVVLTVATVAFLWVILAHSTPEEHARLLDSIGQSTHIAWR
Ga0307310_1061034213300028824SoilMRVLETSIVMLVVMVAFMWVILAHTTPEEQARLHDSIGESTHIAWR
Ga0222749_1005554933300029636SoilMRILETSIVMLVLMVAFMWVILAHTTPEEQARLHDSMGESTRIAWR
(restricted) Ga0255312_111515333300031248Sandy SoilMRALETLIVLAVVTVGFLWAILGHSTPEEHARLYDSMGESTHI
(restricted) Ga0255312_115596813300031248Sandy SoilMRVLETLIVVLVVTAAFMWVILAHTTPEEQARLHDSMGESTHIAWR
Ga0307469_1046831923300031720Hardwood Forest SoilVVLTVAIVAFLWVILAHSTPEEHARLHDSLGQSTHIAWR
Ga0307469_1219752813300031720Hardwood Forest SoilRPEAVMRVLETSIVVLVVMVAFMWVILAHTTPEEQARLHDSMGESTHIAWR
Ga0307468_10147649213300031740Hardwood Forest SoilMRVLETSIVVLVVMVAFMWVILAHTTPEEQARLHDSIGESTHNAWR
Ga0307475_1142100113300031754Hardwood Forest SoilRVLETSIVVLVVMVAFMWVILAHTTPEEQARLHDSIGESTHIAWR
Ga0307473_1016199543300031820Hardwood Forest SoilMRALETLVVLTIATVAFLWVILAHTTPEEHARLHDSIGQSTHVAWR
Ga0307470_1014715223300032174Hardwood Forest SoilMRVLETSIVMLVLMVAFMWVILAHTTPEEQARLHDSMGESTHIAWR
Ga0307471_10259311913300032180Hardwood Forest SoilMRVLETFIVLLIATVAFLWVILAHSTPEEHARLYDAMGESTQIAWRLGSAGP
Ga0307471_10272195213300032180Hardwood Forest SoilMRILETSIVMLVLMVAFMWVILAHTTPEEQARLHDSIGESTHIAWR
Ga0307472_10058728523300032205Hardwood Forest SoilTSIVVLVVMVAFMWVILAHTTPEEQARLHDSMGESTHIAWR
Ga0307472_10064594513300032205Hardwood Forest SoilTSIVVLVVMVTFMWVILAHTTPEEQARLHDSMGESTHIAWR
Ga0307472_10139709113300032205Hardwood Forest SoilVLETLIVVFVVTVAFMWVILAHTTPEEQARLHDSIGESTHIAWR
Ga0326726_1138500813300033433Peat SoilMRFLETLIVVLVVTVAFMWVILAHTTPEERARLHD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.