NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F097753

Metagenome / Metatranscriptome Family F097753

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097753
Family Type Metagenome / Metatranscriptome
Number of Sequences 104
Average Sequence Length 72 residues
Representative Sequence MRRRLMYVVLAAFVLLAAGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQHFAV
Number of Associated Samples 100
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 2.88 %
% of genes from short scaffolds (< 2000 bps) 0.96 %
Associated GOLD sequencing projects 100
AlphaFold2 3D model prediction Yes
3D model pTM-score0.30

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (97.115 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(17.308 % of family members)
Environment Ontology (ENVO) Unclassified
(27.885 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(55.769 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 33.65%    β-sheet: 13.46%    Coil/Unstructured: 52.88%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.30
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF00578AhpC-TSA 83.65
PF06778Chlor_dismutase 2.88
PF00239Resolvase 1.92
PF08241Methyltransf_11 0.96
PF00989PAS 0.96
PF01541GIY-YIG 0.96
PF00291PALP 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG3253Coproheme decarboxylase/chlorite dismutaseCoenzyme transport and metabolism [H] 2.88
COG1961Site-specific DNA recombinase SpoIVCA/DNA invertase PinEReplication, recombination and repair [L] 1.92
COG2452Predicted site-specific integrase-resolvaseMobilome: prophages, transposons [X] 1.92


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A97.12 %
All OrganismsrootAll Organisms2.88 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005434|Ga0070709_10207694All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1390Open in IMG/M
3300005435|Ga0070714_100019411All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia5538Open in IMG/M
3300012211|Ga0137377_10089105All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2913Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil17.31%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil13.46%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil10.58%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil9.62%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere7.69%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil4.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil3.85%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere3.85%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.88%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.88%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.92%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.92%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.96%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.96%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.96%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.96%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.96%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.96%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.96%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.96%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.96%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.96%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.96%
SoilEnvironmental → Terrestrial → Agricultural Field → Unclassified → Unclassified → Soil0.96%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.96%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.96%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.96%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.96%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.96%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.96%
Switchgrass, Maize And Mischanthus LitterEngineered → Solid Waste → Grass → Composting → Unclassified → Switchgrass, Maize And Mischanthus Litter0.96%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2170459017Litter degradation ZMR4EngineeredOpen in IMG/M
3300000579Forest soil microbial communities from Amazon forest - Pasture72 2010 replicate I A01EnvironmentalOpen in IMG/M
3300000890Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004153Grasslands soil microbial communities from Hopland, California, USA (version 2)EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005441Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300009036Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-4 metaGHost-AssociatedOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300010128Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010142Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011003Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t9i015EnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012380Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012398Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012400Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012908Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S089-202R-1EnvironmentalOpen in IMG/M
3300012911Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S088-202R-2EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012985Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_246_MGEnvironmentalOpen in IMG/M
3300012988Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_242_MGEnvironmentalOpen in IMG/M
3300014745Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M5-5 metaGHost-AssociatedOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017944Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0815_BV2_10_20_MGEnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300019279Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021415Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3s1EnvironmentalOpen in IMG/M
3300021445Bulk soil microbial communities from the field in Mead, Nebraska, USA - 072115-187_1 MetaGEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025904Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025918Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025935Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026306Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300028592Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Cellulose_Day30EnvironmentalOpen in IMG/M
3300028714Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_196EnvironmentalOpen in IMG/M
3300028718Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_194EnvironmentalOpen in IMG/M
3300028720Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_357EnvironmentalOpen in IMG/M
3300028771Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_369EnvironmentalOpen in IMG/M
3300028809Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_PalmiticAcid_Day48EnvironmentalOpen in IMG/M
3300028811Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_149EnvironmentalOpen in IMG/M
3300028881Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_116EnvironmentalOpen in IMG/M
3300031082Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_193 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031943Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D2EnvironmentalOpen in IMG/M
3300031996Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R2EnvironmentalOpen in IMG/M
3300032211Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D1EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
4ZMR_024401102170459017Switchgrass, Maize And Mischanthus LitterMYVVLAAFVLLATGASSTFAARGSGVSTFIFNGRLLADAGSSSSLYVDVNGGNRLALKK
AP72_2010_repI_A01DRAFT_105949223300000579Forest SoilMRRRFTYVALAALMLLAAGASSTFAARGGANATYVFNGRLLVDAGNSPTVYLDVNGGNKIALKK
JGI11643J12802_1017793213300000890SoilMRRRLMYVVLAAFVLLAAGASSTFAARGSGVQTFIFNGRLLADAGNSSSLYVDVNGGNRLALKKLVGQSDNQHFAVDSSTQ
Ga0062593_10221535323300004114SoilMRRRLMYVVLAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQHFAVDSNTQ
Ga0063455_10024457713300004153SoilMRRRLTYVVLAALVLLAAGASSTLAARGSVQTFVFNGRLLADAGSSTSLYVDVNGGNRPALKKLVGQS
Ga0062589_10206247823300004156SoilVRRQLLYLVLAGLVVLAAGASSTFAAHRSGQLTYVFNGRLITDADSSTSLYVEVSGGTKPALKKLVGQPSAQHFAVG
Ga0062590_10047243913300004157SoilVRRQLLYLVLAGLVVLAAGASSTFAAHRSGQLTYVFNGRLITDADSSTSLYVEVS
Ga0062595_10086336223300004479SoilMRRRLTYVLIAALVLLAAGASSTFAARGSGEVTFVFNGHLLADAGNSSSLYVDINGGNRLALKKLVGLSDNQNFAVGSGTQFLRWSHGVPTV
Ga0066688_1093632623300005178SoilMRRRLTYVVLAALVLLAAGASSTFAARGSVQTFVFNGRLLADAGSSTSLYVDVNGGNRPALKKLVGQSDNQHFAVDSSTQFLRWSHGVPTVV
Ga0066678_1080479523300005181SoilMRRRLMYVVLAAFVLLAAGASSTFAARGSGVQTFIFNGRLLTDAGSSSSLYVDVNGGNRLALKKLVGQSDNQYFAVDSSTQF
Ga0066671_1039657013300005184SoilMRRRLTYVVIAAFVLLATGASSTFAARGSGEQTLVFNGHLLADAGSSSSLYVDVNGGNRLALKKLIG
Ga0066675_1060756023300005187SoilMRRRLTYVALAALVLLATGASSTFAARSGSVTFVFNGRLLADAGNSPSLYVDVNGGNRLALKKLVGQSDSQHFAVNGSTQYLRWSHGVPT
Ga0066675_1108652423300005187SoilMRRRLTYVVLAALVLLAAGASSTFAARGSVQTFVFNGRLLADAGSSTSLYVDVNGGNR
Ga0068869_10069374923300005334Miscanthus RhizosphereMRRRLMYVVLAVFVLLATGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQHFAVDSSTQFLRW
Ga0068868_10208130713300005338Miscanthus RhizosphereMRRRLMYVVLAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSLLYVDVNGGNRLALKKLVGQSDNQHFAVDS
Ga0070709_1020769443300005434Corn, Switchgrass And Miscanthus RhizosphereMRRRFTYVVLAALVLLAAGASSTFAAGGEQTYVFNGRLLADAGSSSSVYVDINGGN
Ga0070714_10001941183300005435Agricultural SoilMRRRFTYVVLAALVLLAAGASSTFAAGGEQTYVFNGRLLADAGSSSSLYVDINGGNKPALKKLVGQSDNQYFAVGAGT
Ga0070713_10025069453300005436Corn, Switchgrass And Miscanthus RhizosphereMRRRLTYVLIAALVLLAAGASSTFAARGSGEPTFVFNGHLLADAGNSSSLYVDVFGGNRLGLKK
Ga0070700_10010057653300005441Corn, Switchgrass And Miscanthus RhizosphereMRRRLMYVVLAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQH
Ga0070707_10031500553300005468Corn, Switchgrass And Miscanthus RhizosphereMRRRFTYVVLAALVLLAAGASSTFAAGGEQTYVFNGRLLADAGSSSSLYVDINGGN
Ga0070707_10041488143300005468Corn, Switchgrass And Miscanthus RhizosphereMRRRLTYVVLAALVLLAAGASSTFAARGSGEQAYVFNGRLLADAGSSTSLYVDVNGGNRPALKKLVGQ
Ga0070707_10129285223300005468Corn, Switchgrass And Miscanthus RhizosphereMRRRLMYVVLAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQHFAVDS
Ga0070741_1061606313300005529Surface SoilVRRHLVYLVLAALVAVAATASSSFAAQRSGQITFIFNGRLIADAGNSSALYVDVNGGSKPALRKLVGQPDEQHFAVNGSTQ
Ga0066697_1072983623300005540SoilMRRRLTYVALAALVLLAAGASSTFAARGSGEQTYVFNGRLLADAGSSTSLYVDVNGGNRPALKKLVGQSDNQHFAVDSSTQFLRWSHGVPTVV
Ga0066704_1070743523300005557SoilMRRRLTYVVLAALVLLAAGASSTFAARGNVQTFVFNGRLLADAGSSTSLYVDVNGGNRPALKKLVGQSDNQ
Ga0066700_1025781733300005559SoilMRRRLTYVALAALVLLAAGASSTFAARGSGEQTYVFNGRLLADAGSSTSLYVDVNGGNR
Ga0066693_1014286333300005566SoilMRRRLTYVALAALVLLAAGASSTFAARGSGNNTYVFNGRLLADAGNSPTLYVDVNGGNRIALKKLVGLSDNQNFA
Ga0066703_1081707723300005568SoilMRRRLTYVVLAALVLLAAGASSTFAAGGNVQTFVFNGRLLADAGSSTSLYVDVNGGNRPALKKLVGQSD
Ga0068860_10173826913300005843Switchgrass RhizosphereMRRRLMYVVLAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQHFAVDSSTQFLRWSH
Ga0066656_1047967523300006034SoilMRRRLTYVALAALVLLATGASSTFAARSGSVTFVFNGRLLADAGNSPSLYVDVNGGN
Ga0066652_10057500913300006046SoilMRKVLLYVGLVAVVALAAGISTTLAARGSATRAYVFNGRLLADAGNSPTLFVEVTGGNRIAL
Ga0070712_10197084413300006175Corn, Switchgrass And Miscanthus RhizosphereMRRRLTYVLIAALVLLAAGASSTFAARGSGEVTFVFNGHLLADAGNSSSLYVDINGGNRLALKKLVGLSDNQNFAVGSGTQFLRWSHGV
Ga0079222_1041967813300006755Agricultural SoilMRRRLTYVLIAALVLLAAGASSTFAARGSGEPTFVFNGHLLADAGNSSSLYVDVFGGNRLGLKKLV
Ga0105244_1020050033300009036Miscanthus RhizosphereMYLALAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQHFAV
Ga0111539_1312870023300009094Populus RhizosphereMRRRLTYVALAALVLLAAGASSTFAARGSGNVTFVFNGRLLSDAGNSSSLY
Ga0105245_1127696813300009098Miscanthus RhizosphereMRRRLTYVLIAALVLLAAGASSTFAARGSGEPTFVFNGHLLADAGNSSSLYVDVFGGNRLGLKKLVGQSDSQHFAVDSNTQYLRWSHGVPTVV
Ga0105249_1287608113300009553Switchgrass RhizosphereMRRRLTYVLIAALVLLAAGASSTFAARGSGEPTFVFNGHLLADAGNSSSLYV
Ga0127486_107265623300010128Grasslands SoilMYVVLAAFVLLAAGASSTFAARGSGVQTFIFNGRLLTDAGSSSSLYVDVNGGNRLALKKLVGQSD
Ga0127483_128262623300010142Grasslands SoilMYVVLAAFVLLAAGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQYFAVDSSTQF
Ga0134065_1033449833300010326Grasslands SoilMRRRFTYLVLAALVLLAAGASSTFAARSGNVTYQFNGRLLADAGGSSSLYVDVNGGNRLALKKLVGQSDNQHFAVDSNTQY
Ga0134062_1044985113300010337Grasslands SoilMRRRFTYFVIAALVLVAAGASSTLAAGGGNLTYMFNGRLLTDAGNSSSLYVDVNGGNKPALRKLLGQSDNQY
Ga0126372_1311392213300010360Tropical Forest SoilMRRRLTYVALAALVLLAAGASSTFAARSGSVTYVFNGRLLADAGNASSLYV
Ga0134126_1298519613300010396Terrestrial SoilMRRRFTYVVLAALVLLAAGASSTFAAGGEQTYVFNGRLLADAGSSSSVYVDINGGNKPALKK
Ga0126383_1112970013300010398Tropical Forest SoilMRRRLTYVALAALVLLAAGASSTFAARSGSVTYVFNGRLLADAGDSPSLYVDVNGGNRIALKKL
Ga0138514_10002122733300011003SoilMFVFLAALMVFAVGASSTLAAHRGGMQTYIFNGRLLADAGNSSSLYVDVNGGNKPALKKLVGQSD
Ga0137382_1009829213300012200Vadose Zone SoilMYVVLAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNKLALKK
Ga0137399_1171088023300012203Vadose Zone SoilMRRRFTYVVLAALVLLAAGASSTLAAGGEQTYVFNGRLLADAGSSSSVYVDINGGNKPALKKLVGQSDNQYFAVGAGTQ
Ga0137376_1148018323300012208Vadose Zone SoilMRRRFTYVILAAVVVLAAGASSTLAAGRGEQTYIFNGRLLADAGSSTSLYVDINGGNKPALKKLVGQSDSQYFAVG
Ga0137377_1008910563300012211Vadose Zone SoilMRRRFTYVVLAALVLLAAGASSTLAAGGEQTYVFNGRLLADAGSSSSLYVDINGGNKPAL
Ga0137372_1103472523300012350Vadose Zone SoilMRRHFTYLVLVALVLLAAGASSTLAAGGELTYVFNGRLLADAGNSSSLYVDVNGGNRPALRKLVGQSDNQHFAVGSETQYLRWSHGVPTVV
Ga0137366_1008190243300012354Vadose Zone SoilMRRRFTYVVLAALVLLAAGASSTLAAGRGEQTYVFNGRLLADAGSSTSLYVDINGGNKPALKKL
Ga0137369_1076545123300012355Vadose Zone SoilMYVVLAAFVLLAAGASSTFAARGSGVQTFIFNGRLLTDAGSSQSLYVKVNGDNTIAHKKLVGQSDNQYFAADSSPQ
Ga0137371_1113256313300012356Vadose Zone SoilMYVVLAAFVLLAAGASSTFAARGSGVQTFIFNGRLLTDAGSSSSLYVDVNGGNRLALKKLVGQSDNQYFAVDSSTQFLRWSHGVPTVV
Ga0134047_120877723300012380Grasslands SoilMYVVLAAFVLLAAGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKK
Ga0134051_117986013300012398Grasslands SoilMYVVLAAFVLLAAGASSTFAARGSGVQTFIFNGRLLTDAGSSSSLYVDVNGGNRLALKKLVGQSDNQYFAVDSSTQF
Ga0134048_105066323300012400Grasslands SoilMYVVLAAFVLLAAGASSTFAAQGSGLQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVG
Ga0137373_1104504513300012532Vadose Zone SoilMYVVLAAFVLLAAGASSTFAARGSGVQTFIFNGRLLTDAGSSPSLYVDVNG
Ga0137397_1013919853300012685Vadose Zone SoilMRRRFTYVVLAALVLLAAGASSTLAAGGEQTYVFNGRLLADAGSSSSVYVDINGGNKPAL
Ga0157286_1031267923300012908SoilMYVVLAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGN
Ga0157301_1045571623300012911SoilMYVVLAAFVLLAAGASSTFAARGSGVQTFIFNGRLLTDAGTSSSLYVEINGGNRLALKKLVGQSDNQHFAVDSNTQYLRWSHGVP
Ga0134110_1026761023300012975Grasslands SoilMRRRLTYVALAALVLLAAGASSTFAARSGSVAYMFNGRLLADAGNSPTLYVDVNGGNRIALKKLVGLSDNQNFAVGAG
Ga0134087_1026783713300012977Grasslands SoilMRRRLTYVVLAALVLLAAGASSTFAARGSVQTFVFNGRLLADAGSSTSLYVDVNGGNRPALKKLVGQSDNQSFA
Ga0164308_1087530413300012985SoilMYVALAAFALLAVGASSTLAARGGQQTYIFNGRLLADAGNSSSLYVDINGGNKPALKKLVGQNDQQSF
Ga0164306_1036580233300012988SoilMRRRFTYVVLAALVLLAAGASSTFAAGGQQTYVFNGRLLADAGSSTSLYVDINGGNKPALKKLVGQSDNQYFAVGAGTQYLRWAHG
Ga0157377_1069710113300014745Miscanthus RhizosphereMYLALAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVFGGNRL
Ga0134072_1004976533300015357Grasslands SoilMRRRFAYFVIAVLVLVAAGASSTLAAGEGNLTYVFNGRLLTDAGNSSSLYVDVNGGNKPALRKLLGQSDNQYFAVGSGTQYLRWSHGVP
Ga0132256_10153403923300015372Arabidopsis RhizosphereMRRRLTYVALAALVLLAAGASSTFAARSGSVTYVFNGRLLAHAGHSSSLYVDVN
Ga0132255_10129316813300015374Arabidopsis RhizosphereMRRRLTYVALAALVLLAAGASSTFAARSGSVTYVFNGRLLADAGNSSSLYVDVNGGNRIALKKLVGMNDSQNFAVGG
Ga0187786_1063109913300017944Tropical PeatlandMRKHILPLALLTLVMLAVGASSTFAAGSGANFNYVFNGRLLADAGSSPTLAVHVNGGNKAAIKKLIGK
Ga0184621_1009806813300018054Groundwater SedimentMRRRLMYVVLAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQHFAVDSSTQFLR
Ga0066655_1016867333300018431Grasslands SoilMRRRFTYLVLAALVLLAAGASSTFAARSGNVTYQFNGRLLADAGGSSSLYVDVNGGNRLALKKLVGQSDNQHFAVDSNTQYL
Ga0184642_162648323300019279Groundwater SedimentMRRRLMYVVLAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQHFAVDSSTQFLRWS
Ga0193747_113384613300019885SoilMRRRLAYIALAALIALAAGASATFAAQRSGELTYVFNGKLLADAGNSTSLYVEVHGGSRPALKKLVGESQNQHFA
Ga0179594_1003182313300020170Vadose Zone SoilMRRRLMYVVLAAFVLLAAGASSTFAARGSGVQTFIFNGRLLTDAGSSSSLYVDVNGGNRLALKKLVGQRDNQYFAVDSSTQFLRW
Ga0193694_105445913300021415SoilMRRRLMYVVLAAFVLLAAGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQHFAV
Ga0182009_1039075823300021445SoilMRRRLVYVVLAALIVVVAAASSSLAAQRGGDVTYVFNGRLLADAGNSPS
Ga0126371_1207396513300021560Tropical Forest SoilMRRRFTYVALAAFVLLAAGASSTFAARSGSVTYVFNGRLLADAGNSSTLYVDVNGGNKIALKKLVGLSDSQNFAVGAG
Ga0222622_1018290713300022756Groundwater SedimentMRRRLMYVVLAAFVLLATGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYIDVNGGNRLALKKLVGQSDNQHFAVDSSTQFLRWSHGVPTVV
Ga0207647_1044694913300025904Corn RhizosphereMRRRLMYVVLAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGG
Ga0207684_1147994513300025910Corn, Switchgrass And Miscanthus RhizosphereMRRRLMYVVLAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQHFAVDSNTQFLR
Ga0207662_1119928813300025918Switchgrass RhizosphereMRRRLMYVVLAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNG
Ga0207686_1131713123300025934Miscanthus RhizosphereMRRRLMYVVLAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKL
Ga0207709_1098189723300025935Miscanthus RhizosphereMRRRLMYVVLAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNR
Ga0209468_119522713300026306SoilMRRRLTYVALAALVLLAAGASSTFAARSGSVAYMFNGRLLADAGNSSALYVDVNGGNRLALKKLVGQ
Ga0209378_118840623300026528SoilMRRRLTYVALAALVLLAAGASSTFAARGSGEQTYVFNGRLLADAGSSTSLYVDVNGGNRPALKKLVGQSDNQHF
Ga0209807_118592823300026530SoilMRRRFTYFVIAALVLVAAGASSTLAAGGGNLTYMFNGRLLTDAGNSSSLYVDVNGGNKPA
Ga0209376_124480123300026540SoilMRRRFTYFVLAALVVLAAGASSTFAARDGEQTYVFNGRLLADAGSSTSLYVDINGGNRPALKKLVGQSD
Ga0209577_1002545383300026552SoilMRRRLTYVVIAAFVLLATGASSTFAARGSGEQTLVFNGHLLADAGSSSSLYVDVNGGNRLALKKLIGLSDNQYFAVDSSTQFLRWSH
Ga0209073_1022864223300027765Agricultural SoilMRRRLTYVLIAALVLLAAGASSTFAARGSGEVTFVFNGHLLADAGNSSSLY
Ga0209177_1037859523300027775Agricultural SoilMRRRLTYVALAALVLLAAGASSTFAARGSGNVTFVFNGRLLSDAGNSSSLYVDVNGGN
Ga0209814_1029195223300027873Populus RhizosphereMRRRLTYVLIAALVLLAAGASSTFAARGSGEVTFVFNGHLLADAGNSSSLYVDINGGNRLALKKLVGLSDSQNFAVGSGTQFLRWSHGVPTVVA
Ga0247822_1090350223300028592SoilMRRRLMYVVLAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSPSLYVDVNGGNRLALKKLVGQSDNQHFAVDSSTQFLR
Ga0307309_1009414823300028714SoilMRRRLMYVALAAFALLAVGASSSLAARGGQQTYIFNGRLLADAGNSPSLYV
Ga0307307_1030217713300028718SoilMRRRLMYVVLAAFVLLATGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQHFAV
Ga0307317_1027326113300028720SoilMRRRLMYVVLAAFVLLATGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQHFAVDSSTQFLRWSHGVPTVV
Ga0307320_1002732013300028771SoilMRRRLMYVVLAAFVLLATGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQHFAVDSSTQFLRWSHGVPTVVA
Ga0307320_1045757423300028771SoilMRRRLMYVALAAFALLAVGASSSLAARGGQQTYIFNGRLLADAGNSPSLYVDVNGGNKLALKKL
Ga0247824_1108136713300028809SoilMRRRVMYVVLAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQHFAVDSSTQFLRW
Ga0307292_1008820433300028811SoilMRRRLMYVVLAAFVLLATGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQHFA
Ga0307277_1013237733300028881SoilMRRRLMYVVLAAFVLLAVGASSTFAARGSGVQTFIFNGRLLADAGSSSSLYVDVNGGNRLALKKLVGQSDNQHFAVDSSTQFLRWSHGVPTVVAE
Ga0308192_104818013300031082SoilMRRRLMYVALAAFALLAVGASSSLAARGGQQTYIFNGRLLADAGNSPSLYVDVNGGNKLALKKLVGHNDEQSF
Ga0310885_1035437813300031943SoilMRRRLTYVLIAALVLLAAGASSTFAARGSGEPTFVFNGHLLADAGNSSSLYVDVFGGNRL
Ga0308176_1282036923300031996SoilMRRRFTYVVLAALVLLAAGASSTFAARSGNVTYQFNGRLLADAGGSSSLYVDVNGGNRLALKKLVGQSDNQHFAVDSNTQYLRWSHGVPTVVA
Ga0310896_1052308423300032211SoilMRRRLTYVLIAALVLLAAGASSTFAARGSGEPTFVFNGHLLADAGNSSSLYVDVFGGNRLGLKKLVGQS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.