NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F097983

Metagenome / Metatranscriptome Family F097983

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097983
Family Type Metagenome / Metatranscriptome
Number of Sequences 104
Average Sequence Length 62 residues
Representative Sequence DHVGHTVTITGVVSNATLHGMKEDAKAEAKEHGVDKHSTEHGHMTVTNLTMVSDSCKK
Number of Associated Samples 83
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 20.19 %
% of genes from short scaffolds (< 2000 bps) 20.19 %
Associated GOLD sequencing projects 82
AlphaFold2 3D model prediction Yes
3D model pTM-score0.23

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (80.769 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil
(22.115 % of family members)
Environment Ontology (ENVO) Unclassified
(28.846 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(62.500 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 18.60%    β-sheet: 0.00%    Coil/Unstructured: 81.40%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.23
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF05532CsbD 6.73
PF04226Transgly_assoc 3.85
PF12732YtxH 2.88
PF01566Nramp 1.92
PF05974DUF892 1.92
PF06348DUF1059 1.92
PF01268FTHFS 1.92
PF03602Cons_hypoth95 1.92
PF01145Band_7 0.96
PF01564Spermine_synth 0.96
PF08281Sigma70_r4_2 0.96
PF10944DUF2630 0.96
PF02276CytoC_RC 0.96
PF13545HTH_Crp_2 0.96
PF00011HSP20 0.96
PF01435Peptidase_M48 0.96
PF11154DUF2934 0.96
PF02368Big_2 0.96
PF08327AHSA1 0.96
PF04909Amidohydro_2 0.96
PF10099RskA 0.96
PF00999Na_H_Exchanger 0.96
PF02518HATPase_c 0.96
PF13852DUF4197 0.96
PF01204Trehalase 0.96
PF13502AsmA_2 0.96
PF00933Glyco_hydro_3 0.96
PF01988VIT1 0.96
PF07690MFS_1 0.96
PF01814Hemerythrin 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG3237Uncharacterized conserved protein YjbJ, UPF0337 familyFunction unknown [S] 6.73
COG2261Uncharacterized membrane protein YeaQ/YmgE, transglycosylase-associated protein familyGeneral function prediction only [R] 3.85
COG074216S rRNA G966 N2-methylase RsmDTranslation, ribosomal structure and biogenesis [J] 1.92
COG109223S rRNA G2069 N7-methylase RlmK or C1962 C5-methylase RlmITranslation, ribosomal structure and biogenesis [J] 1.92
COG1914Mn2+ or Fe2+ transporter, NRAMP familyInorganic ion transport and metabolism [P] 1.92
COG2242Precorrin-6B methylase 2Coenzyme transport and metabolism [H] 1.92
COG2265tRNA/tmRNA/rRNA uracil-C5-methylase, TrmA/RlmC/RlmD familyTranslation, ribosomal structure and biogenesis [J] 1.92
COG2759Formyltetrahydrofolate synthetaseNucleotide transport and metabolism [F] 1.92
COG2890Methylase of polypeptide chain release factorsTranslation, ribosomal structure and biogenesis [J] 1.92
COG3685Ferritin-like metal-binding protein YciEInorganic ion transport and metabolism [P] 1.92
COG0025NhaP-type Na+/H+ or K+/H+ antiporterInorganic ion transport and metabolism [P] 0.96
COG0071Small heat shock protein IbpA, HSP20 familyPosttranslational modification, protein turnover, chaperones [O] 0.96
COG0475Kef-type K+ transport system, membrane component KefBInorganic ion transport and metabolism [P] 0.96
COG1472Periplasmic beta-glucosidase and related glycosidasesCarbohydrate transport and metabolism [G] 0.96
COG1626Neutral trehalaseCarbohydrate transport and metabolism [G] 0.96
COG1633Rubrerythrin, includes spore coat protein YhjRInorganic ion transport and metabolism [P] 0.96
COG1814Predicted Fe2+/Mn2+ transporter, VIT1/CCC1 familyInorganic ion transport and metabolism [P] 0.96
COG3004Na+/H+ antiporter NhaAEnergy production and conversion [C] 0.96
COG3263NhaP-type Na+/H+ and K+/H+ antiporter with C-terminal TrkAC and CorC domainsEnergy production and conversion [C] 0.96
COG4651Predicted Kef-type K+ transport protein, K+/H+ antiporter domainInorganic ion transport and metabolism [P] 0.96


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A80.77 %
All OrganismsrootAll Organisms19.23 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002245|JGIcombinedJ26739_101833698All Organisms → cellular organisms → Bacteria → Acidobacteria508Open in IMG/M
3300004100|Ga0058904_1341163All Organisms → cellular organisms → Bacteria → Acidobacteria511Open in IMG/M
3300004631|Ga0058899_10101419All Organisms → cellular organisms → Bacteria → Acidobacteria550Open in IMG/M
3300006172|Ga0075018_10197194All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatobacter → Candidatus Sulfotelmatobacter kueseliae953Open in IMG/M
3300009088|Ga0099830_11392622All Organisms → cellular organisms → Bacteria → Acidobacteria583Open in IMG/M
3300009089|Ga0099828_11458608All Organisms → cellular organisms → Bacteria → Proteobacteria604Open in IMG/M
3300011120|Ga0150983_10324219All Organisms → cellular organisms → Bacteria → Acidobacteria1461Open in IMG/M
3300011120|Ga0150983_14420023All Organisms → cellular organisms → Bacteria1541Open in IMG/M
3300011120|Ga0150983_15441463All Organisms → cellular organisms → Bacteria678Open in IMG/M
3300012961|Ga0164302_10527995Not Available839Open in IMG/M
3300021168|Ga0210406_10927972All Organisms → cellular organisms → Bacteria → Acidobacteria653Open in IMG/M
3300021401|Ga0210393_11008078All Organisms → cellular organisms → Bacteria → Acidobacteria674Open in IMG/M
3300024182|Ga0247669_1025948All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium996Open in IMG/M
3300027678|Ga0209011_1204528All Organisms → cellular organisms → Bacteria → Acidobacteria536Open in IMG/M
3300027882|Ga0209590_10091470All Organisms → cellular organisms → Bacteria1800Open in IMG/M
3300027910|Ga0209583_10386945All Organisms → cellular organisms → Bacteria → Acidobacteria662Open in IMG/M
3300029636|Ga0222749_10263911All Organisms → cellular organisms → Bacteria → Acidobacteria880Open in IMG/M
3300031753|Ga0307477_10194938All Organisms → cellular organisms → Bacteria1410Open in IMG/M
3300032174|Ga0307470_10061870All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1976Open in IMG/M
3300032180|Ga0307471_101207334All Organisms → cellular organisms → Bacteria920Open in IMG/M
3300032180|Ga0307471_102973446All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium601Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil22.12%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil19.23%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil15.38%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil6.73%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.81%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil3.85%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil3.85%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.88%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.88%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil2.88%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment1.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.92%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.96%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil0.96%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.96%
SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Soil0.96%
FenEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Fen0.96%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.96%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.96%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.96%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.96%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.96%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001083Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M1EnvironmentalOpen in IMG/M
3300001867Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705)EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300004100Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF244 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004101Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF228 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004118Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF208 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004119Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF210 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004134Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF248 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004136Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF214 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004139Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF230 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004631Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF234 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005994Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 3 DNA2013-049EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006059Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2012EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009524Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_c_BC metaGEnvironmentalOpen in IMG/M
3300009683Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_b_LC metaGEnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300018007Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_5EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300024182Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK10EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026116Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026377Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-BEnvironmentalOpen in IMG/M
3300026502Peat soil microbial communities from Stordalen Mire, Sweden - H.B.S.T-25.r1EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300027635Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027678Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027681Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027826Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027854Peat soil microbial communities from Weissenstadt, Germany - SII-2010 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030114I_Fen_E2 coassemblyEnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032515FICUS49499 Metatranscriptome Czech Republic combined assembly (additional data)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI1027J12803_10235651233300000955SoilHVGHTVTITGTVSNAEMHGMKEDAKAEAKEHGMDQDSTEHGHMTVKNVKMVSASCKQ*
JGI12678J13193_100579013300001083Forest SoilTSDAVKLGDHVGHTVTITGVVSNAAMHGMKEDTKAEAKEHGVGKHSTEHGSMTVTALTMVSESCQK*
JGI12627J18819_1035468913300001867Forest SoilALNQHVGHTVRVTGAVSNATMHGAKEDAKEKASEHGVGDSTEHGHMTVTSLKMVSESCSH
JGIcombinedJ26739_10183369813300002245Forest SoilIKSDSLKLGDHIGHTVTITGVVSNAKMHGMKEDTKAEAKEHGMKKDSTEHGHLTATDVTMVSDTCKK*
Ga0058904_134116313300004100Forest SoilWEVKSDSVKLAPHVGHTMTITGVVSNATMHGVKEDAKSEAKEHGVDKDSTEHGHMTVTNVKMVSNSCQN*
Ga0058896_102686613300004101Forest SoilDTVKLGNHVSHTVTITGVVSNAMMHGAKEDVKDEAKEHGMDKNSTEHGHMTVTYLKMVSESCSN*
Ga0058886_137073123300004118Forest SoilNGHKLSNHVGHTVTITGVVSNAELHGAKEDAKSEAKEHGMDKESTEHGHLKVTNLKMVSDSCKQ*
Ga0058887_149599123300004119Forest SoilHSVTITGVVSNAKMHGMKEDAKAEAKEHGMDKDSTEHGSITVTNLTMVSDTCKK*
Ga0058906_132976513300004134Forest SoilDHVSHTVTITGVVSNATLHGAKEDVKDEAKEHGMAKDSTEHGHMTVTHLKMVNDGCSK*
Ga0058889_143549713300004136Forest SoilLKLGDHVGHTVTITGVVSNAKMHGMKEDAKAEAKEHGMGKNSTEHGHMTVTNLTMVSDTCKK*
Ga0058897_1001249523300004139Forest SoilVKLGDHVGHSVTITGVVSNAKMHGMKEDAKAEAKEHGMDKDSTEHGSITVTNLTMVSDTCKK*
Ga0058897_1004313013300004139Forest SoilDHVGHTVRITGVVSNAKMHGMTEDAKDEAREHGVAKNSTEHGHMTVTYVKMVSDSCSK*
Ga0058899_1010141913300004631Forest SoilWEVKSDSVDLAKHVGHTITVTGAVQNAALHGAKEDTKAAAKEHGVDKNATEHGHMIVTNLKMVSGSCEK*
Ga0070714_10211640313300005435Agricultural SoilKLAPHVGHTITVTGVVANAALHGTKEDVKGEAREHGVDKNSTEHGHMTVTNAKMVSDSCQN*
Ga0070713_10083646423300005436Corn, Switchgrass And Miscanthus RhizosphereKSDAVKLAPHVGHTMTITGVVSNATLHGAKEDVKGEAREHGVDKNSTEHGHMTVTNAKMISDSCQN*
Ga0073909_1060889123300005526Surface SoilKLGDHVGHTVTITGVVANAKMHGMKEDAKAEAKEHGMKKDSTEHGHLTATDVTMVSDTCKK*
Ga0070731_1032025133300005538Surface SoilLAKHLGHTVTVTGAVSNAALHGAKEDTKAEAREHGVDKSSTEHGHMTVTNLKMVSGSCEK
Ga0066706_1094755223300005598SoilHVGHTVKITGVVANAAAHGMKEDAKSEMKEHGMEKHATEHGHMTVTYLAMVSDSCKK*
Ga0066789_1004237233300005994SoilSDAVKLGEHVGHSVRITGVVSNATMHGAKEDAKAEAKEHGVGENSAEHGHLTATGLKMVSESCSQ*
Ga0070717_1152590213300006028Corn, Switchgrass And Miscanthus RhizosphereKLDEHVGHSVAVTGVVSHHKEHAMKEDAKAEMKEHGMDKDAKEHGHMTATALTMVSDSCQK*
Ga0075017_10166397713300006059WatershedsEIKSENVKLADHIGHTVTITGVVANAEMHGMKEDAKGEAKEHGMDKDSTEHGHMTVTNLKMVSESCKK*
Ga0075018_1019719433300006172WatershedsDSVNLGQHVGHTVKIVGVVSNAKAHGMKEDAKAEMKEHGMEKHTAERGHMTVTDLTMVSDTCQK*
Ga0079221_1112466123300006804Agricultural SoilVKMAPHVGHTVTITGVVSNATAHGLKEDTKTEAREHGIDKDSTEHGHMTVTSLKMVSDSCSR*
Ga0079220_1003516213300006806Agricultural SoilLHSDSLKLGDHVGHTVTITGVVSNATLHGAKEDAKAEAKEHGVDKDSTEHGHMTITYLKMVSDTCSK*
Ga0079220_1176592123300006806Agricultural SoilTLKLGDHVGHKVTITGVVSNAKMHGMKEDVKAEAKEHGMDKDSTEHGHMTVTYLKMVSDTCSK*
Ga0099830_1055288913300009088Vadose Zone SoilKSDTVKLGDHVAHTVTITGVVSNATLHGAKEDAKAEAKEHGMDKSSTEHGHMTVTHLKMVSDSCSK*
Ga0099830_1139262223300009088Vadose Zone SoilSTWEIKSDSLKLGDHVGHKVAVTGVVTNAKMHGMKEDAKDEAKEHGVATHSTEHGHMTVTNLSMVSESCQK*
Ga0099828_1145860823300009089Vadose Zone SoilIKSDSVRLGPHVEHTMTITGVVSNATLHGVKEDVKAEAKEHGVAKNSTEHGHMTVTNAKMVSDSCQN*
Ga0116225_116849213300009524Peatlands SoilDHVGHTVTITGVVSNATMHGMKEDAKAEAKEHGVDKGSTEHGHMAVTNLKMVSDSCKK*
Ga0116224_1010461313300009683Peatlands SoilLVDHVGHTVTITGVVSNATMHGMKEDAKAEAKEHGVDKGSTEHGHMAVTNLKMVSDSCKK
Ga0126370_1199498323300010358Tropical Forest SoilVGHTVRVTGVVRNAEAHGMKEDAKEKAAEHGVDKNETEHGHMEVTAVKHVSESCKK*
Ga0126378_1034441323300010361Tropical Forest SoilMASHVGHTVTVTGVVSNATLHGLKEDAKTEAREHGIDKNSTEHGHMTVTGLKMVSENCLRGRSG*
Ga0126378_1242112823300010361Tropical Forest SoilVKMAPHVGHTVTVTGVVPNATAHGVKEDAKSEAREHGVAKNSTEHGHMTVTSLKMVSKSCS*
Ga0150983_1032421913300011120Forest SoilWELHSDSLKLGDHVGHTVTITGVVSNAKMHGMKEDAKAEAKEHGVAKHSTEHGSITVTDLSMVSDSCKK*
Ga0150983_1052309243300011120Forest SoilDIVKLGDHVSHTVTITGVVSNATLHGAKEDVKDEAKEHGMGKDSTEHGHMTVTYLKMVSDSCSK*
Ga0150983_1070974813300011120Forest SoilHVSHTVTITGVVSNATLHGAKEDVKAEAKEHGVDKNSTEHGHMTVTYLKMVSDGCSK*
Ga0150983_1160159913300011120Forest SoilVKLAPHVGHTITVTGVVSNATLHGVKEDVKAEAKEHGVDKDSTEHGHMTVTSAKMVSNSCQK*
Ga0150983_1355684413300011120Forest SoilHVGHAVTITGVVSNATLHGLKEDVKGEAKEHGVGKASTEHGSVTVTNLTMVSNSCRK*
Ga0150983_1442002313300011120Forest SoilTWEIKSDTVKLGNHVSHTVTITGVVSNAMMHGAKEDVKDEAKEHGMDKNSTEHGHMTVTYLKMVSESCSN*
Ga0150983_1544146323300011120Forest SoilWEIKSDTVKLGNHVGHTVRITGVLPNATLHGVKEDAKAEAKEHGVDKNSTEHGHMTVTYLKMVSESCSK*
Ga0137393_1128169923300011271Vadose Zone SoilLGDHVGHTVKITGVVSNAAMHGVKEDAKDSAREHGMGKNSTEHGHLTATNLTMVSDSCTK
Ga0137389_1040199623300012096Vadose Zone SoilLELKRNGHRLSNHVGHTVTITGVVSNAELHGAKEDAKAEAKEHGMDKDSTEHGHMKVTNLKMVSDSCKP*
Ga0137388_1020779523300012189Vadose Zone SoilDLEIKRDTVNLGEHIGHTVTMTGVVYHAKIHAMKEDVKDEAREHGVDKNSAERGHITVTDLKMVSDSCQK*
Ga0137399_1029617023300012203Vadose Zone SoilLGDHVGHTVRITGVVSNAAMHGVKEDAKDAAKEHGMGKNSTEHGHLTVVNLTMVSDTCKK
Ga0137362_1013346613300012205Vadose Zone SoilDDHVGHTVAVTGVVSHHKAHAMKEDVKAEMKEHGMDKDAKEHGHMTVTALKMVSDTCQK*
Ga0137381_1021982733300012207Vadose Zone SoilGDHVGHTVKITGVVSNAKMHGMKEDTKEEMKEHGMNKHATESGHLTVTDLSMVSDSCQ*
Ga0137358_1047227423300012582Vadose Zone SoilEHVGHTVSVTGVVSHAKMHGMKEDAKDEAKEHGVDKDATEHGHMTVTNLSMVSESCHQ*
Ga0137413_1026449523300012924Vadose Zone SoilGEHVGHTVKVTGVVSNAAAHGAKEDAKEKASEHGVGDAAEHGHMTVTGLKMVSESCSK*
Ga0137419_1136366423300012925Vadose Zone SoilVGHTVRITGVVSNAAMHGVKEDAKDAAKEHGMGKNSTEHGHLTVVNLTMVSDTCKK*
Ga0137416_1157842513300012927Vadose Zone SoilIKSDAVKLGEHIGHTVRITGVVSNATAHGAKEDAKAEAKEHGVGENSTEHGHLTATGLKMVSESCSQ*
Ga0137410_1136192813300012944Vadose Zone SoilHVGHTVTVTGVVSNAALHGLKEDAKEEAKEHGMDKDAKEHGHMTVTDVKMVSESCKK*
Ga0164302_1052799523300012961SoilKSDSVDLAPHVGHTVTVTGVVANAEMHGMKEDAKEEAKEHGMDKKAEEHGHLTATAVKMVSESCKK*
Ga0157379_1013450213300014968Switchgrass RhizosphereSVKLDEHVGHTVKITGVVANATAHGMKEDTKEEMKEHGMNKHDTEHGHMTVTDLTMVSDSCQK*
Ga0132256_10031654333300015372Arabidopsis RhizosphereLDEHVGHTVKIVGVVSNATTHGMKEDTKEEMKEHGMDKNSTEHGHMTVTDLTMVSGSCQK
Ga0187821_1022254733300017936Freshwater SedimentKSDSVDLAKHVGHTITVTGAVQNAALHGAKEDTKAEAKEHGVGKNSTEHGHMTVTNVKMVSNGCEK
Ga0187805_1058846813300018007Freshwater SedimentDSVDLAKHVGHTITVTGAVQNAALHGAKEDTKAEAKEHGVGKNSTEHGHMTVTNIKMVSGSCEK
Ga0210403_1000700513300020580SoilLDEHVGHTVTVTGVVAHHKAHAMKEDTKAEMKEHGMDKGAKEHGHLTVADLTMVSDTCPK
Ga0215015_1000086613300021046SoilVTITGVVSNAKMHGMKEDAKAEAKEHGMDKDSTEHGSITVTNLTMVSDTCKK
Ga0210406_1092797223300021168SoilAKGSTWELKSNGHKLSHHVGHTVTITGVVSNAEMHGAKEGVKAEAKEHDIDQDSTEHGHMKVTNLKMVSDSCKR
Ga0210406_1129280723300021168SoilNGHNLSHHVGHTVTVTGVVSNPEMHGAKEDAKAEAKEHDLDKDSTEHGHLKVTNLKMVSDSCNGSH
Ga0210400_1007755213300021170SoilHTITVTGVVSNAAMHGAKEDAKEKAKEHGVAKNSTERGHMTVTNAKMVSDSCSN
Ga0210405_1011091813300021171SoilDSVDLAKHVGHTVTVTGAVSNAAVHGAKEDAKDEAREHGVDKNSTEHGHMTVTNLKMVGSSCEK
Ga0210405_1028948533300021171SoilGEHVGHTVTITGVVSNATAHGMKEDTKEEMKEHGMDKHATEHGHLTATALTMVSDTCQK
Ga0210408_1011635833300021178SoilDSIDFAKHVGHTITVTGAVQNATLHGAKEDTKAEAKEHGVGKNSTEHGHMTATNLKMVSGSCEK
Ga0210408_1023174313300021178SoilIKSDSVKLAPHVGHTMTITGVVSNATLHGVKEDVKSEAKEHGVDKDSTEHGHMTVTNAKMVSDSCQN
Ga0210393_1100807823300021401SoilGSTWEVKSDSVALAPHVGHTITTTGAVSNAALHGAKEDAKEEAKEHGVAKNASEHGHMTVTNLKMISKSCEK
Ga0210389_1095252723300021404SoilDSVKLDEHVGHTVTVTGVVAHHKAHAMKEDTKAEMKEHGMDKGAKEHGHLTVSDLTMVSDTCPK
Ga0210394_1068839913300021420SoilGHTMTITGVVFNATLHGVKEDAKSEAKEHGFDKDSTEHGHMTVTNAKMVSDSCQN
Ga0210384_10001979303300021432SoilTVRLGEHVGHTVRITGVVSNATLHGAKEDAKDEAKEHGVAKGSTEHGHLTVTNLSMVSEGCKP
Ga0210384_1123436713300021432SoilTVTGAVSNAALHGAKEDAKDEAREHGVDKNSTEHGHMTVTNLKMVGSSCEK
Ga0210402_1069290413300021478SoilHVGHTMTVTGVVSNATLHGVKEDAKSEAKEHGVDKDSTEHGHMTVTNAKMVSGSCQN
Ga0210402_1177444423300021478SoilVPLTEHVGHTVRVTGVVSNAKMHGAKEDAKAEAKEHGVGENSTEHGHMTVTGLKMVSESCSQ
Ga0247669_102594823300024182SoilWEIKSDTVKLGDHVAHTVTITGVVANATLHGMKEDAKAEAKEHGVDKDSTEHGHMTVTHLKMVSDSCSK
Ga0207699_1033041913300025906Corn, Switchgrass And Miscanthus RhizosphereALSEHVGHTVKVTGVVSNAAAHGAKEDAKEKAAEHGVGDSAEHGHMTVTGLKMVSESCSK
Ga0207684_1062118713300025910Corn, Switchgrass And Miscanthus RhizosphereSDALRLGDHVGHTVTITGVVSNAAMHGMKEDAKAEAKEHGVDKHSTEHGHMTVTNLTMVSDTCKK
Ga0207646_1193811113300025922Corn, Switchgrass And Miscanthus RhizosphereHTVTITGVVSNAKMHGMKEDAKAEAKEHGMDKDSTEHGHMTVTYLKMVSDSCSK
Ga0207674_1088828723300026116Corn RhizosphereMTPSRLVTVTGVVSNAAMHGMKEDAKDEAKEHGMDKKATEHGHLTATAVKMVSDSCKK
Ga0207675_10174820313300026118Switchgrass RhizosphereAHTVTITGVVSNATLHGVKEDAKEEAKEHGIDKDSTEHGHMAVTHLKMVSESCSK
Ga0257171_102229313300026377SoilDHVGHTVTITGVVSNATLHGMKEDAKAEAKEHGVDKHSTEHGHMTVTNLTMVSDSCKK
Ga0255350_113830523300026502SoilEHVGHTVKITGVVANATAHGMKEDTKEEMKEHGMDTHAERGHMTVTNLSMVSDTCQK
Ga0179593_106907933300026555Vadose Zone SoilKADSLNLGEHVGHTVSVTGVVSHAKMHGMKEDAKDEAKEHGVDKDATEHGHMTVTNLSMVSESCHQ
Ga0209625_105464413300027635Forest SoilTWELTSNGHKLSNHVGHTVTITGVVANAEMHGAKEDAKAETKEHDMDKASTEHGHLKVTNLKMVSDSCKP
Ga0209011_112364913300027678Forest SoilDSLKLGDHVGHTVTITGVVSNAAIHGVKEDVKSEAKEHGMGKNSTEHGHMTVTNLTMVSDSCKN
Ga0209011_120452823300027678Forest SoilSDSAKLDDHVGHTVTVTGVVSHHKEHAMKEDAKAEMKEHGMDKGAKEHGHMTVTDVKMVSDSCQK
Ga0208991_124693713300027681Forest SoilDSVKLGDHVGHSVTVDGVVSNAKLHGAKEDIKSEAKEHGLDKDATEHGHMTVTNLTMVSDSCKK
Ga0209073_1002339943300027765Agricultural SoilKHVGHTVTVTGAVSNAALHGTKEDAKTKAREHGIDKDSAERGHMTVTNLKMVSASCEK
Ga0209060_1032609413300027826Surface SoilEIKSDAVALSEHVGHTVKVTGVVSNAMAHGAKEDAKEKASEHGVGDSAEHGHVTVTGIKMVSESCSK
Ga0209517_1018401943300027854Peatlands SoilTVTITGVVSNATMHGMKEDAKAEAKEHGVDKGSTEHGHMAVTNLKMVSDSGKK
Ga0209166_1011864333300027857Surface SoilHVSHTVTITGVVSNATLHGAKEDAKAEAKEHGMDKNSAEHGHMTVTHLKMVSDTCSN
Ga0209590_1009147013300027882Vadose Zone SoilGGNWEIRSDTVKLGEHVGHTVTITGVVSNATMHGMKEDVKAEAKERGIDKDSTEHGHMTVTNLKMVSGSCKK
Ga0209583_1038694513300027910WatershedsGTWEVKSDSVKLAPHVGHTMTITGVVSNATLHGAKEDVKAEAKEHGVDKDSTEHGHLTATNLTMVSDTCKK
Ga0222749_1009150433300029636SoilDSVKLGEHVGHTVTITGVVSNATAHGMKEDTKEEMKEHGMDKNATEHGHLTATAVSMVSDSCQK
Ga0222749_1026391113300029636SoilLAPHVGHTMTITRVVSNATLHGVKEDVKSEAKEHGVDKDSTEHGHMTVTNAKMVSDSCQN
Ga0222749_1028249233300029636SoilGHTVTVTGAVSNAALHGAKEDTKAEAREHGIDKSSTEHGHMTVTNLKMVGSSCEK
Ga0311333_1144873323300030114FenHVGHTVKIVGVVSNTMAHGMKEDTKEEMKEHGMDKHAKESGHMTVTNLTMVSDTCQK
Ga0170834_11406215413300031057Forest SoilEIKSDAVALGEHVGHSVRVTGVVSNAAIHGAKEDAKEKASEHGVGDAAEHGHMTVTGLKMVSESCSK
Ga0307477_1019493843300031753Hardwood Forest SoilLDEHVGHTVKITGVVPNAMAHGMKEDTKEEMKEHGMDKHATEHGHLTVTDLTMVSDTCPK
Ga0307475_1023165423300031754Hardwood Forest SoilSVTITGVVSNAKMHGMKEDAKAEAKEHGMDKDSSEHGSMTVTNLAMVSDTCKK
Ga0307470_1006187013300032174Hardwood Forest SoilDGSKWDVKSDSVKLAPHVGHTVTVTGVVSNAGAHGAKEDIKDEAKEHGVGKNSTETGDLTVTTLKMVSKSCKE
Ga0307470_1041506313300032174Hardwood Forest SoilKLGDHVGHKVAVTGVVTNAKMHGMKEDAKDEAKEHGVATHSTEHGHMTVTNLSMVSESCQ
Ga0307471_10120733413300032180Hardwood Forest SoilGGKWDVKSDSVKLAPHVGHTVTVTGVVSNAAAHGAKEDVKDEAKEHGIAKNSTETGNLTVTTVKMVSDSCQQ
Ga0307471_10297344613300032180Hardwood Forest SoilVTVTGVVSNAKLHGMKEDAKAEAKEHGVDKDSTEHGHMTVTNLSMVSDTCKK
Ga0307471_10358246223300032180Hardwood Forest SoilVNLAEHVGHTVKITGVVSNAKMHGMKEDTKEEMKEHGMDKNSTEHGHLTVTNVDMVSDTC
Ga0348332_1036928413300032515Plant LitterAKHVGHTVTVTSAVANAALHGAKEDTKAEAKEHGIDKSSTEHGHMTVTNLKMVSTSCEK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.