NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F072548

Metagenome / Metatranscriptome Family F072548

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F072548
Family Type Metagenome / Metatranscriptome
Number of Sequences 121
Average Sequence Length 45 residues
Representative Sequence MLGLWAVLTLTGALYSAWLGYGGRAFAATLTAFAFFFLVML
Number of Associated Samples 98
Number of Associated Scaffolds 121

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 90
AlphaFold2 3D model prediction Yes
3D model pTM-score0.64

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(26.446 % of family members)
Environment Ontology (ENVO) Unclassified
(27.273 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(64.463 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 52.17%    β-sheet: 0.00%    Coil/Unstructured: 47.83%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.64
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 121 Family Scaffolds
PF07786HGSNAT_cat 69.42
PF02687FtsX 9.09
PF12704MacB_PCD 7.44
PF04977DivIC 2.48
PF13602ADH_zinc_N_2 2.48
PF01757Acyl_transf_3 1.65
PF08281Sigma70_r4_2 0.83
PF02517Rce1-like 0.83
PF01053Cys_Met_Meta_PP 0.83
PF14534DUF4440 0.83
PF02879PGM_PMM_II 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 121 Family Scaffolds
COG3503Uncharacterized membrane protein, DUF1624 familyFunction unknown [S] 69.42
COG2919Cell division protein FtsBCell cycle control, cell division, chromosome partitioning [D] 2.48
COG4839Cell division protein FtsLCell cycle control, cell division, chromosome partitioning [D] 2.48
COG0033Phosphoglucomutase/phosphomannomutaseCarbohydrate transport and metabolism [G] 0.83
COG0075Archaeal aspartate aminotransferase or a related aminotransferase, includes purine catabolism protein PucGAmino acid transport and metabolism [E] 0.83
COG01567-keto-8-aminopelargonate synthetase or related enzymeCoenzyme transport and metabolism [H] 0.83
COG0399dTDP-4-amino-4,6-dideoxygalactose transaminaseCell wall/membrane/envelope biogenesis [M] 0.83
COG0436Aspartate/methionine/tyrosine aminotransferaseAmino acid transport and metabolism [E] 0.83
COG0520Selenocysteine lyase/Cysteine desulfuraseAmino acid transport and metabolism [E] 0.83
COG0626Cystathionine beta-lyase/cystathionine gamma-synthaseAmino acid transport and metabolism [E] 0.83
COG1109PhosphomannomutaseCarbohydrate transport and metabolism [G] 0.83
COG1266Membrane protease YdiL, CAAX protease familyPosttranslational modification, protein turnover, chaperones [O] 0.83
COG1921Seryl-tRNA(Sec) selenium transferaseTranslation, ribosomal structure and biogenesis [J] 0.83
COG1982Arginine/lysine/ornithine decarboxylaseAmino acid transport and metabolism [E] 0.83
COG2008Threonine aldolaseAmino acid transport and metabolism [E] 0.83
COG2873O-acetylhomoserine/O-acetylserine sulfhydrylase, pyridoxal phosphate-dependentAmino acid transport and metabolism [E] 0.83
COG4100Cystathionine beta-lyase family protein involved in aluminum resistanceInorganic ion transport and metabolism [P] 0.83
COG4449Predicted protease, Abi (CAAX) familyGeneral function prediction only [R] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil26.45%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil14.05%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil14.05%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.61%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland4.96%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.96%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.13%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil4.13%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil3.31%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.48%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.65%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.65%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.65%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.65%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.83%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.83%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil0.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.83%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.83%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.83%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002909Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006052Freshwater sediment microbial communities from North America - Little Laurel Run_MetaG_LLR_2013EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011444Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT800_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300017961Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_20_MGEnvironmentalOpen in IMG/M
3300018058Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_20_MGEnvironmentalOpen in IMG/M
3300018085Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP15_20_MGEnvironmentalOpen in IMG/M
3300018086Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP02_10_MGEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020150Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP10_20_MGEnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026316Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027070Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF004 (SPAdes)EnvironmentalOpen in IMG/M
3300027729Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027829Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027855Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028146Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK23EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031719Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000 (v2)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300032160Sb_50d combined assembly (MetaSPAdes)EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032955Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.5EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10069528513300002245Forest SoilMLGLWAGLCLTGTFYAVWQGYGGRDFAATLTAFAFFFLVMLLFAARG
JGIcombinedJ26739_10129602513300002245Forest SoilMLGLWAGLCLTGTLYATWHGYGGRGFAATLTAFAFFFL
JGIcombinedJ26739_10164406013300002245Forest SoilMLGIGTALSFGGVLYASWLGYGGREFAATVTTFAFY
JGI25382J37095_1016766823300002562Grasslands SoilMLGLWAVLTLTGALYSAWQGYGGRQFAATLTAFAFFFLVTLLFAARGVEDRLAS
JGI25388J43891_100565933300002909Grasslands SoilMLGLWAVLTLTGALYSAWQGYGGREFAATLTAFAFFFLVTLLFAARGVEDRL
JGI25388J43891_101881513300002909Grasslands SoilMLGLWAVLTLTGALYSAWQGYGGRQFAATLTAFAFFFLVTLLFAARGVE
JGI25617J43924_1006472923300002914Grasslands SoilMLGLWAVLTLTGALYSAWLGYGGRAFAATLTAFAFFFLVMLLLAARGVENSIASRFG
Ga0062595_10046521813300004479SoilMLGLWAGMCLAGALYATWQGYGGRDFAATLTAFAFFFLVMLLFAARGVA
Ga0066672_1034928213300005167SoilMVGLWAVLTLTGALYAAWQGYGGREFAATLTAFAFFFLVTLLFAARGIEDRLA
Ga0066677_1003910543300005171SoilMLGLWALLTLIGVFFAVWKGYGGHEFAATLTSFAFLFLVMLLF
Ga0066679_1023395913300005176SoilMLGLWAILTLTGVLYSVWQGYGGHEFAATLTAFAFFFLVTLLFAARGVE
Ga0066690_1032755713300005177SoilMLGLWALLTLIGVLYAVWLGYGGHAFAATLTAFAFLFLLMLL
Ga0066671_1086694813300005184SoilMLGLWALLTLIGVFFAVWKGYGGHEFAATLTSFAFLFLVMLL
Ga0066681_1077328713300005451SoilMLGLWAILTLIGVLYAVWQGFGGPAFAATLTSFALLFLVMLLFAARGA
Ga0070733_1061954723300005541Surface SoilMLGLWAGLCLVATFYATWHGYGGTEFAATLTAFSFYTLVMLMFAARGFG
Ga0066704_1067362313300005557SoilMLGLWAVLTLTGALYSAWQGYGGREFAATLTAFAFFFLVTLLFAARGVEDRLASRFG
Ga0070763_1031388323300005610SoilMIGLWAALTLTGTLYATWLGYGGRAFAATLTAFAFFFLVTLLFAARGM
Ga0070763_1043431113300005610SoilMLGLWAGLCLVGALYSAWQGYGGRAFAATLTAFAFFFLM
Ga0066903_10680479523300005764Tropical Forest SoilMLGLWATLTLIGVFYAVWKGYSGHEFAATLTAFAF
Ga0070766_1081347923300005921SoilMLGLWAGLCLTGTFYATWHGYGGRDFAATLTAFAFFLLV
Ga0075023_10019679623300006041WatershedsMLGLWAVLCFAGGLYASWQGYGGRDFAATLTAFAFFLGVMLLFAARGVVDFFSL
Ga0075029_10025075723300006052WatershedsMLGLWAALTLTGVLCSVWLGYGGRAFAATLTAFAFLFLVMLLFAARGVEDRL
Ga0070765_10073331023300006176SoilMLGCWAVLCLTGALYAAWQGYGGRDFAATLTVFAFFFLVMLLFAARG
Ga0066658_1010517533300006794SoilMLGLWAVLTLTGALYSAWQGYGGREFAATLTAFAFFFL
Ga0066658_1076846113300006794SoilMLALWAVLTLTAALYSAWLGYGGRALAATLTAFALLFLVMLLFAARGV
Ga0066660_1050424723300006800SoilMLGLWAVLTLTGALYSAWQGYGGRQFAATLTAFAFFFLVTLLFA
Ga0075435_10057547923300007076Populus RhizosphereMLGLWAILTLIGVLYAVWLGFGGPPFAATLTSFALLFLVMLLFAARGAETV
Ga0099791_1034953713300007255Vadose Zone SoilMLGLWAALTLTGALYAAWLGYGGRGFAATLTAFAIFFLVMLLFAARGV
Ga0099793_1001475733300007258Vadose Zone SoilMLGLWAVLTLTGVLYSVWQGYGGRGFAATLTAFAFFLLVTLLFAARGVE
Ga0099830_1011650443300009088Vadose Zone SoilMLGLWAVLTLTGALYSTWLGYGGRAFAAILTAFAFFFLVMLLFAARGVD
Ga0099828_1144083623300009089Vadose Zone SoilMLGLCAVLTLTGALYSVWQGYGGRAFAATLTAFAFFLLVMLLFAARGVEDRLAS
Ga0126383_1214131013300010398Tropical Forest SoilMLGLWAILTLIGVLYAVWLGFGGHAFAATLTSFALLF
Ga0150983_1194600723300011120Forest SoilMIGLWAALTLTGTLYATWLGYGGRAFAAMLTAFAFFFLVTLLFAARGMDDRLAARFGSSS
Ga0137392_1018689513300011269Vadose Zone SoilLIGLWAVLTLAGALYSAWLGYGGRAFAATLTAFAFFFLVML
Ga0137392_1100881423300011269Vadose Zone SoilMLGFWAVLTLTGALYSAWLGYGGRAFAATLTAFAFF
Ga0137463_117810813300011444SoilMLGLWATLTLAGALYASWLGYGGRGFAATLTAFAFFFLLMLLFAARGVA
Ga0137389_1136132013300012096Vadose Zone SoilMLGLCAVLTLTGALYSVWQGYGGRAFAATLTAFAFFLLVMLLFAARGVEDRLASRFGA
Ga0137388_1123431913300012189Vadose Zone SoilLIGLWAVLTLAGALYSTWLGYGGRAFAATLTAFAFFFLVML
Ga0137388_1185845223300012189Vadose Zone SoilMLGVWAVLTLAGALYSSWLGYGGRAFAATLTAFAFFFLVMLLFAARGVENGLTA
Ga0137383_1008257923300012199Vadose Zone SoilMLGLWAVLTLTGALFSAWLGYGGRAFAATLTAFAFFFLVMLLFAAR
Ga0137383_1111542723300012199Vadose Zone SoilVTRQLGPLAMLGLWAVRTLAGALYASWLGYGGLAFAATLTAFAFFFLVMLLFAARGVE
Ga0137363_1025696113300012202Vadose Zone SoilLIGLWAVLTLAGALYSTWLGYGGRAFAATLTAFAFFFLVMLLFAARGVEDNLTS
Ga0137363_1031004113300012202Vadose Zone SoilMLGLWAALTLTGALYALWLGYGGRGFAATVTAFAIFFLVML
Ga0137362_1014687413300012205Vadose Zone SoilMLALWAVLTLTAALYSAWLGYGGRAFAATLTAFAFLF
Ga0137378_1172801623300012210Vadose Zone SoilVTRQLGPLAMLGVWAVLTLAGALYSSWLGYGGLAFAATLTAFAFFFLVMLLFAARGVEN
Ga0137361_1038388413300012362Vadose Zone SoilVTRQLGPLAMLGVWAVLTLAGALYSSWLGYGGQAFAATLTAFAFFFLVMLLFAARGVEN
Ga0137390_1006861913300012363Vadose Zone SoilMLGLGAVLTLTGALFSAWLGYGGRAFAATLTAFAFFFLVMLLFAAR
Ga0137398_1048538313300012683Vadose Zone SoilMLGLWAVLTLAGVLYSVWLGYGGRAFAATLTAFAFFFLVMLL
Ga0137396_1009397923300012918Vadose Zone SoilMLGLWAALCFGGGLYASWQGYGGRGFAATLTVFSFSSA*
Ga0137359_1163469413300012923Vadose Zone SoilMLGLWAVLTLIGALFSAWLGYGGRAFAATLTAFAFFFLVMLL
Ga0137413_1044086823300012924Vadose Zone SoilMLGLCAVLTLTGALYSAWQGYGGRAFAATLTAFAFFLLVMLLFAARGVEDR
Ga0137419_1078251323300012925Vadose Zone SoilMLGVWAVLTLTGVLYSVWLGYGGRAFAATLTAFAFFFLVMLLFAARGVEDRLAAR
Ga0137404_1092898713300012929Vadose Zone SoilMLGVWAVLTLTGVLYSVWLGYGGRAFAATLTAFAFFFLVMLL
Ga0164304_1098592923300012986SoilMLGLWAALTLTGTLYAAWLGYGGRAFAATLTAFALLFLVMLLFAAR
Ga0134078_1038659613300014157Grasslands SoilLAPARQLGLFSMLGLWAILTLIGVIYAVWLGFGGPAFAATLTSFALLFLVMLLFAARGAETVL
Ga0137405_112633313300015053Vadose Zone SoilMLGVWAVLTLTGVLYSVWLGYGGRAFAATLTAFAFFFLVMLLFAARGVE
Ga0134073_1022631813300015356Grasslands SoilMLGLWAVLTLTGALFSAWLGYAGRAFAATLTAFAFFFLVML
Ga0187817_1076481723300017955Freshwater SedimentMLGLWAVLTLAGALYSTWLGYGGREFAATLTAFSFFFLIML
Ga0187778_1001106713300017961Tropical PeatlandMLGLWAVLTLTGVIYSVWHGYGGREFASTLTASAF
Ga0187766_1141992413300018058Tropical PeatlandMFGLWAVLTLTGVIYSVWHGYGGREFASTLTASAFLFLVML
Ga0187772_1007204043300018085Tropical PeatlandMFGFWAVLTLTGVIYSVWHGYGGRDFASTLTAFAFLF
Ga0187769_1003803353300018086Tropical PeatlandMFGFWAVLTLTGVIYSVWHGYGGRDFASTLTAFAFLFLVMLLFAARGMDNG
Ga0187769_1103257923300018086Tropical PeatlandMFGFWAVLTLTGVIYSVWHGYGGRDFASTLTAFAFLFLVMLLFAARGMD
Ga0066662_1216021423300018468Grasslands SoilMLGLWAVLTLTGALYSTWQGYGGREFAAALTAFAFFFFVTLLFAARGVE
Ga0187768_104890413300020150Tropical PeatlandMFGLWAVLTLTGVIYSVWHGYGGREFASTLTASAFLFLVMLLFAA
Ga0179594_1038481423300020170Vadose Zone SoilMLGLWAVLTLTGALYSTWQGYGGRAFAATLTAFAFFF
Ga0210407_1108818013300020579SoilMVGLWAGLCLTGALYAAWQGYGGRDFAATLTVFAYFFLV
Ga0210403_1031062123300020580SoilMLGLWAALCFGGGLYATWQGYGGRAFAATLTVFSFYFGVMLLFAAR
Ga0179596_1011219613300021086Vadose Zone SoilMLGVWAVLTLTGVLYSVWLGYGGRAFAATLTAFAFFFLV
Ga0179596_1061998313300021086Vadose Zone SoilLIGLWAVLTLAGALYSTWLGYGGRAFAATLTAFSFFFL
Ga0210404_1032446313300021088SoilMLGLWAVLTLTGALYSAWLGYGGRAFAATLTAFAFFFLV
Ga0210404_1060198723300021088SoilMLGLWAVLTLTGALYSAWLGYGGRAFAAALTAFAFFFLVMLLLAARGVEDRLAS
Ga0210406_1070418313300021168SoilMLGLWAVLCLVGSGYSLWNGYGGRDFAATLTAFAFYFAVMLLFAARGVPDFLSSRF
Ga0210406_1108816013300021168SoilMLGLWAGLCLTGTFYAVWQSYGGRDFAATLTAFAFFFLVMLLFAARGFAAGL
Ga0210400_1036572023300021170SoilMLGLWAALCFGGGLYATWQGYGGRAFAATLTVFSFYFGVMLLFAARGVPEFL
Ga0210400_1106495723300021170SoilMLGLWAVLCLVGSGYSLWNGYGGRDFAATLTAFAFYFAVMLL
Ga0210408_1092991123300021178SoilMLGAWAALCFAGGLYATWQGYGGRDFAATLTAFAFFLGGM
Ga0210398_1054387423300021477SoilMLGLWAGLCLTGTFYATWHGYGGREFAATLTAFAFFFLAMLL
Ga0210402_1060119113300021478SoilMLGLWAGLCLTGTLYATWHGYGGRGFAATLTAFAF
Ga0210410_1035929223300021479SoilMLGLWAALCFGGGLYATWQGYGGRAFAATLTVFSFYFGVMLL
Ga0210410_1065916313300021479SoilVLGWWAGLCLTGALYAAWQGYGGRDFAATLTVFAFFFLV
Ga0210410_1116062623300021479SoilMLGGWAGLCLTGALYAAWQGYGGRDFAATLTVFAFFFLVMLL
Ga0210409_1141565123300021559SoilMLGLWAVLTLTGALYSAWLGYGGRAFAATLTAFAF
Ga0126371_1017406913300021560Tropical Forest SoilMLGLWAILTLIGVLYAVWLGSGGPAFAATLTSFALLFL
Ga0126371_1234197723300021560Tropical Forest SoilMLGLWAILTLIGVLYAVWLGFGGHAFAATLTTFAFLFLIMLLFAA
Ga0137417_109287223300024330Vadose Zone SoilMLGLWAVLTLTGALYSAWLGYGGRAFAATLTAFSFFFLV
Ga0209350_103418213300026277Grasslands SoilMLGLWAVLTLTGALYSAWQGYGGRQFAATLTAFAFFFLVTLLFAARGVEDRLA
Ga0209236_120012413300026298Grasslands SoilMLGLWAVLTLTGALFSAWLGYGGRAFAATLTAFAFFLLVMLLFAARGMESS
Ga0209055_101166783300026309SoilMLALWAVLTLTAALYSAWLGYGGRALAATLTAFALLFL
Ga0209686_104806233300026315SoilMLGLWALLTLIGVFFAVWKGYGGHEFAATLTSFAFLFLVMLLFAA
Ga0209155_104278013300026316SoilMLALWAVLTLTAALYAVWLGYGGRAFAATLTAFAFL
Ga0209155_129896913300026316SoilMLGLWAILTLTGMLYSVWLGYSGRAYAATLTAFAFLFLIMLLFAARGVETSLAT
Ga0209154_132295723300026317SoilMLALWAVLTLTAALYSAWLGYGGRALAATLTAFALLFLVMLLFA
Ga0209158_127817313300026333SoilMLGLWAVLTLTGALYAVWQGYGGREFAASLTAFAFFFLVTLLFAARGVEDRLA
Ga0209377_123062213300026334SoilMLGLWAALTLTGALYAAWLGYGGRGFAATLTAFAIFFLVMLLFAARGVPE
Ga0209057_119942013300026342SoilMLGLWAILTLIGVLYAVWQGFGGPAFAATLTSFALLFLVMLLFAARGAETVLAARFGATTGH
Ga0209059_117237823300026527SoilMLALWAVLTLTGALYSAWLGYGGRAFAATLTAFAFFFLVMLLF
Ga0209648_1013325243300026551Grasslands SoilMLGLWAVLTLTGALYSAWQGYGGRAFAATLTAFAF
Ga0209577_1012944643300026552SoilMLGLWALLTLIGVLYAVWLGYGGHAFAATLTAFAFLFLLMLLFAARGAETSLAARFS
Ga0209577_1052017713300026552SoilMLGLWALLTLIGVFFAVWKGYGGHEFAATLTSFAFLFLVMLLFAARGAETILATR
Ga0208365_103001713300027070Forest SoilMLGLWAVLTLTGALYSAWLGNGGRAFAATFTAFAFLFLVTLLFAARGFEDRLASRFGA
Ga0209248_1024990423300027729Bog Forest SoilMLGLWAGLCLTGTFYATWHGYGGREFAATLTAFAFFFLV
Ga0209773_1028373913300027829Bog Forest SoilMLGLWAGLCLTAALYAAWQGYGGRAFAATLTAFALFL
Ga0209580_1047675423300027842Surface SoilMIGLWAVLCFAGALYASWQGYGGREFAATLTVFAIYLGVMLLFAA
Ga0209180_1027550313300027846Vadose Zone SoilMLGLWAVLTLTGALYSAWLGYGGRAFAATLTAFAFFFLVML
Ga0209693_1019982813300027855SoilMIGLWAALTLTGTLYATWLGYGGRAFAATLTAFAFFFLVTLLFAARGMDDRL
Ga0209701_1000431993300027862Vadose Zone SoilMLGLWAALTLTGALYSLWLGYGGRAFAATLTAFAFF
Ga0209701_1011311433300027862Vadose Zone SoilMLGLWAVLTLTGALYSAWLGYGGRAFAATLTAFAFFFLVT
Ga0209701_1055963213300027862Vadose Zone SoilMFGLWAVLTLTGALYSVWLGYGGRDFAATLTAFAFYFLVM
Ga0209526_1032038133300028047Forest SoilMLGLWAGLCLTGTLYATWHGYGGRGFAATLTAFAFFFLVMLL
Ga0247682_104385823300028146SoilMLGLWAGMCLAGALYATWQGYGGRDFAATLTAFAFF
Ga0307504_1034320213300028792SoilMVGLWAALTLTGALYALWLGYGGRSFAATLTAFAFFFLV
Ga0222749_1028643713300029636SoilMLGLWSALTLTGALYASWLGYGGRGFAATLTAFAFFFLVILLF
Ga0306917_1002209513300031719SoilMLGLWALLTLIGVFFALWKGYGGHAFAATLTAFAF
Ga0307469_1047472013300031720Hardwood Forest SoilMLGLWAALTLTGALYAAWLGYGGRGFAATLTAFAIFFLVMLLFAAR
Ga0307469_1121819423300031720Hardwood Forest SoilMLGLWAGMCLAGALYATWQGYGGRDFAATLTTFAFFFLVML
Ga0307475_1036445823300031754Hardwood Forest SoilMLGVWAVLTLTGVLYSAWLGYGGRAFAAALTAFAF
Ga0307475_1050596123300031754Hardwood Forest SoilVLGLWAVLTLTGVLYSAWLGYGGRAFAAALTAFAFFFLVM
Ga0311301_1022930313300032160Peatlands SoilMLGLWSALTLTGALYASWLGYGGRAFAATLQAVAV
Ga0307472_10209603123300032205Hardwood Forest SoilMLGLWAILTLIGVLYAVWLGFGGPPFAATLTSFAL
Ga0335076_1099891713300032955SoilMLGLWAGLCLTGMFYSAWQGYGGRDFGATLTAFAFLFLVMLLFAARGVAEALASRFG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.