NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F103836

Metagenome Family F103836

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103836
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 102 residues
Representative Sequence YVDIDPARSDRMEYGIPANSSEPNMIGAPPTVAFANWQILEVHAKREELWIKRMQQHEFQSALVICGLVHLLSFAFRLQEAKFSVQAINYANWQRNPL
Number of Associated Samples 82
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 1.98 %
% of genes from short scaffolds (< 2000 bps) 1.98 %
Associated GOLD sequencing projects 78
AlphaFold2 3D model prediction Yes
3D model pTM-score0.36

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (98.020 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(16.832 % of family members)
Environment Ontology (ENVO) Unclassified
(22.772 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(50.495 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 46.03%    β-sheet: 0.00%    Coil/Unstructured: 53.97%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.36
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF01850PIN 14.85
PF12146Hydrolase_4 8.91
PF00805Pentapeptide 2.97
PF04326AlbA_2 1.98
PF00072Response_reg 0.99
PF03239FTR1 0.99
PF13505OMP_b-brl 0.99
PF01695IstB_IS21 0.99
PF13517FG-GAP_3 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG1357Uncharacterized conserved protein YjbI, contains pentapeptide repeatsFunction unknown [S] 2.97
COG2865Predicted transcriptional regulator, contains HTH domainTranscription [K] 1.98
COG1484DNA replication protein DnaCReplication, recombination and repair [L] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A98.02 %
All OrganismsrootAll Organisms1.98 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300027854|Ga0209517_10565269All Organisms → cellular organisms → Bacteria608Open in IMG/M
3300027903|Ga0209488_10277844All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1253Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil16.83%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds11.88%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment10.89%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil8.91%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.94%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa5.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.95%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil4.95%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.95%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil3.96%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.97%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland1.98%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.98%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.98%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.98%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.98%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog0.99%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.99%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.99%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.99%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.99%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost0.99%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.99%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300004091Coassembly of ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006052Freshwater sediment microbial communities from North America - Little Laurel Run_MetaG_LLR_2013EnvironmentalOpen in IMG/M
3300006059Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2012EnvironmentalOpen in IMG/M
3300006086Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2013EnvironmentalOpen in IMG/M
3300006162Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006354Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009177Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaGHost-AssociatedOpen in IMG/M
3300009521Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_9_AC metaGEnvironmentalOpen in IMG/M
3300009522Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_5_LS metaGEnvironmentalOpen in IMG/M
3300009523Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_8_FC metaGEnvironmentalOpen in IMG/M
3300009698Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_3_AS metaGEnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300014201Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin23_10_metaGEnvironmentalOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300017822Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_2EnvironmentalOpen in IMG/M
3300017823Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_3EnvironmentalOpen in IMG/M
3300017926Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_2EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017934Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_3EnvironmentalOpen in IMG/M
3300017943Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_4EnvironmentalOpen in IMG/M
3300017993Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_3EnvironmentalOpen in IMG/M
3300017996Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_21_40EnvironmentalOpen in IMG/M
3300018007Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_5EnvironmentalOpen in IMG/M
3300018047Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_10_10EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300025914Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027583Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027625Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_c_BC metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027641Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_8_FC metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027812Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027854Peat soil microbial communities from Weissenstadt, Germany - SII-2010 (SPAdes)EnvironmentalOpen in IMG/M
3300027869Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300027911Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300029951III_Palsa_N1 coassemblyEnvironmentalOpen in IMG/M
3300029999I_Palsa_E3 coassemblyEnvironmentalOpen in IMG/M
3300030007I_Palsa_E1 coassemblyEnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031234Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_2EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300032160Sb_50d combined assembly (MetaSPAdes)EnvironmentalOpen in IMG/M
3300032892Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.5EnvironmentalOpen in IMG/M
3300033486Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_N3_C1_D5_AEnvironmentalOpen in IMG/M
3300034163Peat soil microbial communities from wetlands in Alaska, United States - Goldstream_04D_14EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12635J15846_1047279833300001593Forest SoilPTVAFANWQLLEVHAKREELWVKRMQQHEFQSALVICGLVHLLSFAFRLRDAKFSVQAINYANWQRNPL*
JGIcombinedJ26739_10047517113300002245Forest SoilERADFGIPANSSEPNMIGTPPSVAFANWQILDAHARREELWVKRIQQHKFESGLVICGLVHLLSFAFRLQDAKFSVQAINYANWQRNPL*
Ga0062387_10140366713300004091Bog Forest SoilVSEGVDFIFEEASGLGPTIAEKLALEQLAFGHYVDIDPARSDRPDFGIPANASEPNMIGTPPTVAFANWQILEVHAKREELWLKRMQQHEFQSALVICGLVHLLSFAFRLQNAKYSVQAINYANWQRNPL*
Ga0070731_1038407933300005538Surface SoilMIGAPPSVAFANWQVLDVHAKREELWLARMRQRRFQSALVICGLVHLLSFAFRLQAANFSVQAINYANWQRNPL*
Ga0070733_1105418213300005541Surface SoilIDFIFEEASGLGPTIAEKLALEQLPFGHYVDIDPAKAERPDLGIPSNSSEPNMIGSPPSVAFANWQLLDVHAKREELWVQRMQQHEFKSALVICGLVHLLSFAFRLQAANFSVQAINYANWQRNPL*
Ga0066903_10584727213300005764Tropical Forest SoilYVDMDPARGDRSEFGIPSSSSEPNMIGAPPTVSFANWQILAVHAKREELWLERIQQSEFKSALVICGLVHLLSFAFRLRDAKFSVQAINYANWQRPRRTSEVKERIFDWDD*
Ga0070766_1090566613300005921SoilPTVAFANWQILEVHEKREELWVKRMQQHEFQSALVICGLVHLLSFAFRLQEAKYSVQAINYANWQRNPL*
Ga0070766_1111740123300005921SoilFGHYVDIDPARGERMEYGIPANSSEPNMIGTPPTVAFANWQILEVHAKREELWVKRMQQHEFQSALVICGLVHLLSFAFRLQEAKFSVQAINYANWQRNPL*
Ga0075028_10002670133300006050WatershedsVSKFGSEGVDFIFEEASGLGPTIAEKLATGTLAFGHYVDIDPSRTERPEFGNPPNSSEPHMIGDPPKVAFANWQVLDVHARRDELWLQRIRQRQFQSALVICGLVHLLSFAFRLQAAQYSVQGINYANWQRNPL*
Ga0075028_10078511413300006050WatershedsEASGLGPTIAEKLALERFPFGHYVDIDPSSNERMDFGIPTNSSEPNMIGTPPTVAFANWQILDVHTKREELWLQRIQQRKFESALVICGLVHLLSFAFRLRAAKFSVQAINYANWQPNPL
Ga0075029_10062563533300006052WatershedsDPARAARGDFGIPSNSSEPNMIGAPPTVAFANWQILDVHAKREELWAERIQQHEFNSALVIVGLVHLLSFAFRLQKAKFSVQAINYANWQRNPL*
Ga0075017_10108369323300006059WatershedsMDFGIPTNSSEPNMIGAPPSVAFANWQILDVHARREDLWLQRIQQRKFESALVICGLVHLLSFSFRLQSAKFSVQAINYANWLRNPL*
Ga0075019_1089541113300006086WatershedsARTDRPDFGIPTNSSEPNMIGALPTVTFANWQILEVHAKREELWVQRIQQREFQSALVICGLVHLLSFAFRLQHAKYSVQAINYANWQRNPL*
Ga0075030_10062887713300006162WatershedsPANSSEPLMIGSPPSAAYAQWQILDVHAKREELWLQRVQQRSFKSALVICGLVHLLSFSQRLQHAQFSVQAINYANWQRNPL*
Ga0075030_10073118413300006162WatershedsARSDRAEFGIPANSSEPHMIGVPPTVSFANWQILEAHEKREELWIKRMQQHEFQSALVICGLVHLLSFAFRLQDAKFSVQAINYANWQRNPL*
Ga0075018_1008922113300006172WatershedsIFEEASGLGPTIAEKLALEKLAFGHYLDIDPARGERAELGIPANSSEPNMIGAPPTVSFANWQILDVHVKREELWVKRMQQHEFQSALVICGLVHLLSFAFRLQDSKFSVQAINYANWQRNPL*
Ga0075021_1008516113300006354WatershedsGIPANSSEPNMIGAPPTVSFANWQILDVHVKREELWVKRMQQHEFQSALVICGLVHLLSFAFRLQDAKFSVQAINYANWQRNPL*
Ga0079221_1096799613300006804Agricultural SoilNWQILDVHAKREELWAERVQQHEFKSALVIVGLVHLLSFAFRLQEAKFSVQAINYANWQRNPL*
Ga0105240_1196156523300009093Corn RhizospherePFGHYVDIDPARNERMDFGIPANSSEPNMIGAPPSAAFANWQILDVHAKREDLWLQRIKQRRFESALVICGLVHLLSFSFRLQSANFSVQAINYANWQRSQL*
Ga0105248_1161753723300009177Switchgrass RhizosphereGPTIAEKLALERLAFGHYVDLDPARGERAEFGIPTNSSEPNMIGAPPTVAFANWQVLEVHAKREELWLKRMQQRKFESALVICGLVHLLSFAFRLQAANLSVQAINYANWQRNPL*
Ga0116222_103802233300009521Peatlands SoilMIGEPPSVAFANWQLLDVHAKREDLWLKRMQQHEFKSALVICGLVHLLSFAFRLQEAKFSVQAINYANWQRNPL*
Ga0116218_111150513300009522Peatlands SoilEGVDFIFEEASGLGPTIAEKLALERLAFGHYVDVDPARGERMEYGIPTNSSEPNMIGAPPTVAFANWQILEVHAKREELWIKRMQQHQFQSALVICGLVHLLSFAFRLQEAKFSVQAINYANWQRNPL*
Ga0116221_133246513300009523Peatlands SoilMIGEPPSVAFANWQLLDVHAKREDLWLKRMQQHEFKSALVICGLVHLLSLAFRLQEAKFSVQAINYANWQRNPL*
Ga0116216_1092817513300009698Peatlands SoilPARAARPDFGIPSNSSEPNMIGEPPSVAFANWQLLDVHAKREDLWLKRMQQHEFKSALVICGLVHLLSFAFRLQEAKFSVQAINYANWQRNPL*
Ga0126378_1154162513300010361Tropical Forest SoilIVSSEPNMIGSPPSVAFANWQILEAHAKREEVWLRRIQQHKFQSALAIVGLVHLLSFAFRLQSANFSVQAINYANWQRQPARRLDAGVFDENLRIFDWEE*
Ga0126378_1177102213300010361Tropical Forest SoilNMIGSPPSVAFANWQILEAHAKREELWLKRIGQHKFQSALVICGLVHLLSFAFRLQSAKYSVQAINYANWQRDPPFARRADADILGEKLRIFDWDE*
Ga0134121_1040715743300010401Terrestrial SoilEPNMIGAPPTVAFANWQVLEVHAKREELWLQRMQQRKFESALVICGLVHLLSFAFRLQAANLSVQAINYANWQRNPL*
Ga0137384_1081512013300012357Vadose Zone SoilTAAFANWQILSVHAKREELWIKRMQQHEFQSALVICGLVHLLSFAFRLQDAKFSVQAINYANWQRNSP*
Ga0137416_1013967243300012927Vadose Zone SoilAEFGISVSSSEPNMIGAPPKVGFANWQLLEVHAKREELWMKRMRQREFESALVICGLVHLLSFAFRLQEAKFSVQAIDYANWQRNPL*
Ga0137410_1044458333300012944Vadose Zone SoilDFIFEEASGLGPTIAEKLALEKLAFGHYVDIDPARGERAEFGIPAISSEPNMIGAPPTVAFANWQILEAHAKREELWVKRMQLREFDSALVICGLVHLLSFAFRLQNAKFSVQAINYANWQRNLL*
Ga0164305_1161281323300012989SoilPTVAFANWQVLEVHAKREELWLQRMQQRKFESALVICGLVHLLSFAFRLQAANLSVQAINYANWQRNPL*
Ga0181537_1080140713300014201BogASGLGPTIAEKLALERLAFGHYVDLDPARGERAEFGIPANSSEPLMIGAPPSAAYAYWQILDAHAKREELWLRRMQQRNFESALVICGLVHLLSFSQRLQHAQFSVQAINYANWQRNPL*
Ga0182024_1101354113300014501PermafrostSGLGPTIAEHLALEQLPFGHYLDIDPARAERANFGIPANSSEPNVIGSPPTLAFANWQILEVHAKREELWLKRMRQHKFQSALVICGLVHLLSFAFRLQDAKFSVQAINYANWQRNPL*
Ga0187802_1006713623300017822Freshwater SedimentGPTIAEKLALEQLAFGHYVDIDPARSERPDFGIPASSSEPNMLGEPPTVAFANWQILEAHAKREELWVKRMQQHEFKSALVIVGLVHLLSFAFRLQSAKYSVQAINYANWQRNPL
Ga0187818_1007615123300017823Freshwater SedimentAFANWQILEAHAKREELWVKRMQQHEFKSALVIVGLVHLLSFAFRLQNAKYSVQAINYANWQRNPL
Ga0187818_1008748423300017823Freshwater SedimentGLGPTIAEKLALEQLAFGHYVDIDPARGDRPDFGIPANSSEPQMIGTPPTVAFANWQILEVQAKREELWMKRMQQREFQSALVICGLVHLLSFAFRLQNAKYSVQAINYANWQRNPL
Ga0187818_1040687023300017823Freshwater SedimentGPTIAEKLALEQLAFGHYVDIDPARGERMDYGIPTNSSEPNMIGAPPTVAFANWQILEVHAKREELWVKRMQQHEFQSALVICGLVHLLSFAFRLQNANFSVQAINYANWQRNPL
Ga0187807_130530413300017926Freshwater SedimentAEKLALEKLPFGHYVDIDPARNDRPAFGIPVSSAEPNMIGNPPTVAFANWQILEAHAKREELWVKRMQQHEFKSALVIVGLVHLLSFAFRLQDAKYSVQAINYANWQRNPL
Ga0187825_1003748843300017930Freshwater SedimentISEGVDFIFEEASGLGPTIAEKLALEQLAFGHYVDIDPARSDRMEYGIPANSSEPNMIGAPPTVAFANWQILEVHAKREELWIKRMQQHEFQSALVICGLVHLLSFAFRLQEAKFSVQAINYANWQRNPL
Ga0187803_1043881423300017934Freshwater SedimentSEPNMLGEPPTVAFANWQILEAHAKREELWVKRMLQHEFKSALVIVGLVHLLSFAFRLQNAKYSVQAINYANWQRNPL
Ga0187819_1057400813300017943Freshwater SedimentEQLAFGHYVDIDPARAERMEYGIPANSSEPNMIRAPPTVAFANWQILEVHAKREELWVKRMQQHEFQSALVICGLVHLLSFAFRLQEAKFSVQAINYANWQRNPL
Ga0187823_1007775933300017993Freshwater SedimentYVDIDPARSDRMEYGIPANSSEPNMIGAPPTVAFANWQILEVHAKREELWIKRMQQHEFQSALVICGLVHLLSFAFRLQEAKFSVQAINYANWQRNPL
Ga0187891_130118113300017996PeatlandIPPNSSEPHMIGAPPSVAFANWQVLDVHAKREELWLARMRQRRFQSALVICGLVHLLSFAFRLQDANFSVQAINYANWQRNPL
Ga0187805_1000140683300018007Freshwater SedimentWQILDVHAKREELWLKRMQQREFQSALVICGLVHLLSFAFRLQNAKYSVQAINYANWQRNPL
Ga0187805_1014926013300018007Freshwater SedimentVDIDPARNERPAFGIPVSSAEPNMIGNPPTVAFANWQILDVHAKREELWLKRMQQREFQSALVICGLVHLLSFAFRLQNAKYSVQAINYANWQRNPL
Ga0187859_1087800913300018047PeatlandPARGDRAEFGIPVNSHEPHMIGAPPKVAFANWQILDVHARREKLWIERIQRQKFESALVICGLVHLLSFAFRLQAANYSVQAIDYANWQRNPL
Ga0210403_1149623513300020580SoilLGPTIAEKLAIEKLAFGHYVDIDPARTERPEFGIPPNSSEPHMIGAPPTVAFANWQILDTHLKREELWVRRIRQHAFQSALAIVGLVHLLSFAFRLQAANYSVQAINYANWQRQPQSARRTDANVFAENMRIFDWDE
Ga0210399_1060386413300020581SoilGHYVDIDPARTERPEFGIPPNSSEPHMIGAPPTVAFANWQILDTHLKREELWVRRIRQHTFQSALAIVGLVHLLSFAFRLQAANYSVQAINYANWQRQPQGGRRTDANVFAENMRIFDWD
Ga0210399_1135447823300020581SoilVLLPFGHYVDIDPARGERPEFGIAANTSEPNMIGAPPTVTFANWQILEVHAKREELWVKRMQQHEFQSALVICGLVHLLSFAFRLQDAKFSVQAINYANWQRNPQ
Ga0210395_1028293133300020582SoilLALETLAFGHYVDIDPARSERPDYGIPPNSSEPNMIGTPPTVAFANWQILDVHTKREELWLGRIQQHKFESALVIVGLVHLLSFSFRLQTAKFSVQAINYANWQRNPL
Ga0210401_1109417213300020583SoilISEGVDFIFEEASGLGPTIAEKLALERLAFGHYVDIDPARGERADFGIPANSSEPLMIGAPPSAAYASWQILDVHAKREELWLRRMQQRNFESALVICGLVHLLSFSQRLQHAQFSVKAINYANWQRNPL
Ga0215015_1024852323300021046SoilEMCIRDSPTVAFANWQLLDVHARREELWVKRMHQHEFESALVICGLVHLLSFAFRLQEAKFSVQAINYANWQRNPL
Ga0210400_1041046113300021170SoilRDLIATERVEFIFEEASGFGPTIAEKLSLQELGSGRYLDIDPANADRLNLGIPPNSNEPHIIGSPPKVAFAHWQILEVHAKREILWMKRVQQHKFESALVICGLVHLLSFAFRLQSEQFSVQAIDYANWQRNST
Ga0210400_1162926413300021170SoilARGDRADFGIPSNSSEPHMIGAPPSVAFANWQILDAHAKREEVWLQRIRQHKFQSALAIVGLVHLLSFAFRLQAANFSVQAINYANWQRQPARRPDAGVFDENLRIFDWDE
Ga0210405_1116688313300021171SoilPTVAFANWQILEVHAKREELWVKRMQQHEFQSALVICGLVHLLSFAFRLQEAKFSVQAINYANWQRNPL
Ga0210408_1036341733300021178SoilRAARGDFGIPSNSSEPNMIGAPPTVAFANWQILDVHAKREELWAERIQQHEFNSALVIVGLVHLLSFAFRLQKAKFSVQAINYANWQRNPL
Ga0210393_1060569223300021401SoilRADFGIPANSSEPNMIGSPPTVAFANWQILDVHARREELWLERMRQRKFESALVICGLVHLLSFAFRLQKANFSVQAINYANWQRNPM
Ga0210397_1107373923300021403SoilANWQILEVHAKREELWVKRMQQHEFQSALVICGLVHLLSFAFRLQEAKFSVQAINYANWQRNPL
Ga0210386_1067841733300021406SoilEEASGLGPTIAEKLALERLAFGHYVDIDPARGERADFGIPANSSEPLMIGAPPSAAYASWQILDAHAKREELWMRRMQQRNFESALVICGLVHLLSFSQRLQHAQFSVKAINYANWQRNP
Ga0210394_1042279313300021420SoilLMISEGVDFIFEEASGLGPTIAEKLALQQLAFGHYVDIDPARGARMEYGIPANSSEPNMIGAPPTVAFANWQILEVHAKREVLWIKRMQQHEFQSALVICGLVHLLSFAFRLQEAKFSVQAINYANWQRNRL
Ga0210410_1033316433300021479SoilDIDPARGERPEFGIPVNSSEPNMIGAPPTVTFANWQILEVHAKREELWVKRMQQHKFQSALVVCGLVHLLSFAFRLQDAKFSVQAINYANWQRNPL
Ga0210410_1053494123300021479SoilVAFAHWQLLDVHAKREELWIKRIQQSEFKSALVICGLVHLLSFASRLQSAKFSVQAINYANWNRNPL
Ga0210409_1002099283300021559SoilMASEGVDFIFEEASGLGPTIAEKLALERLGFLRYLDIDPARGDRADFGIPSNSSEPLMIGAPPSVAFANWQILDAHAKREEVWLQRIRQHKFQSALAIVGLVHLLSFAFRVQAANFSVQGINYANWQRQPARRLDTGVFDENLRIFDWDE
Ga0210409_1116717013300021559SoilASGLGPTIAEKLALEHLGAGRYLDVDPARSDRAEFGIATNSHEPNMIGSPPRVAFANWQILEVHARREELWLKRIQRQEFKSALVICGLVHLLSFAFRLQVANFSVQAIDYASWQRNPI
Ga0126371_1008051813300021560Tropical Forest SoilMDPARGDRSEFGIPSSSSEPNMIGAPPTVSFANWQILAVHAKREELWLERIQQSEFKSALVICGLVHLLSFAFRLRDAKFSVQAINYANWQRPRRTSEVKERIFDWDD
Ga0126371_1049833813300021560Tropical Forest SoilKLAFGHYVDIDPAKGDRSDFGIPIVSSEPNMIGSPPSVAFANWQILEAHAKREEVWLRRIQQHKFQSALAIVGLVHLLSFAFRLQSANFSVQAINYANWQRQPARRLDAGVFDENLRIFDWEE
Ga0207671_1045514123300025914Corn RhizosphereMDDSVIDKAVYAELQETAGAEFVAELVDTFFEEASGLGPTIAEKLALERLAFGHYVDLDPARGERAEFGIPTNSSEPNMIGAPPTVAFANWQVLEVHAKREELWLQRMQQRKFESALVICGLVHLLSFAFRLQAANLSVQAINYANWQRNPL
Ga0179587_1026453533300026557Vadose Zone SoilLGFGRYLDIDPARGDRPEFGISVSSSEPNMIGAPPKVGFANWQLLEVHAKREELWMKRMQQREFESALVICGLVHLLSFAFRLQEAKFSVQAIDYANWQRNPL
Ga0209527_111547413300027583Forest SoilARGDRPDFGLPLDSHEPHMIGAPPKVAFAHWQLLDVHTKREELWINRIQQSEFKSALVICGLVHLLSFASRLQSAKFSVQAINYANWQRNPL
Ga0208044_117413113300027625Peatlands SoilERLAFGHYVDVDPARGERMEYGIPTNSSEPNMIGAPPTVAFANWQILEVHAKREELWIKRMQQHQFQSALVICGLVHLLSFAFRLQEAKFSVQAINYANWQRNPL
Ga0208827_107114433300027641Peatlands SoilTVAFANWQILEVHAKREELWIKRMQQHQFQSALVICGLVHLLSFAFRLQEAKFSVQAINYANWQRNPL
Ga0209118_106483033300027674Forest SoilEPNMIGAPPTVAFANWQLLDVHAKREELWIKRLRQHEFQSALVICGLVHLLSFAFRLQDAKFSVQAINYANWQRNPL
Ga0209656_1043860423300027812Bog Forest SoilSSEPNMIGNPPTVAFANWQILDVHAKRERLWVERIQQRRFESALVICGLVHLLSLAFRLQEANYSVQAINYANWQRNSL
Ga0209517_1056526913300027854Peatlands SoilNKLMLSEGVDFIFEEASGLGPTIAEKLALEQLPFGRYLDIDPRRNERPDFGIPPKSSEPHMIGAPPSVAFANWQVLDVHAKREELWLARMRQRRFQSALVICGLVHLLSFAFRLQQADFSVQAINYANWQRNPL
Ga0209579_1081746423300027869Surface SoilFGILPNSSEPHMIGAPPSVAFANWQVLDVHAKREELWLARMRQRRFQSALVICGLVHLLSFAFRLQAANFSVQAINYANWQRNPL
Ga0209068_1002353913300027894WatershedsVSKFGSEGVDFIFEEASGLGPTIAEKLATGTLAFGHYVDIDPSRTERPEFGNPPNSSEPHMIGDPPKVAFANWQVLDVHARRDELWLQRIRQRQFQSALVICGLVHLLSFAFRLQAAQYSVQGINYANWQRNPL
Ga0209488_1027784413300027903Vadose Zone SoilPERAAYSHSASVGRRYSFTPSMALSLLMISEGVDFIFEEASGLGPTIAEKLTLEQLGAGHYLDVDPARGDRADFGIATNSHEPNMIGSPPRVAFANWQILEVHAKREELWLKRIERQEFKSALVICGLVHLLSFAFRLQVANFSVQAIDYASWQRNSI
Ga0209006_1022392413300027908Forest SoilSGLGPTIAEKLALERLAFGHYVDIDPARGERADFGIPANSSEPNMIGTPPSVAFANWQILDAHARREELWVKRIQQHKFESGLVICGLVHLLSFAFRLQDAKFSVQAINYANWQRNPL
Ga0209698_1124745713300027911WatershedsARGERAEFGIPANSSEPLMIGSPPSAAYAQWQILDVHAKREELWLQRVQQRSFKSALVICGLVHLLSFSQRLQHAQFSVQAINYANWQRNPL
Ga0209069_1065131623300027915WatershedsDIDPARGDRAELGIPANSSEPNMIGAPPTVAFANWQVLEVHAKREELWAKRMQQHEFQSALVICGLVHLLSFAFRLQDAKFSVQAINYANWQRNPL
Ga0311371_1172013613300029951PalsaSEGVDFIFEEASGLGPTIAEKLALEKLPFGHYVDIDPARNDRPAFGIPVSSAEPNMIGTPPTVAFANWQILDVHAKREELWVKRIQQRRFKSALAICGLVHLLSFAFRLQKAKYSVQAINYANWQRNSQ
Ga0311339_1125292413300029999PalsaGVDFVFEEASGLGPTIAEKLSVEELGLGRYLDIDPARGDRAKFGISTNSNEPNMIGTPPKVAFANWQVLAVHAKREELWVMRMRQQEFASALVICGLVHLLSFAFRLQAAGFFVQAIDYANWQRNPL
Ga0311338_1100047723300030007PalsaLDKLAFGHYVDIDPARNERPAFGIPISSAEPNMIGTPPTVAFANWQILDVHAKREELWVKRIQQRRFKSALAICGLVHLLSFAFRLQKAKYSVQAINYANWQRNSQ
Ga0170834_11288348923300031057Forest SoilALKQLPFGRYLDIDPARSERPDFGILPNSSEPHMIGEPPSVAFANWQVLEVHAKREELWLARMQQRRFESALVICGLVHLLSFAFRLQQADFSVQAINYANWQRNSV
Ga0302325_1014914213300031234PalsaGTPPTVAFANWQILDVHAKREELWVKRIQQRRFKSALAICGLVHLLSFAFRLQKAKYSVQAINYANWQRNSQ
Ga0302325_1199857213300031234PalsaFGHYVDIDPARNDRPAFGIPVSSAEPNMIGTPPTVAFANWQILDVHAKRERLWVERIQQRRFESALIICGLVHLLSLAFRLQDAKYSVQAINYANWQRNPL
Ga0302325_1291043223300031234PalsaASSNEPNMIGKPPKVAFANWQLLEVHTKREELWMKRMQQQRFQSALVICGLVHLLSFAFRLQAANFFVQAIDYANWQRSPL
Ga0310686_10779163623300031708SoilMEYGIPANSSEPNMIGAPPTVAFANWQILEVHAKREELWVKRMQQHEFQSALVICGLVHLLSFAFRLQEAKFSVQAINYANWQRNPL
Ga0310686_11690734323300031708SoilEYGIPANSSEPNMIGAPPTVAFANWQILEVHAKREELWIKRVQQHEFQSALVICGLVHLLSFAFRLQEAEFSVQAINYANWQRNPL
Ga0310686_11982335513300031708SoilDFIFEEASGLGPTIAEKLALEKLGLGRYLDIDPARGERTELGIPVNTHEPHMIGAPPKVAFANWQLLEVHARREELWMKRMQQQEFGSALVICGLVHLLSFAFRLQIANFSVQAIDYANWQRNSL
Ga0307476_1017392713300031715Hardwood Forest SoilEKLALDKLGFLRYLDIDPARGDRADFGIPSNSSEPHMIGAPPTVAFASWQVLDVHARREELWLKRMQEHRFESALVICGLVHLLSFAFRLQSAQFSVQAINYANWQRNSL
Ga0307476_1019402613300031715Hardwood Forest SoilPPTVAFANWQILDVHTKREELWLGRIQQHKFESALVIVGLVHLLSFSFRLQTAKFSVQAINYANWQRNPL
Ga0307477_1058381323300031753Hardwood Forest SoilSGLGPTIAEKLALEKLGFLRYLDIDPARGDRADFGIPSNSSEPHMIGAPPSVAFANWQILDAHAKREEVWLQRIQQHKFQSALAIVGLVHLLSLAFRLQSANFSVQAINYANWQRQPARRLDTGVFDENLRIFDWDE
Ga0307475_1134646213300031754Hardwood Forest SoilEVDFIFEEASGLGPTIAEKVALEKLAFCHYVDIDPARDERSEFGIPSSSSEPNMIGSPPSVSFANWQILDAHAKREELWLKRIQQSEFNSALVICGLVHLLSFAFRIRDAKFSVQAINYANWQRPRRTSEATMRIFDWDD
Ga0307475_1148647913300031754Hardwood Forest SoilKLMISESVDFIFEEASGLGPTIAEKLALEQLAFGHYVDVDPASRERPDYGIPANSSEPNMIGSPPTVAFANWQILDVHAKREELWLKRMRQHEFQSALVICGLVHLLSFAFRLQSEKFSVQAINYANWQRNPL
Ga0307478_1061109013300031823Hardwood Forest SoilGVDFVFEEASGLGPTIAERLALEKLGFGRYLDIDPARGDRAEHGIPGNSHEPHMIGTPPKVAFANWVVLEVHAKREELWIKRIQQQKFESALVICGLVHLLSLAFRLQTAKFSVQAIDYANWQRSPV
Ga0311301_1006690623300032160Peatlands SoilMIGEPPSVAFANWQLLDVHAKREDLWLKRMQQHEFKSALVICGLVHLLSFAFRLQEAKFSVQAINYANWQRNPL
Ga0311301_1184342033300032160Peatlands SoilPPKSSEPHMIGAPPSVAFANWQVLDVHAKREELWLARMRQRRFQSALVICGLVHLLSFAFRLQQADFSVQAINYANWQRNPL
Ga0335081_1184936123300032892SoilTVAFANWQILDVHARREKLWVERIQQRRFESALAICGLVHLLSFAFRLQEANYSVQAINYANWQRNS
Ga0316624_1142922813300033486SoilSEGVDFIFEEASGLGPTIAEKLAQEKLGSGHYLDIDPARHDRAEFAIPSISQEPHMIGTPPSVAFANWQILDVHAKRENLWIQRIKQHHFESALMICGLVHLLSMGFRLQEAKFSVQAIDYANWLRKPVF
Ga0370515_0340245_1_3543300034163Untreated Peat SoilGLGPTIAEKLALERLPFGHYVDIDPGSAERPDFGIPANSSEPNMIGSPPTVAFANWQLLDVHAKREELWLARMRPRRFQSALVICGLVHLLSFAFRLQAANFSVQAINYANWQRNPL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.