NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F073329

Metagenome / Metatranscriptome Family F073329

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F073329
Family Type Metagenome / Metatranscriptome
Number of Sequences 120
Average Sequence Length 73 residues
Representative Sequence ADSYREIFDAAAERAGDPPQSPTWWAVGDAATALAARRHLDPEQFALLVGPLGVALPWLKDAAGQG
Number of Associated Samples 100
Number of Associated Scaffolds 120

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 85
AlphaFold2 3D model prediction Yes
3D model pTM-score0.68

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(41.667 % of family members)
Environment Ontology (ENVO) Unclassified
(62.500 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(78.333 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 56.38%    β-sheet: 0.00%    Coil/Unstructured: 43.62%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.68
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 120 Family Scaffolds
PF07722Peptidase_C26 27.50
PF02720DUF222 26.67
PF01523PmbA_TldD 5.00
PF00583Acetyltransf_1 4.17
PF14300DUF4375 3.33
PF05362Lon_C 1.67
PF13784Fic_N 1.67

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 120 Family Scaffolds
COG0312Zn-dependent protease PmbA/TldA or its inactivated homologGeneral function prediction only [R] 5.00
COG0466ATP-dependent Lon protease, bacterial typePosttranslational modification, protein turnover, chaperones [O] 1.67
COG1067Predicted ATP-dependent proteasePosttranslational modification, protein turnover, chaperones [O] 1.67
COG1750Predicted archaeal serine protease, S18 familyGeneral function prediction only [R] 1.67
COG3480Predicted secreted protein YlbL, contains PDZ domainSignal transduction mechanisms [T] 1.67


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil41.67%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil12.50%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil12.50%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil12.50%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil8.33%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.17%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.33%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost1.67%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.83%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.83%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001536Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A15-65cm-8A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002909Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010128Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012391Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300013766Permafrost microbial communities from Nunavut, Canada - A26_65cm_6MEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026343Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 (SPAdes)EnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027587Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027591Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027663Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027678Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027684Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027738Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
A1565W1_1140607013300001536PermafrostMFDSAAERAGDPPQSPAWWAVGDAATALAARRRLTPDQFALLVGPLGVALPWLKDAANQT
JGI12635J15846_1021847923300001593Forest SoilFEAAAERAGDPPESPTWWAAGDAATGLASRRRLTSEQFLILVAPMSVALPWLKDAGNRT*
JGI12053J15887_1046796923300001661Forest SoilQPKVEEARRQALAALGPDSYREIFDAAAERAGDTPQTPTWWAVGDAATALAARRRLEADPFALLVGPLGVALPWLKDAANQS*
JGIcombinedJ26739_10078643323300002245Forest SoilLVALGAESYREIFDAAAERAGDPPQSPTWWAVGDAATGLAARRHLDSDQFAVLVGPLGVALPWLKDAASQS*
JGIcombinedJ26739_10153206013300002245Forest SoilADSYREIFDAAAERAGDPPQSPTWWAVGDAATALAARRHLDPEQFALLVGPLGVALPWLKDAAGQG*
JGI25388J43891_103192013300002909Grasslands SoilEIFDSAAERAGDPAESPTWWAAGDAATAIAARRRLAHEEFVLLVGPLGVALPWLKDAANQI*
JGI25390J43892_1003569913300002911Grasslands SoilAERAGDAPQSPTWWAAGDAAIALAARRRLXREQFAVLVGPLSVALPWLKDAGSPT*
JGI25386J43895_1013359423300002912Grasslands SoilGDSYREIFDSAAERAGDQPQGPTWWAAGDAATGLASRRRLTQEQFLTLVAPLSVALPWLKDAANQI*
JGI25386J43895_1015627923300002912Grasslands SoilGDQPQGPTWWAAGDAATAIAARRRLXPEQFLALVAPLSVALPWLKDAANQI*
JGI25389J43894_100773543300002916Grasslands SoilQASIEAARGDAQEALGADSYREVFGAAAERAGDPRSPTWWAAGDAATAIAARRRLTSEQFLLLVAPLSVALPWLRDAASQA*
Ga0066672_1007043943300005167SoilSYREIFEAAAGRAGDPAQSPTWWAIGDAASALAARRRLDPEQYAVLVGPLGVALPWLKDAGSQ*
Ga0066672_1046691213300005167SoilERAREAARSALGVDSYREIFEAAAERAGDPPQSPTWWATGDAATGLATRRRLTPEQFLTLVAPLSVALPWLKDAAGQS*
Ga0066677_1041656723300005171SoilSDPARALVGKRQQQAWLSSKPIIEAARRDALTALGGDSYREIFESAAERAGDPPQGPTWWAAGDAATALAARRRLGPDEFAILVGPLGVALPWLKDAATQT*
Ga0066683_1048245023300005172SoilLIAKRQQQAWLQASDQVVQAREAARTALGADSFREIFEAAAQRAGDPPQSPTWWATGDAAIALAARRHLAPEQFAVLVGPLSVALPWLKDAGSPPASALSQRT*
Ga0066680_1020551823300005174SoilSYREIFDAAAERAGDPPQSPVWLAVGDAAIALAARRRLDADHFALLVGPLGVALPWLKDAASQG*
Ga0066690_1006664843300005177SoilARALIAKRQQQAWSQASELIEQAREAARDALGADSYREFFEAAAGRAGDPAQSPTWWAIGDAASALAARRRLDPEQYAVLVGPLGVALPWLKDAGSQ*
Ga0066685_1041067923300005180SoilREIFESAAQRAGDPPHSPTWWAAGDAATALAARRRLASDPFAVLVGPLGVALPWLKDAGKAG*
Ga0066685_1049101733300005180SoilTLGTDSYREVFEAAAERAGDSAQGPIWWAAGDAATALAARRRLASDQFAVLVGPLGVALPWLKDAGSQ*
Ga0066678_1029051223300005181SoilIGKRQQQAWLTSQTSIEAARRDAKAALGADSYREIFESAAERAGDPSESPTWWAAGDAATAIAARRRLASEEFALLVGPLGVALPWLKDAANQT*
Ga0066671_1095062623300005184SoilASDQVERAREAARSALGVDSYREILEAAAERAGDPPQSPTWWATGDAATGLATRRRLTPEQFLTLVAPLSVALPWLKDAAGQS*
Ga0066681_1028603333300005451SoilSDPARALIGKRQQQAWLTSQPNIEAARREALATLGTDSYREVFEAAAERAGDSAQGPIWWAAGDAATALAARRRLASDQFAVLVGPLGVALPWLKDAGSQ*
Ga0066681_1077329213300005451SoilARRDAKAALGADSYREIFESAAERAGDPSESPTWWAAGDAATAIAARRRLASEEFALLVGPLGVALPWLKDAANQT*
Ga0070706_10204608413300005467Corn, Switchgrass And Miscanthus RhizosphereKRQQQAWLTSQPAIEVARRDAEAALGADSYREIFEAAAERAGDPAQSPTWWAVGDAATGLASRRRLTPDQFLTLVAPLSVALPWLKDAASQA*
Ga0070707_10011405043300005468Corn, Switchgrass And Miscanthus RhizosphereQQQAWLTTQPNIEAARRQALAALGDDSYREIFDAAAERAGDPPQSPTWWAVGDAATGLAARRHLDSDQFALLVGALGVALPWLKDAASQG*
Ga0070697_10088655423300005536Corn, Switchgrass And Miscanthus RhizosphereLRAEDQIEHARQQARDALGGDAYREIFEAAADRIGEPSQTPAWWAAGDAATALATRRRLGAEQFALLVGPLGVALPWLKDAGRQT*
Ga0070697_10212118823300005536Corn, Switchgrass And Miscanthus RhizosphereEAARRDAQAALGIDSYREVFEAAAERAGDSPQSPTWWAAGDAATALAARRRLDEGQFALMVGPLSVALPWLKDAGNQA*
Ga0066697_1020654013300005540SoilREIFESAAQRAGDPPHSPTWWAAGDAATALAARRRLASDPFAVLVGPLGVALPWLKDAGSAG*
Ga0066701_1027979813300005552SoilGPESYREIFEAAAGRAGDPAQSPTWWAIGDAASALAARRRLDPEQYAALVGPLGVALPWLKDAGSQ*
Ga0066661_1011323033300005554SoilREAARSALGVDSYREIFEAAAERAGDPPQSPTWWATGDAATGLATRRRLTPEQFLTLVAPLSVALPWLKHAAGQS*
Ga0066707_1029010213300005556SoilRALVGKRQQQAWLSSKPIIEAARRDALTALGGDSYREIFESAAERAGDPPQGPTWWAAGDAATALAARRRLGPDEFAILVGPLGVALPWLKDAATQT*
Ga0066704_1016541413300005557SoilERAGDPRSPTWWAAGDAATAIAARRRLTSEQFLLLVAPLSVALPWLRDAASQA*
Ga0066704_1032883333300005557SoilAALGADSYREIFESAAERAGDPSESPTWWAAGDAATAIAARRRLASEEFALLVGPLGVALPWLKDAANQT*
Ga0066698_1038945123300005558SoilLGAESYREIFESAAQRAGDPPHSPTWWAAGDAATALAARRRLASDPFAVLVGPLGVALPWLRDARSAG*
Ga0066700_1096160513300005559SoilQQAWLSSKPIIEAARRDALTALGGDSYREIFESAAERAGDPPQGPTWWAAGDAATALAARRRLGPDEFAILVGPLGVALPWLKDAATQT*
Ga0066699_1018292313300005561SoilSQASELIEQAREAARDALGADSYREIFEAAAGRAGDPAQSPTWWAIGDAASALAARRRLDPEQYAVLVGPLGVALPWLKDAGSQ*
Ga0066703_1040248013300005568SoilDAQAALGADAYREMFDSAAERAGDPQDATWWAAGDAATAIAARRRLTSEQFLLLVAPLSVALPWLKDASRQA*
Ga0066703_1082141113300005568SoilARALISKRQQQAWSASQPKVEAARRQALAALGGDSYRQIFDAAAERAGDPPQSPTWWAVGDAATAIAARRRLDAEQFGLLVGPLGVALPWLKEAASQS*
Ga0066705_1022479213300005569SoilRALVAKRQQAAALQAKDQIERAREMAREALGPESYREIFEAAAGRAGDPAQSPTWWAIGDAASALAARRRLDPEQYAALVGPLGVALPWLKDAGSQ*
Ga0066708_1000968613300005576SoilLGAESYREIFESAAQRAGDPPHSPTWWAAGDAATALAARRRLASDPFAVLVGPLGVALPWLKDAGKAG*
Ga0066691_1023258413300005586SoilAAERAGDSAQSPTWWAVGDAASAIAARRRLTSEQFLLLVAPLSVALPWLRDAASQA*
Ga0066654_1087030913300005587SoilEAAREALGAESYREIFESAAQRAGDPPHSPTWWAAGDAATALAARRRLAPDPFAVLVGPLGVALPWLRDAGSAG*
Ga0066706_1150863213300005598SoilGKRQQQAWLTSQASIQEARREAQAALGDDSYREVFSAAAERAGDSAQSPTWWAVGDAASAIAARRRLTSEQFLLLVAPLSVALPWLRDAASQA*
Ga0070717_1002797013300006028Corn, Switchgrass And Miscanthus RhizosphereGADSYREIFDGAAERAGDLPQSPVWWAVGDAATALAARRHLDPEQFALLVGPLGVALPWLKDAAGQA*
Ga0066696_1063093033300006032SoilGDPSESPTWWAAGDAATAIAARRRLASEEFALLVGPLGVALPWLKDAANQT*
Ga0066656_1023557313300006034SoilQIEQARDAARQALGAESYREIFESAAERAGDPPQSPTWWAAGDAAIALAARRRLSREQFAVLVGPLSVALPWLKDAGSPT*
Ga0066665_1010467533300006796SoilGAAAERAGDPQAPTWWAAGDAATAIAARRRLTSEEFLLLVAPLAVALPWLKDAASQA*
Ga0066659_1056541913300006797SoilKRQQQAWLTSQASIEAARRDAQAALGADSYREIFESAAERAGDPLESPTWWAAGDAATAIAARRRLASEEFALLVGPLGVALPWLKDAANQT*
Ga0066659_1064317713300006797SoilEAWLQASDQIEAVRQVARDALGDDSYREIFEAAAERSGDPPRSPTWWATGDAAIALAARRRLGPEQFAVLVGPLGVALPWLKEAASQ*
Ga0066710_10049906513300009012Grasslands SoilEALGADSYREVFGAAAERAGDPRSPTWWAAGDAATAIAARRRLTSEQFLLLVAPLSVVLPWLRDAASQA
Ga0099830_1001879373300009088Vadose Zone SoilSYREIFDAAAERAGDPPQSATWWAVGDAATGLAARRHLDAEQFALLVGPLAVALPWLKYAASQT*
Ga0099828_1006004853300009089Vadose Zone SoilRVLVGKRQQQAWLTSKASIEAARRDAQAALGADSYREIFESAAERAGDREGPVWWAAGDAATAIAARRRLTSEDFLILVAPLSVALPWLKDAANQT*
Ga0066709_10055532533300009137Grasslands SoilDSYREVFGAAAERAGDPRSPTWWAAGDAATAIAARRRLTSEQFLLLVAPLSVALPWLRDAASQA*
Ga0127486_102148723300010128Grasslands SoilLASGSSKPLQASDQIEQAREAAREALGAESYREIFESAAQRAGDPPHSPTWWAAGDAATALAARRRLASDPFAVLVGPLGVALPWLKDAGSAG*
Ga0134067_1038566223300010321Grasslands SoilAGDSAQSPTWWAVGDAASAIAARRRLTSEQFLLLVAPLSVALPWLRDAASQA*
Ga0134064_1014413413300010325Grasslands SoilRRDAKAALGADSYREIFESAAERAGDPQDPTWWAAGDAATALATRRRLTSEQFLLLVAPLSVALPWLKDAANQT*
Ga0134065_1039537413300010326Grasslands SoilQAWLQASDQVVQAREAARTALGADSFREIFEAAAQRAGDPPQSPTWWATGDAAIALAARRHLAREQFAVLVGPLSVALPWLKDAGSPPASAFSQRT*
Ga0134080_1010022413300010333Grasslands SoilLIGKRQQQAWLQTSDQIEQAREAAREALGAESYREIFESAAQRVGDPPHSPTWWAAGDAATALAARRRLASDPFALLVGPLGVALPWLKDAASQT*
Ga0134063_1072969613300010335Grasslands SoilAAAERAGDSAQGPIWWAAGDAATALAARRRLASDQFAVLVGPLGVSLPWLKDAGSQ*
Ga0134062_1034123613300010337Grasslands SoilGTDSYREVFEAAAERAGDSAQGPIWWAAGDAATALAARRRLASDQFAVLVGPLGVALPWLKDAGSQ*
Ga0137393_1092614013300011271Vadose Zone SoilAESYREIFEAAAERAGDQPQSPTWWAAGDAAVGIAARRRLNSEHFLTLVAPLSVALPWLKDAARQS*
Ga0137389_1006674343300012096Vadose Zone SoilIFDAAAERAGDPPQSATWWAVGDAATGLAARRHLDAEQFALLVGPLAVALPWLKYAASQT
Ga0137364_1141688013300012198Vadose Zone SoilAERAGDSAQGPIWWAAGDAATALAARRRLASDQFAVLVGPLGVALPWLKDAGSQ*
Ga0137381_1144243713300012207Vadose Zone SoilKRQQQAWLTSQASIEAARRDAEATLGADSYREVFEAAAERVGDPLQSPTWWAAGDAATGLASRRRLTQEQFLTLVAPLSVALPWLRDAANQI*
Ga0137376_1146035813300012208Vadose Zone SoilATLGTDSYREVFEAAAERAGDSAQGPIWWAAGDAATALAARRRLASDQFAVLVGPLGVALPWLKDAGSQ*
Ga0137379_1118234313300012209Vadose Zone SoilSAAERAGDPPQSPTWWAAGDAAIALAARRRLGRQQFAVLVRPLSVALPWLKDAGSPT*
Ga0137377_1113334813300012211Vadose Zone SoilREIFESAAERAGDPLESPTWWAAGDAATAIAARRRLASEKFALLVGPLGVALPWLKDAANQT*
Ga0134035_125485113300012391Grasslands SoilLQASNQIEQARDAARQALGAESYREIFESAAERAGEPPQSPTWWAAGDAATALAARRRLASDPFALLVGPLGVALPWLKDAGSPT*
Ga0137396_1020773633300012918Vadose Zone SoilAERAGDPQGPTWWAAGDAATAIAARRRLSSEQFVLLVAPLSVALPWLRDAANQT*
Ga0137396_1037507923300012918Vadose Zone SoilLSDPARALISKRQQQAWITSQPNVEGARRQALAALGADSYREIFDAAAERAGDPPQSPTWWAVGDAATALAARRRLDAEQFGLLVGPLGVALPWLKDAASQS*
Ga0137416_1037918613300012927Vadose Zone SoilSYREIFEAAAERAGDPPQGPTWWAAGDAATAIAARRRLTPEQFLALVAPLSVALPWLKDAANQI*
Ga0134076_1051514923300012976Grasslands SoilESYREIFESAAQRAGDPPHSPTWWAAGDAATALAARRRLASDPFAVLVGPLGVALPWLKDAGKAG*
Ga0120181_109418323300013766PermafrostAAERAGDPPQSPAWWAVGDAATALAARRRLTPDQFALLVGPLGVALPWLKDAANQT*
Ga0134089_1013670713300015358Grasslands SoilQIEQAREAAREALGAESYREIFESAAQRVGDPPHSPTWWAAGDAATALAARRRLASDPFALLVGPLGVALPWLKDAGKAG*
Ga0066655_1013917213300018431Grasslands SoilFEAAAQRAGDPPQSPTWWATGDAAIALAARRHLAPEQFAVLVGPLSVALPWLKDAGSPPASAFSQRT
Ga0066655_1019402813300018431Grasslands SoilSKRQQQAWLQASDQIEQAREAAREALGAESYREIFESAAQRAGDPPHSPTWWAAGDAATALAARRRLASDPFAVLVGPLGVALPWLKDAGKAG
Ga0215015_1038000023300021046SoilARHDAQAALGVDSYREIFEAAAERAGDPPQSPTWWAVGDAAIGLASRRRLTSEQFLTLVSPLSVALPWLKDAASQS
Ga0207664_1031758833300025929Agricultural SoilGLGDPARALIGKRQQQAWLTSQPTIEAARREALAALGPDSYREIFDAAAERAGDPPQSPVWLAVGDAATALAARRHLKWDQFAVLVGPLGVALPWLKDAASQS
Ga0209237_103837243300026297Grasslands SoilSAIEAARRDAEATLGGDSYREIFDSAAERAGDQPQGPTWWAAGDAATGLASRRRLTQEQFLTLVAPLSVALPWLKDAANQI
Ga0209055_121214113300026309SoilYREMFDSAAERAGDPQDATWWAAGDAATAIAARRRLTSEQFLLLVAPLSVALPWLKEASRQA
Ga0209239_100746313300026310Grasslands SoilSAAERAGDPPQSPTWWAAGDAAIALAARRRLGREQFAVLVGPLSVALPWLKDAGSPT
Ga0209761_104402243300026313Grasslands SoilAAERAGDQPQGPTWWAAGDAATAIAARRRLTPEQFLALVAPLSVALPWLKDAANQI
Ga0209154_126920413300026317SoilRAREMAREALGPESYREIFEAAAGRAGDPAQSPTWWAIGDAASALAARRRLDPEQYAVLVGPLGVALPWLKDAGSQ
Ga0209471_129318513300026318SoilAALGADSYREIFDAAAERAGDPPQSPTWWAVGDAATGLAARRRLDAEQFALLVGPLGIALPWLKDAASS
Ga0209802_118648613300026328SoilDPARALIGKRQQQAWLTSQTSIEAARRDAKAALGADSYREIFESAAERAGDPSESPTWWAAGDAATAIAARRRLASEEFALLVGPLGVALPWLKDAANQT
Ga0209267_109267613300026331SoilPPQSPTWWATGDAATGLATRRRLTPEQFLTLVAPLSVALPWLKDAAGQS
Ga0209267_111740813300026331SoilAALGADSYREIFESAAERAGDPSESPTWWAAGDAATAIAARRRLASEEFALLVGPLGVALPWLKDAANQT
Ga0209267_128180213300026331SoilELVEQAREAARDALGADSYREIFEAAAGRAGDPAQSPTWWAVGDAASALAARRRLDPEQYAALVGPLGVALPWLKDAGSQ
Ga0209158_102910213300026333SoilAAERAGDSAQSPTWWAVGDAASAIAARRRLTSEQFLLLVAPLSVALPWLRDAASQA
Ga0209804_104811543300026335SoilWLTSQASIQEARRDAQAALGDDSYREVFSAAAERAGDSAQSPTWWAVGDAASAIAARRRLTSEQFLLLVAPLSVALPWLRDAASQA
Ga0209159_114798313300026343SoilAREAAREALGAESYREIFESAAQRAGDPPHSPTWWAAGDAATALAARRRLASDPFAVLVGPLGVALPWLKDAGSMAR
Ga0209808_130634013300026523SoilSYREVFGAAAERAGDPRSPTWWAAGDAATAIAARRRLTSEQFLLLVAPLSVALPWLRDAASQA
Ga0209378_110627723300026528SoilERAGDPSESPTWWAAGDAATAIAARRRLASEEFALLVGPLGVALPWLKDAANQT
Ga0209806_102753353300026529SoilALGPESYREIFEAAAGRAGDPAQSPTWWAIGDAASALAARRRLDPEQYAALVGPLGVALPWLKDAGSQ
Ga0209805_100358483300026542SoilEAAAGRAGDPAQSPTWWAIGDAASALAARRRLDPEQYAALVGPLGVALPWLKDAGSQ
Ga0209156_1006052513300026547SoilSYREIFESAAQRAGDPPHSPTWWAAGDAATALAARRRLASDPFAVLVGPLGVALPWLKDAGKAG
Ga0209156_1010294613300026547SoilSYREIFESAAQRAGDPPHSPTWWAAGDAATALAARRRLASDPFAVLVGPLGVALPWLRDARSAG
Ga0209474_1045500213300026550SoilERAGDSAESATWWAAGDAAIAIAARRRLTSEQFLLLVAPLSVALPWLKDAANQT
Ga0209648_1010707913300026551Grasslands SoilAAERAGDPPQSPTWWAIGDAATALAARRRLDGDQFALLVGPLGVALPWLKDAARQT
Ga0209648_1035615213300026551Grasslands SoilLGADSYREIFEAAAERAGDAAQSPTWWAVGDAAIAIAARRRLNSEHFLTLVAPLSVALPWLKDAAAPVLTNPS
Ga0209648_1072083313300026551Grasslands SoilQPAIEAARLEARVALGADSYREIFEAAAERAGDLPQSPTWWATGDAATGLASRRRLTPEQFLMLVAPMSVALPWLKDAANQT
Ga0209577_1087211513300026552SoilAAAERAGDSAQSPTWWAVGDAASAIAARRRLTSEQFLLLVAPLSVALPWLRDAASQA
Ga0209220_103431923300027587Forest SoilYREIFGSAAERAADPQSPTWWAAGDAATAIAARRRLASEQFLLLVAPLSVALPWLKDAANQS
Ga0209220_108302213300027587Forest SoilARVLIAKRQQQAWLSSQPAIEAARRDAQAALGIDSYREVFEAAAERAGDPPESPTWWAAGDAATGLASRRRLTSEQFLILVAPMSVALPWLKDAGNRT
Ga0209220_119178323300027587Forest SoilDAAAERAGDPPQSPTWWAVGDAATGLAARRHLDSDQFAVLVGPLGVALPWLKDAASQS
Ga0209733_108279913300027591Forest SoilSYREIFEAAAERAGDPPQSPTWWATGDAATGLASRRRLTPEQFLALVAPLSVALPWLKDAASQT
Ga0209117_102792333300027645Forest SoilDSYREIFGSAAERAADPQSPTWWAAGDAATAIAARRRLASEQFLLLVAPLSVALPWLKDAANQS
Ga0208990_112815023300027663Forest SoilALGGDSYREIFDAAAERAGDPPQSPTWWAVGDAGTALAARRRLDAEQFGLLVGPLGVALPWLKDAASQS
Ga0209011_101066013300027678Forest SoilFDAAAERAGDPPQSPTWWAVGDAATGLAARRRLDPDQFALLVGPLGVALPWLKDAATQT
Ga0209011_103019323300027678Forest SoilAWLTSQSKVEAARRKALTSLGGDSYREIFDGAAERAGDLPQSPAWWAVGDAAIGLAARRHLDPDEFALLVGPLAVALPWLKDAANQS
Ga0209626_100534513300027684Forest SoilYREIFDAAAERAGDPPQSPTWWAVGDAATALAARRHLDPEQFALLVGPLGVALPWLKDAAGQG
Ga0208989_1007858433300027738Forest SoilSRPAIEAARRDAQAALGADSYREVFEAAAERAGDPPQGPTWWAAGDAATAIAARRRLGAEQFALLVGPLGVALPWLRDAANQS
Ga0209180_1079725223300027846Vadose Zone SoilAESYREIFEAAAERAGDQPQSPTWWAAGDAAVGIAARRRLNSEHFLTLVAPLSVALPWLKDAARLS
Ga0209283_1069987123300027875Vadose Zone SoilQQAWLTSKASIEAARRDAQAALGADSYREIFESAAERAGDREGPVWWAAGDAATAIAARRRLTSEDFLILVAPLSVALPWLKDAANQT
Ga0209069_1050772323300027915WatershedsVALGADSYREIFDAAAERAGDPPQSPTWWAVGDAATGLAARRRLDADQFALLVGPLGVALPWLKDAASQS
Ga0209526_1035693833300028047Forest SoilLDAGFVNELREAIVLVGKRQPQAWLTSKASIEAARRDAEGTLGADSYREIFEAAAERAGDPSQSPTWWAVGDAATAIGARRRLTSEQFLTLVGPLSVAMPWLKDAANQA
Ga0137415_1087392513300028536Vadose Zone SoilQQAWLTSKSSIEAARRDARAALGADSYREIFEAAAERAGDPPQGPTWWAAGDAATAIAARRRLTSEQFLLLVAPLSVALPWLKDAANQT
Ga0307475_1090288523300031754Hardwood Forest SoilARQQALVALGADSYREIFDAAADRAGDPPQTPAWWAVGDAATALAARRHLDVEQFALLVGPLAVALPWLKDAASQT
Ga0307479_1011447243300031962Hardwood Forest SoilVFEAAAERAGDSPQSPTWWAAGDAATALAARRRLDADQFALMVGPLSVALPWLKDAGNQA
Ga0307479_1031780443300031962Hardwood Forest SoilRREALAALGANSYREIFDAAAERAGDPPQSPTWWAVGDAATALAARRQLESDQFAVLVGPLGVALPRLKEAVSQG
Ga0307471_10044749813300032180Hardwood Forest SoilALGADSYREIFDAAAERAGDPQQSPTWWAVGDAATALAPRRHLKSDRFAVLVGPLGVALPWLKDAASQS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.