NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F091755

Metagenome Family F091755

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F091755
Family Type Metagenome
Number of Sequences 107
Average Sequence Length 54 residues
Representative Sequence MPTDGQRIQKRTPATVALQVSSAKQPFITELAFTENVSLHGVRVVTERAWKPG
Number of Associated Samples 63
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 6.54 %
% of genes from short scaffolds (< 2000 bps) 5.61 %
Associated GOLD sequencing projects 53
AlphaFold2 3D model prediction Yes
3D model pTM-score0.28

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (90.654 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(68.224 % of family members)
Environment Ontology (ENVO) Unclassified
(61.682 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(68.224 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 17.28%    Coil/Unstructured: 82.72%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.28
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 107 Family Scaffolds
PF07238PilZ 4.67
PF01381HTH_3 3.74
PF02687FtsX 2.80
PF00072Response_reg 1.87
PF12146Hydrolase_4 1.87
PF07730HisKA_3 1.87
PF02353CMAS 1.87
PF13432TPR_16 0.93
PF03551PadR 0.93
PF12327FtsZ_C 0.93
PF04226Transgly_assoc 0.93
PF13701DDE_Tnp_1_4 0.93
PF14559TPR_19 0.93
PF00563EAL 0.93
PF00528BPD_transp_1 0.93
PF08448PAS_4 0.93
PF13533Biotin_lipoyl_2 0.93
PF01255Prenyltransf 0.93
PF12697Abhydrolase_6 0.93
PF00561Abhydrolase_1 0.93

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 107 Family Scaffolds
COG2226Ubiquinone/menaquinone biosynthesis C-methylase UbiE/MenGCoenzyme transport and metabolism [H] 1.87
COG22272-polyprenyl-3-methyl-5-hydroxy-6-metoxy-1,4-benzoquinol methylaseCoenzyme transport and metabolism [H] 1.87
COG2230Cyclopropane fatty-acyl-phospholipid synthase and related methyltransferasesLipid transport and metabolism [I] 1.87
COG3850Signal transduction histidine kinase NarQ, nitrate/nitrite-specificSignal transduction mechanisms [T] 1.87
COG3851Signal transduction histidine kinase UhpB, glucose-6-phosphate specificSignal transduction mechanisms [T] 1.87
COG4564Signal transduction histidine kinaseSignal transduction mechanisms [T] 1.87
COG4585Signal transduction histidine kinase ComPSignal transduction mechanisms [T] 1.87
COG0020Undecaprenyl pyrophosphate synthaseLipid transport and metabolism [I] 0.93
COG1695DNA-binding transcriptional regulator, PadR familyTranscription [K] 0.93
COG1733DNA-binding transcriptional regulator, HxlR familyTranscription [K] 0.93
COG1846DNA-binding transcriptional regulator, MarR familyTranscription [K] 0.93
COG2200EAL domain, c-di-GMP-specific phosphodiesterase class I (or its enzymatically inactive variant)Signal transduction mechanisms [T] 0.93
COG2261Uncharacterized membrane protein YeaQ/YmgE, transglycosylase-associated protein familyGeneral function prediction only [R] 0.93
COG3434c-di-GMP phosphodiesterase YuxH/PdeH, contains EAL and HDOD domainsSignal transduction mechanisms [T] 0.93
COG4943Redox-sensing c-di-GMP phosphodiesterase, contains CSS-motif and EAL domainsSignal transduction mechanisms [T] 0.93
COG5001Cyclic di-GMP metabolism protein, combines GGDEF and EAL domains with a 6TM membrane domainSignal transduction mechanisms [T] 0.93


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A90.65 %
All OrganismsrootAll Organisms9.35 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005537|Ga0070730_10026904All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4385Open in IMG/M
3300007258|Ga0099793_10141243All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1137Open in IMG/M
3300012202|Ga0137363_11118434All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium670Open in IMG/M
3300012203|Ga0137399_10671721All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium872Open in IMG/M
3300020580|Ga0210403_10149194All Organisms → cellular organisms → Bacteria → Acidobacteria1912Open in IMG/M
3300020581|Ga0210399_10003270All Organisms → cellular organisms → Bacteria → Acidobacteria12623Open in IMG/M
3300026285|Ga0209438_1137507All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium651Open in IMG/M
3300026304|Ga0209240_1016716All Organisms → cellular organisms → Bacteria → Acidobacteria2798Open in IMG/M
3300026467|Ga0257154_1036827All Organisms → cellular organisms → Bacteria → Acidobacteria743Open in IMG/M
3300027857|Ga0209166_10032790All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3147Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil68.22%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil8.41%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil7.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil6.54%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.80%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.87%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.93%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.93%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.93%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.93%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.93%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2040502001Soil microbial communities from sample at FACE Site 2 North Carolina CO2+EnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026467Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-AEnvironmentalOpen in IMG/M
3300026496Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-AEnvironmentalOpen in IMG/M
3300026508Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-AEnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300027381Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
FACENCE_27735402040502001SoilMSTDSQRIQKRTPATVALQISSANQPLVAELVFTENISLHGVRAVTARAWNP
JGI25390J43892_1008048113300002911Grasslands SoilMPTDGRRIQKRTPASVALQVSSAKQPFITELAFTENVSLHGVRVGW
JGI25616J43925_1007247123300002917Grasslands SoilMPTDRQRIQKRTHASVALQISSAEQPFITELAFSENVSLHGVRVVTERAWKPGERVIVKSYHGSIKSPAKVIYCYR
JGI25616J43925_1019109523300002917Grasslands SoilMPTDGQRIQKRIPASVALQVSSATQPFITELAFAENVSLHGVRVVTERAWKPGE
Ga0066674_1013848653300005166SoilMPTDCKRIQKRTPATVALQVSSATQPFITELVFTENFSLHGVRVVTVRAWKPGERVI
Ga0066673_1069718623300005175SoilMPTDCKRIQKRTPATVALQVSSATQPFITELVFTENFSLHGVRVVTVRAWKPGERVIVKSYHGSIQSRARVIYCH
Ga0070730_1002690443300005537Surface SoilMPTDCKRIQKRTPATVALQITNANQPFISQLAFTENVSLHGVRVVTERRGNQERA*
Ga0066691_1045279113300005586SoilMPTDCKRIQKRTPATVALQITSVNQPFITELAFTENVSLHGVRALTERVWKPGERVIVKSYHGSIQS
Ga0066691_1084896323300005586SoilMPTDGQRFQKRTPATVSLQVSSAEQPFITELAFTENVSLH
Ga0066653_1068662923300006791SoilMPTDGQRFQKRTPATVSLQVSSAAQPFTTELAFTENVSLHGVRVVTGRAWKRAERLIVKSYNGSVQSR
Ga0066665_1141840313300006796SoilMPSDCKRIQKRTTASVALQISSARQPFITELAFTENVSL
Ga0099791_1004178313300007255Vadose Zone SoilMPTDAQRIQKRTPATVALQVSSAKQPFITELAFTENVSLHGVRVVTERAWKP
Ga0099793_1014124313300007258Vadose Zone SoilMPTDAQRIQKRTPATVALQVSSANQPFITELAFTENVSLHGVRVVTERAWKPGERVIVKS
Ga0099793_1014370223300007258Vadose Zone SoilMPTDGQRIQKRNTATVALQVSSAKQPLITELVFTENISLHGVRVVTQRVWNPGERVIVKSYHGSIQS
Ga0099794_1007697133300007265Vadose Zone SoilMPTDGQRIQKRIPASVALQVSSATQPFITELAFAENVSLHGVRVVTERAWKPGERV
Ga0099794_1063910523300007265Vadose Zone SoilMPTDTQRIQKRTPATVALQVSSAKQPFITELAFTENVSLHGVRVVTERAWKPGERVIVKS
Ga0066710_10261971713300009012Grasslands SoilMPTDAQRIQKRTPATVALQVSSANQPFITELAFTENVSLHGVRVVTERAWKPGER
Ga0099830_1118853923300009088Vadose Zone SoilMPTDGQRIQKRIPASVALQVSSATQPFITELAFAENVSLHGVRVVTERAWKPGERVIVKS
Ga0099828_1156150223300009089Vadose Zone SoilMPTDGQRIQKRIPASVALQVSSATQPFITELAFAENVSLHGVRVVTERAWKPGER
Ga0099828_1202196213300009089Vadose Zone SoilMPTDGQRIQKRTPATVALQISSAKQPFITELAFTENFSLHGVRVVTGRAWKPGERVIVKSYHGSIQSRARVIYCHHLAE
Ga0099827_1085797413300009090Vadose Zone SoilMPTDCKRIQKRTPTTVALQITSAGQPFITELAFTENVSLHGVRAVTERAWKP
Ga0134067_1043297813300010321Grasslands SoilMPSDRQRNQKRTPAAVALQVSSAAQPFITELAFTEN
Ga0134080_1009282053300010333Grasslands SoilMPTDCKRIQKRTPATVALQVSSATQPFITELAFTENVSLHGVRVVTERAWKPVERVIVKSYHG
Ga0137392_1005702943300011269Vadose Zone SoilMPTDRQRIQKRTHASVALQISSAEQPFITELAFSENVSLHGVRVVTERAWK
Ga0137393_1007774913300011271Vadose Zone SoilMPTDGQRIQKRIPASVALQVSSATQPFITELAFAENVSLHGVRVVTERAWKPGERVIVKSYHGSIQSRARVIYCHHL
Ga0137393_1079156323300011271Vadose Zone SoilVALQIISAKQPFITELAFAENVSLHGVRAVTERAWKPGERVIVKSYHGSIQSAAKVIYCHRLAEARYAVGLE
Ga0137363_1016913133300012202Vadose Zone SoilMPTDCKRIQKRTPASVALQITSARQPFITELAFTENVSLHGVRAVTERAWK
Ga0137363_1111843423300012202Vadose Zone SoilMSNEEKRIQKRTPATVALQISSAKQPFITELVFTENLSLHGVRVVTERAWQPGERVVVTS
Ga0137363_1171563123300012202Vadose Zone SoilMPTDAQRIQKRTPATVALQVSSANQPFITELAFTENVSLHGVRVVTGRAWKPGERVIVKSYHGSIQSRA
Ga0137399_1018802913300012203Vadose Zone SoilMPTDAQRIQKRTPATVALQVSSATQPFITELAFTENVSLHGVRV
Ga0137399_1023096143300012203Vadose Zone SoilMPADTQRIQKRTPATVALQVSSAKQPFITELAFTENVSLHGVRVV
Ga0137399_1057998613300012203Vadose Zone SoilMPTDGQRIQKRSPATVALQISSEKQPFITELAFTENVSLHGVRVVTERAWKSGERVIVRS
Ga0137399_1067172123300012203Vadose Zone SoilMPTDGQRIQKRAPATVALQVSSAKQPFITELAFTENVSLHGVRVV
Ga0137399_1123650123300012203Vadose Zone SoilMPTDAQRIQKRTPATVALQVSSATQPFITELAFTEN
Ga0137399_1124081623300012203Vadose Zone SoilMPTDCKRIQKRTPATVALQVSSATRPFITELVFTENFSLHGVRVVTVR
Ga0137399_1136042913300012203Vadose Zone SoilMPTDGQRIQKRTPATVALQVSSAKQPFITELAFTENVSLHGVRVVTERAWKPG
Ga0137399_1171468023300012203Vadose Zone SoilVPTEQRTQKRTPATVALQVSSATRPFITELVFTENFSLHGVRVVTVR
Ga0137362_1071121313300012205Vadose Zone SoilMPNDGKRIQKRTPATVALQISSAKQPFITELAFTENFSLHGVRVVTER
Ga0137362_1104884923300012205Vadose Zone SoilMPIDCKRIQKRTPATVALQVSSATQPFITELAFAENVSLHGVRVVTERAWKPG
Ga0137362_1156337513300012205Vadose Zone SoilMPTDGQRIQKRTPASVALQVSSATQPFITELAFAENVSLHGVRVVTE
Ga0137360_1009317743300012361Vadose Zone SoilMPTDCKRIQKRTPATVALQITSAKQPFITELAFTEN
Ga0137360_1024859313300012361Vadose Zone SoilMPTDCKRIQKRTPATVALQITSANQPFITELAFTENVSLHGVRALTE
Ga0137360_1059773123300012361Vadose Zone SoilMPTDAQRIQKRTPATVALQVSSAKQPFITELAFTENVSLHGVRVVTERAWKPGERVIVKSYHG
Ga0137360_1073920113300012361Vadose Zone SoilMPTDGKRIQKRAPATVALQVSSAKQPFITELAFTENVSLHGVRVVTGRVWKPGERVIVKSYHG
Ga0137360_1117492113300012361Vadose Zone SoilMPPECQRIQKRTSATVALQISSANQPLIAELVFTENLSLHGVRAVTGRAW
Ga0137360_1186411223300012361Vadose Zone SoilMPTDGQRFQKRTPATVSLQVSSAEQPFITELAFAENVSLHGVRVVTERAWK
Ga0137361_1033983123300012362Vadose Zone SoilMLTDGQRIQKRTPATVALQVSSAKQSFITELAFTENV
Ga0137361_1080462113300012362Vadose Zone SoilMPTDSQRIQKRTPATVFLQISSANQPFIAELAFTENISLHGVRAVTARAWNPGDRVLLKSYHGSIQSWA
Ga0137361_1153777313300012362Vadose Zone SoilMSTDGQRIQKRTPATVALQISSAKQPFITELAFTENFSLHGV
Ga0137390_1007611513300012363Vadose Zone SoilMPTDGQRIQKRTPATVALQISSANQPFIAELAFTENISLHGVRVVTERA
Ga0137395_1000319313300012917Vadose Zone SoilMTTDCKRIQKRTPTTVALQVTSVRQPLITELAFTENVSLHGV
Ga0137395_1061093113300012917Vadose Zone SoilMPTDGQRIQKRIPASVALQVSSATQPFITELAFAENVSLHGVR
Ga0137395_1072644923300012917Vadose Zone SoilMPTDCKRIQKRTPTTVALQVTSVRQPLITELAFTENVSLHGV
Ga0137396_1024198223300012918Vadose Zone SoilMPTDTQRIQKRTPANVALQVSSAKQPFITELAFTENVSLHGIRVVTERAWKPGERAIVKSYHGSIQSRAG*
Ga0137396_1068179513300012918Vadose Zone SoilMPSDTQRIQKRTPATVFLQISSANQPFIAELAFTENISLHGVRAVTARAWNPGERVLLKSYHGSI*
Ga0137394_1018272043300012922Vadose Zone SoilMPTDRQRIQKRTPATVALQVSSAKQPFITELAFTENVSLHGVRVV
Ga0137394_1137922723300012922Vadose Zone SoilMPTDCKRIQKRTPATVALQITSAKQPFITELAFTENFSLHGVRVVTERAWKPGERVIIKSYHGSV
Ga0137419_1009349043300012925Vadose Zone SoilMPSDTQRIQKRTPATVFLQISSANQPFIAELAFTEN
Ga0137419_1012744133300012925Vadose Zone SoilMPTDRQRIQKRTPATVALQVSSAKQPFITELAFTENVSLHGVRVVTERAWKPGERVTVK
Ga0137419_1023468423300012925Vadose Zone SoilMPTDCKRIQKRTPATVALQITSAKQPFITELAFTENFSLHGVRVVTERAWKPGE
Ga0137419_1043169213300012925Vadose Zone SoilMPADTQRIQKRTPATVALQVSSAKQPFITELAFTEN
Ga0137419_1048569313300012925Vadose Zone SoilMPTDGQRIQKRIPASVALQVSSATQPFITELAFAENV
Ga0137419_1117584513300012925Vadose Zone SoilMPTDGQRIQKRTPATVALQVSSATQPFITELAFAENVSLHGVRVVTERV
Ga0137419_1149949713300012925Vadose Zone SoilMPTDGQRIQKRTPATVALQVSSAKQPFVTELAFTENVSLHGVR
Ga0137416_1012580413300012927Vadose Zone SoilMPTDGQRIQKRIPASVALQVSSATQPFITELAFAENVSLHG
Ga0137416_1019983033300012927Vadose Zone SoilMPTDAQRIQKRTPATVALQVSSATQPFITELAFTENV
Ga0137416_1091637733300012927Vadose Zone SoilMPTDGQRIQKRTPATVALQVSNAKQPFITELAFTENVSLHGVRVVTE
Ga0137416_1121053013300012927Vadose Zone SoilMPTDGQRIQKRTPATVALQVSSATQPFITELAFAENVSLHGVRVVTERAWKPGERVI
Ga0137407_1108429613300012930Vadose Zone SoilMPTDGQRIQNRSPATVALQISSEKQPFITELAFTENVSLHGVRVVTERAWK
Ga0137407_1185223413300012930Vadose Zone SoilMPTDCKRIQKRTPATVALQITSAKQPFITELAFTENVSLHGVRVVTERAWKPGERVIIKS
Ga0137410_1189359033300012944Vadose Zone SoilMPTDGQRIQKRSPATVALQISSEKQPFITELAFTENVSLHGVRVVTERAWKSGERVIV
Ga0134087_1031925113300012977Grasslands SoilMPTDCKRIQKRTPATVALQVSSATQPFITELAFTENV
Ga0137411_121921913300015052Vadose Zone SoilMPTDCKRIQKRTAATVALQITSGKQPFITELAFTENVSLHLNNA*
Ga0137405_122987713300015053Vadose Zone SoilMPTDAQRIQKRTPATVALQVSSAKQPFITELAFTENVSLHGVRVVTERAWKPGERVIVKSYHGSVIVKSYHGSIQSRA
Ga0137420_146009423300015054Vadose Zone SoilMPTDARESEADPATVALQVSSATQPFITELAFTENVSLHGVRL*
Ga0137420_146239433300015054Vadose Zone SoilMPTDRQRIQKRTPATVALQVSSAKQPFITELAFTENVSLHGVRVVTQRAGNRESA*
Ga0137418_10014927103300015241Vadose Zone SoilMPTDSQRIQKRTPATVFLQISSANQPFIAELAFTENISLHGVRAVTARAWNPGDRVLLKSYHGSIQSWAKVIYCQEL
Ga0137412_1117226613300015242Vadose Zone SoilMPTDSQRIQKRTPATVFLQISSANQPFIAELAFTENISLHGVRAVTARAWNPGDR
Ga0137403_1007298813300015264Vadose Zone SoilMPTDAQRIQKRTPATVALQVSSANQPFITELAFTENVSLHGVRVVTERAWKPGERVIVK
Ga0137403_1012712913300015264Vadose Zone SoilMPTHGQRFQKRTPATVSLQVSSAEQPFITELAFTENVSLHGVRVVTERA
Ga0066667_1197197913300018433Grasslands SoilMPTEQRTQKRTPATVALQVSSAKQPLITELAFTENVSLRGVRVVTVRAWKPGEH
Ga0179592_1025093213300020199Vadose Zone SoilMPTDGQRIQKRTPATVALQVSSATQPFITELAFAENVSLHGVRIVTERAWKPGERVIVKSYHGSIQSR
Ga0210403_1014919453300020580SoilMPTDGTRIQKRTPATVALQISSAKQPFITELPFSENLS
Ga0210399_10003270143300020581SoilMPTERIQKRSPATVALQISSAKQPLITELAFTENLSLHGVRLVTGRVCGSQGSA
Ga0210405_1123401413300021171SoilMPTDCKRIQKRTPATVALQITSAKQPFISELAFTENVSLHGVRVVTERAWKPGERVIVKSYH
Ga0210396_1000090233300021180SoilMPTERIQKRSPVTVALQISSAKQPFITELAFTENLSLHGVRLVTGRPWEPGKRVVVKSYHGSIQSRANVICHTPHTHKDSRATRHIRY
Ga0137417_109016313300024330Vadose Zone SoilMPTDCKRIQKRTTATVALQITSAKQPFITELAFTENVSLHGVR
Ga0137417_121697913300024330Vadose Zone SoilMPTDCKRIQKRTPASVALQITSAQQPFITELAFTENVSLHGVRAVTERAWKPGE
Ga0209438_113750713300026285Grasslands SoilMPTDGQRIQKRTPTTVALQVSSANQPFITELAFTENVSLHGVRVVTERAWKPGERVIVKSYH
Ga0209240_101671653300026304Grasslands SoilMPTDGKRIQKRTPATVALQISSANQPFITELAFTENVSLH
Ga0209240_126862713300026304Grasslands SoilMPTDGQRIQKRTPATVALQISSAKQPFITELAFTENFSLHGVRVVTGRAWKPRE
Ga0209240_129740313300026304Grasslands SoilMPTDCKRIQKRTPATVALQVSSATQPFITELVFTENFSLHGVRVVTVRAWKPGERVIVKSYHGSIQSRARVIYCHH
Ga0209470_138013213300026324SoilMPTEQRTQKRTPATVALQVSSAKQPLITELAFTENVSLRGVRVVTVRAWKPGEHVIV
Ga0209267_116523513300026331SoilMPTDCKRIQKRTPATVALQVSSATQPFITELAFTENVSLHGVRVVTERAWKPGERVIVKSYHGSIQSR
Ga0257154_103682713300026467SoilMPTERIQKRSPATVALQISSAKQPLITELAFTENLSLHGVRLVTGRAWEPGERVVV
Ga0257157_103348313300026496SoilMPTDRQRIQKRTHASVALQISSAEQPFITELAFSENVSLHGVRVVTERAWKPG
Ga0257161_105705213300026508SoilMPTDRQRIQKRSPATVALQVSSAKQPFITELAFTENFSLHGVRVVTERAWKPGERVIIKSYH
Ga0209056_1028518123300026538SoilMPTDCKRIQKRTPATVALQVTSAKQPFIAELAFTENVSLHGVRSVRGNQESA
Ga0208983_107215813300027381Forest SoilMPTDGQRIQKRAPATVALQVSSAKQPFITELAFTENISLHGVRVVTE
Ga0209076_103633823300027643Vadose Zone SoilMPTDRQRIQKRTHASVALQISSAEQPFITELAFSENVSLHGVRVVTERTWKPGERVIVKSYYGSIQ
Ga0209076_116902813300027643Vadose Zone SoilMPTDTQRIQKRTPANVAVQVSSAKQPFITELAFTENV
Ga0209588_124414413300027671Vadose Zone SoilMPTDCKRIQKRTPATVALQVSSATRPFITELVFTENFSLH
Ga0209166_1003279043300027857Surface SoilMPTDCKRIQKRTPATVALQITNANQPFISQLAFTENVSLHGVRVVTERRGNQERA
Ga0209488_1058466613300027903Vadose Zone SoilMPTDGQRFQKRTPATVSLQVSSAEQPFITELAFTENVSLHGVRVVTE
Ga0137415_1034590923300028536Vadose Zone SoilMPTDGQRIQKRTPATVALQVSSATQPFITELAFAENVSLHGVRVVTE
Ga0170824_11910878723300031231Forest SoilMPPDCQRIQERSPAAVARQISSASQPLIAELVFTGNLSLHGVRAVTGRAWNP
Ga0307479_1153447423300031962Hardwood Forest SoilMSTDTQRIQKRTPATVALQISSANQLFIAELVFTENISLHGVRAVTARAWNPGERVLLKSYHGNIQSWAKVIYC


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.