NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F104918

Metagenome / Metatranscriptome Family F104918

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104918
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 97 residues
Representative Sequence MSKVQEKESTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKSETSRTGPAIVADEKRETRPRNKKQS
Number of Associated Samples 79
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 16.67 %
% of genes near scaffold ends (potentially truncated) 12.00 %
% of genes from short scaffolds (< 2000 bps) 11.00 %
Associated GOLD sequencing projects 67
AlphaFold2 3D model prediction Yes
3D model pTM-score0.49

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (88.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(40.000 % of family members)
Environment Ontology (ENVO) Unclassified
(75.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(77.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 65.00%    β-sheet: 0.00%    Coil/Unstructured: 35.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.49
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF00224PK 20.00
PF00215OMPdecase 14.00
PF02887PK_C 5.00
PF13714PEP_mutase 2.00
PF17135Ribosomal_L18 1.00
PF06745ATPase 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG0469Pyruvate kinaseCarbohydrate transport and metabolism [G] 25.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A88.00 %
All OrganismsrootAll Organisms12.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002560|JGI25383J37093_10027818All Organisms → cellular organisms → Archaea1895Open in IMG/M
3300005166|Ga0066674_10188929All Organisms → cellular organisms → Archaea979Open in IMG/M
3300005557|Ga0066704_10173460All Organisms → cellular organisms → Archaea1452Open in IMG/M
3300012356|Ga0137371_10481937All Organisms → cellular organisms → Archaea958Open in IMG/M
3300017657|Ga0134074_1084208All Organisms → cellular organisms → Archaea1087Open in IMG/M
3300018431|Ga0066655_10352592All Organisms → cellular organisms → Archaea966Open in IMG/M
3300026297|Ga0209237_1111151All Organisms → cellular organisms → Archaea1178Open in IMG/M
3300026307|Ga0209469_1078906All Organisms → cellular organisms → Archaea986Open in IMG/M
3300026313|Ga0209761_1042170All Organisms → cellular organisms → Archaea2628Open in IMG/M
3300026329|Ga0209375_1123225All Organisms → cellular organisms → Archaea1136Open in IMG/M
3300026331|Ga0209267_1159696All Organisms → cellular organisms → Archaea920Open in IMG/M
3300026532|Ga0209160_1177632All Organisms → cellular organisms → Archaea897Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil40.00%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil23.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil19.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil16.00%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300010126Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026307Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026343Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25383J37093_1002781813300002560Grasslands SoilMSKVQEKESTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVLNKPQEK
JGI25382J37095_1012541023300002562Grasslands SoilLSADDWPPIGMSKVQEKDATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHQEVRNKPQTHTTERATVASETVKTRPRKKERSKTTQKKVAELEDLKDPTSTEITAKPEGLRSHLHGDNSKTGRSPETR*
JGI25382J43887_1003660713300002908Grasslands SoilMSKVQAKEATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVHNESQTHITGRATVPEEKAAARPRSKQRSKTKAEKVTTXDEPSS
JGI25382J43887_1029993923300002908Grasslands SoilMSKVQEKESTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVLNKPQEKTTER
JGI25382J43887_1048324713300002908Grasslands SoilMIRLFACDCPLIAMSKVQEKEATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNRPQTHTTEWATVNDEKVETRPRKRDRAKTRGKEVAELDDPKDPSSTDITAKP
JGI25386J43895_1005858823300002912Grasslands SoilMSKVQEKESTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVLNKPQEKTTERATVTEEKTETK
JGI25386J43895_1008567133300002912Grasslands SoilMSKVQEKEATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPETNTTGRAIVADETRETRPRNKRQSKTRETKVTELNKPVSPSPTEISAKPE
Ga0066674_1018892923300005166SoilMSKVQEKENTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPQIQTAERATVTAEKTETRPRNKKQSKTRET
Ga0066683_1026689723300005172SoilMSKVQEKENTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPQIQTAERATVTAEKTETRPRN
Ga0066680_1075364013300005174SoilMSKVQEKESTLLNNLSSTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVGNKSETSRTGPAIVADEKRETRPRNKKPSKTRETEVTELNKPVSPSSTDISEKPEAHPR
Ga0066684_1055835513300005179SoilMSKVQEKESTLLNNLSSTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVGNKSETSRTGPAIVADEKRETRPRNKKQSKPRETEVTELNKPVSPSSTDISEKPEAHPRRVPKTT
Ga0066685_1046047013300005180SoilMSKVQEKEATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHNWNLLHEEVRNKSETSRTGPAIVADEKRETRPRNKKQSKTRETEVTEL
Ga0066676_1047464713300005186SoilMSKAQEKESTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPETNSTGRASVADEKRETRPRNKKQSKTRETEVTELNKP
Ga0066686_1091873713300005446SoilMSKVQEKEATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHNWNLLHEEVRNKSETSRTGPAIVADEKRETRPRNKKQSKTRETEVTELNK
Ga0066686_1099854613300005446SoilMSKVQEKESTLLNNLSSTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVGNKSETSRTGPAIVADEKRETRPRNKKPSKTRETEVTELNKPVSPSSTDISEKPE
Ga0066687_1091632513300005454SoilMSKVQEKESTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKSETSRKGPAIVADEKRETRPRNKKQSRITETEVTELNKTVSPS
Ga0066701_1065875123300005552SoilMSKVQEKENTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPQIQTAERATVTAEKTETRPRNKKQSKTRE
Ga0066695_1068244913300005553SoilMSKVQEKEATLLSNLSGSEIANELDSRLGDLEKNIAVAREFIKLADESYSELRHNWNLLHEEVRNKSETSRTGPAIVADEKRETRPRNKKQSKTRE
Ga0066704_1017346013300005557SoilMSKVQEKESTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVLNKPETNSTGRAIVTDENRETRPRNKKQSKPRETEGTELNKAGSPSSTDISAKPEVHPRRIKTTNRPKKE
Ga0066704_1056051223300005557SoilMSKVQEKESTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVLNKPQEKTTERATVTEEK
Ga0066704_1057981023300005557SoilMSKVQEKENTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVLNKPQEKTTERATVTEEK
Ga0066698_1002654113300005558SoilLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPETNSTGRASVADEKRETRPRNKKQSKTRETEVTELNK
Ga0066703_1045243423300005568SoilVQEKENTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPQIQTAERATVTAEKTETRPRNKKQSKTRET
Ga0066703_1078870813300005568SoilMSKVQEKESTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKSETSRTGPAIVPDEKRETRPRNKKQSKTKETEVTELNKPV
Ga0066694_1008881213300005574SoilMSKVQEKEATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHNWNLLHEEVRNQPETNKTGRAIVADEKRETRPRNKKQSK
Ga0066694_1019961723300005574SoilVSKVQDKESTLLNNLSDREIAHELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLSEEVRNKPQTHET
Ga0066654_1005387213300005587SoilVSKVQDKESTLLNNLSDREIAHELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLSEEVRNKPQTHETDHPTPPDEKAEIRPRKKDHSKTRDKTVTEPDESATPSSAVIPIKPEG
Ga0066706_1074155123300005598SoilVQEKENTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPQIQTAERATVTAEKTETRPRNKKHGPGTRNNPRLER
Ga0066696_1006448243300006032SoilVSKVQDKESTLLNNLSDREIANELDSRLRDLEKNIAVAREFIKLADESYSELRHRWNLLSEEVRNKPQTHETEHPTLPDGKAEIRPRKKDHSK
Ga0066696_1058105733300006032SoilMSKVQEKESTLLNTLSSKEVANELDSHLRDLEKNIAVAREFIKLADESYSELRHKWNLLTEEVRNKPQTHETEHAAAQDVKAAIRPLKKEHSKARQKTVTEPDA
Ga0079222_1198728813300006755Agricultural SoilMSKVQEKEATLLNNLSSSEIANDLDSRLRDLEKNIAVAREFIKLADESYSDLRHKWNLLHEEVHSKPQMHTTERATVASETVETRPRKRE
Ga0066660_1043760023300006800SoilMSKVQEKESTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKSETSRTGPAIVPDEKRETRHRNRKQSKTKETEVTDLNKPVSPSSMRISAIPEGHARRM*
Ga0099791_1023765813300007255Vadose Zone SoilMSKVQEKEATLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPQTHAADQATVTGEKKESRPRNKERAKPRSKD
Ga0099793_1007981113300007258Vadose Zone SoilMSKVQEKEATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVLSKRQEKTTERAVVVDEKKETRPRSKKQS
Ga0066710_10054627713300009012Grasslands SoilMSKVTEKEATLLNNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLWHEEVRNKPQAHATEQSTLTLEKTETRTRARERSKTRAKDVAELNEPKSPSSADVSAKPESHAKR
Ga0066710_10190843513300009012Grasslands SoilLLNNLSDREIAHELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLSEEVQNKPQTHETDRPSLPEGKA
Ga0127482_117284413300010126Grasslands SoilLLNNLSDREIAHELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLSEEVRNKPQTYETEHPTLPDGKAEIRPRKKDHSKTRDKTVTEPDEPATPSSAVIP
Ga0134088_1005013053300010304Grasslands SoilMSKVQEKESTLLNTLSSKEVANELDSHLRDLEKNIAVAREFIKLADESYSELRHKWNLLTEEVRNKPQTHETEHAAAQDVKAAIRPLKK
Ga0134084_1009273213300010322Grasslands SoilLLNNLSDREIAHELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLSEEVRNKPQTHETEHPTLPDGKAEIRPRKKDHSKTRDKT
Ga0134064_1035989713300010325Grasslands SoilMSKVQEKESTLLNNLSSTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVGNKSETSRTGPAIVADEKRETRPRNKKQSK
Ga0134065_1032058913300010326Grasslands SoilMSKVQEKESTLLNTLSSKEVANELDSHLRDLEKNIAVAREFIKLADESYSELRHKWNLLTEEVRNKPQTHEREHAAAQDVKAAIRPLKKEHSKARQKTVTEPDAPATPS
Ga0134111_1006343833300010329Grasslands SoilLLNNLSDREIAHELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLSEEVRNKPQTHET
Ga0134063_1042904513300010335Grasslands SoilMSKVQEKEATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHNWNLLHEEVRNKSETSRTGPAIVADEKRETRPRNKKQSKTRETEVTELNKPVSPSST
Ga0134063_1060164413300010335Grasslands SoilMSKVTEKEATLLNNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLWHEEVRNKPQAHATEQSTLTLEKTETRTRARERSKTRAKDVTEL
Ga0134062_1017459313300010337Grasslands SoilLLNNLSDREIAHELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLSEEVRNKPQTHETEHPTLPDGKAEIRPRKKDHSKTGDKTVTEPDEQATPSSAIIPMKP
Ga0137364_1137770423300012198Vadose Zone SoilLLNNLSDREIAHELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLSEEVRNKPQT
Ga0137365_1039588113300012201Vadose Zone SoilMSKVQEKEATLLSNLSGSEIANELDSRLRDLEKNIAFAREFIKLADESYSELRHNWNLLHEEVRNQPETNTTGRAIV
Ga0137399_1097938013300012203Vadose Zone SoilMSKLQEKDATLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVLNKPQTHTIEPATVTDKKTETRPRKKEHS
Ga0137399_1130789013300012203Vadose Zone SoilMSKVQEKEATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKTQTHTTERATVNDEKVETRPRKKDRA
Ga0137380_1020440133300012206Vadose Zone SoilMSKVQGKDTTLFNNLSSTEIANELDSRLRDLEKNIAVAREFIKLADDSYSELRHKWNLLTEEVRNKPQTHETEHAAAQHV
Ga0137377_1046128633300012211Vadose Zone SoilMSKVQGKETTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLTEEVRNKPQT
Ga0137372_1075507913300012350Vadose Zone SoilMSKVQEKESTLLNILSSKGVANELDSHLRDLEKNIAVAREFIKLADESYSELRHKWNLLTEEVRNKPQTHETEHAAAQDVKAA
Ga0137366_1109774313300012354Vadose Zone SoilMSKVQEKENTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPQIQVAERATVTAEKTETRPRNKKQSKTRET
Ga0137371_1048193713300012356Vadose Zone SoilMSKVQEKENTLLNNLSGTEIANELHSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPQIQTAERATVTAEKT
Ga0137384_1047127313300012357Vadose Zone SoilMSKVQEKEATFLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHNWNLLHEEVRNQPETNTTGRAIVADEKRET
Ga0137384_1053194913300012357Vadose Zone SoilMSKVQEKDATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHNWNLLHEEVRNKPETNTTGRAIVADEKRVIRARNKKQSKTRETEVTELNKPVSPGSTDISEKPEGHPRRIKTTNRPKKG
Ga0137360_1082351223300012361Vadose Zone SoilMGNIMIRLLANNWPPIGMSKVQEKDATLLSNLSGSEIANEMDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPDTNTTGRAIVADEKRETRPRNKKQSKPGETELTTLDEAKNPSTRETSPKPETHVKRAKNANRPK
Ga0137360_1172016513300012361Vadose Zone SoilMSKVQEKESTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVLNKPQEKTTERATVTE
Ga0137361_1128032223300012362Vadose Zone SoilMSKVQEKENTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPEIQTAERATVTDEKTE
Ga0137361_1182141013300012362Vadose Zone SoilMSKVQEKDATLLSNLSGSDIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPPTQTPERAAITKEKTQSGPGNKQHRKT
Ga0137390_1098636613300012363Vadose Zone SoilMSKVQEKDSTLLSNLSDTGIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLQEEVRNKPQTRAEDGATVRDQKTESKPRNKEHSKTKVKKE
Ga0137373_1027793213300012532Vadose Zone SoilMSKVQEKESTLLNTLSSKEVANELDSHLRDLEKNIAVAREFIKLADESYSELRHKWNLLTEEVRNKPQTHETEHAAAQDVKAAIRPLKKEHSKARQKTVTEPDAPATPSSADVSTESE
Ga0137394_1162818013300012922Vadose Zone SoilMIRLLAHDCSLIAMSKVQEKEATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKTQTHTTERATVNDEK
Ga0137416_1152467113300012927Vadose Zone SoilMSKLQEKETTLFNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLQEEVRSKPGTQITERAPVTDKKTETLPRSKKHLRNTAKEVTVLEEPEQTSSTEIITKPEGHPKRVATKKGL
Ga0134077_1026371113300012972Grasslands SoilMSKVQEKEATLLGNLSGTEIANEMDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPETNKTGRAIVADEKRETRPRNKKQSKPGETELTTLDEAKSPSSREISPKPETQDPVF*
Ga0134077_1052069513300012972Grasslands SoilMRLLVDREMSKVQEKDATLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKSPTYTTERAAVAEEKTEPKPRNKE
Ga0134076_1006643343300012976Grasslands SoilMSKVQEKESTLLNTLSSKEVANELDSHLRDLEKNIAVAREFIKLADESYSELRHKWNLLTEEVRNKPQTLE
Ga0134076_1026889513300012976Grasslands SoilLLNNLSDREIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLSEEVRNKPQTHETDHPTLPDGKAEIRPRKKDHSKTGDKTVTEPDEQAT
Ga0134081_1011667423300014150Grasslands SoilMSKVQEKESTLLNNLSSTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVGNKSEPSRTGPAIVADEKRETRPRNKKPSKTRETEVTELNKPVSPSSTDISEKP
Ga0134072_1003332733300015357Grasslands SoilLLNNLSDREIAHELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLSEEVRNKPQTHETDHPTP
Ga0134089_1017163513300015358Grasslands SoilMSKVTEKEATLLNNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLWHEEVRNKPQAHATEQSTLTLEKTETRTRARERSKTRAKDVTE
Ga0134069_102137613300017654Grasslands SoilMSKVQEKEATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHNWNLLHEEVRNQPETNKTG
Ga0134112_1010180613300017656Grasslands SoilMSKVQEKEATLLGNLSGTEIANEMDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPETNKTGRAIVADEKRETRP
Ga0134074_108420813300017657Grasslands SoilMSKVQEKESTLLNNLSSTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNQPETNKTGRAI
Ga0066655_1035259223300018431Grasslands SoilMSKVQEKESTLLNNLSSTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVGNKSETSRTGPAIVADEKRETRPRNKK
Ga0066655_1108621513300018431Grasslands SoilMSKVQEKEATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHNWNLLHEEVRNKSETSRTGP
Ga0066669_1050881713300018482Grasslands SoilVSKVQDKESTLLNNLSDREIAHELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLSEEVQNKPQTHETDRPSLP
Ga0066669_1082509113300018482Grasslands SoilMSKVQEKEATLLSNLSGSEIANELDSRLRDLEKNIAVGREVTERAGESCSEQAYHGRLLPEEVRNHPAMSKTGRAIAADM
Ga0210405_1087926013300021171SoilVSRVQAKEANFLSNLSGTEIASELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVHNKPQTHAAEQATVTGEKKESQPRNKERGKPR
Ga0209237_111115133300026297Grasslands SoilLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVLNKPETNSTGRAIVTDE
Ga0209236_122170113300026298Grasslands SoilMSKVTEKEATLLNNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLWHEEVRNKPQAHATEQSTLTLEKTETRTRARERSKTRAKDVTELNEPKSPSSADVSAKPESHAKRAKSVE
Ga0209469_107890623300026307SoilVQEKENTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPQIQTAERATVTAEKTETRPRNKKQSKTRETKV
Ga0209761_104217043300026313Grasslands SoilMSKVQEKDATLLSNLSGSDIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPPTQTPERAAITKEKTQSGPGN
Ga0209470_126039713300026324SoilLLNNLSSTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVGNKSETSRTGPAIVADEKRETRPRNKKPSKTRETEVTELNKPVSPSSTDISEKPEAHPRRIKTT
Ga0209801_109299913300026326SoilMSKVQEKESTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKSETSRTGPAI
Ga0209375_112322513300026329SoilMSKVQEKESTLLNNLSSTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVGNKSETSRTGPAIVADEKRETRPRNKKQSKPRE
Ga0209267_115969613300026331SoilMSKVQEKESTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKSETSRTGPAIVADEKRETRPRNKKQS
Ga0209803_130122213300026332SoilMSKVQEKEATLLGNLSGTEIANEMDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPETNKTGRAIVADEKRE
Ga0209377_103291813300026334SoilMSKVQEKEATLLGNLSGTEIANEMDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPETNKTGRAIVADEKRETRPRNKKQSKPGETELTTLDEAKSPSSREISPKPETHVKRAKNAN
Ga0209159_120734113300026343SoilMSKVQEKEATLLSNLSGSEIANELDSRLGDLEKNIAVAREFIKLADESYSELRHNWNLLHEEVRNKSETSRTGPAIVADEKRETRPRNKKQSKTRETE
Ga0209159_124366713300026343SoilMSKVQEKEATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHNWNLLHEEVRNKSETNTTGRAIVAGEKR
Ga0209690_122618713300026524SoilMSKVQEKENTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPQIQTAERATVTAEKTE
Ga0209160_117763213300026532SoilMSKVQEKESTLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVLNKPQEKTTERATVTEEKTE
Ga0209058_102786213300026536SoilLLNNLSGTEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPETNSTGRASVADEKRETRPRNKKQSKTRETEVTELNKAGSPSSTDISEKPE
Ga0209056_1020510123300026538SoilMSKVQAKEATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVHNESQTHITGRATVPEEKAAARPRSKQRS
Ga0209376_104378153300026540SoilMSKVQEKESTLLNTLSSKEVANELDSHLRDLEKNIAVAREFIKLADESYSELRHKWNLLTEEVRNKPQTHETEHAAAQDVKAAIRPLKKEHSKARQKTVTEPDAPATPSSADVSTESEGITQHKSS
Ga0209474_1053188023300026550SoilMSKVQEKESTLLNTLSSKEVANELDSHLRDLEKNIAVAREFIKLADESYSELRHKWNLLTEEVRNKPQTHETEHAAAQDVKAAIRPLKKEHSKARQKTVTEPDAPATPSSADVSTKSEGITQ
Ga0209474_1070562413300026550SoilMSKVQEKEATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHNWNLLHEEVRNQPETNKTGRAIVADEKRETRPRNKKQSKTRETEVTELNKPVSPSSTEISAK
Ga0209076_101408123300027643Vadose Zone SoilLSADDWPPIGMSKVQEKDATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLHEEVRNKPQTHTTERATVASERVKTRPRKKERSKTTQKKVAELEDLKDPTSTEITAKPEGHQK
Ga0137415_1022580613300028536Vadose Zone SoilMIRLCACDCSLIAMSKVQEKEATLLSNLSGSEIANELDSRLRDLEKNIAVAREFIKLADESYSELRHKWNLLQEEVRNKPQTRAEDGATVRDQK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.