NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F102911

Metagenome Family F102911

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102911
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 91 residues
Representative Sequence MFEAEAVVHDGNGDRRETIYTQLDWQLVDQRKHRLEYYEKLFLEMKGIIDRFPGQPVDHKEVLTGLLGKPHLHLGSRIVSGQMKRFTR
Number of Associated Samples 76
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 16.67 %
% of genes near scaffold ends (potentially truncated) 1.98 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 64
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (94.059 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(31.683 % of family members)
Environment Ontology (ENVO) Unclassified
(62.376 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(66.337 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 59.09%    β-sheet: 0.00%    Coil/Unstructured: 40.91%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF01037AsnC_trans_reg 11.88
PF00848Ring_hydroxyl_A 7.92
PF10094DUF2332 3.96
PF02700PurS 3.96
PF13589HATPase_c_3 2.97
PF13507GATase_5 1.98
PF02844GARS_N 0.99
PF00583Acetyltransf_1 0.99
PF02769AIRS_C 0.99
PF12710HAD 0.99
PF13840ACT_7 0.99
PF00903Glyoxalase 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG4638Phenylpropionate dioxygenase or related ring-hydroxylating dioxygenase, large terminal subunitInorganic ion transport and metabolism [P] 15.84
COG1828Phosphoribosylformylglycinamidine (FGAM) synthase, PurS subunitNucleotide transport and metabolism [F] 3.96
COG0151Phosphoribosylamine-glycine ligaseNucleotide transport and metabolism [F] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A94.06 %
All OrganismsrootAll Organisms5.94 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002562|JGI25382J37095_10001587All Organisms → cellular organisms → Archaea7008Open in IMG/M
3300005536|Ga0070697_100052812All Organisms → cellular organisms → Archaea3303Open in IMG/M
3300009090|Ga0099827_10006725All Organisms → cellular organisms → Archaea7074Open in IMG/M
3300026297|Ga0209237_1000095All Organisms → cellular organisms → Archaea41702Open in IMG/M
3300027643|Ga0209076_1001542All Organisms → cellular organisms → Archaea4745Open in IMG/M
3300028536|Ga0137415_10005321All Organisms → cellular organisms → Archaea12739Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil31.68%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil30.69%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil19.80%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil10.89%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.98%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.99%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026314Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143 (SPAdes)EnvironmentalOpen in IMG/M
3300026316Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026499Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-BEnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1000861733300002558Grasslands SoilMFEAEAVVHDGNGDRRETIYTQLDWQLVDQRKHRLEYYKKLFLEMKGIIDRFPGQPVDHKEVLTGLLGKPHLHLGSRIVSGQMKRFTR*
JGI25385J37094_1001942333300002558Grasslands SoilMFEAEAVIHDGNGDRRETMYTQLDWQLVDQRKHRLEYYEKLFLEMKRVIDLFPRQPDDHKKVLAGLMGKPNLTPALRIVKGQMKRFGK*
JGI25383J37093_1004561213300002560Grasslands SoilHDGNGDRRETMYTQLDWQLVDQRKHRPEYYEKLFLEMKRVINRFPGQPDDHKKVLTGLLGKPNLTLASRIVNGQMKRFGK*
JGI25383J37093_1009505713300002560Grasslands SoilMFEAEAVIHDGNGDRRETMYTQLDWQLVDQRKHRLEYYEKLFLEMKRVIDLFPRQPDDHKKVLVGLMGKPNLTPALRIVKGQMKRFGK*
JGI25383J37093_1013735513300002560Grasslands SoilRHESGYIAFSPPSRIPFALYLALREKNQDFMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDSLPRRQDGHEEVLTGLLGKPHLRLGSKIVSGQMKRFTR*
JGI25384J37096_1017970233300002561Grasslands SoilLALRGRNQDFMFEAEAVIHDGNGDRRETVYTQLDWQLVDQNTHRREYYEKLFIEMKRIIERFPGQPADQKEVLTGLLGKPQLRLGSRIVSGQMKRFTR*
JGI25384J37096_1022279113300002561Grasslands SoilFALYLVLREKNQDFMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDSLPRRQDGHEEVLTGLLGKPHLRLGSKIVSGQMKRFTR*
JGI25382J37095_1000158793300002562Grasslands SoilMFEAEAVVHDGNGDRRETIYTQLDWQLVDQRKHRLEYYKKLFLEMKGIIDSFPGQPVDHREVLTGLLGKPHLHLGSRIVSGQMKRFTR*
JGI25382J43887_1002126223300002908Grasslands SoilMFEAEAVIHDGNGDRRETXYTQLDWQLVDQRKHRLEYYEKLFLEMKRVIDLFPRQPDDHKKVLAGLMGKPNLTPALRIVKGQMKRFGK*
JGI25386J43895_1007963123300002912Grasslands SoilESGYIAFSPPSRIPFALYLALREENQDFMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKARLGYYENLFSDIQDAIDTLPRRQDGHKDVLTGLLGKPHLPLGSKIVSGQMKRFTR*
JGI25389J43894_100117823300002916Grasslands SoilMFEAEAVIHDGNGDRRETVYTQLDWQLVDQRKHRLEYYEKLFLEMKRVIDLFPGQPDDHKKVLAGLMGKPNLTPALRIVKGQMKRFGK*
Ga0066673_1001424953300005175SoilDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDSLPRRQDGHEEVLTGLLGKPHLRLGSKIVSGQMKRFTR*
Ga0066685_1082242413300005180SoilPSRIPFALYLALREKNQDFMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDSLPRRQDGHEEVLTGLLGKPHLRLGSKIVSGQMKRFTR*
Ga0066686_1016962223300005446SoilMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDSLPRRQDGHEEVLTGLLGKPHLRLGSKIVSGQMKRFTR*
Ga0066686_1017121523300005446SoilMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDNLPRRQDGHKEVLTGLLGKPHLPLGSKIVNGQMKRFTR*
Ga0066689_1040122013300005447SoilNGDRRETVYTQLDWQLVDQNTHRREYYEKLFVEMKRIIERFPGQPADHKEVLTGLLGKPHLRLGSRIVSGQMKRFTR*
Ga0066689_1089333513300005447SoilMFEAEAVIHDGNGDRRETVYTQLDWQLVDQRKHRLEYYEKLFLEMKRVIDLFPGQPDDHKKVLAGLMGKPNLTPALRIVKGQMK
Ga0070707_10013830913300005468Corn, Switchgrass And Miscanthus RhizosphereMFEAEAVIHDGNGDRRETVYTQLDWQLVDQNTHRLEYYEKLFVEMKRIIEGFPGQPAGHKEVLTGLLGKPHLRLGSRIVSGQMKRFTR*
Ga0070697_10005281213300005536Corn, Switchgrass And Miscanthus RhizosphereSGYIAFSPPSRVPFALYLALRRRNQDFMFEAEAVIHDGNGDRRETMYTQLDWQLLDQRNPRLEYYEKLFTQIRNIIEHLPEQPADKKEVLTGLLGRPHLQLGSRIVSGQMKRFTR*
Ga0066697_1000290933300005540SoilMFEAEAVIHDGNGDRRETVYTQLDWQLVDQNTHRREYYEKLFTEMKRIIERFPGQPPDQKEVLTGLLGKPQLRLGSRIVSGQMKRFTR*
Ga0066697_1068408923300005540SoilAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDSLPRRQDGHEEVLTGLLGKPHLRLGSKIVSGQMKRFTR*
Ga0066701_1062784613300005552SoilEAVIHDGNGDRRETMYTQLDWQLVDQRKHRLEYYEKLFLEMKRVINRFPGQPDDHRKVLTGLLGKPNLTLASRIVNGQMKRFGK*
Ga0066701_1085560423300005552SoilMFEAEAVIHDGNGDRRETVYTQLDWQLVDQNTHRREYYEKLFVEMKRIIERFPGQPADHKEVLTGLLGKPHLRLGSRIVSGQMKRFTR*
Ga0066692_1001483743300005555SoilMFEAEAVIHDGNGDRRETMYTQLDWQLVDQRKHSLAYYEKLFLEMKRVINRLPGQPDDHKKVLTGLLGKPNLTLASRIVNGQMKRFGT*
Ga0066704_1006695843300005557SoilMFEAEAVIHDGNGDRRETVYTQLDWQLVDQNTHRREYYEKLFVEMKRIIERFPGQPADHKEVLTGLLGKPQLRLGSRIVSGQMKRFTR*
Ga0066698_1105271923300005558SoilYTQLDWQLLDQRKPKLRYYENLFSEIQDAIDSLPRRQDGHKDVLTGLLGKPHLRLGSKIVSGQMKRFTR*
Ga0066703_1025716023300005568SoilGDRRETMYTQLDWQLVDQRKHRPEYYEKLFLEMKRVINRFPGQPDDHKKVLTGLLGKPNLTLASRIVNGQMKRFGK*
Ga0066694_1003071113300005574SoilLYLALREKNQDFMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDSLPRRQDGHEEVLTGLLGKPHLRLGSKIVSGQMKRFTR*
Ga0066691_1052629023300005586SoilYLALRGRNHDFMFEAEAVIHDGNGDRRETMYTQLDWQLVDQRKHSLAYYEKLFLEMKRVINRLPGQPDDHKKVLTGLLGKPNLTLASRIVNGQMKRFGT*
Ga0066706_1000075713300005598SoilMFEAEAVIHDGNGDRRETVYTQLDWQLVDQNTHRREYYEKLFIEMKRIIERFPGQPADHKEVLTGLLGKPHLRLGSRIVSGQMKRFTR*
Ga0066706_1110912513300005598SoilLYLALREKNQDFMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDVINSLAKRQDGHEEVLTGLLGKSHLRLGSKIVSGQMKRFTR*
Ga0066656_1078914213300006034SoilVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDSLPRRQDGHEEVLTGLLGKPHLRLGSKIVSGQMKRFTR*
Ga0066665_1114219223300006796SoilEAEAVIHDGNGDRRETVYTQLDWQLVDQNTHRREYYEKLFIEMKRIIERFPGQPADHKEVLTGLLGKPHLRLGSRIVSGQMKRFTR*
Ga0066659_1030998113300006797SoilQDFMFEAEAVIHDGNGDRRETMYTQLDWQLVDQRKHRPEYYEKLFLEMKRVINRFPGQPDDHKKVLTGLLGKPNLTLASRIVNGQMKRFGK*
Ga0099791_1021937113300007255Vadose Zone SoilDRHESGYIAFSPPSRIPFDLYLALRSRNRDFMFEAEAVIHDGNGDRRETVYTQLDWQLVDERKHLLEYYEKLFLEMRRIIERLPRRRADHKEVLTGLLGKPHVTIGSKIVSGQMKRFAR*
Ga0099794_1029232223300007265Vadose Zone SoilFALYLALRRRNQDFMFEAEAVIHDGNGDRRETMYTQLDWQLVDQRKHRLEYYEKLFLEMKRIINRFPGQPDDRKEVLTGLLGKSNLTLASRIVNGQMKRFGK*
Ga0066710_10399909023300009012Grasslands SoilEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSKIQDTIDSLPRRQDGYEEVLTGLLGKPHLPLGSKIVSGQMKRFTR
Ga0099828_1120937213300009089Vadose Zone SoilREKNQDFMFEAEAVIHDGNGDRRETMYTQLDWQLVDQRKHRLEYYEKLFLEMKRVINRFPGQPDDRKEVLTGLLGKPNLTLASRIVKGQMKRFGR*
Ga0099827_10006725103300009090Vadose Zone SoilAEAVLHDGNGDRRETVYTQLDWQLVDERKHLLEYYEKLFLEMRRIIERLPRRRADHKEVLTGLLGKPHVTIGSKIVNGQMKRFAR*
Ga0099827_1028008133300009090Vadose Zone SoilFMFEAEAVIHDGNGDRRETMYTQLDWQLVDQRKHRLEYYEKLFLEMKRVINRFPGQPDDHKEVLTGLLGKPNLTLASRIVKGQMKRFGK*
Ga0099827_1143502313300009090Vadose Zone SoilMYTQLDWQLVDQRKHRLEYYEKLFLEMKRVINRFPGQPDDHKEVLTGLLGKPNLSLASRIVNGQMRRFGK*
Ga0134070_1005472013300010301Grasslands SoilLYITLRERNQDFMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDAIDSLPRRQDGHKDVLTGLLGKPHLRLGSKIVSGQMKRFTR*
Ga0134088_1057051713300010304Grasslands SoilGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDAIDSLPRRQDGHKDVLTGLLGKPHLRLGSKIVSGQMKRFTR*
Ga0134064_1026117923300010325Grasslands SoilGYIAFSPPSRIPFALYLALRERNQDFMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDSLPRRQDGHEEVLTGLLGKPHLRLGSKIVSGQMKRFTR*
Ga0134063_1010258813300010335Grasslands SoilFSPPSRIPFALYLALRERNQDFMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDAIDSLPRRQDGHKDVLTGLLGKPHLRLGSKIVSGQMKRFTR*
Ga0134071_10000526143300010336Grasslands SoilMFEAEAVIHDGNGDRRETVYTQLDWQLVDQNTHRREYYEKLFTEMKRIIERFPGQPADQKEVLTGLLGKPQLRLGSRIVSGQMKRFTR*
Ga0137391_1114117613300011270Vadose Zone SoilMFEAEAVLHDGNGDRRETVYTQLDWQLVDERKHLLEYYEKLFLEMRRIIERLPRRRADHKEVLTGLLGKPHVTIGSKIVNGQMKRFAR*
Ga0137389_1127845413300012096Vadose Zone SoilFMFEAEAVIHDGNGDRRETMYTQLDWQLLDQRKHRLEYYEKLFLEMKRVINRFPGQPDDHKEVLTGLLGKPNLTLASRIVKGQMKRFGR*
Ga0137363_1153897613300012202Vadose Zone SoilDFMFEAEAVIHDGNGDRRETMYTQLDWQLVDERKHLLEYYEKLFLEMRRIIERLPRRRADHKEVLTGLLGKPHVTIGSKIVNGQMKRFAR*
Ga0137399_1108063213300012203Vadose Zone SoilPPSRIPFSLYLALREKNQDFMFEAEAVIHDGNGDRRETVYTQLDWQLLDQRKPKLRYYHNLFSQMRHTIHGLPGRQDGHKEVLAGLLGKPHLSIGSKIVTDQMKRFTR*
Ga0137399_1142500823300012203Vadose Zone SoilAIYLELRNRNRDFMFEAEAVIHDGNGDRRETIYTQLDWQLVDQRKHRLEYYEKLFDEIRRIISLLPGQPVDHKDVLTGLLGKPRLSSGSRIVSGQMKRFTR*
Ga0137380_1003074763300012206Vadose Zone SoilMFEAEAVIHDGNGDRRETMYTQLDWQLVDERKHLLEYYKKLFLEMRRIIERLPRRRADHKEVLTGLLGKPHLTIGSKIVSGQMKRFAR*
Ga0137380_1030825333300012206Vadose Zone SoilNGDRRETMYTQLDWQLVDQRKHRLEYYEKLFLEMKRVIDRFPGQPDDHKKVLTGLLGKPNLRIALRIVNGQMKRFGK*
Ga0137380_1034504833300012206Vadose Zone SoilAEAVIHDGNGDRRETMYTQLDWQLVDQRKHRLEYYEKLFLEMKRVINRFPGQPDDHRKVLTGLLGKPNLTLASRIVNGQMKRFGK*
Ga0137381_1105940123300012207Vadose Zone SoilDRHESGYIAFSPPSRIPFDLYLALRSRNQDFMFEAEAVIHDGNGDRRETMYTQLDWQLVDERKHLLEYYKKLFLEMRRIIERLPRRRADHKEVLTGLLGKPHLTIGSKIVSGQMKRFAR*
Ga0137381_1106176213300012207Vadose Zone SoilGDRRETMYTQLDWQLVDQRKHRLEYYEKLFLEMKRVIDRFPGQPDDHKKVLTGLLGKPNLRLALRIVNGQMKRFGK*
Ga0137387_1034956023300012349Vadose Zone SoilHDGNGDRRETMYTQLDWQLVDQRKHRLEYYEKLFLEMKRVINRFPGQPDDHRKVLTGLLGKPNLTLALRIVNGQMKRFGK*
Ga0137387_1070493813300012349Vadose Zone SoilHDGNGDRRETMYTQLDWQLVDQRKHRLEYYEKLFLEMKRVIDRFPGQPDDHKKVLTGLLGKPNLRLALRIVNGQMKRFGK*
Ga0137386_1023613333300012351Vadose Zone SoilHDGNGDRRETMYTQLDWQLVDQRKHRLEYYEKLFLEMKRVINRFPGQPDDHRKVLTGLLGKPNLTLASRIVNGQMKRFGK*
Ga0137386_1090845523300012351Vadose Zone SoilSRIPFDLYLALRSRNQDFMFEAEAVIHDGNGDRRETMYTQLDWQLVDERKHLLEYYKKLFLEMRRIIERLPRRRADHKEVLTGLLGKPHLTIGSKIVSGQMKRFAR*
Ga0137384_1096081813300012357Vadose Zone SoilDFMFEAEAVIHDGNGDRRETVYTQLDWQLVDQRKPRREYYEKLFLQMTHIIQQLPGKPVDQKEVLTGLLGRKRLAQGSRIVNKQMNRFTR*
Ga0137385_1051611813300012359Vadose Zone SoilRETMYTQLDWQLVDQRKHRLEYYEKLFLEMKRVIDRFPGQPDDHKKVLTGLLGKPNLRLALRIVNGQMKRFGK*
Ga0137385_1059594323300012359Vadose Zone SoilMFEAEAVIHDGNGDRRETFYTQLDWQLLDQRKPKLRYYENLFSEIQDAIDSLPRRQDGYKDVLTGLLGKPHLRLGSKIVSGQMKRFTR*
Ga0137360_1067492013300012361Vadose Zone SoilRETVYTQLDWQLVDERKHLLEYYEKLFLEMKRIIERLPRRRADHKEVLTGLLGKPHVTIGSKIVNGQMKRFAR*
Ga0137396_1079336113300012918Vadose Zone SoilAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEVEHAIESLPRRQDGHKDVLTGLLGKPHLPLVSKIVSGQMKRFTR*
Ga0137396_1119331323300012918Vadose Zone SoilAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIEHTIESLPRRQDGHKDVLTGLLGKPHLALVSKIVSGQMKRFTR*
Ga0134077_1000287023300012972Grasslands SoilMFEAEAVIHDGNGDRRETVYTQLDWQLVDQNTHRREYYEKLFIEMKRIIERFPGQPADQNEVLTGLLGKPQLRLGSRIVSGQMKRFTR*
Ga0134075_1000104533300014154Grasslands SoilMFEAEAVIHDGNGDRRETVYTQLDWQLVDQNTHRREYYEKLFIEMKRIIERFPGQPADQKEVLTGLLGKPQLRLGSRIVSGQMKRFTR*
Ga0134089_1024132023300015358Grasslands SoilPSRIPFALYLALRGRNHDFMFEAEAVIHDGNGDRRETMYTQLDWQLVDQRKHRLEYYEKLFLEMKRVIDLFPGQPDDHKKVLAGLMGKPNLTPALRIVKGQMKRFGK*
Ga0134069_134481313300017654Grasslands SoilMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDAIDSLPRRQDGHKDVLTGLLGKPHLRLGSKIVSGQMKRFTR
Ga0134112_10000362143300017656Grasslands SoilMFEAEAVIHDGNGDRRETVYTQLDWQLVDQNTHRREYYEKLFIEMKRIIERFPGQPPDQKEVLTGLLGKPQLRLGSRIVSGQMKRFTR
Ga0134074_121181813300017657Grasslands SoilRIPFALYITLRERNQDFMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDAIDSLPRRQDGHKDVLTGLLGKPHLRLGSKIVSGQMKRFTR
Ga0066655_1104395423300018431Grasslands SoilEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDSLPRRQDGHEEVLTGLLGKPHLRLGSKIVSGQVKRFTR
Ga0066655_1111680223300018431Grasslands SoilEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDSLPRRQDGHEEVLTGLLGKPHLRLGSKIVSGQMKRFTR
Ga0066667_1050029723300018433Grasslands SoilMFEAEAVIHDGNGDRRETVYTQLDWQLVDQRKHRLEYYEKLFLEMKRVIDLFPGQPDDHKKVLAGLMGKPNLTPALRIVKGQMKRFGK
Ga0066662_1265264013300018468Grasslands SoilAEAVIHDGNGDRRETMYTQLDWQLLDQRKHRLEYYEKLFLEMKRIINRLPGQPADHKEVLTGLLGRPHLPLGSRIVNGQMKRFTR
Ga0215015_1021091233300021046SoilVYKRQLYLALNGRNQDFMFEAEAVVHDGNGDRRETFYTQLDWQLVDQRKHRLEYYEKLFREMQRIIDHLPEQPADHKEVMAGLLGKPHLQLGSRIVSGQMKRFTR
Ga0209237_100009543300026297Grasslands SoilMFEAEAVVHDGNGDRRETIYTQLDWQLVDQRKHRLEYYKKLFLEMKGIIDSFPGQPVDHREVLTGLLGKPHLHLGSRIVSGQMKRFTR
Ga0209237_112013113300026297Grasslands SoilMFEAEAVIHDGNGDRRETMYTQLDWQLVDQRKHRLEYYEKLFLEMKRIINQLPGQPGDHKEVLTGLLGKPHLPLGSRIVNGQMKRFSR
Ga0209236_102642123300026298Grasslands SoilMFEAEAVIHDGNGDRRETMYTQLDWQLVDQRKHRLEYYEKLFLEMKRVIDLFPRQPDDHKKVLVGLMGKPNLTPALRIVKGQMKRFGK
Ga0209238_103518533300026301Grasslands SoilRIPFALYLALREKNQDFMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDSLPRRQDGHEEVLTGLLGKPHLRLGSKIVSGQMKRFTR
Ga0209055_123113123300026309SoilDRRETVYTQLDWQLVDQNTHRREYYEKLFIEMKRIIERFPGQPADQKEFLTGLLGKPRLRLGSRIVSGQMKRFTR
Ga0209268_102362013300026314SoilLRRRFDRHESGYIAFSPPSRIPFALYLALREKNQDFMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDSLPRRQDGHEEVLTGLLGKPHLRLGSKIVSGQMKRFTR
Ga0209155_107172733300026316SoilEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDSLPRRQDGHEEVLTGLLGKPHLRLGSKIVSGQMKRFTR
Ga0209377_133040723300026334SoilGNGDRRETIYTQLDWQLLDQRKPRLRYYENLFSEIQDAIDSLPRRQDGHKEVLTGLLGKPHLPLGSKIVSGQMKRFTR
Ga0209057_119632123300026342SoilFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDSLPRRQDGHEEVLTGLLGKPHLRLGSKIVSGQMKRFTR
Ga0209057_123775223300026342SoilFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSKIQDTIDSLPRRQDGYEEVLTGLLGKPHLPLGSKIVSGQMKRFTR
Ga0257181_100095233300026499SoilMNQDFMFEAEAVIHDGNGDRRETMYTQLDWQLVDERKHLLEYYEKLFLEMRRIIERLPRRRADHKEVLTGLLGKPHVTIGSKIVSGQMKRFAH
Ga0209808_106601523300026523SoilMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYESLFSEIQDTIDNLPRRQDGHKEVLTGLLGKPHLPLGSKIVNGQMKRFTR
Ga0209806_107780313300026529SoilMFEAEAVIHDGNGDRRETVYTQLDWQLVDQNTHRREYYEKLFIEMKRIIERFPGQPADHKEVLTGLLGKSHLRLGSRIVSGQMKRFTR
Ga0209806_124104923300026529SoilERNQDFMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYESLFSEIQDTIDNLPRRQDGHKEVLTGLLGKPHLPLGSKIVNGQMKRFTR
Ga0209058_102118043300026536SoilMFEAEAVIHDGNGDRRETVYTQLDWQLVDQNTHRREYYEKLFTEMKRIIERFPGQPPDQKEVLTGLLGKPQLRLGSRIVSGQMKRFTR
Ga0209156_1029030013300026547SoilMFEAEAVIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDSLPRRQDGHEEVLTGLLGKPHLRLGSKIVSGQMKRFTR
Ga0209161_1000630963300026548SoilMFEAEAVIHDGNGDRRETVYTQLDWQLVDQNTHRREYYEKLFIEMKRIIERFPGQPADHKEVLTGLLGKPHLRLGSRIVSGQMKRFTR
Ga0209474_1052940913300026550SoilIHDGNGDRRETIYTQLDWQLLDQRKPKLRYYENLFSEIQDTIDSLPRRQDGHEEVLTGLLGKPHLRLGSKIVSGQMKRFTR
Ga0209076_100154273300027643Vadose Zone SoilMFEAEAVVHDGNGDRRETIYTQLDWQLVDQRKHRLEYYEKLFLEMKGIIDRFPGQPVDHKEVLTGLLGKPHLHLGSRIVSGQMKRFTR
Ga0209283_1022418613300027875Vadose Zone SoilLALRGRNQDFMFEAEAVIHDGNGDRRETMYTQLDWQLLDQRKHRLEYYEKLFLEMKRIINRLPGQPADHKEVLTGLLGRPHLPLGSRIVNGQMKRFTR
Ga0209283_1054932423300027875Vadose Zone SoilNQDFMFEAEAVIHDGNGDRRETMYTQLDWQLVDQRKHRLEYYEKLFLEMKRVINRFPGQPDDRKEVLTGLLGKPNLTLASRIVKGQMKRFGR
Ga0209283_1086903523300027875Vadose Zone SoilMFEAEAVIHDGNGDRRETVYTQLDWQLVDERKHLLEYYEKLFLEMKRIIERLPRRRADHKGVLTGLLGKPHVTIGSKIVSGQMKRFAR
Ga0137415_1000532123300028536Vadose Zone SoilMFEAEAVIHDGNGDRRETMYTQLDWQLVDERKHLLEYYEKLFLEMKRIIERLPRRRADHKEVLTGLLGKPHVTIGSKIVNGQMKRFAR
Ga0307471_10374002613300032180Hardwood Forest SoilFALYLALRGRNQDFMFEAEAVIHDGNGDRRETVYTQLDWQLVDQNTHRREYYEKLFIEMKRIIERVPGQPADHKDVLTGLLGKPHLRLGSKIVNGQMKRFTR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.