NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F022770

Metagenome / Metatranscriptome Family F022770

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F022770
Family Type Metagenome / Metatranscriptome
Number of Sequences 213
Average Sequence Length 80 residues
Representative Sequence MKTNWPALRHGYLILASMVLAASPVLAQTQSPAGATSTSAVISSVAITQAPERSAVRVEGEGRLDVHAGRMQNPDRLVL
Number of Associated Samples 173
Number of Associated Scaffolds 213

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 99.06 %
% of genes near scaffold ends (potentially truncated) 98.59 %
% of genes from short scaffolds (< 2000 bps) 90.14 %
Associated GOLD sequencing projects 163
AlphaFold2 3D model prediction Yes
3D model pTM-score0.26

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.531 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(14.085 % of family members)
Environment Ontology (ENVO) Unclassified
(38.028 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(65.258 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 18.69%    β-sheet: 0.00%    Coil/Unstructured: 81.31%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

10203040506070MKTNWPALRHGYLILASMVLAASPVLAQTQSPAGATSTSAVISSVAITQAPERSAVRVEGEGRLDVHAGRMQNPDRLVLSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.26
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
99.5%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Watersheds
Bog Forest Soil
Peatland
Freshwater Sediment
Soil
Watersheds
Soil
Soil
Vadose Zone Soil
Tropical Forest Soil
Glacier Forefield Soil
Grasslands Soil
Surface Soil
Switchgrass Rhizosphere
Agricultural Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Bog Forest Soil
Soil
Soil
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Palsa
Miscanthus Rhizosphere
Ectomycorrhiza
Rhizosphere
Miscanthus Rhizosphere
Boreal Forest Soil
13.6%8.9%2.8%14.1%4.7%12.2%6.1%7.5%2.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
AF_2010_repII_A100DRAFT_103204513300000655Forest SoilMKTNWPALKYGCLILASLTLPTSAPAQTVVSTGAGKSTAVISSVAITQAPAQSAVRVEGAGNLEVRTATMQHPDRLVLDFVGARLAVHKTVIPGVSAPVLGVRMG
JGI12032J12867_100965923300000912Forest SoilMKTNWPALRHGYLILASMALANSPVLAQSPAGTTVTAAVISSVAITQAPERAAVRVEGEGRLDVHAGRMQNPDRLVLDFAGARLA
JGI12683J13190_102437013300001089Forest SoilMKTNWPALRHGYLILASMALANSPVLAQSPAGTTVTAAVISSVAITQAPERAAVRVEGEGRLDVHAGRMQNPDRLVLDFAGAR
JGIcombinedJ26739_10100634813300002245Forest SoilMKTIRPALRNGCLILANMALVTSPLLAQNAEVSGKTASATITNTTSGNAVASVISSVAITQAPERAAVRVEGEGRLEVKAARLQSPDRLVLDF
JGIcombinedJ26739_10181140023300002245Forest SoilMKTNWPAIKVGCLILASAGLTTVPTLAQTQESAGTSATASIISSVAITQASQHATVRVEGEGPLQVRASRIQDPDRLVLDFS
JGI25614J43888_1021026013300002906Grasslands SoilMKTNWPALRSSCLILASMTAATSPVLAQTQEPAGATAVSTVISSVAITQAPRRASVRVEGEGRLDVHAARMQNPDRLVLDFAGARLAVQRTA
JGI25382J43887_1015509713300002908Grasslands SoilMKTNWPALRNGCLILASLACSTAPLLAQXQESTGVTATAVISSVAVMQGAQRASVRVEGEGRLEARAVRMQNPDRLVLDFAGT
JGI25617J43924_1010032413300002914Grasslands SoilMKTNWPALRHGYLILASMALANSPVLAQSPAGTTVTAAVISSVAITQAPERAAVRVEGEGRLDVHAGRMQNPDRLVLDFAGARLAVQK
JGI25617J43924_1010041423300002914Grasslands SoilMKTNWPAIRHGYLILASMALATSPVLAQTQPPAGATPTAAVISSVAVTQAPQRAAVRVEGEGRLDVHAGRMQNPDRLVLDFAGARLAVQKTV
JGI26342J46808_100533033300003220Bog Forest SoilMKTNWPAFRCGFLILSSAVMGASPLLAQTQEPAGATATSAVISSVAITQAPQRASVRVDGAGRLDVRASRMQNP
JGI26344J46810_100763523300003224Bog Forest SoilMKTNWPAFRCGFLILSSAVMGASPLLAQTQEPAGATATSAVISSVAITQAPQRASVRVDGAGRLDVRASRMQNPDRLVLD
JGIcombinedJ51221_1022474323300003505Forest SoilMKTNWQALKNGCLTLASLALAASPLLAQTQEPAGANAAIISSVAITQAPQHAAVRVEGEGRLDVHAARMQNPDRLVLDFAGARL
Ga0058891_151319623300004104Forest SoilMKTNWPALRHGCLILASMALATSPVLAQMQEPSGAVANAAVISSVAITQAPLRAAVRVEGEGRLDVHAARMQNPDRLVLDFAGARLAVDKT
Ga0058893_134108713300004117Forest SoilMKTNWPAFRCGSLILSSAVLGATPMLAQTQEPAGATATSAVISSAAITQAPQRSSVRVEGEGRLDARVSRMQNP
Ga0058901_147868413300004120Forest SoilMKTNWPALRHGYLILLSMAAAASPLLGQTEGPTGATSTSAVISSVAITQAPERSAVRVEGVGHLDVHAGRMQKPDRLVLDFAGARLAVQKTVIPG
Ga0058906_132942223300004134Forest SoilMKTNWQALRNGCLTLASLVLVASPLLAQTQEPAGANGAVISSVAITQAPQHAAVRVEGEGRLDVRAARMQNPDRLV
Ga0066398_1013815823300004268Tropical Forest SoilMITNWPALRNGCLILASVAFGALPSLAQTQEPAGPAAAASVISSVAITQGAQHASVRVEGEGRLDAHALRMQ
Ga0066869_1006329013300005165SoilMKTNRPAVRNGCLILASLVWSAAPVLAQTQESAETLANAAVISSVAVAQASQRASVRVEGEGKLDARAVRMH
Ga0066677_1010235213300005171SoilMKTNWPALKYGCLILASLTLPTSVPAQTLVPSSASKSAAVISSVAITQAPEQSAVRVEGEGKLDVRPARMQHPDRLVLDFVGAKL
Ga0066680_1018514513300005174SoilMKTNWPALRHGCLILASVALVTSPMLAQTQETSGATTTAAMISSVAITQAPQRAAVRVEGEGRLDVRAARMQNPDR
Ga0066680_1058398913300005174SoilMKTNWPALRYGCLILAGFTLAAKTVPAQTAEPAGRKSAVISSVAITQAPERSAVRVEGEGKLDVRAARMQHPDRLVLDFVGARLAVHKTLIPGV
Ga0066688_1090867923300005178SoilMITNWPALRNGCLILASVALTSVPAIAQTQEPAVPTATASVISSVAITQGAQRSSVRVEGEGRLD
Ga0066685_1047160313300005180SoilMKTNWPALRHGYVILASMALANSSVLAQTPAGRAAAVISSVAITQVPQRAAVRVEGQGRLDAHAARLQNPDRLVLDFAGARLAV
Ga0066685_1091775423300005180SoilMITNWPALRHGCLILASVALSTVPALAQTQEPAVPTATASVISSVAITQGAQRSSVRVEGEGRLDARAARMANPDRLVLDFAGTRLA
Ga0065705_1058896813300005294Switchgrass RhizosphereMKTNWPALRNGCLILASLACCEAPLLAQSHETAAVAPGASVISSVAVTQASARSSVRVEGEGELYARPTRMQNPDRLVLDFAGTRMAVQ
Ga0066388_10417135123300005332Tropical Forest SoilMKTTWPALRHGCLILAGVALGGSPVLAQTQEPAGATATASVISSVAITQAPQRAAVRVEGEGRLDARA
Ga0066388_10565600823300005332Tropical Forest SoilMKTNWPAVKYGCLILASLTLPTSAPAQTVVPTSAGKSNAIISSVAVTQAPEQSAVRVEGEGNLDVRPATMRHPDRL
Ga0068869_10139233213300005334Miscanthus RhizosphereMKTNWPALRNGCLILASLACCEAPLLAQSHETAAVAPGASVISSVAVTQASARSSVRVEGEGELYAR
Ga0070689_10143976523300005340Switchgrass RhizosphereMKTNWPALRNGCLIFTSLACGAAPLLAQSHDAAEVARGGAVISSVAVTQASARSSVRVEGEGELYARPTRMQNPDRLVL
Ga0070711_10066129523300005439Corn, Switchgrass And Miscanthus RhizosphereMMTNWPALRNGCLILASLACCEAPLLAQSHETAAVAPGASVISSVAVTQASARSSVRVEGEGELYARPTRMQNPDRLVLDFAGTRMAVQKTVI
Ga0070708_10012254913300005445Corn, Switchgrass And Miscanthus RhizosphereMMTNWPAMRIGCLILASAALTTSSSFAQTQEPAGTGATASVISSVAITQAPQRASVRVDGAGPLEVHASRVQDPDR
Ga0066682_1062468213300005450SoilMKTNWPALRNGCLILASLACSTAPLLAQSQESTGVTATAVISSVAVMQAAQRASVRVEGEGRLEARAVRMQNPDRLVLDFAGTRMAV
Ga0066687_1022853033300005454SoilMKTNWPALRNGCLILASLACSAAPLLAQSQESTGGVTATAVISSVAVMQAAQRARVRVEGQGRLEARAVRMQNPDRLVLDFAGTRMAV
Ga0070730_1016573513300005537Surface SoilMKTNWPALRTGCLILASVACGAAPVLAQSHASLELQPGAATISSVAVTQATQHASVRVEGEGHLDAHALRMQNPDRLVLDFVGTRMTVQKT
Ga0070686_10049172513300005544Switchgrass RhizosphereMMTNWPALRNGCLILASLACCEAPLLAQSHETAAVAPGASVISSVAVTQASARSSVRVEGEGELYARPTRMQNPDR
Ga0070696_10197672013300005546Corn, Switchgrass And Miscanthus RhizosphereMKTNWPALRNGCLILASLASSAAPLLSQTHATSGGAVISSVAVTQAAARSSVRVEGEGELFAHATRMQNPERLVLDFADT
Ga0066661_1056893723300005554SoilMKTNRPALRRGYMILAGVALVSSPAFAQTPAGVTATAAVISSVAITQAPERSAVRVEGEGRLEVHAGRMHNPDRLVLDFAGARLAVQKTVIP
Ga0066661_1088398413300005554SoilMKTNWPALWHGYLMLLSMALASSPVLAQTPGGATATTAVISSVAITQAPQRSAVRVEGEGRLDVHAGRM
Ga0066698_1110325023300005558SoilMKTNWRALKYGCLVLAGLTLATKPAPAQTVESTGAKKSAVISSVAITQAPERSAVRVEGEGKLDVRAARMQHPDRLVLDFVGARLAVHKTVIP
Ga0066699_1120536713300005561SoilMMTNWPALRNGCLIFASLACGVAPVLAQSNESAGSTVTSAVISSVAVMQASQRASVRVEGEGRLDAQAVRMHNPDRLVLDFPATRMAVQRTIIPGV
Ga0066691_1051327023300005586SoilMKTNWPALRHGYLILAGMALATSPLLAQTQAPAAATSAVISSVAITQAPERSAVRVEGEGRLDVHAGRMQNPDRLVLDF
Ga0066691_1089297823300005586SoilMITNWPALRHGCLILASVALSTVPALAQTQEPAVPTATASVISSVAITQGAQRSSVRVEGEGRLDAHAARM
Ga0066654_1075829023300005587SoilMKTNWRALKYGCLVLAGLTLATKPAPAQTVESTGARKSAVISSVAITQAPERSAVRVEGEGKLDVRAARMQHPDRL
Ga0070763_1062724313300005610SoilMKTTRPALRYGCLILASMALAISPVLAQTPEPAGATAAPTVISSVAITQAPERAAVRVEGEGKLEVRAARMQNPERLVLDFAGARLQVDKT
Ga0070764_1068241513300005712SoilMKTNWPAFRCGFLILSSAALGANPMLAQTQEPAGATATSAVISSVAITQAPQRATVRVEGEG
Ga0066903_10297938023300005764Tropical Forest SoilMKTNWPALKYGCLILASLTLATKPAPAQSLQPTGAGKSPAVISSVAITQAPEHSAVRVEGEGKLDVRAARMQHPDRLVLDFVGARLAVHKTAIPGVSAP
Ga0066903_10891201223300005764Tropical Forest SoilMKTNWPALRNGCLILASLACCEAPLLAQSHETAAVAPGASVISSVAVTQASARSSVRVEGEGELYARPTRMQNPDRLVLDFSGTRMAVQK
Ga0070766_1090450623300005921SoilMKTNWPAFRCGCLILSSAVLGASPMLAQTQEPAGATAISAVISSVAITQAPQRASVRVEGEGRLDVHALRM
Ga0066651_1069622313300006031SoilMITNWPALRHGCLILASVALSTVPALAQTQEPAVPTATASVISSVAITQGAQRSSVRVEGEGRLDAHAARMANPDRLVLDFA
Ga0066696_1048789513300006032SoilMITNWPALRHGCLILASVALSTVPALAQTQEPAVPTATASVISSVAITQGAQRSSVRVEGEGRL
Ga0066656_1036753613300006034SoilMKTNWPALRHGCLILVSIAVAASPALAQTKEPSGTTTTGAVISSVAITQAPQRAAVRVEGEGRLDVHAARMQDPDRLVLD
Ga0066652_10005868853300006046SoilMKTNWPALKYGCLILASLTLATKPVPAQTQVSTGASKSAAVISSVAITQAPEHSAVRVEGEGKLDVRAARMQHPDRLVLDFVGARLAVHQTVIPG
Ga0075024_10060730423300006047WatershedsMKTNWPAFRHGCLILASIAVAASPAFAQTQEPSGTTAPGAVISSVAITQAPQRAAVRVEGEGRLDVHAARMQDPD
Ga0075015_10038905713300006102WatershedsMKTNWPALRHGYLILASMAVAGSPMLAQTQAPAGATSTSAVISSVAITQAPERSAVRVEGEGRLDVHAGRMQNPDRLVLDF
Ga0075015_10084952913300006102WatershedsMKTNWPAFRGGFLILSSAVLGASPMLAQTQEPAGATATSAVISSVAITQAPQRASVRVEGEGRLDARVSRMQNPERLV
Ga0070715_1023656713300006163Corn, Switchgrass And Miscanthus RhizosphereMITNWPALRNGCLILASVAFASVPALAQTQEPAVPTATASVISSVAITQGAQRSSVRVEGEGRLDAHASRMANPDRVV
Ga0070715_1034163123300006163Corn, Switchgrass And Miscanthus RhizosphereMTTNWPALRNGCLILASMACGAAPMLAQPQESAGPKVTAAVISSVAVMQVAQRASVRVEGEGHLEAQAVRMHNPDRVVLDFAGTRMAVQKTIIPGVSAPVR
Ga0075018_1022430213300006172WatershedsMKTNGLALGIGCLILVGTMPGIRPAMGQTQAAAEAAVTPAVISSVAITQAPQRASVRVEGEGRLEVRAARMQSPDRLVLDFV
Ga0070765_10123654313300006176SoilMKTNWPALRHGCLILASMALATSPVLAQTQEPSGAVANAAVISSVAITQAPLRAAVRVEGEGRLDVHAARMQN
Ga0070765_10184011223300006176SoilMKTNWPAFRCGCLILSSAVLGATPMLAQTQEPAGATATSAVISSVAITQAPQRSSVRVEGEGRL
Ga0079222_1153749613300006755Agricultural SoilMKTNWPAIRIGCLGLASAAMSSSGLLAQTEQPNGASPVVSVISNVAIMQAPNQASVRVQGEGPLEVHTSRMQDPERLVLDFTATRLSVRKTVVPG
Ga0066653_1016883023300006791SoilMKTNWPALKYGCLILASLTLATKPVPAQTQVSTGASKSAAVISSVAITQAPEHSAVRVEGEGKLDVRAARMQHPDRLVLDFVGARLA
Ga0066665_1011179243300006796SoilMKTNWPALRHGYVILASMALANSSVLAQTPAGRAAAVISSVAITQAPQRAAVRVQGQGRLDAHA
Ga0099793_1005040313300007258Vadose Zone SoilMKTNWPALRHGCLILVSIAVAASPALAQTKEPSGTTTTGAVISSVAITQAPQRAAVRVEGEGRLDVHAARMQDPDRLVLDFAGAR
Ga0099793_1011918613300007258Vadose Zone SoilMKTNWPALRHSYLILASLVLASSPVHAQSQVPAGVTATATVLSSVAITQAPQRSAVRVEGEGRLDVHAGRMQNPD
Ga0099828_1125159013300009089Vadose Zone SoilMKTNWPALRCGCLILASMGAAVCPALAQNQEPAGAPTKTAVISSVAILQAPEHASVRVEGEGRLEVHASRMQNPERLVLDFSRARLA
Ga0105245_1239403923300009098Miscanthus RhizosphereMKTNWPALRNGCLIFTSLACGAAPLLAQSHDAAEVARGGAVISSVAVTQASARSSVRVEGEGELYARPTRMQNPDRLV
Ga0116105_120872013300009624PeatlandMALALSPVLAQTADPSGPSAVMGTTAAVISSVAITQAPERAAVRVEGEGHLDVHAARLQSPERLVLDFA
Ga0126380_1134297823300010043Tropical Forest SoilMKTNWPALRCGCLILAGVALGLSPVLAQTQEPAGATATASVISSVAITQAPQHAAVRVEGEGKLDARASRMQNPD
Ga0126380_1142934823300010043Tropical Forest SoilMITNWPALRNGCLILASVAFGTMPALAQTQEPAGATATATVISSVAITQGAQRASVRVEGEGRLE
Ga0126382_1241615523300010047Tropical Forest SoilMITNWPALRNGCLILASVAFGSLPTLAQTQEPAGPTATASVISSVAITQGAQRASVRVVGEGRLDARAARMQNPD
Ga0134088_1001684013300010304Grasslands SoilMKTNWPALRHGYLILASVALTTSPTLAQTPAGGTAAGAVISSVAITQAPERSAVRVEGEGRLDVHAGRMQNPERLVLDF
Ga0134109_1009574023300010320Grasslands SoilMKTNWPALRHGYLILASVALATSPTLAQTPAGATAGGAVISSVAITQAPERSAVRVEGEGRLDVHAGRMQNPERLVLDFAGARLAVQK
Ga0134084_1001253343300010322Grasslands SoilMKTNWPALRHGYLILASVALTTNPTLAQTPAGATAGGAVISSVAITQAPERSAVRVEGEGRLDVHAGRMQNPERLVLDFAGARLAVQKPVIPGI
Ga0074044_1062571123300010343Bog Forest SoilMKTNWPAFRCGFLILSSAAMGASPMLAQTQEPAGATATSAVISSVAITQAPLRASVRVEGEGRLDAHV
Ga0074044_1105820013300010343Bog Forest SoilMKTTRPALRYGCLILASLALAASPVLAQTVEPGGTPTIGATASVISSVAITQAPERAAVRVEGEGRLDVHAARLQSPERLVLDFAGARLHV
Ga0126376_1138807323300010359Tropical Forest SoilMKTNWPALRCGFLILSGAALSFGPAMAQTQEPAGATATSAVISSVAITQAPERASVRVEGEG
Ga0126376_1209830423300010359Tropical Forest SoilMKTNWPALRCGCLILAGVALGLSPVLAQTQEPAGATATASVISSVAITQAPQHAAVRVEGEGKLDARA
Ga0126372_1130367823300010360Tropical Forest SoilMITNWPALRNGCLILASVAFASMPALAQTQEPAGATATASVISSVAITQGAQNASVRVEGDGRLDARAVRMQNPD
Ga0126372_1309363113300010360Tropical Forest SoilMKTNWPALRHGYLILASVALTTSPMLAQTQEAAGAKKSAAVISSVAITQSPERAAVRVEGEGRLDVRAAR
Ga0126377_1118754713300010362Tropical Forest SoilMITNWPALRNGCLILASVAFGSLPALAQTQEPAGPTASASVISSVAITQGAQRSSVRVEGEGRLEPHPVRMQNPDRLVLDFAGA
Ga0126379_1101133513300010366Tropical Forest SoilMKTNWPALRCGCLILAGVALGLSPVLAQTQEPAGATATASVISSVAITQAPQHAAVRVEGEGKLDARASRMQNPDRLVLDFAGTR
Ga0126379_1146046023300010366Tropical Forest SoilMKTNWPALRNGCLILATTAVTMSPAIAQTQESAGASKSAAVISSVAITQAPERSAVRVEGEGRLDVRA
Ga0126381_10501108013300010376Tropical Forest SoilMKINWPALRCGCLILAGAALGLSPVLAQTQEPAGATATASVISSVAITQAPQHASVRVEGEGKLDARASRMQNPDRL
Ga0126383_1161705323300010398Tropical Forest SoilMKINWPALRCGCLILAGVTLAGGTALGQAQQPAGATATASVISSVAITQAPQRAAVRVEGEGKL
Ga0126383_1173997413300010398Tropical Forest SoilMITNWPALRNGCLILASVAFGTMPALAQTQEPAGATATASVISSVAITQGAQRASVRVEGEGRLEPRAARMQNPDRLVLDFVGA
Ga0126383_1227868713300010398Tropical Forest SoilMKTNWPALKNGCLILASMALAASPALAQTQESAGTSRAAVISSVAITQSPERSAVRVEGEGRLEVRAARMQNPDRVVLDFVGARLAVHRTA
Ga0126358_113125813300010856Boreal Forest SoilMKTNWPALRHGCLILASIAVASVPAFAQTQEPSGTTATGAIISSVAITQALQRSAVRVEGEGRLEVHAARMQDPDRLVLDFAGARLAVQKTAIP
Ga0150983_1284876813300011120Forest SoilMKTNWPALKNGCLIFASLALATSPVLAQTQEPAGTNGSAAVISSVAITQAPQHAAVRVEGEGRLDVRAARMQNPDRLV
Ga0150983_1417327713300011120Forest SoilMKTNWPAMRSGCLILASAAVLTCPCVAQTQESAGANATASVISSVAITQAPERASVRVEGEGPLQV
Ga0137393_1029272213300011271Vadose Zone SoilMKTNWPALGNGCLILASLACGTAPVLAQSQEFAGATATVAVISSVAVMQASQRASVRVEGEGRLDAHAVRLHNPDRLVLDFADTRMAVQRMIIP
Ga0137463_126704523300011444SoilMKTNWPAFRHGCLILASIAVATSPVFAQTQEPSGATATAAVISSVAITQAPQHAAVRVKGEGRLDVRA
Ga0137389_1159863313300012096Vadose Zone SoilMKTNWPALRHGCLILASVALATSPMLAQTQEPSGARKSAAVISSVAITQSPERAAVRVEGEGRLDVRAARMHSPDRLV
Ga0137383_1082044613300012199Vadose Zone SoilMITNWPALRNGCLILASVALISVPAIAQTQEPAGPTATASVISSVAITQGAQRSSVRVEGEGRLDAHAARMSNPDRLVLDFAGTRLAVQRTMIPGVAAPV
Ga0137382_1041487713300012200Vadose Zone SoilMKTNWPALKYGCLILASLTLATKPVPAQTQVSTGASKSAAVISSVAITQAPEHSAVRVEGEGKLDVRAARMQHPDRRVLDF
Ga0137363_1160680413300012202Vadose Zone SoilMKTNWLALRNGCLILASLACNTAPLLAQSQAPNSETANAAVISSVAVTQGTQRASVRVEGEGRLEARAARLQNPDRLVL
Ga0137399_1156200213300012203Vadose Zone SoilMKTNWPALRHGYLILVSMALATCPVLAQTQAPAGATAAGAVISSVAITQAPERSAVRVEGEGRLDVHAGRMQNPDRLVLDFAGARLAVQKTV
Ga0137381_1139814023300012207Vadose Zone SoilMKTNWPALKYGCLILAGFTLAAKTVPAQTAEPAGARKSAVISSVAITQAPEHSAVRVEGEGKLDVRAARMQHPD
Ga0137381_1140116023300012207Vadose Zone SoilMKTNWPALRHGYLILAGVALASSPLLAQTQATSAVISSVAITQAPERSAVRVEGEGRLDVHAGRMQNPDRLVLDFAGARL
Ga0137377_1031528013300012211Vadose Zone SoilMKTNWLALRNGCLILASLACNTAPLLAQSQAPNSEPANAAVISSVAVTQGSQRASVRVEGEGRLEARAVRMQNPDRLV
Ga0137387_1061025833300012349Vadose Zone SoilMITNWPALRNGCLILASAAFSTVPALAQTQEPAVPTATASVISSVAITQGAQRSSVRVEGEGRLDAH
Ga0137360_1095870623300012361Vadose Zone SoilMKTNWPALRHGCLILVSIAVAASPALAQTKEPSGTTTTWAVISSVAITQAPQRSAVRVEGEGRLDVHAARMQDP
Ga0134031_104124813300012388Grasslands SoilMKTNWPALRHGYLILASVALATSPTLAQTPAGATAGGAVISSVAITQAPERSAVRVEGEGRLDVHAGRMQNPERLVLDFAGARLAVQKTVIP
Ga0137397_1043160923300012685Vadose Zone SoilMITNWPALRNGCLILASMALTSVPALAQTQEPAVPTATASVISSVAITQGAQRSSVRVEGEGRLDAHAARMANP
Ga0137394_1005075463300012922Vadose Zone SoilMKTNWPAFRDGCLILASIAGAASPALAQTQEPSGTTPTGAVISSIAVTQAPQRAAVRVEGEGRL
Ga0137394_1118531413300012922Vadose Zone SoilMKTNWPAFRHGCLILASIAGAASPALAQTQEPSGTTPTGAVISSIAVTQAPQRAAVRVEGEGRL
Ga0137359_1074574423300012923Vadose Zone SoilMKTNWPALRNGCLILASLACSTAPLLAQSQESTGGVTATAVISSVAVMQAAQRASVRVEGEGRLEARAVRMQNPDRLVLDFAGTRMA
Ga0137359_1131091723300012923Vadose Zone SoilMKTNWPALRHGYLILASMALATCPVLAQTQAPAGATAAGAVISSVAITQVPERSAVRVEGEGRLDVHAGRMQNPDRLVL
Ga0137404_1042662033300012929Vadose Zone SoilMKTNWPALRNGCLILASLACSTPPLLAQSQEPTGGVTATAVISSVAVMQAAQRASVRVEGEGRLEARAVRM
Ga0137404_1152585113300012929Vadose Zone SoilMKTNWPAFRHGCLILASIAGAASPALAQTQEPSGRTPIGAVISSVAVTQAPQRAAVRVEGEGRLSVRAARMQDPERLVLDFAGARLAVQKTVIPGVSAPVRG
Ga0137407_1123092813300012930Vadose Zone SoilMILASVVLASSPALAQTPAGVTTTAAVISSVAITQAPERSAVRVEGEGRLDVHAGRMQNPDRLVLDFAGARLAVQKTVIP
Ga0137407_1176167023300012930Vadose Zone SoilMKTNWPALRRGYLILASMALANSPAFAQTPAGVTSTAAVISSVAITHAPERSAVRVEGEGRLD
Ga0126369_1333514213300012971Tropical Forest SoilMKTNWPAFRCGCLILASVAFNSLPTLAQTQEPVGPTAAASVISSVAITQGAQRSSVRVEGEGRLDARMSRMQNPDRLVLDF
Ga0134076_1025708113300012976Grasslands SoilMKTNWPAIRCGCLLLGAVALTTPAVRAQAPETAGTPQGAAVISSVAITQAERRASVRVEGEGRLDVRAARMQDPDRLVLDFNGARLAVPK
Ga0137405_120282513300015053Vadose Zone SoilMITNWPALRNGCLILASVAFGSLPSLAQTREPAGPTATASVISSVAITQGAQRASVRVEGEGRLDARAARMQNPDRLVLDFAGTRLAVQKTMIPGVR*
Ga0137420_123204023300015054Vadose Zone SoilMKTNWPALRHGYLILASMALANSPVLAQSPAGTTVTAAVISSVAITQAPERAAVRVEGEGRLDVHAGRMQN
Ga0167661_100356653300015167Glacier Forefield SoilMTTNWPALRNGCLILASLACSAAPLLAQSQESAGIPATNAVISSVAVMQALQRASVRVEGEGRLDAQAVRLHNPDRL
Ga0137403_1086264413300015264Vadose Zone SoilMKTNWPAFRGGFLILSSAVLGASPMLAQTQEPAGATATSAVISSVAITQAPQRASVRVEGEGRLD
Ga0134072_1002528023300015357Grasslands SoilMKTNWPALKYGCLILASLTLPTSVPAQTLVPSSASKSAAVISSVAITQAPEQSAVRVEGEGKLDVRPARMQHPDRLVLDFVGA*
Ga0182035_1191553213300016341SoilMKTNWPALRHGCLILASMALATSPMLAQTQEPAGARNSAAVISSVAITQSPECEAVRVQGEGR
Ga0187824_1022084123300017927Freshwater SedimentMKTNWPALRPSGLILASMMAVTSPLLAQTQGESRATGVASVISGVALTQAPQHASVRVEGEGRLEVRAARMQNPDRLVLDFSG
Ga0187825_1026378123300017930Freshwater SedimentMKTNWPALRPSGLILASMMAVTSPLLAQTQGESRATGVASVISGVALTQAPQHASVRVEGEGRLEVRAARMQNPDRLVL
Ga0187779_1021304033300017959Tropical PeatlandMKTNWPALRCGCLILAGVALGLSPVLAQTQEPAGATATASIISSVAITQAPQHAAVRVEGEGKLDARASRMQNPDRLVLDFAG
Ga0187780_1088628023300017973Tropical PeatlandMKTNWPALRCGFLILTSAALGVGPVLAQTQEPAGATATAAVISSVAITQATQRASVRVEGEG
Ga0187782_1159658113300017975Tropical PeatlandMKTNWPAFRHGYLIVVGMALAATPALAQTQEPASATASTAVISSVAITQAPQHSVVRVDGEGRLEVHAARMQNPDRLVLDFAGAKLAVQ
Ga0187823_1039193923300017993Freshwater SedimentMKTNWPALRNGCLILASVACGAAPLLAQNHAPLELQPGAAIISSVAVTQATQHASVRVEGEGRLDAHALRMQNPDRLVLDFAGTR
Ga0066669_1065251023300018482Grasslands SoilMKTNWRALKYGCLVLAGLTLATKPAPAQTVESTGARKSAVISSVAITQAPERSAVRVEGEGKLDVRAARMQHPD
Ga0066669_1073167623300018482Grasslands SoilMITNWPALRHGCLILASVALSTVPALAQTQEPAVPTATASVISSVAITQGAQRSSVRVEGEGRLD
Ga0210407_1091032413300020579SoilMKTNWPAFRHGYLILAGMALTSSPMLAQTQAPAGVTASAAVISSVAITQAPERSAVRVEGEGRLDVHAGRMQNPDRLVLDFAGAR
Ga0210407_1105026213300020579SoilMKMNWPALRSSGLILASMVLVTSPVLAQTQEPSGATAVASVISSVAITQAPQQASVRVEGQGRLDVRTARMQNPYRLVLDFTRTRL
Ga0210407_1135486813300020579SoilMKTNWPAYRFGCLILASMVVAAVPSLAQTQEPTGVTATASVISSVAITQASQRAAVRVEGEGRLDAHASRMQNPDRVVLDFSGTRLA
Ga0210403_1029263333300020580SoilMKTNWPAFRCGCLILSSAVLGASPLLAQTQQPAGATATSAVISSVAITQAPQRASVRVEGEGRLDVHALRMQNPDRLVLDFSGARMAVDK
Ga0210403_1075541613300020580SoilMKTTRPALRYGCLILASMALAISPVLAQTPEPAGATAAPTVISSVAITQAPERAAVRVEGEGKLEVRAARMQNP
Ga0210399_1050833523300020581SoilMKTNWPAFRGGFLILSSAVLGASPMLAQTQEPAGATATSAVISSVAITQAPQRASVRVEGEGRLDARVSRMQNPERLVLDFSGTR
Ga0210395_1007380153300020582SoilMKTNWPAYRFGCLILASMVVAAVPSLAQTQEPTGVTATASVISSVAITQASQRAAVRVEGEGRLDAHASRMQNPDRVVLDFSGTRLAVQKT
Ga0210383_1008424153300021407SoilMKTNWPAFRCGCLILSSAVLGVSPMLAQTQEPAGATAASAVISSVAIMQAPQRASVRVEGEGRLDAHVSRMQNPERLVLDFSGTRLAVDKTI
Ga0210409_1073009113300021559SoilMKTNWPALRHGFLILASITVATSPVFAQTQEPIGATAAPSVISSVAITQAPQRAAVRVEGEGRLDVHAARMQNPDRLVLDFAGARLAVQKTVIPGVSAPVL
Ga0126371_1046861413300021560Tropical Forest SoilMKINWPALRCGCLILAGVMLAGGTALGQAQQPAGATATASVISSVAITQAPQRAAVRVEGEGKLDAHASRMQNPDRLVLDFAGTRL
Ga0126371_1052956213300021560Tropical Forest SoilMKTNRRALKYGCLILAGLTLAAKPAPAQTAESAGARKSAVISSVAITQAPEHSAVRVEGEGKLDVRAARMQHPDRLVLDFVGARLAVHKTMI
Ga0126371_1116506933300021560Tropical Forest SoilMKTNRPALRYGFLILTGLALAASPVFAQTQEPAGATATSAVISSVAITQAPERASVRVEGEGRLDVHAA
Ga0126371_1352153613300021560Tropical Forest SoilMKTNWPALRCGCLIVSSVVMGVGPLRAQTPEPAGATATSAVISSVAITQASEHASVRVEGEGRLDVHTSRMQNPLRLVLDFA
Ga0213853_1031346433300021861WatershedsMKTNWPALRHGYLILASMVLAASPVLAQTQSPAGATSTSAVISSVAITQAPERSAVRVEGEGRLDVHAGRMQNPDRLVL
Ga0242662_1025669613300022533SoilMKTNWPAFRHGYLMLAGMALAISPVLAQTQEPLGATASAAVISSVAITQAPQRSAVRVAGEGRLDVHAARMQNPDRLVLD
Ga0242661_101346913300022717SoilMKTNWPAFRCGCLILSSAVLGASPMLAQTQEPAGATATSAVISSVAITQVPQRSSVRVEGEGRLDARVSRMQ
Ga0242654_1004951013300022726SoilMKTNWPAFRCGCLILSSAVLGVSPMLAQTQEPAGATAASAVISSVAIMQAPQRASVRVEGEGRLDAHVSRMQNP
Ga0228598_103326013300024227RhizosphereMKTNWPAFRCGCLILSSAVLGVSPMLAQTQEPAGATAASAVISSVAITQAPQRASVRVEGEGRL
Ga0247676_102230413300024249SoilMKTNWPALRNGCLILASLACSTAPLLAQSHETPAVAPGASVISSVAVTQASARSSVRVEGEGELYARPTRMQNPDR
Ga0247666_111185313300024323SoilMKTNWPAFRCGCLILSSAVVGASPLPAQTQEPAGATATSAVISSVAITQAPQRASVRVEGEGRLNVH
Ga0137417_102668653300024330Vadose Zone SoilMKMNWPALRHSYLILASLVLASIPVHAQSQVPAGVTATAAVISSVAITQAPQRAAVRVEGEGPLDVHAGRMQNPDRLVLDFAGARVAVEKAVIPGISAPVR
Ga0137417_149192143300024330Vadose Zone SoilMKTNWPALRSSCLILASMALATSPVLAQTQEPAGATAVSTVISSVAIMQAPQRASVRVEGEG
Ga0207646_1078315223300025922Corn, Switchgrass And Miscanthus RhizosphereMMTNWPAMRIGCLILASAALTTSSSFAQTQEPAGTGATASVISSVAITQAPQRASVRVDGAGPLEVHASRVQDPDRLVLDFSATKLA
Ga0207709_1172144123300025935Miscanthus RhizosphereMKTNWPALRNGCLILASLACCEAPLLAQSHETAAVAPGASVISSVAVTQASARSSVRVEGEGELYARPTRMQNPDRLVLDFAGTRMAV
Ga0207670_1120851913300025936Switchgrass RhizosphereMKTNWPALRNGCLIFTSLACGAAPLLAQSHDAAEVARGGAVISSVAVTQASARSSVRVEGEGELYARPTRMQNPDRLVLDF
Ga0209839_1006213913300026294SoilMKTTRPALRYGCLILASMALAINPALAQTPAPSGMTATAAVISSVAITQAPERAAVRVEGEGRLDVQAARLQSPERLVLDFAGAHLHVEKTLIL
Ga0209236_101312113300026298Grasslands SoilMKTNWPALKHGCLILASVALATSPMLAQTQEPAGGRKSAAVISSVAITQSPERAAVRVEGEGRLDVRTARMHSP
Ga0209153_126958223300026312SoilMKTNWQALRNGCLILASLTCSAAPLLAQSQESRAAMTSAVISSVAVVMQASQRASVRVEGEGRLEARAVRMHNPDRLVLDFI
Ga0209761_1008804113300026313Grasslands SoilMKTNWPALHGYLILAGMALATSPVLAQTPAGATATSAVISSVAITQAPERSAVRVEGEGRLDVHAGRMR
Ga0209647_121708813300026319Grasslands SoilMKTNWPALRSSCLILASMTAATSPVLAQTQEPAGATAVSTVISSVAITQAPRRASVRVEGEGRLDVHAARMQNPDRLVLDFAGARLA
Ga0209472_130232013300026323SoilMITNWPALRHGCLILASVALSTVPALAQTQEPAVPTATASVISSVAITQGAQRSSVRVEGEGRLDAHAARMANPDRLVLDFAGTRLAVQRTMI
Ga0209152_1006790213300026325SoilMKTNWPALRHSYLILASLVLASSPVHAQSQVPAGVTATAAVISSVAITQVPQRAAVRVEGDGRLDVHAGRMQNPDRLVLDFAGARVAVQKTVIPGISAP
Ga0209473_104590313300026330SoilMKTNWRALKYGCLVLAGLTLATKPAPAQTVESTGAKKSAVISSVAITQAPERSAVRVEGEGKLDVRAARMQHPDR
Ga0209158_101741613300026333SoilMKTNWPAIRCGCLLLGAVALTTPAVRAQAPETAGTPQGAAVISSVAITQAERRASVRVEGEGRLDVRAARMQDPDRLVLDFNGARLAVP
Ga0257150_105945213300026356SoilMKTNWPALRHGYLILASMALANSPVLAQSPAGTTVTAAVISSVAITQAPERAAVRVEGEGRLDVHAGRMQNPDRLVLDFAGARLAVQKTV
Ga0257146_103384123300026374SoilMKTNWPALRNGCLILASLACGTAPVLAQSQEFAGATATPAVISSVAVMQASQRASVRVEGEGRLDAHAM
Ga0257146_105727813300026374SoilMITNWPALRNGCLILASVAFGSMPALAQTQEPAGATATASVISSVAITQGAQRASVRVEGEGHLDARAA
Ga0257167_106774723300026376SoilMKTNWPALRSSCLILASMALATSPVLAQTQEPAGATAVSTVISSVAIMQAPQRASVRVEGEGRLDVHAARM
Ga0257171_100394243300026377SoilMKTNWPALRHGCLILVGIAVAASPALAQTKEPSGTTTTGAVISSVAITQAPQRAAVRVEGEGRLD
Ga0257178_104045723300026446SoilMKTNWPALRHGCLILVGIAVAASPALAQTKEPSGTTTTGAVISSVAITQAPQRAAVRVEGEGRLDVHAARMQDPDRLVLDFAGA
Ga0257156_106378623300026498SoilMKTNWPALRNGCLILASLACGTAPVLAQSQGFAGATATPAVISSVAVMQASQRASVRVEGEGRL
Ga0257181_109505723300026499SoilMKTNWPALRHGYLILASMALANSPVLAQTPGGATANAAVISSVAITQAPERSAVRVEGEGRLEVHAGRMQN
Ga0209160_123183113300026532SoilMKTNWPALRNGCLILASLACSTAPLLAQSQESTGGVTATAVISSVAVMQAAQRASVRVEGEGRLEARAVRMQNPDRLVLDFAGTRMAVQKT
Ga0209157_132290813300026537SoilMKTNWPAFRHGCLILVGIAVAGSPALAQTQEPSGTSTTGAVISSVAITQAPQRAAVRVEGEGRLDVHAARMQDPDRLVLDFAGARLAVQKTVI
Ga0209056_1044721813300026538SoilMITNWPALRNGCLILASVALSTVPSLAQTQEPAVPTATASVISSVAITQGAQRSSVRVEGEGRLDARAARMSNPDRLVLDFA
Ga0209376_134504113300026540SoilMKTNWPALRNGCLILASLACSTAPLLAQSQESTGVTATAVISSVAVMQAAQRASVRVEGEGRLEARAVRMQNPDRLVLDFAGTRMAVQKTII
Ga0209805_138986513300026542SoilMMTNWPALRNGCLIFASLACGVAPVLAQSNESAGSTVTSAVISSVAVMQASQRASVRVEGEGRLDAQAVRMHNPDRLVLDFPATRMAVQRTIIPGVS
Ga0209648_1030587613300026551Grasslands SoilMKTNWPAFRCGCLILSSAVLGASPMLAQTQEPAGATATSAVISSVAITQAPQRASVRVEGEGRLDVRALRMQNPERLVLDFS
Ga0208489_100286913300027065Forest SoilMKTNWPAMRMGCLILVGAVLVTSPAVAQTQEPAGANATASVISSVAITQAPERASVRVEGEGPLEVHAARVQDPDRLVLDFRGTRLAVRNTVIPGVSAP
Ga0208365_106072813300027070Forest SoilMKTNWQALKNGCLTLASLALAASPLLAQTQEPAGANAAIISSVAITQAPQHAAVRVEGEGRLDVHAARMQNPD
Ga0209733_100889513300027591Forest SoilMKTNWPAMRIGCLIVASAALSSSPCFAQTKESAGASATGSVISSVAITQAPQRASVRVEGQGRLEVHATRVQDPDRLVLDFSDARLAVHKTVIPGVSAPV
Ga0209217_110198213300027651Forest SoilMKTNWPAIKVGCLILASAGLTTVPTLAQTQESAGTSATASIISSVAITQASQHATVRVEGEGPLQVRASRIQDPDRLVL
Ga0208989_1025640923300027738Forest SoilMKTNWPAFRHGYLILASMALANSPVLAQSPAGATVTAAVISSVAITQAPERAAVRVEGEGRLDVHAGRMQNPD
Ga0209074_1055107423300027787Agricultural SoilMKTNWPALKYGCLILAGFTLGAKSAPAQTAEPAGARKSAVISSVAITQAPERSAVRVEGEGKLDVRAARMQHPDRLVLDFVGARLAVHKTVIPGVSAPVVD
Ga0209773_1004470743300027829Bog Forest SoilMKTNWPAFRSGCLILSSAALGASPMLAQTQEPAGATATSAVISSVAITQAPQSASVRVEGNGQLDARVSRMQNPERL
Ga0209580_1001752313300027842Surface SoilMKTNWPALKYGCLILASLTLATKPAPAQSLEPMGARKSPAVISSVAITQAPERSAVRVEGEGKLDVRAARMQHPDRLVLDFVGARLAVHKTVIPGV
Ga0209701_1038625523300027862Vadose Zone SoilMKTNWPALRSSCLILASMTAATSPVLAQTQEPAGATAVATVISSVAIMQAPQRASVRVEGEGRLDVHAARMQNPDRLVLDFTGARLAV
Ga0247684_105846923300028138SoilMITNWPALRNGCLILTSVAFSTMPALAQTQEPAVPTATASVISSVAITQGAQRSSVRVEGEGRLDAHASRMANPDRVVLDFAGTRLAVQRTMIP
Ga0247682_105179913300028146SoilMITNWPALRNGCLILASVAFSTVPAIAQTQEPAVPTATASVISSVAITQGAQRSSVRVEGEGRLDAHASRMANPDRVVLDFAGTRLAVQR
Ga0257175_108376623300028673SoilMKTNWPAMQIGCLILASAALATSPCVAQTQEPAGASATVSVISSVAITQAPERASVRVEGEGPLEVHAARVQDPDRLVLDFRGTRLAVRNTV
Ga0307312_1107041923300028828SoilMKTNWPALRSGFLILASVAVGTTVLPAQTQELAGATTTPAVISSVTILQSALQRAGVRVEGEGKLDARAARMQNPDRLVLDFSGARL
Ga0302306_1034087713300030043PalsaMKTTRPALRYGCLILASMALALSPALAQTPEPSGVTATAAVISSVAITQAPERAAVRVEGEGLLEVHAVRLQSPER
Ga0302311_1108041113300030739PalsaMKTTRPALRYGCLILASMALALSPALAQTPEPSGVTATAAVISSVAITQAPERAAVRVEGEGLLEVHAVR
Ga0307476_1048535913300031715Hardwood Forest SoilMKTNWPALRTGCLILASVACGAAPLLAQTHAPLELQPGAATISSVAVTQATQHASVRVEGEGRLDAHALRMQNPDRLVLDFAGTRMTVQ
Ga0307476_1129274713300031715Hardwood Forest SoilMKTNWPAFRGGFLILSSAVLGASPMLAQTQEPAGATATSAVISSVAITQAPQRASVRVEGEGRLDAHVSRMQNPERLVLDFSGTRL
Ga0307474_1124347523300031718Hardwood Forest SoilMKTNWPALRHGYLILASIAVASSPVLAQTPADATATASVISSVAITQAPQRSAVRVEGEGRL
Ga0307516_1044466013300031730EctomycorrhizaMKTNRPAVRNGCLILASLVWSAAPVLAQTQESAETLANAAVISSVAVAQASQRASVRVEGEGKLDARAVRMHNPDRLVLDFASTRMAVQKTVIPGVSA
Ga0307475_1008915013300031754Hardwood Forest SoilMKTNWPAMRSGCLILASAAVLTSPCVAQTQEPAGAKATASVISSVAITQAPERASVRVEGEGPLQVHAA
Ga0307475_1108647123300031754Hardwood Forest SoilMKTNWPAFRGGFLILSSAVLGASPMLAQTQEPAGATATSAVISSVAITQAPQRASVRVEGEGRLDA
Ga0307475_1151046613300031754Hardwood Forest SoilMKTNWPAFRCGFLILSSAALGVSPVPAQTQEPAGATATSAVISSVAITQASQHASVRVEGEGRLDVRALRMQNPERLVLDFS
Ga0307478_1016504513300031823Hardwood Forest SoilMKTNWPALKHGCLILASLAVTASATPAQTQEAGGASKPAVISSVAVTQAPERSAVRVEGEGRLDVRAARMQNPDRLVLDFPGA
Ga0302315_1060892923300031837PalsaMKTTRPALRYGCLILASMALALSPALAQTPEPSGVTATAAVISSVAITQAPERAAVRVEGEG
Ga0306921_1278685513300031912SoilMKTNWRAVKYGCLILASLTLPTSAPAQTVVPTSAGKSTAVISSVAITQAPEQSAVRVEGEGNLDVRPATMRHPDRLVLDFVGARLAV
Ga0310913_1087491513300031945SoilMKTNWRAVKYGCLILASLTLPTSAPAQTVVPTSAGKSTAVISSVAITQAPEQSAVRVEGEGNLDVRPATMRHPDRLVLDFVGARLAVHKTVIPGVSAPVREV
Ga0307479_1022728843300031962Hardwood Forest SoilMKTNWPAFRCGCLILTSAVLGASPMLAQTQEPAGATATSAVISSVAITQAPQRASVRVEGEGRLDVHALRMQNPERLVLDFSGA
Ga0307479_1057429733300031962Hardwood Forest SoilMKTNWPALRHGYLILASVALTASPVLAQTQAPAGATSTSAVISSVAFTQAPERSAVRVEGEGRLDV
Ga0306924_1114921523300032076SoilMKTNWRAVKYGCLILASLTLPTSAPAQTVVPTSAGKSTAVISSVAITQAPEQSAVRVEGEGNLDVRPATMRHPDRLVLDF
Ga0316051_100022713300032119SoilMKTNWPAYRCGFLILSSAALGANPVLAQAQEPSGATATSAVIRSVAITEAPQHASVRVEGEGRLDVHASRMQNPERL
Ga0307471_10071888413300032180Hardwood Forest SoilMKTNWQALRNGCLTLASLALAASPLLAQTQEPAGANAAVISSVAITQAPQRAAVRVEGEGRLDVHAARMQN
Ga0307471_10079297013300032180Hardwood Forest SoilMTTNWPALRNGCLILASMTCGAAPMLAQPQESAGPKVTAAVISSVAVMQMAQRASVRVEGEGPLEAQAVRMHNPDRVVLDFAGTRM
Ga0307471_10307672413300032180Hardwood Forest SoilMKTNWPAFRHGCLILVGIAVAASPALAQTQEPTGASTTGAVISSVAITQAPQRAAVRVEGEGRLDVHAARMQDPDRLVLDFAGAR
Ga0307472_10043569513300032205Hardwood Forest SoilMTTNWPALRNGCLILASLAWGAAPLLAQSQESAGPKVTAAVISSVAVMQTAQRASVRVEGEGRLDAQAVRMHNPDRVVLDF
Ga0335085_1116534823300032770SoilMITNWPALRNGCLILASVAFSSVPALAQTQEPAGATATASVISSVAITQGAQRSSVRVEGEGHLDA
Ga0335082_1056939133300032782SoilMITNWPALRNGCLILASLAFSSVPALAQTQEPAGATATASVISSVAITQGAQRSSVRVEGEGLLDARAARMQNPDRL
Ga0335081_1016192713300032892SoilMKTNWPALRNGCLILVSAAFVASPLLAQTQEPAGATATTAVISSVALTQGTERASVRVVGEGRLQPRAVRMQNPDRL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.