NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F047782

Metagenome Family F047782

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F047782
Family Type Metagenome
Number of Sequences 149
Average Sequence Length 87 residues
Representative Sequence MGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFDDFRKCNKSSTITRLEWVLGFRPVEDCRRL
Number of Associated Samples 101
Number of Associated Scaffolds 149

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 77.85 %
% of genes near scaffold ends (potentially truncated) 32.89 %
% of genes from short scaffolds (< 2000 bps) 83.22 %
Associated GOLD sequencing projects 88
AlphaFold2 3D model prediction Yes
3D model pTM-score0.52

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(29.530 % of family members)
Environment Ontology (ENVO) Unclassified
(45.638 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(67.785 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 51.72%    β-sheet: 0.00%    Coil/Unstructured: 48.28%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

1020304050607080MGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFDDFRKCNKSSTITRLEWVLGFRPVEDCRRLExtracel.Cytopl.Sequenceα-helicesβ-strandsCoilSS Conf. scoreTM segmentsTopol. domains
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.52
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
100.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Soil
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Agricultural Soil
Arabidopsis Rhizosphere
Avena Fatua Rhizosphere
Corn Rhizosphere
Populus Rhizosphere
Rhizosphere
Rhizosphere
Corn Rhizosphere
Arabidopsis Rhizosphere
3.4%18.1%6.0%8.1%29.5%8.7%4.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPgaii200_087895012228664022SoilMKIAQAASKLLAFIRRRRAARPRRSLADLIALTIARGLSFAVVGGMVYVTYVVEMTGFDDFRKCSKNSAITRIEWVLGFRPAEDCRPS
JGI12053J15887_1000658643300001661Forest SoilMGIAVAPLRVFAFVCRRRTARPKRSTADWVALIIARGLSLAVVGGLVYTTYVVEMAGYNDFLRCNPKSTMTKIEWVLGFRPVEDCRW*
Ga0062593_10044661523300004114SoilMGIAALPAKLFAVIRRRRAKRPRRTLQDWIALTIARGLSFAVVGGLVYVTYIVEKTGFDDFRKCNKNTTLTQIEWVLGFRPVEDCRRL*
Ga0062595_10262388513300004479SoilAKLFAVIRRRRAKRPRRTLQDWIALTIARGLSFAVVGGLVYVTYIVEKTGFDDFRKCNKNTTLTQIEWVLGFRPVEDCRRL*
Ga0066672_1011294043300005167SoilMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0066672_1018770813300005167SoilACSGLRASSDRHLKRRMAQIATVLAYIRRRASRPKRSIADWLALATARGLSFVVGCLVYVAYIVEMTGFDDFRKCNKSSTVTRFEWALGFQPAYDCQRF*
Ga0066673_1010771633300005175SoilMATVLAFIRRWRASRPKRSIADWIALAIARGLSFAVVGGLVYVTYIVEMTGFDDFRKCNKSSTVTRFEWALGFQPAYDCQRF*
Ga0066673_1045520613300005175SoilMSVAAVPVKLFAVIRQRRAARPKRSLADWIALTIAHGLSIMVVSGLVYVTYFVQMTSYDDFRKCDKNSTISRFEWVLGIRPAEVCGR*
Ga0066679_1012780523300005176SoilMGIAAAPLRLFAFIHRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0066679_1018277613300005176SoilMATVLAFIRRWRASRPKRSIADWIALAIARGLSFAVVGGLVYVTYIVEMTGFDDFRKCNKSSTVTRFE
Ga0066671_1016611423300005184SoilMGIAAAPLKLFAAIRRRRAMRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0066671_1050244013300005184SoilMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARALSFAVVGSLVYVTYIVEMTGFDDFRKCNKSSTITRFEWVLGFRPVEDCRRF*
Ga0066675_1037952923300005187SoilMAIVKALRKFLAFIRRRIAARPKRSLADWIALTIARGLSLAVVGGLVYVTYVVEMTGYNDFLRCNPKSSITRFEWVLGFRPAEACRRF*
Ga0066675_1127987313300005187SoilGERLMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARALSFAVVGSLVYVTYIVEMTGFDDFRKCNKSSTITRFEWVLGFRPVEDCRRF*
Ga0066388_10015470833300005332Tropical Forest SoilMQIVKALRKFLAFIRARRAARPRRSLADWIALTIARGLSLAVVAGLVYVTYVVEMTGYKDFLRCNPKSSITRFEWILGFRPAEDCRPQIH*
Ga0070714_10019155613300005435Agricultural SoilMGIAALPAKLFAVIRRRRATRPRRTLQDWIALTIARGLSFAVIGGLVYVTYIVETTGFDDFRKCNKNTTMTQLEWVLGFRPVEDCRRL*
Ga0070714_10060733533300005435Agricultural SoilMGIAATPLKLIALIPSKLFVFMHRRWAARPKRSLQDWIALTIARGLSLAVVGGLVYVTYIVEMTSYDDFRKCNKSSTVTRLEWVLGSRPADDCRR*
Ga0070713_10076525313300005436Corn, Switchgrass And Miscanthus RhizosphereMGIAALPAKLFAVIRRRRATRPRRTLQDWIALTIARGLSFAVIGGFVYVTYIVETTGFDDFRKCNKNTTMTQLEWVLGFRPVEDCRRL*
Ga0070710_1032879413300005437Corn, Switchgrass And Miscanthus RhizosphereVIRRRRATRPRRTLQDWIALTIARGLSFAVIGGLVYVTYIVETTGFDDFRKCNKNTTMTQLEWVLGFRPVEDCRRL*
Ga0066682_1027217223300005450SoilMATVLALICRWRASRPKRSIADWIALSVARGLSFAVVGGLVYVTYIVEMTGFDDFRNNKSSTVTRFEWALGFQPAYDCQRF*
Ga0066681_1011594423300005451SoilMGIAAAPLRLFAFIHRRRAIRPRRSLQDWIALTIARGLSFAIVGGLVYVTYIVEMTSYDDFRKCNKSSTVTRIEWVLGFRPADDCRRF*
Ga0066681_1036060123300005451SoilMSVGAVPVKLFAAIRQRRAARPKRSLADWIALTIARGLSLAVVGCLVSVTYTVEMTGFDDFRKCNKTSTISRLEWVLGIRPAEGCGR*
Ga0066681_1048971123300005451SoilMATVLAFIRRWRASRPKRSIADWIALTIARGLSFAVVGGLVYVTYIVEMTGFDEFRNNKSSTVTRFEWALGFQPAYDCQRF*
Ga0066687_1009776823300005454SoilMGIAAAPLRLFAFIHRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFDDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0066687_1062868613300005454SoilRASRPKRSIADWIALSVARGLSFAVVGGLVYVTYIVEMTGFDDFRKCNKSSTVTRFEWALGFQPAYDCQRF*
Ga0070706_10054336023300005467Corn, Switchgrass And Miscanthus RhizosphereMLPRAGTEGENAMGTSQTLMRAFASIHHRRAARPKRSWEEWIALSIARLMTFAVVAGLAYVTYVVEMTSYDDFRKCNKATTTTRLEWVLGLRPAEDCRSY*
Ga0070698_10069941313300005471Corn, Switchgrass And Miscanthus RhizosphereMRTSQTLTRAFASIYHRRAARPKRSWEEWIALSIARLMTFAVVAGLAYVTYVVEMTSYDDFRKCNKATTTTRLEWVLGLRPAEDCRSY*
Ga0066692_1009447813300005555SoilMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEW
Ga0066670_1007835653300005560SoilMGIAAAPLKLFAAIRRRRAMRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRP
Ga0066699_1000779453300005561SoilMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARALSFAVVGSLVYVTYIVEMTGFDDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0070664_10023780713300005564Corn RhizosphereMGIAALPAKLFAVIRRRRATRPRRTLQDWIALTIARGLSFAVVGGLVYVTYIVEKTGFDDFRKCNKNTTLTQIEWVLGFR
Ga0066693_1001132733300005566SoilMATVLAFIRRWRASRPKRSIADWIALTIARGLSFAVVGGLVYVTYIVEMTGFDDFRNNKSSTVTRFEWALGFQPAYDCQRF*
Ga0066708_1067384323300005576SoilMSVAAVPVKLFAVIRQRRAARPKRSLADWIALTIARGLSIMVVSGLVYVTYFVQMTSYDDFRKCDKNSTISRFEWVLGIRPAEVCGR*
Ga0066706_1072113723300005598SoilMATVLAFIRRWRASRPKRSIADWIALAIARGLSFAVVGGLVYVTYIVEMTGFDDFRKCNKSSTVTRFE*
Ga0068856_10013780123300005614Corn RhizosphereMGIAALPAKLFAVIRRRRAKRPRRTLQDWIALTIARGLSFAVVGGLVYVTYIVEKTGFDDFRKCNKNTTMTQIEWVLGFRPVEDCRRL*
Ga0066903_10016777673300005764Tropical Forest SoilMAIVKALRKFLASIRARRAARPRRSLADWIALTIARGLSLALVAGLVYVTYVVEMTGYKDFLRCNPKSSITRFEWVLGFRPADDCRPQIH*
Ga0066903_10481956613300005764Tropical Forest SoilMRVSAWGVVRAIARAARKGLAFIWARRAARPKRSIADWIALTIARGLSLAVVGGLVYVTYVVEMTSYRDFLRCSPKSTITRLEWVLGFRPAEDCRRF*
Ga0066903_10584988423300005764Tropical Forest SoilMRITAVPAKSLAFIRRRWAARPRRTLHDWIALTIARGLSFAIVGGLVYVTYIVEMTSYDDFRKCNKGSTVTRVEWVLGSRPADGCGR*
Ga0066651_1023542233300006031SoilLKRRMVQMATVLAFIRRWRASRPKRSIADWIALTIARGLSFAVVGGLVYVTYIVEMTGFDDFRNNKSSTVTRFEWALGFQPAYDCQRF*
Ga0066651_1070694713300006031SoilMSVAAVPVKLFAVIRQRRAARPKRSLADWIALTIAHGLSIMVVSGLVYVTYFVQMTSYDDFRKCDKNSTISRFEWVLGI
Ga0066696_1036514413300006032SoilMRIAALPLKLFTFIRRRWAVRPKRTLQDWIALTIARGLSLALVGGLVYVTYFVEMTSYDDFRKCNKASTFTRFEWVLGFRPAEDCRSY*
Ga0066696_1062031213300006032SoilMTQGNRLMGIAAAPLNLIALIPSKVFVFMRRRWAARPKRTLQDWIALTIARGLSFAVVGGLVYVTYIVEMTSYDDFRKCNKSST
Ga0066652_10063166513300006046SoilMVQMATVLAFIRRWRASRPKRSIADWIALTIARGLSFAVVGGLVYVTYIVEMTGFDDFRKCNKSSTVTRFEWALGFQPAYDCQRF*
Ga0070765_10075951423300006176SoilMKIARAAARLLAFIPRRSAARPKRSFQDWISLTIARGLSLAVVGGLVYVTYIVEMTSFGDFRKCNKSSTITRFEWVLGFRPAEDCRHL*
Ga0066658_1004796133300006794SoilGAQTGSRRAGTSMLSHKQGDRPMGIAAAPLKLFAVIRRRRAMRPKRTLQDWIALTIARALSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0066660_1020773233300006800SoilMGIAAAPLKLFAAIRRRRAMRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCR
Ga0066660_1134744023300006800SoilMVQMATVLAFIRRWRASRPKRSIADWIALTIARGLSFAVVGGLVYVTYIVEMTGFDEFRNNKSSTVTRFEWALGFQPAYDCQRF*
Ga0066660_1163011513300006800SoilMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCR
Ga0079221_1119596023300006804Agricultural SoilLPAKLFAVIRRRRAKRPRRTLQDWIALTIARGLSFAVVGGLVYVTYIVEKTGFDDFRKCNKNTTLTQIEWVLGFRPVEDCRRL*
Ga0075434_10034359113300006871Populus RhizosphereMGIAALPAKLFAVIRRRRATRPRRTLQDWIALTIARGLSFAVVGGLVYVTYIVEKTGFDDFRKCNKNTTLTQIEWVLGFRPVEDCRRL*
Ga0099795_1003079213300007788Vadose Zone SoilMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFDDFRKCNKSSTITRLEWVLGF
Ga0099795_1014677813300007788Vadose Zone SoilMGSGHTQARAFAFIHRRHRTAQPKRGLEAWIALTIARGLTFAIVGGLVYLTYVIQVASYNDFRRCSPTSTMTRIEWALGFRPAEDCRQY*
Ga0099795_1017681823300007788Vadose Zone SoilMLTRHRLQGDRLMGIAVAPLRVFAFVRRRRTARPKRSMADWVALTIARGLSLAVVGGLVYTTYVVEMAGYNDFLRCNPKSTMTKIEWVLGFRPVEDCRW*
Ga0066709_10005631363300009137Grasslands SoilMGIAAAPLRLFAFIHRRRAMRPRRSLQDWIALTIARGLSFAVVGSLIYVTYIVEMTGFDDFRRCNKSSTITRREWVLGFPTVEECRRPQVLAQSTAKR*
Ga0066709_10020780643300009137Grasslands SoilMVQMATVLAFIRRWRASRPKRSIADWIALTIARGLSFAVVGGLVYVTYIVEMTGFDDFRNNKSSTVTRFEWALGFQPAYDCQRF*
Ga0066709_10202553433300009137Grasslands SoilCAKGGSQMSVAAVPVKLFAVIRQRRAARPKRSLADWIALTIAHGLSIMVVSGLVYVTYFVQMTSYDDFRKCDKNSTISRFEWVLGIRPAEVCGR*
Ga0099792_1000282573300009143Vadose Zone SoilMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFDDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0126380_1049541513300010043Tropical Forest SoilTKLFTFIRRRWAARPKRTLAEWIALTIARGLSFAIVGGLVYVTYIVEMTSYDDFRKCNKGSTVTRVEWVLGSRPADGCGR*
Ga0126384_1050783713300010046Tropical Forest SoilMAIVKALRKFLAFIRARRAARPRRSLADWIALTIARGLSLALVAGLVYVTYAVEMTGYKDFLRCNPKSSITRFEWVLGFRPAEDCRPQIH*
Ga0126384_1062901913300010046Tropical Forest SoilMLMRITAAAAKLFAFIRRRWAARPKRTLQDWIALTIARGFSFAIVGGLVFVTYIVEMTSYDDFRKCNKSSTITRLEWVLGVRPADECRP*
Ga0134082_1021091623300010303Grasslands SoilMGIAAAPLRLFAFIHRRRAIRPRRSLQDWIALTIARGLSFAVVGGLVYVTYIVEMTGFDDFRKCNKSSTITRLEWALGFPPVEDCRRL*
Ga0134109_1006827023300010320Grasslands SoilAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0134109_1016633513300010320Grasslands SoilLKRRMVQMATVLAFIRRWRASRPKRNIADWIALTIARGLSFAVVGGLVYVTYIVEMTGFDEFRNNKSSTVTRFEWALGFQPAYDCQRF*
Ga0134067_1020430623300010321Grasslands SoilMVQMATVLAFIRRWRASRPKRSIADWIALAIARGLSFAVVGGLVYVTYIVEMTGFDDLRNNKSSTVTRFE*
Ga0134084_1006613523300010322Grasslands SoilMATVLAFIRRWRASRPKRSIADWIALAIARGLSFAVVGGLVYVTYIVEMTGFDDFRNNKSSTVTRFEWALGFQPAYDCQRF*
Ga0134064_1000434113300010325Grasslands SoilPRRRALEGSARHKQGERLMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0134065_1002414513300010326Grasslands SoilMGIAAAPLRLFAFINRRRAIRPRRTLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0134065_1049078913300010326Grasslands SoilMATVLAFLRRWRASRPKRSIADWIALTIARGLSFAVVGGLVYVTYIVEMTGFDEFRNNKSSTVTRFEWALGFRPAYDCQRF*
Ga0134063_1073915613300010335Grasslands SoilMGIAAAPLRLFAFINRRRAIRPRRTLQDWIALTIARGLSFAVVGSLVYVTYVVEMTSFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0126372_1108086323300010360Tropical Forest SoilMHIADLPAKLLAFIRRRWAARPKRTLADWIALTIARGLSFAIVAGLVYVTYIVEMTGYDDFRKCNKSSTTTRLEWILGSRPVDECRR*
Ga0134128_1026893233300010373Terrestrial SoilMGIAALPAKLFAVIRRRRAKRPRRTLQDWIALTIARGLSFAVVGGLVYVTYIVEKTGFDDFRNCNKNTTLTQIEWVLGFRPVEDCRRL*
Ga0134128_1059158923300010373Terrestrial SoilMGIAATPLKLIALIPSKLFVSMHRRWAVRPKRSLQDWIALTIARGLSFAVVGGLVYVTYIVEMTSYDDFRKCNKSSTVTRLEWVLGSRPADDCRR*
Ga0134126_1266412913300010396Terrestrial SoilPLKLIALIPSKLFVFMHRRWAARPKRSLQDWIAIAIACGLSFAVVGGLVYVTYIVEMTSYDDFRKCNKSSTVTRLEWVLGSRPADDCRR*
Ga0134121_1100484213300010401Terrestrial SoilMEIAAAPLKLIALIPSRLFVFMHRRWAARPKRSLQDWIALTIARGLSFAVVGGLVYVTYIVEMTSYDDFRKCNKSSTVTRLEWILGSRPADDCRRF*
Ga0137383_1010295133300012199Vadose Zone SoilMGIAAAPLRLFAFINRRLAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0137382_1019729033300012200Vadose Zone SoilMDKTEARRTVFGILRRRRAARPRRNLQDWIALTIARGLSLALIGGLVYVTYVVEMTGYNDFRHCNRESTMTWLEWVLGFRPAEDEDCRR*
Ga0137382_1028308813300012200Vadose Zone SoilMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTSFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0137382_1048507423300012200Vadose Zone SoilMVQMATVLAFIRRWRASRPKRSIADWIALTIARGLSFAVVGGLVYVTYIVEMTGFDDLRNNKSSTVTRFEWALGFQPAYDCQRF*
Ga0137382_1058514613300012200Vadose Zone SoilMGIAAAPLRLFAFINGRRAMRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTSFEDFRKCNKSSTITRLE
Ga0137376_1012238413300012208Vadose Zone SoilRHKQGERLMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0137376_1149877113300012208Vadose Zone SoilMGIAAAPLRLFAFINGRRAMRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFDDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0137376_1175094813300012208Vadose Zone SoilMDKTEARGNVFAILRRRRAARPRRNLQDWIALTIARGLSLALIGGLVYVTYVVEMTGYNDFRHCNRESTVTWLEWVLGFRPAEDEDCRR*
Ga0150985_10939681523300012212Avena Fatua RhizosphereMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYIVEMTGFEDFRKCNKSSTVTRIECVLGFRTADDCRHF*
Ga0137370_1007184833300012285Vadose Zone SoilMVQMATVLAFIRRWRASRPKRSIADWIALTIARGLSFAVDEGMIYVTYIVEMTGFDDFRKCNKSSTVTRLGPRFSACG*
Ga0137398_1001135943300012683Vadose Zone SoilMGIAAAPLRLFAFINGRRAMRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0137398_1007293123300012683Vadose Zone SoilMGIAVAPLRVFAFVRRRRTARPKRSMADWVALTIARGLSLAVVGGLVYTTYVVEMAGYNDFLRCNPKSTMTKIEWVLGFRPVEDCRW*
Ga0137395_1005424633300012917Vadose Zone SoilMGIAAAPLRLFAFINRRRAIRPRRSLQDWFALTIARGLSFAVVGSLVYVTYVVEMTGFDDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0137395_1008179733300012917Vadose Zone SoilMATVLAFIRRWRASRPKRSIADWIALAIARGLSFAVVGGLVYVTYIVEMTGFDDFRKCNKSSMVTRFEWALGFRPAYDCQRF*
Ga0137395_1097536023300012917Vadose Zone SoilMGSGHTQARAFAFIHRRHRTAQPKRGLEAWIALTIARGLTFAIVGGLVYLTYVIQVASYNDFRRCSPTSTMTRIEWTLGFRPAEDCRQY*
Ga0137359_1070949223300012923Vadose Zone SoilMDKTEARGNVFAILRRCRAARPRRNLQDWIALTIARGLSLAAVGGLVYVTYVVEMTGYNDFRHCNRESTMTWLEWVLGFRPAEDEDCRR*
Ga0137359_1128167313300012923Vadose Zone SoilAFINRRRAIRPRRSLQDWFALTIARGLSFAVVGSLVYVTYIVEMTSFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0137416_1107532323300012927Vadose Zone SoilMGIAAAPLRLFAFINRRRAIRPRRTLQDWIALTIARGLSFAVVGSLVYVTYIVEMTSFEDFRKCNKSSTITRLEWVLGFRPVE
Ga0164299_1054778123300012958SoilMRIAAAPLRLFALIHRRRAMRPRRSLQDWIALTIARGLSFAVVGGLVYVTYIVEMTGFDDFRKCSKSSTITRLEWVLGFRPVEDCRRL*
Ga0164301_1058626223300012960SoilMGIAALPAKLFAVIRRRRAKRPRRTLQDWIALTIARGLSFAVVGGLVYVTYIVEKTGFDDFRKCNKNTTMTQIEWVRGFRPVEDCRRL*
Ga0126369_1004180323300012971Tropical Forest SoilMAIVKAPRKFLASIRARRAARPRRSLADWIALTIARGLSLALVAGLVYVTYVVEMTGYKDFLRCNPKSSITRFEWVLGFRPADDCRPQIH*
Ga0126369_1205793513300012971Tropical Forest SoilMRIAALPLKVFSLIRRRWAAQPKRTLADWIALTIARGLSFVVVGGLVYVTYIVEMTSYEDFRKCHKNSTITRFEWVLGSRSPEDCRR*
Ga0126369_1332753713300012971Tropical Forest SoilMRVSAWGVVGAIARAARKGLAFIWARRAARPKRSIADWIALTIARGLSLAVVGGLVYVTYVVEMTSYREFLRCSPKSTITRLEW
Ga0134078_1000935233300014157Grasslands SoilMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVED*
Ga0134078_1047083123300014157Grasslands SoilMATVLAFIRRWRASRPKRSIADWIALAIARGLSFAVVGGLVYVTYIVEMTGFDDFRKCNKSSTVTRLEWVLGSRPADDCRRL*
Ga0134079_1000348133300014166Grasslands SoilMGIAAAPLRLFAFINHRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL*
Ga0137418_1029285643300015241Vadose Zone SoilRHRLQGDRLMGIAVAPLRVFAFVRRRRTARPKRSMADWVALTIARGLSLAVVGGLVYTTYVVEMAGYNDFLRCNPKSTMTKIEWVLGFRPVEDCRW*
Ga0137412_1023967323300015242Vadose Zone SoilVVGPRRRALEGSSRHKQGERLMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSLAVVGSLVYVTYIVEMTRFW*
Ga0182007_1009396013300015262RhizosphereMGIAALPAKLFAVIRRRRAKRPRRTLQDWIALTIARGLSFAVVGGLVYVTYIVEKTGFDDFGKCNKNTTMTQIEWVLGFRPVEDCRRL*
Ga0132256_10117106813300015372Arabidopsis RhizosphereMQIAQAAPSLLAAIRRRRAARPKRTPTDWIALTIARGLSFAVVGGLVYVTYVVEMAGYNDHLRCHRDSTITRIEWVLGTRPAEECVR*
Ga0132255_10481894413300015374Arabidopsis RhizosphereVNELGETIMKIAQAVPRLLASIRRRRAARPKRTLADWIALTIARGLSFAVVGGLVYVTYIVEVAGYNDHLRCYRDSTITRIEWFLGIRPAEECARW*
Ga0066655_1015261413300018431Grasslands SoilMGIAAAPLRLFAFIHRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGVRPVEDWRRLEGMA
Ga0066667_1008589533300018433Grasslands SoilMATVLAFIRRWRASRPKRSIADWIALTIARGLSFAVVGGLVYVTYIVEMTGFDEFRNNKSSTVTRFEWALGFQPAYDCQRF
Ga0066667_1066517123300018433Grasslands SoilMGIAAAPLKLIALIPSKLFAFMRRRWAARPKRTLPDWIALTIARGLSFAVVGGLVYVTYIVEMTSYDDFRKCNKSSTISRVEWVLGFRPVEDCRHY
Ga0066667_1165413113300018433Grasslands SoilMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRL
Ga0066667_1174931623300018433Grasslands SoilHRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYIVEMTGFDDFRKCNKSSTITRLEWVLGFRPVEDCRRP
Ga0066662_1093994023300018468Grasslands SoilMGIAAAPLKLFAFINRRRAMRPRRSLQDWIALTIARGLSFAVVGGLVYVTYVVEMTSFDDFRKCNKSSTITRLEWVLGFRPVEDCRRL
Ga0066669_1038601733300018482Grasslands SoilMRIAALPLKLFTFIRRRWAVRPKRTLQDWIALTIARGFSLALVGGLVYVTYFVEMTSYDDFRKCNKASTFTRFEWVLGFRPAEDCRSY
Ga0066669_1113973113300018482Grasslands SoilMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL
Ga0066669_1229889713300018482Grasslands SoilMVQMATVLAFICRWRASRPKRSIADWIALSVARGLSFAVVGGLVYVTYIVEMTGFDDFRKCNKSSTVTRFEWALGFQPAYDCQRF
Ga0066669_1246071413300018482Grasslands SoilMSVAAVPVKLLAVIRQRRAARPKRSLADWIALTIAHGLSIMVVSGLVYVTYFVQMTSYDDFRKCDKNSTISRFEWVLGIRPAEVCGR
Ga0193751_111081513300019888SoilMKIASAAARLLAFIRRRRAARPKRSMGDWIALTIARGLSLAVVGGLVYVTYIVEMTSYDDFRKCDKTSTITRLEWVLRFRPAEDCRSY
Ga0193751_114962423300019888SoilMVGTKALRTVFAFLRRRKAQRAARPKRSFQDWIALTIARGLSLVVVGGLVYVTYVVEMAGFDDLRKCNKNSTITRLEWVLGFRPAEDCRRY
Ga0193733_117606623300020022SoilMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTTTRLEWVLGFRPV
Ga0210400_10002911253300021170SoilMKIAQAAPKSLAFIRRRRAARPKRSLADWIALTIARGLLFAMVGGLAYVTYVVEMAGYNDFRKCSKNRTLTRLEWVLGVRPAEDL
Ga0210408_1110655413300021178SoilMKIAQAAPKLLAFIRRRRAARPKRSLADWIALTIARGLSFAVVGGLVYVTYVVEMTGYDDFRKCSKNSTMTRIEWVLGVRPAEGCRSY
Ga0213873_1007111113300021358RhizosphereMGIAAFPRKFVALIHGRWAARPKRTLADWIALTIARLMTFAVVGGLVYVTYIVEMAGYDDFRKCNKNTTMTRLEWVLGFRPAEDCRSF
Ga0126371_1048845113300021560Tropical Forest SoilKLVGFIRRRWAARPKQTLADWIALTIARGLSFPIVSGLVYATYIVQMTGYDDFRKCDKSSTTTPLEWVLGSRPVDECRR
Ga0126371_1109739913300021560Tropical Forest SoilMAIVKALRKFLASIRARRAARPRRSLADWIALTIARGLSLALVAGLVYVTYVVEMTGYKDFLRCNPKSSITRFEWVLGFRPADDCRPQIH
Ga0207684_1160194313300025910Corn, Switchgrass And Miscanthus RhizosphereMGTSQTLMRAFASIHHRRAARPKRSWEEWIALSIARLMTFAVVAGLAYVTYVVEMTSYDDFRKCNKATTTTRLEWVLGLRPAEDCRSY
Ga0207700_1059033033300025928Corn, Switchgrass And Miscanthus RhizospherePLKLIALIPSKLFVFMHRRWAARPKRSLQDWIALTIARGLSFAVVGGLVYVTYIVEMTSYDDFRKCNKSSTVTRLEWVLGSRPADDCRR
Ga0207664_1124613233300025929Agricultural SoilLFVFMHRRWAARPKRSLQDWIALTIARGLSLAVVGGLVYVTYIVEMTSYDDFRKCNKSSTVTRLEWVLGSRPADDCRR
Ga0209688_109015013300026305SoilMVQMATVLAFIRRWRASRPKRSIADWIALTIARGLSFAVVGGLVYVTYIVEMTGFDDFRNNKSSTVTRFEWALGFQPAYDCQRF
Ga0209055_119931023300026309SoilAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL
Ga0209153_102878133300026312SoilMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARALSFAVVGSLVYVTYIVEMTGFDDFRKCNKSSTITRFEWVLGFRPVEDCRRL
Ga0209686_103048433300026315SoilKQGERLMGIAAAPLRLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL
Ga0209154_108465323300026317SoilMGIAAAPLRLFAFINRRRAIRPRRTLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL
Ga0209471_101180263300026318SoilINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL
Ga0209472_126331723300026323SoilMVQMATVLAFIRRWRASRPKRSIADWIALTIARGLSFAVVGGLVYVTYIVEMTGFDEFRNNKSSTVTRFEWALGFQPAYDCQRF
Ga0209473_106498423300026330SoilMGIAAAPLKLFAAIRRRRAMRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL
Ga0209473_121164323300026330SoilPLKLFTFIRRRWAVRPKRTLQDWIALTIARGLSLALVGGLVYVTYFVEMTSYDDFRKCNKASTFTRFEWVLGFRPAEDCRSY
Ga0257153_102223723300026490SoilMGISQTLMRAFASIHHRRAARPKRSSEEWTALSIARLMTFAVVAGLAYVTYVVEMTSYDDFRKCNKATTTTRLEWVLGLRPAEDCRSY
Ga0209808_112035623300026523SoilMGIAAAPLKLFAFINRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL
Ga0209808_126379213300026523SoilMATVLAFIRRWRASRPKRSIADWIALTIARGLSFAVVGGLVYVTYIVEMTGFDEFRNNKSSTVTRFEWEFVK
Ga0209807_109600623300026530SoilMATVLAFIRRWRASRPKRSIADWIALTIARGLSFAVVGGLVYVTYIVEMTGFDDFRNNKSSTVTRFEWALGFQPAYDCQRF
Ga0209474_1006030733300026550SoilMSVAAVPVKLFAVIRQRRAARPKRSLADWIALTIAHGLSIMVVSGLVYVTYFVQMTSYDDFRKCNKNSTITRFEWVLGFRPAEVCGR
Ga0209474_1009594743300026550SoilLRKFLAFIRRRIAARPKRSLADWIALTIARGLSLAVVGGLVYVTYVVEMTGYNDFLRCNPKSSITRFEWVLGFRPAEACRRF
Ga0209474_1023939013300026550SoilRFANSCLDVACSGLRASSDRHLKRRMVQMATVLAFIRRWRASRPKRSIADWIALTIARGLSFAVVGGLVYVTYIVEMTGFDEFRNNKSSTVTRFEWALGFQPAYDCQRF
Ga0209179_103887723300027512Vadose Zone SoilMGIAVAPLRVFAFVRRRRTARPKRSMADWVALTIARGLSLAVVGGLVYTTYVVEMAGYNDFLRCNPKSTMTKIEWVLGFRPVEDCRW
Ga0209106_109803513300027616Forest SoilRHRLQGDRLMGIAVAPLRVFAFVRRRRTARPKRSMADWVALTIARGLSLAVVGGLVYTTYVVEMAGYNDFLRCNPKSTMTKIEWVLGFRPVEDCRW
Ga0208991_103891623300027681Forest SoilMGIAAAPLRLFAFIHRRRAIRPRRSLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWVLGFRPVEDCRRL
Ga0209488_1019329833300027903Vadose Zone SoilMATVLAFIRRWRASRPKRSIADWIALAIARGLSFAVVGGLVYVTYIVEMTGFDDFRKCNKSSTVTRFEWALGFQPAYDCQRF
Ga0209488_1040796313300027903Vadose Zone SoilMGSGHTQARAFAFIHRRHRTAQPKRGLEAWIALTIARGLTFAIVGGLVYLTYVIQVASYNDFRRCSPTSTMTRIEWALGFRPAEDCREY
Ga0137415_1086997423300028536Vadose Zone SoilMGIAAAPLRLFAFINRRRAIRPRRTLQDWIALTIARGLSFAVVGSLVYVTYVVEMTGFEDFRKCNKSSTITRLEWV
Ga0307475_1017359943300031754Hardwood Forest SoilMSIAATPLRVFAFIRRRRAARPKRSMEEWITLAIARGLSLAVVGGLVYVTYVVEMTSFDDFRKCNRNRTITRLEWVLGSRPAEDCRHL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.