NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F044601

Metagenome Family F044601

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F044601
Family Type Metagenome
Number of Sequences 154
Average Sequence Length 79 residues
Representative Sequence MEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT
Number of Associated Samples 113
Number of Associated Scaffolds 154

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 81.58 %
% of genes near scaffold ends (potentially truncated) 29.22 %
% of genes from short scaffolds (< 2000 bps) 77.92 %
Associated GOLD sequencing projects 99
AlphaFold2 3D model prediction Yes
3D model pTM-score0.34

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (98.052 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(31.169 % of family members)
Environment Ontology (ENVO) Unclassified
(67.532 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(75.974 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138
1JGI1027J12803_1041200053
2JGI25381J37097_10282302
3JGI25385J37094_100472941
4JGI25385J37094_100778562
5JGI25383J37093_100154064
6JGI25382J37095_102732882
7JGI25388J43891_10051532
8JGI25390J43892_101607322
9Ga0066674_100391722
10Ga0066674_103124212
11Ga0066677_102879221
12Ga0066680_100602922
13Ga0066673_101224491
14Ga0066690_103290812
15Ga0066685_102377612
16Ga0066685_103322092
17Ga0066685_109700901
18Ga0066676_100648603
19Ga0066676_108812692
20Ga0066675_100363962
21Ga0066675_100927261
22Ga0066675_104958332
23Ga0070708_1005815262
24Ga0070708_1013326091
25Ga0066686_107660451
26Ga0066689_100049146
27Ga0066689_100238832
28Ga0066681_100084741
29Ga0070706_1017929512
30Ga0070699_1003743282
31Ga0066697_103065862
32Ga0066701_107141282
33Ga0066695_101053852
34Ga0066692_101615222
35Ga0066704_102104032
36Ga0066670_103198532
37Ga0066699_106544512
38Ga0066694_104813251
39Ga0066654_100823161
40Ga0066651_100096932
41Ga0066696_105665012
42Ga0070765_1008859221
43Ga0066653_103542681
44Ga0066658_100223182
45Ga0066665_101777653
46Ga0066665_108246631
47Ga0066665_108668761
48Ga0066665_116088742
49Ga0066659_103201513
50Ga0066659_118691672
51Ga0066660_108746832
52Ga0075433_104741261
53Ga0075425_1002325383
54Ga0075424_1019293191
55Ga0099794_107985261
56Ga0066710_1004794133
57Ga0066710_1009407212
58Ga0066709_1004574842
59Ga0066709_1009751132
60Ga0075423_107686052
61Ga0134109_100014812
62Ga0134109_104877572
63Ga0134067_104834042
64Ga0134086_101402381
65Ga0134086_103640151
66Ga0134064_100014947
67Ga0134065_101353752
68Ga0134080_100124345
69Ga0134080_100300212
70Ga0134080_106512922
71Ga0134063_101852342
72Ga0134071_106048882
73Ga0134062_101731732
74Ga0134066_102277672
75Ga0137391_100438204
76Ga0137389_103093763
77Ga0137388_112222831
78Ga0137388_116603142
79Ga0137364_102181422
80Ga0137382_108524172
81Ga0137379_108414912
82Ga0137371_104389312
83Ga0137390_112523182
84Ga0137358_109943531
85Ga0137396_107612251
86Ga0137416_103033222
87Ga0137404_100230876
88Ga0137407_122987521
89Ga0134110_100525962
90Ga0134110_100950881
91Ga0134110_102894483
92Ga0134087_101079851
93Ga0134075_101505222
94Ga0134075_105033121
95Ga0134079_103970521
96Ga0134072_103552362
97Ga0134089_102491021
98Ga0134089_104865022
99Ga0134069_11489162
100Ga0134112_101102272
101Ga0134112_101429822
102Ga0134083_100297233
103Ga0066655_100149476
104Ga0066655_100585983
105Ga0066655_101203162
106Ga0066667_100659841
107Ga0066667_101673631
108Ga0066667_117868181
109Ga0066667_120100922
110Ga0066662_102943442
111Ga0066662_106118312
112Ga0066669_100605682
113Ga0066669_112200782
114Ga0210384_100170861
115Ga0207684_1000188318
116Ga0207646_101676703
117Ga0207646_102943562
118Ga0207646_119424452
119Ga0209350_10041052
120Ga0209234_10580622
121Ga0209234_11073012
122Ga0209235_10477483
123Ga0209237_10919712
124Ga0209027_11243502
125Ga0209468_100053712
126Ga0209761_12159591
127Ga0209152_100369152
128Ga0209803_10267492
129Ga0209158_10717682
130Ga0209377_11150992
131Ga0257170_10370782
132Ga0257163_10326622
133Ga0257168_10476751
134Ga0257168_10564741
135Ga0209808_12382342
136Ga0209690_10383633
137Ga0209056_102557243
138Ga0209376_11172322
139Ga0209156_100323084
140Ga0209161_103942091
141Ga0209701_105578721
142Ga0209526_100413533
143Ga0209526_101067301
144Ga0137415_101796473
145Ga0137415_101805643
146Ga0307312_101634232
147Ga0308309_100537201
148Ga0255311_11088571
149Ga0307469_102476722
150Ga0307473_102103112
151Ga0307471_1002738322
152Ga0307471_1010348262
153Ga0307471_1017916351
154Ga0307471_1036833341
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 69.81%    β-sheet: 0.00%    Coil/Unstructured: 30.19%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

10203040506070MEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRTSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.34
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
98.1%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds



Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Soil
Vadose Zone Soil
Grasslands Soil
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Sandy Soil
Populus Rhizosphere
11.7%18.2%31.2%18.8%3.2%3.9%5.2%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI1027J12803_10412000533300000955SoilMEENWYAVEQQVRDRISEARAAARIRTLTRKVAPTARRPNSVGITISRLASRVSTRAMQLSLGLSRALANVRAVTKATSRERTPTMHQEETPSQRQPFKPGA
JGI25381J37097_102823023300002557Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMQLPLELSRALAKVQAATK*
JGI25385J37094_1004729413300002558Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPKARRPNFVGITIIRLANWVLARAMQLALELSRALANVQAATKRT*
JGI25385J37094_1007785623300002558Grasslands SoilMEENWYAVEQQIRDRLTDARASARIRTLTQKLAPTARRQYSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT*
JGI25383J37093_1001540643300002560Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIWTLTQKLALTARRPNSVGITIIRLANWVLARAMQLPLELARALANVQAATKRI*
JGI25382J37095_1027328823300002562Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRSLTQKLAPTARRPNSVGITVIRLANWVLARAMLLPLELSRALANVQAATKRT*
JGI25388J43891_100515323300002909Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNYVGITIIRLANWVLARAMRLPLELSRALAKVQAATK*
JGI25390J43892_1016073223300002911Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNXVGITIIRLANWVLARAMXLPLELSRALAKVQAATK*
Ga0066674_1003917223300005166SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0066674_1031242123300005166SoilNWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT*
Ga0066677_1028792213300005171SoilMEGDWYTVEQQIRDRLTEARAAAQIRTLTQKLAPRARRPNSVGITIIRLANWVLARAMQLPLELS
Ga0066680_1006029223300005174SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQYSVGITIVRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0066673_1012244913300005175SoilMEENWYPVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT*
Ga0066690_1032908123300005177SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALAKVQAATK*
Ga0066685_1023776123300005180SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRQANWVLARAMQLALELSRALANVQAATKRT*
Ga0066685_1033220923300005180SoilMDEAWYIVEQQIRDRLTEALATARIRTLTQKLAPTARRPNSVGITIIRLASWVLARAMQLPLELSRALANVQAATKRS*
Ga0066685_1097009013300005180SoilNWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0066676_1006486033300005186SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT*
Ga0066676_1088126923300005186SoilMEENWYAVEQQVRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRQANWVLARAMQLALELSRALANVQAATKRT*
Ga0066675_1003639623300005187SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMRLPLELSRALAKVQAATK*
Ga0066675_1009272613300005187SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENGALARAMQLFLGLSRALANVRAVTK*
Ga0066675_1049583323300005187SoilMDEAWYIVEQQIRDRLTEALATARIRTLTQKLAPTARRPNSVGITIIRLASWVLARAMQLPLVLSRALANVQAATKRS*
Ga0070708_10058152623300005445Corn, Switchgrass And Miscanthus RhizosphereMEENWYAVEQQVRDRLTEARAAARIRTLTQRLAPAARRPHAVRVAFTRLTSWVWARTIGLPTALSHALANVRTATKRCQSTSALLAGKESRPWR*
Ga0070708_10133260913300005445Corn, Switchgrass And Miscanthus RhizosphereMEENMYALEQQVRDRLTEARTAARTRALVQQLAPTARRPYSVGIAFSDLASRVLARAREWPLELSRALANVRAATKRG*
Ga0066686_1076604513300005446SoilKPMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLADWVLARAMRLPLELSRALAKVQAATK*
Ga0066689_1000491463300005447SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENWALARAMQLCLGLSRALANVRAVTK*
Ga0066689_1002388323300005447SoilMEENWYAVEQQIRDRLTDARAAARIRSLTQKLAPTARRQYSVGITIVRLANWVLARAMQLPLELSRALANVQAATKRI*
Ga0066681_1000847413300005451SoilNWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRVTSWVLARAMRLPPELSRALAKVQAATK*
Ga0070706_10179295123300005467Corn, Switchgrass And Miscanthus RhizosphereMEENWYAVEQQVRDRLTEARAAARIRTLTQRLAPAARRPHAVRVAFTRLTSWVWARTIGLPTALSHALANVRTATKRCQSTSALLAGKESRP*
Ga0070699_10037432823300005518Corn, Switchgrass And Miscanthus RhizosphereMEENWYAVEQQVRDRLTEARAAARIRTLTQGLAPAARRPHAVRVAFTRLTSWVWARTIGLPTALSRALANVRTATKRCQSTSALLAGKESRP*
Ga0066697_1030658623300005540SoilMEENWYAVEQQIRDRLTDARAAARIRTMTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0066701_1071412823300005552SoilMEENWYAVEQQIRDRLTDARAAARIRSLTQKLAPTARRQYSVGITIVRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0066695_1010538523300005553SoilNWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMQLPLELSRALAKVQAATK*
Ga0066692_1016152223300005555SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENWALARAMQLFLGLSRALANVRAVTK*
Ga0066704_1021040323300005557SoilMEENWYAAEQQVRDRLSEARAAARIRTLTRGLAPVARRPHAIRVAFTHLASWVWARTIELPAALSHALANARTATKRCQSTNALLVGKEPRP*
Ga0066670_1031985323300005560SoilMDEAWYIVEQQIRDRLTEALATARIRTLTQKLAPTARRPNSVGITITRLASWVLARAMQLPLEISRALANVQAATKRS*
Ga0066699_1065445123300005561SoilMDEDWYTVEQQIRDRLTEARTAAQIRALTEELAPTARRPTSVGIIRLASWVLARAMQLPLELSRALARVRAAMERRASAARERTPH
Ga0066694_1048132513300005574SoilYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNYVGITIIRLANWVLARAMRLPLELSRALAKVQAATK*
Ga0066654_1008231613300005587SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALAN
Ga0066651_1000969323300006031SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRVTSWVLARAMRLPPELSRALAKVQAATK*
Ga0066696_1056650123300006032SoilMDEAWYIVEQQIRDRLTEALATARIRTLTQKLAPTARRPNSVGITITRLASWVLARAMQLPLELSRALANVQAATKRS*
Ga0070765_10088592213300006176SoilMEENWYALEQQIRDRLTEARAAARIRTLTAKLAPTARRPNAAGTTIVRLANWVLGRPMRSPLELSRPLAKVRAAMK*
Ga0066653_1035426813300006791SoilENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT*
Ga0066658_1002231823300006794SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIVRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0066665_1017776533300006796SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPPELSRALANVQAATKRT*
Ga0066665_1082466313300006796SoilVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT*
Ga0066665_1086687613300006796SoilLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0066665_1160887423300006796SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENWALARAMQLCLGLSRALANVHAVTK*
Ga0066659_1032015133300006797SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLAPSGRRPNSVGITIIRLENWALARAMQLCLGLSRALANVRAVTK*
Ga0066659_1186916723300006797SoilMEENWYAVEQQIRDRLTDARAAARIRTMTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALAN
Ga0066660_1087468323300006800SoilMEENWYAGEQQVRDRLNEARAAARTGALNHGLAPSARRPNSVGITIIRLENWTLARAMQLFLGLSRALANVRAVTK*
Ga0075433_1047412613300006852Populus RhizosphereMEENWYAVEQQIRDRLTEARAGARTWTMTQGPTPAARRPHTVGITIIRLGSWVWARAVQLPLELSRGFASVRAAMKDTASHRRDSS
Ga0075425_10023253833300006854Populus RhizosphereMEENWYAVEQQVRDRLNEARAAARTGALNHGLAPSARRPNSVGTTIIRLANWALARPMQLFLRFSRALANVRAVTK*
Ga0075424_10192931913300006904Populus RhizosphereRDRLTEARARARTWTMTQGPTPAARRPHAVWITIIRLGSWVWARAVQLPLELSRGFASVRAAMKDTASHRRDSSKVADSNT*
Ga0099794_1079852613300007265Vadose Zone SoilMEENWYAVEQQVRDRLSEARAAARIRTLTRGLAPVARRPHAIRVAFTHLASWVWARTIELPAALSHALANARTATKR
Ga0066710_10047941333300009012Grasslands SoilMDEAWYIVEQQIRDRLTEALAAARIRTLTQELAPTARRRNSVGITIIRLASWVLARAMQLPLELSRALANVQAATKRS
Ga0066710_10094072123300009012Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRLANWVLARAMQLALELSRALANVQAATKRT
Ga0066709_10045748423300009137Grasslands SoilMDEAWYIVEQQIRDRLTEALAAARIRTLTQELAPTARRPNSVGITIIRLASWVLARAMQLPLELSRALANVQAATKRS*
Ga0066709_10097511323300009137Grasslands SoilMEENWYAVEQQVRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRLANWVLARAMQLALELSRALANVQAATKRT*
Ga0075423_1076860523300009162Populus RhizosphereMEENWYAVEQQIRDRLTEARARARTWTMTQGPTPAARRPHAVWITIIRLGSWVWARAVQLPLELSRGFASVRAAMKDTASHRRDSSKVADGNT*
Ga0134109_1000148123300010320Grasslands SoilMEENWYPVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALAKVQAATK*
Ga0134109_1048775723300010320Grasslands SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMRLPLELSRALANVQAATKR
Ga0134067_1048340423300010321Grasslands SoilMDEAWYIVEQQIRDRLTEALATARIRTLTQKLAPTARRPNSVGITIIRLANWVLARAMRLPLELSRALANVQAATKRS*
Ga0134086_1014023813300010323Grasslands SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMRLPLELSRALAN
Ga0134086_1036401513300010323Grasslands SoilENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLVNWVLARAMQLPLELSRALAKVQAATK*
Ga0134064_1000149473300010325Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLASWVLARAMQLPLVLSRALANVQAATKRS*
Ga0134065_1013537523300010326Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELS
Ga0134080_1001243453300010333Grasslands SoilLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT*
Ga0134080_1003002123300010333Grasslands SoilMEENWYPVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMRLPLELSRALAKVQAATK*
Ga0134080_1065129223300010333Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0134063_1018523423300010335Grasslands SoilMDEDWYTVEQQIRDRLTEARAAARIRTMTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0134071_1060488823300010336Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALASVQAATK*
Ga0134062_1017317323300010337Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRLASWVLARAMQLPLELSRALANVQAATKRS*
Ga0134066_1022776723300010364Grasslands SoilRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMQLPLELSRALAKVQAATK*
Ga0137391_1004382043300011270Vadose Zone SoilMDEAWYLVEQQIRDRLTEARAAARLRTPTQKPAQTGRRPNSVGITISRLASWVLARAMQLSLGLSRVLANVRAVTK*
Ga0137389_1030937633300012096Vadose Zone SoilMEEDWDLEQQIRDRLTEARAAARIRIPTQKLAPTPRRQNSVGITIIRLSNWVLARAMQLSLELSRALANARAATKRG*
Ga0137388_1122228313300012189Vadose Zone SoilMEENMYALEQQVRDRLTEARTAARARALVQQLAPTARRPYSVGIAFSDLASRVLARAREWPLELSRALANARAATKRG*
Ga0137388_1166031423300012189Vadose Zone SoilMEEDWYTVEQQIRDRLTEARTAARIRSLSQGLAPAARRPHSVGTAFIRFASWVWARARELPPEGSGGVANVRTAREDTNHG*
Ga0137364_1021814223300012198Vadose Zone SoilVKAMEGDWYTVEQQIRDRLTEARAAVQIPTLTEKLAPRARRPNSVGITIIRLANWVLARAMQLRLEISRALANVQAATKRS*
Ga0137382_1085241723300012200Vadose Zone SoilMDEAWYIVEQQIRDRLTDARAAARMRPLTQKLALTARRRNSVGITIIRLANWVLARAMQLPLELSRALANVQATTKRS*
Ga0137379_1084149123300012209Vadose Zone SoilMEEKWYAVEQQVRDRLNEARAVARTGALNHGLAPSARRPNSVGITIIRLENWALARAMQLFLGLSRALANVRAVTK*
Ga0137371_1043893123300012356Vadose Zone SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNYVGITIIRLANWVLARAMRLPLELSR
Ga0137390_1125231823300012363Vadose Zone SoilMEENWYAVEQQVRDRLSEARAAARIRTLTRGLAPVARRPHAIRVAFTHLASWVWARTIELPAALSHALANARTATKRCQSTNALLVGKEPRP*
Ga0137358_1099435313300012582Vadose Zone SoilMEENWYAVEQQIRDRLTEARAAARTWSLIHGLAPSARRPYSITVTFIPLASWVLARALGLPLKLSRALASVRAATKRYRSTNALFGGKESRS*
Ga0137396_1076122513300012918Vadose Zone SoilRTLRWREPQGVKRMEENWYAIEQQIRERLSEARAGARTWTLTGGLVPAARRPHAVRVVFIRLRSWALARAMEFTTELSRALANVRTATKRRHSTNALLPGKKSRP*
Ga0137416_1030332223300012927Vadose Zone SoilMEENWYAVEQQIRDRLTEARAGARTWALTGGLVPAARRPHAVRVVFIRLRSWALARAMEFTTELSRALANVRTATKRRQSTHALLPGKKSRP*
Ga0137404_1002308763300012929Vadose Zone SoilMEENWYAIEQQIRDRLTEARAGARTWILTGGLVPAARRPHAVRVVFIRLRSWALARAMEFTTELSRALANVRTATKRRQSTNALLPGKKSRP*
Ga0137407_1229875213300012930Vadose Zone SoilMPTVPGKECTPMEENMYALEQQVRDRLTEARTAARTRALVQQLAPTARRPYSVGIAFSDLASRVLARAREWPLELSRAVANVRAATKRG*
Ga0134110_1005259623300012975Grasslands SoilMDEAWYIVEQQIRDRLTEALATARIRTLTQKLAPTARRPNSVGITITRLASWVLARAMQLPLVLSRALANVHAATKRS*
Ga0134110_1009508813300012975Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMRLPLELSRALANVQAATKRT*
Ga0134110_1028944833300012975Grasslands SoilVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENWALARAMQLCLGLSRALANVRAVTK*
Ga0134087_1010798513300012977Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRVTSWVLARAMRLPLELSRALANVQAATKRT*
Ga0134075_1015052223300014154Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRQANWVLARAMQLPLELSRALANVQAATKRT*
Ga0134075_1050331213300014154Grasslands SoilQGVKPMAENWYAVEQQVRDRLTEARAAARIRTLTQKLAPKARRPNFVGITIIRLANWVLARAMQLALELSRALANVQAATKRT*
Ga0134079_1039705213300014166Grasslands SoilPMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNYVGITIIRLANWVLARAMRLPPELSRALAKVQAATK*
Ga0134072_1035523623300015357Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT*
Ga0134089_1024910213300015358Grasslands SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENWALASAMQLFLGLSRTLANVRAVTK*
Ga0134089_1048650223300015358Grasslands SoilLCRRAWKGLKPMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT*
Ga0134069_114891623300017654Grasslands SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRS
Ga0134112_1011022723300017656Grasslands SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALAKVQAATK
Ga0134112_1014298223300017656Grasslands SoilQQIRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRQANWVLARAMQLALELSRALANVQAATKRT
Ga0134083_1002972333300017659Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVMARAMRLPLELSRALANVQAATKRT
Ga0066655_1001494763300018431Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT
Ga0066655_1005859833300018431Grasslands SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLAPSGRRPNSVGITIIRLENWALARAMQLFLGLSRTLANVRAVTK
Ga0066655_1012031623300018431Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPTARRPNSVGITIIRQANWVLARAMQLALELSRALANVQAATKRT
Ga0066667_1006598413300018433Grasslands SoilENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMQLPLELSRALAKVQAATK
Ga0066667_1016736313300018433Grasslands SoilMEENWYAVEQQIRDRLTDARAAARIRTMTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSR
Ga0066667_1178681813300018433Grasslands SoilENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT
Ga0066667_1201009223300018433Grasslands SoilMDEAWYIVEQQIRDRLTEALAAARIRTLTQELAPTARRPNSVGITIIRLASWVLARAMQLPLELSRALANVQAATKRS
Ga0066662_1029434423300018468Grasslands SoilMEENWYAVEQQIRDRLTDARAAARIRSLTQKLAPTARRQYSVGITIVRLANWVLARAMQLPLELSRALANVQAATKRT
Ga0066662_1061183123300018468Grasslands SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENWALARAMQLCLGLSRALANVRAVTK
Ga0066669_1006056823300018482Grasslands SoilMEDNWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRVTSWVLARAMRLPPELSRALAKVQAATK
Ga0066669_1122007823300018482Grasslands SoilMEENWYPVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRS
Ga0210384_1001708613300021432SoilMEENWYALEQQIRDRLTEARAAARIRTLTAKLAPTARRPNAAGTTIVRLANWVLGRPMRSPWSSHARSPRCERR
Ga0207684_10001883183300025910Corn, Switchgrass And Miscanthus RhizosphereMEEDWYTVEQQVRDRLTEARAAARIWTLTPKPAATASHLNVVGITIIRLANWVLARAVRSPVDPSRALADVPVTTTRCESAAPERTSPCDWRSPDFRPAHRE
Ga0207646_1016767033300025922Corn, Switchgrass And Miscanthus RhizosphereMEENWYAVEQQVRDRLTEARAAARIRTLTQRLAPAARRPHAVRVAFTRLTSWVWARTIGLPTALSHALANVRTATKRCQSTSALLAGKESRP
Ga0207646_1029435623300025922Corn, Switchgrass And Miscanthus RhizosphereMEENMYALEQQVRDRLTEARTAARTRALVQQLAPTARRPYSVGIAFSDLASRVLARAREWPLELSRALANVRAATKRG
Ga0207646_1194244523300025922Corn, Switchgrass And Miscanthus RhizosphereMEENWYAVEQQIRDRVTEARAAARTRTLIHGLAPSARRPYSITIAFFRLASWVLARALGLPLKLSRALATSNVGGRNT
Ga0209350_100410523300026277Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNYVGITIIRLANWVLARAMRLPLELSRALAKVQAATK
Ga0209234_105806223300026295Grasslands SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMQLPLELSRALAKVQAATK
Ga0209234_110730123300026295Grasslands SoilMDEAWYIVEQQIRDRLTEALAAARIRTLTQELAPTARRPNSVGITIIRLASWVLARAMQLPLELSRALANVQAATKRT
Ga0209235_104774833300026296Grasslands SoilMEENWYAVEQQIRDRLTEARAAARIRTLTQKLAPKARRPNFVGITIIRLANWVLARAMQLALELSRALANVQAATKRT
Ga0209237_109197123300026297Grasslands SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT
Ga0209027_112435023300026300Grasslands SoilMEENWYAVEQQVRDRLNEARAAARTGALNHGLAPSARRPNSVGITIIRLENWALARAMQLCLGLSRALANVRAVTK
Ga0209468_1000537123300026306SoilMEENWYPVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVEITIIRLANWVLARAMRLPLELSRALANVQAATKRT
Ga0209761_121595913300026313Grasslands SoilKPMEENWYAVEQQIRDRLTEARAAARIRSLTQKLAPTARRPNSVGITVIRLANWVLARAMLLPLELSRALANVQAATKRT
Ga0209152_1003691523300026325SoilMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIVRLANWVLARAMQLPLELSRALANVQAATKRT
Ga0209803_102674923300026332SoilMEENWYAVEQQIRDRLTDARAAARIRSLTQKLAPTARRQYSVGITIVRLANWVLARAMQLPLELSRALANVQAATKRI
Ga0209158_107176823300026333SoilMEENWYAAEQQVRDRLSEARAAARIRTLTRGLAPVARRPHAIRVAFTHLASWVWARTIELPAALSHALANARTATKRCQSTNALLVGKEPRP
Ga0209377_111509923300026334SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENWALARAMQLCLGLSRALAN
Ga0257170_103707823300026351SoilMEEDWYAVEQQVRDRLNEARAAARTRTLIHGHALTARRPNSLGITIIRLENWVLACAMQLSLGLSRALANVRAVTK
Ga0257163_103266223300026359SoilMEENWYAVEQQIRDRLTEARAGARTWTLTGGLVPAARRPHAVRVVFIRLRSWALARAMELPTELSRALAYVRTATKRRQSTDALLPGKESRP
Ga0257168_104767513300026514SoilMEENWYAVEQQVRDRLSEARAAARIRTLTRGLVPAARRPLAVRIVFIGLTSWALARATKLSRALANVRTATKRWWQSTNAPLPGKESRR
Ga0257168_105647413300026514SoilMEEDWYTVEQQVRDRLTEARAAGRIRTLPPKLAPTARRPNVVGITIIGLANWVLARAMRSLVDLSRALADVTVTTTRCESAAEKGRRH
Ga0209808_123823423300026523SoilMEENWYAVEQQVRDRLSEARAAARIRALTQKLAPTARRPNSVGITIIRLANWVLARAMRLPLELSRALAKVQAATK
Ga0209690_103836333300026524SoilMEENWYAVEQQVRDRLNEARAAARTGALNRGLALSGRRPNSVGITIIRLENWTLARAMQLFLGLSRALANVRAVTK
Ga0209056_1025572433300026538SoilLKPMEENWYAVEQQIRDRLTDARAAARIRTLTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT
Ga0209376_111723223300026540SoilMEENWYAVEQQIRDRLTDARAAARIRTMTQKLAPTARRQNSVGITIIRLANWVLARAMQLPLELSRALANVQAATKRT
Ga0209156_1003230843300026547SoilMDEAWYIVEQQIRDRLTEALATARIRTLTQKLAPTARRPNSVGITIIRLASWVLARAMQLPLVLSRALANVQAATKRS
Ga0209161_1039420913300026548SoilMDEDWYTVEQQIRDRLTEARAAARIRALTEELAPTARRPTSVGIIRLASWVLARAMQLPPEL
Ga0209701_1055787213300027862Vadose Zone SoilMEENWYAVEQQIRDRLNEARAAARTRALIPKLAPSARRPYSIRLAVIRLAGRVLAQALELPLKLLRALDFTCAHANRR
Ga0209526_1004135333300028047Forest SoilMEENWYAVEQQIRDRLTEARARARTWTLIHGLAPSARRPYSITIALIPLASWVLARALWLSLKLSRALANARTATKRYQSTNALMGGKESRP
Ga0209526_1010673013300028047Forest SoilMEENWYAVEQQVRDRISEARAAARIRTLTRKVAPTARRPNSVGITISRLASWVSARAMQLSLGLSRALVNVAR
Ga0137415_1017964733300028536Vadose Zone SoilMEENWYAVEQQVRDRLSEARAAARIRTLTRGLAPVARRPHAIRVAFTHLASWVWARTIELPAALSHALANARTATKRCQSTNALLVGKEPRP
Ga0137415_1018056433300028536Vadose Zone SoilMEENWYAVEQQIRDRLTEARAGARTWALTGGLVPAARRPHAVRVVFIRLRSWALARAMEFTTELSRALANVRTATKRRQSTHALLPGKKSRP
Ga0307312_1016342323300028828SoilMEENWYAVEQQVRDRLSEARAAARIRTLTRGLVPAARRPHAVRIVFIGLTSWALARATKLSRALANVRTATKRWWQSANAPLPGKESRR
Ga0308309_1005372013300028906SoilMEENWYALEQQIRDRLTEARAAARIRTLTAKLAPTARRPNAAGTTIVRLANWVLGRPMRSPLELSRPLAKVRAAMK
(restricted) Ga0255311_110885713300031150Sandy SoilMEENWYAVEQQVRDRLNEARAAARTRALVQNLAPTARRPNSVGFTIIRLANWILARAMQLPPELSRALAKMRGTRERRESA
Ga0307469_1024767223300031720Hardwood Forest SoilMEENWYALEQQIRDRLTEARAAARIRTLTAKLAPTARRPNAAGTTILRLANWVLGRPMRSPLELSRPLAKVRAAMK
Ga0307473_1021031123300031820Hardwood Forest SoilMEENWYELEQRIRDRLTEARAAARVRTLTQRVAPTARRPNSVGITIIRLANWVLACALQLPLELSRALVKVRAAPK
Ga0307471_10027383223300032180Hardwood Forest SoilMEENWYAVEQQVRDRISEARAAARIRTLTRKVAPTARRPNSVGITISRLASWVSARAMQLSLGLSRALANVRAVTKVTSRERTPTMHQEETTSQRQPFKPGAW
Ga0307471_10103482623300032180Hardwood Forest SoilSSGFTALPDAAVPGKECTPMEENMYALEQQVRDRLTEARTAARTRALVQQLAPTARRPYSVGIAFSDLASRVLARAREWPLELSRALANVRAATKRG
Ga0307471_10179163513300032180Hardwood Forest SoilMEENWYAVEQQIRDRLTEARAGARTWAMTQGLAPAARRPHAVGITIIRLGNWVLARAMQLSLGLSRALANVRAVTKVTSREGTPTMHQEETT
Ga0307471_10368333413300032180Hardwood Forest SoilMEENWYAVEQQVRDRLNEARAAARTRTLIHRPALTACGPNSVRIIITHLKNEVLARAMQLSLGL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.