NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F036622

Metagenome Family F036622

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F036622
Family Type Metagenome
Number of Sequences 169
Average Sequence Length 69 residues
Representative Sequence MLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGPLDLKSLPTFEDEDKRASQRHDRYLYG
Number of Associated Samples 103
Number of Associated Scaffolds 169

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 22.22 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 79
AlphaFold2 3D model prediction Yes
3D model pTM-score0.42

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (94.675 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil
(31.361 % of family members)
Environment Ontology (ENVO) Unclassified
(63.314 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(65.089 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 19.39%    β-sheet: 11.22%    Coil/Unstructured: 69.39%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.42
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 169 Family Scaffolds
PF01850PIN 28.40
PF05977MFS_3 26.04
PF00923TAL_FSA 15.98
PF09948DUF2182 6.51
PF13559DUF4129 2.37
PF00496SBP_bac_5 1.78
PF00071Ras 1.78
PF07040DUF1326 1.78
PF12840HTH_20 1.78
PF07690MFS_1 1.18
PF02780Transketolase_C 1.18
PF02518HATPase_c 1.18
PF03060NMO 0.59
PF01061ABC2_membrane 0.59
PF07726AAA_3 0.59
PF00834Ribul_P_3_epim 0.59
PF14117DUF4287 0.59

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 169 Family Scaffolds
COG2814Predicted arabinose efflux permease AraJ, MFS familyCarbohydrate transport and metabolism [G] 26.04
COG0176Transaldolase/fructose-6-phosphate aldolaseCarbohydrate transport and metabolism [G] 15.98
COG5588Uncharacterized conserved protein, DUF1326 domainFunction unknown [S] 1.78
COG0036Pentose-5-phosphate-3-epimeraseCarbohydrate transport and metabolism [G] 0.59
COG0516IMP dehydrogenase/GMP reductaseNucleotide transport and metabolism [F] 0.59
COG2070NAD(P)H-dependent flavin oxidoreductase YrpB, nitropropane dioxygenase familyGeneral function prediction only [R] 0.59


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A94.67 %
All OrganismsrootAll Organisms5.33 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002558|JGI25385J37094_10000651All Organisms → cellular organisms → Bacteria9858Open in IMG/M
3300002560|JGI25383J37093_10000303All Organisms → cellular organisms → Bacteria10887Open in IMG/M
3300012206|Ga0137380_10013682All Organisms → cellular organisms → Archaea → Euryarchaeota7401Open in IMG/M
3300026296|Ga0209235_1000387All Organisms → cellular organisms → Archaea → Euryarchaeota21813Open in IMG/M
3300026313|Ga0209761_1006841All Organisms → cellular organisms → Bacteria7499Open in IMG/M
3300026326|Ga0209801_1004232All Organisms → cellular organisms → Archaea → Euryarchaeota8070Open in IMG/M
3300026328|Ga0209802_1001987All Organisms → cellular organisms → Archaea → Euryarchaeota14587Open in IMG/M
3300027490|Ga0209899_1000297All Organisms → cellular organisms → Archaea → Euryarchaeota9827Open in IMG/M
3300027882|Ga0209590_10000453All Organisms → cellular organisms → Archaea → Euryarchaeota13438Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil31.36%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil27.22%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil23.67%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil8.28%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand4.14%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil2.37%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.18%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.59%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.59%
Fracking WaterEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Fracking Water0.59%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002120Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_2EnvironmentalOpen in IMG/M
3300002503Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_3EnvironmentalOpen in IMG/M
3300002557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cmEnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009813Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_10_20EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025313Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_3 (SPAdes)EnvironmentalOpen in IMG/M
3300025318Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 1EnvironmentalOpen in IMG/M
3300025327Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_1 (SPAdes)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026300Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300027068Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027169Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_10_20 (SPAdes)EnvironmentalOpen in IMG/M
3300027181Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM2_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027490Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027947Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027961Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300034073Fracking water microbial communities from deep shales in Oklahoma, United States - MC-6-XLEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
C687J26616_1001033223300002120SoilMIAKVRRWGNGLALRVRKRDLERAGVSEGDVVQVDVKPVPKGGTLDLDRLPTFEDADPRASVRHDRYLYG*
C687J35164_1013490913300002503SoilMIAKVRRWGNGLALRVRKRDLERAGVSEGDVVQVDVKPVPKGGTLDLDRLPTFEDADPRASV
JGI25381J37097_100655843300002557Grasslands SoilMLAKVRRWGNGLALRVHKDDLASVGIAEGDVVQIELVRRPDRRLDLKGLPTFEDDDPRASLRHDRYLYR*
JGI25381J37097_101671723300002557Grasslands SoilMLAKVRRWGNGLALRVHKEDLESAGVAEGDVVQIELTRRPDRGLDLKGLPTFEDEDPRASLRHDRYLYG*
JGI25385J37094_1000065193300002558Grasslands SoilMLAKVRRWGNGLALRVHKRDLESVGVAEGDVVQVELIRSSKRGQLDLKSLPTFEDADKRASERHDRYLYX*
JGI25385J37094_1000641663300002558Grasslands SoilMFAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDRYLYG*
JGI25385J37094_1000656753300002558Grasslands SoilMFAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELIRPRESGRLDLNSLPTFEDEDPRASVRHDRYLYG*
JGI25385J37094_1001406233300002558Grasslands SoilMFAKVRRWGNGLALRVHKKDLESAGVGEGDVVQVELIRPRESGRLDLQSLPTFEDKDPRASVRHDRYLYG*
JGI25385J37094_1014296623300002558Grasslands SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPEGGQLDLKSLPTFEDXDKRASERHDRYLYR*
JGI25383J37093_1000030353300002560Grasslands SoilMLAKVRRWGNGLALRVHKRDLESVGVAEGDVVXVELIRSSKRGQLDLKSLPTFEDADKRASERHDRYLYX*
JGI25383J37093_1000683833300002560Grasslands SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGRLDLASLPTFEDNDKRASERHDRYLYG*
JGI25383J37093_1000841933300002560Grasslands SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPEGGQLDLKSLPTFEDEDKRASERHDRYLYR*
JGI25383J37093_1003615413300002560Grasslands SoilVYTLCVYMFAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELIRPRESGRLDLNSLPTFEDEDPRASVRHDRYLYG
JGI25384J37096_10001135113300002561Grasslands SoilMLAKVRRWGNGLALRVHKRDLESVGVAEGDVVQVELIRSSKRGQLDLKSLPTFEDADKRASERHDRYLYG*
JGI25384J37096_1002015023300002561Grasslands SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPEGGQLDLKSLPTFEDGDKRASERHDRYLYR*
JGI25384J37096_1002890923300002561Grasslands SoilVYTLCVYMFAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELIRPRESGRLDLNSLPTFEDEDPRASVRHDRYLYG*
JGI25382J37095_1023134813300002562Grasslands SoilWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPEGGQLDLKSLPTFEDGDKRASERHDRYLYR*
JGI25382J43887_1002720533300002908Grasslands SoilVYTLCVYMFAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDRYLYG*
JGI25382J43887_1002933543300002908Grasslands SoilMFAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELIQPRESGRLDLNSLPTFEDEDPRASVRHDRFLYG*
JGI25382J43887_1003154133300002908Grasslands SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGRLDLTSLPTFEDDDKRASERHDRYLYG*
JGI25390J43892_1000099753300002911Grasslands SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPEGGQLDLKSLPTFEDXDKRASERHDRXLYR*
JGI25386J43895_1006767713300002912Grasslands SoilRWGNGLALRVHKKDLESAGIGEGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDRYLYG*
JGI25386J43895_1016515423300002912Grasslands SoilVYTLMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPEGGQLDLKSLPTFEDEDKRASERHDRYLYR*
JGI25389J43894_103254723300002916Grasslands SoilMLAKVRRWGNGLALRVHKDDLASVGIAEGDVVQIELVRRPDRRLDLKGLPTFEDDDP
Ga0066674_1000644153300005166SoilMYAKVRRWGNGLALRVHKKDLESAGVGEGDVVQVELIRLRESGRLDLRSLPVFEDEDPRASVRHDRYLYG*
Ga0066672_1004995913300005167SoilMLAKVRRWGNGLALRVHKDDLETAGIAEGDVVQIELTRRPDRGLDLEGLPTFEDEDPRASLRHDRYLYG*
Ga0066677_1027380433300005171SoilVSTLCVYMYAKVRRWGNGLALRVHKKDLESAGVGEGDVVQVELIRLRESGRLDLRSLPVFEDEDPRASVRHDRYLYG*
Ga0066683_1037910223300005172SoilMFAKVRRWGNGLALRVHKKDLESAGISEGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDRYLYG*
Ga0066680_1019792913300005174SoilGGVYTLCRYMLAKVRRWGNGLALRVHKEDLERAGVAEGDVVQIELTRPPDRGLDLDGLPTFEDDDPKASLRHDRYLYG*
Ga0066680_1060406913300005174SoilRRWGNGLALRVHKKDLESAGIGEGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDRYLYG*
Ga0066679_1066857023300005176SoilMLAKVRRWGNGLALRVHKDDLEAAGVAEGDVVQIELTRRPDRGLDLEGLPTFEDEDPRASLRHDRYLYG*
Ga0066688_1088422713300005178SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGPLDLKSLPTFEDEDKRASQRHDRYLYG*
Ga0066685_1011091623300005180SoilVYTLCVYMFAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELIRPRESRRLDLKSLPTFEDEDPRASVRHDRYLYG*
Ga0066676_1085819123300005186SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPDRGLLDLKSLPTFEDEDKRASQRHDRYLYG*
Ga0066686_1024735133300005446SoilMFAKVRRWGNGLALRVHKKDLESAGIREGDVVQVELIRPRESGRLDLNSLPTFEDEDPRASVRHDRYLYG*
Ga0066686_1061819923300005446SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGLLDLKSLPTFEDEDKRASQRHDRYLYG*
Ga0066682_1003855643300005450SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPEWGQLDLKSLPTFEDGDKRASERHDRYLYR*
Ga0066701_1009099633300005552SoilVYTLCVYMFAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDRYLYR*
Ga0066701_1049387723300005552SoilMLAKVRRWGNGLALRVLKEDLETAGIAEGDTVQIEVIGRPGRGLDLGSLPTFEDDDSRASQRHDRYLYG*
Ga0066692_1004728123300005555SoilVYTLCVYMFAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELVRPRESGRLDLNSLPTFEDEDPRASVRHDRYLYG*
Ga0066692_1038358723300005555SoilMLAKVRRWGNGLALRVLKEDLEMAGIAEGDTVQIEVIRRPGRGLDLGSLPTFEDDD
Ga0066707_1005671323300005556SoilMLAKVRRWGNGLALRVHKEDLERAGVAEGDVVQIELTRPPDRGLDLDGLPTFEDDDPKASLRHDRYLYG*
Ga0066698_1039328333300005558SoilLALRVHKDDLEAAGVAEGDVVQIELTRRPDGGLDLEGLPTFEDEDPRASLRHDRYLYG*
Ga0066700_1003817343300005559SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGRLDLASLPTFEDDDKRASERHDRYLYG*
Ga0066691_1033517913300005586SoilMLAKVRRWGNGLALRVLKEDLEMAEIAEGDTVQIEVIRRPGRGLDLGSLPTFEDDDPRASQRHDRYLYG*
Ga0066706_1048186213300005598SoilGGVYTLCRYMLAKVRRWGNGLALRVHKEDLERAGVAEGDVVQIELTRRPDRGLDLDGLPTFEDDDPKASLRHDRYLYG*
Ga0066696_1068727213300006032SoilSTLCVYMYAKVRRWGNGLALRVHKKDLESAGVGEGDVVQVELIRLRESGRLDLRSLPVFEDEDPRASVRHDRYLYG*
Ga0066656_1030789733300006034SoilMFAKVRRWGNGLALRVHKKDLELAGIREGDVVQVELIRPRESGRLDLNSLPTFEDEDPRASVRHDRYLYG*
Ga0066665_1004658433300006796SoilMLAKVRRWGNGLALRVHKEDLERAGVAEGDVVQIELTRRPDRGLDLDGLPTFEDDDPKASLRHDRYLYG*
Ga0099793_1000419253300007258Vadose Zone SoilMLAKVRRWGNGLALRVHKDDLESAGVAEGDVVQIELARRPDRGLDLKALPTFEDEDPRASLRHDRYLYR*
Ga0099793_1002990523300007258Vadose Zone SoilMLAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDRYLYG*
Ga0099793_1024714713300007258Vadose Zone SoilMLAKVRRWGNGLALRVHKDDLEAAGVAEGDVVQIELTRRPDRGFDLEGLPTFEDEDPRASLRHDRYLYG*
Ga0099793_1034654723300007258Vadose Zone SoilMLAKVRRWGNGLALRVHKDDLESVGVAEGDVVQIELARRPERGLDLKSLPTFMDDDPKASLRHDRYLYR*
Ga0099793_1059892823300007258Vadose Zone SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPKRGQLDLKSLPTFEDEDKRASERHDRYLYG*
Ga0099794_1023182413300007265Vadose Zone SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPKRGQLDLKSLPTFEDADKRASERHDRYLYG*
Ga0099794_1041961913300007265Vadose Zone SoilMIAKVRRWGNGLALRVLRKDLESQGVSEGDVVQLELTRVSVLGRIDLTTLPTFEDKDPRTSLRHDRYLYG*
Ga0066710_10032810523300009012Grasslands SoilMLAKVRRWGNGLALRVHKEDLERAGVAEGDVVQIELTRRPDRGLDLDGLPTFEDDDPKASLRHDRYLYG
Ga0066710_10075130233300009012Grasslands SoilMFAKVRRWGNGLALRVHKKDLELAGIREGDVVQVELIRPRESGRLDLNSLPTFEDEDPRASVRHDRYLYG
Ga0066710_10408032623300009012Grasslands SoilVYMFAKVRRWGNGLALRVHTKDLESAGIGEGDVVQVELVRPAESGRLDLKSLPTFEDEDPRASVRHDRYLYG
Ga0099829_1045426223300009038Vadose Zone SoilMLAKVRRWGNGLALRVHKQDLESAGVAEGDIVQVELIRSPKRGQLDLKSLPTFEDADKRASERHDRYLYG*
Ga0099830_1004996223300009088Vadose Zone SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDIVQVELIRSPKRGQLDLKSLPTFEDADKRASERHDRYLYG*
Ga0099827_1002041423300009090Vadose Zone SoilMFAKVRRWGNGLALRVHKKDLESAGIREGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDSYLYG*
Ga0099827_1005892933300009090Vadose Zone SoilMLAKVRRWGNGLALRVHKDDLESAGIAEGDLVQIELTRRPDRGLDLKSLPTFEDDDPKASLRHDRYLYR*
Ga0099827_1079187813300009090Vadose Zone SoilMLAKVRRWGNGLALRVHKEDLESAGVADGDVVQIELTRRPDRGLDLKGLPTFEDEDPKTSLRHDRYLYG*
Ga0066709_10286968023300009137Grasslands SoilMFAKVRRWGNGLALRVHKKDLESAGIREGDVVQVELIGPRESGRLDLNSLPTFEDEDPRAGVRHDRYPHG*
Ga0105057_100177413300009813Groundwater SandMLAKVRRWGNGLALRVHKNDLESAGVREGDVVQVELIRSPERGRLDLTSLPTFEDKDPRTSVRHDRYLYG*
Ga0105057_103360523300009813Groundwater SandMIAKVRRWGNGLALRVHKKDLESAGLSEGDVVQVELIRAPGRVTLGLGELPTFIDDDPKASLHHDRYLYG*
Ga0134088_1006513623300010304Grasslands SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGLLDLKSLPTFEDEDKQASQRHDRYLYG*
Ga0134088_1009762423300010304Grasslands SoilMLAKVRRWGNGLALRVHKDDLEAAGVAEGDVVQIELTRRPDGGLDLEGLPTFEDEDPRASLRHDRYLYG*
Ga0134088_1035059323300010304Grasslands SoilMFAKVRRWGNGLALRVHKKDLELAGIREGDVVQVELIRPRESGRLDLNSLPTFEDEDPRASVRH
Ga0134088_1051427413300010304Grasslands SoilMLAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELVRPRESGRLDPKSLPTFEDEDPRASVRHDRYLYG*
Ga0134071_1002185113300010336Grasslands SoilWGNGLALRVHKKDLESAGVGEGEEVQVELIRLRESGRLDLRSLPVFEDEDPRASVRHDRYLYG*
Ga0137389_1073302913300012096Vadose Zone SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDIVQVELIRSPKRGQLDLKSLPMFEDADKRASERHDRYLYG*
Ga0137364_1049763613300012198Vadose Zone SoilMLAKVRRWGNGLALRVHKDDLEAAGVAEGDVVQIELTRRPDRGLDLEGLPTFEDKDPRASLRHDRYLYG*
Ga0137399_1007946313300012203Vadose Zone SoilMLAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELVRPRDSGRLDLKSLPTFEDEDPRTSVRHDRYLYG*
Ga0137399_1010386023300012203Vadose Zone SoilMLAKVRRWGNGVALRVHKDDLAAAGISEGDVVQIEMTHRPERGLDLKTLPTFQDDDPKTSLRHDRYLYR*
Ga0137399_1027976423300012203Vadose Zone SoilMLAKVRRWGNGLALRVHKEDLESAGVAEGDVVQIELTRRPERGLDLKGLPTFEDKDPKTSLRHDRYLYG*
Ga0137374_1014762123300012204Vadose Zone SoilMLAKVRRWGNGLALRVHKNDLESAGIAEGDTVQIEVTRPPGRGIDLGSLPTFEDEDPRASQRHDRYLYG*
Ga0137380_1000999593300012206Vadose Zone SoilMLAKVRRWGNGLALRVLKEDLETAGIAEGDTVQIEVIRRPGRGLDLGSLPTFEDDDSRASQRHDRYLYG*
Ga0137380_1001368283300012206Vadose Zone SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGRFDLTSLPTFEDDDKRASERHDRYLYG*
Ga0137380_1001589453300012206Vadose Zone SoilMLAKVRRWGNGLALRVHKDDLKAAGVAEGDVVQIELTRRPDRGLDLESLPTFEDEDPRASLRHDRYLYR*
Ga0137380_1002450163300012206Vadose Zone SoilMFAKVRRWGNGLALRVHRKDLESAGVGEGDVVQVELIRPRERGRLDLESLPTFEDEDPRASVRHDRYLYG*
Ga0137380_1053313113300012206Vadose Zone SoilMFAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELIRSRESGRMDLNSLPTFEDEDPRASVRHDRYLYG*
Ga0137381_1062399933300012207Vadose Zone SoilKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGPLDLKSLPTFEDEDKRASQRHDRYLYG*
Ga0137381_1068740923300012207Vadose Zone SoilMLAKVRRWGNGLALRVHKDDLEAAGVAAGDVVQIELTRRPDRGLDLESLPTFEDEDPRASLRHDRYLYR*
Ga0137381_1093825523300012207Vadose Zone SoilMFAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELIRSGESGRMDLNSLPTFEDEDPRASVRHDRYLYG*
Ga0137381_1122010713300012207Vadose Zone SoilMFAKVRRWGTGLALRVNKKDLESAGIGEGDVVQVELIRARESGRMDLNSLPTFEDEDPRASVRHDRYLYG*
Ga0137376_1011091523300012208Vadose Zone SoilMLAKVRRWGNGLALRVHKDDLETAGIAEGDVVQIELTRRPDRGLDLDGLPTFEDEDPRASLRHDRYLYG*
Ga0137376_1046930923300012208Vadose Zone SoilMLAKVRRWGNGLALRVHKDDLEAAGVAEGDVVQIELTRRPDRGLDLEGLPTFEDEDPRASLRHDRYLYR*
Ga0137377_1052279613300012211Vadose Zone SoilRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGPLDLKSLPTFEDEDKRASQRHDRYLYG*
Ga0137387_1001718323300012349Vadose Zone SoilMLAKVRRWGNGLALRVHKDDLEAAGVAEGDVVQIELTRRPDRGLDLESLPTFEDEDPRASLRHDRYLYG*
Ga0137386_1006476423300012351Vadose Zone SoilMLAKVRRWGNGLALRVHKDDLEAAGVAEGDVVQIELIRRPDRGLDLESLPTFEDEDPRASLRHDRYLYG*
Ga0137371_1128735723300012356Vadose Zone SoilMLAKVRRWGNGLALRVHKDDLEAAGIAEGDVVQIELTRRPDRGLDLEGLPTFEDEDPRASLRHDRYLYG*
Ga0137385_1024064423300012359Vadose Zone SoilMLAKVRRWGNGLALRVHKDDLEAAGVAEGDVVQIELIRRPDRGLDLESLPTFEDEDPRASLRHDRYLYR*
Ga0137419_1003117623300012925Vadose Zone SoilMLAKVRRWGNGLALRVHKDDLESAGVSEGDVGQIELARRPDRGLDLKVLPTFEDEDPRASLRHDRYLYR*
Ga0137416_1001625533300012927Vadose Zone SoilMLAKVRRWGNGLALRVHKKDLESAGIDEGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDRYLYG*
Ga0134077_1023562513300012972Grasslands SoilMFAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDRY
Ga0134110_1009039413300012975Grasslands SoilMYAKVRRWGNGLALRVHKKDLESAGVGEGDVVQVELIRLRESGRLDLRSLPVFEDEDPR
Ga0134076_1027057813300012976Grasslands SoilRWGNGLALRVHKKDLELAGIREGDVVQVELIRPRESGRLDLNSLPTFEDEDPRASVRHDRYLYG*
Ga0134089_1029806413300015358Grasslands SoilALRVHKKDLESAGVGEGDVVQVELIRLRESGRLDLRSLPVFEDEDPRASVRHDRYLYG*
Ga0134085_1000873553300015359Grasslands SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPEGGQLDLKSLPTFEDGDKRASERHDRDLYR*
Ga0134112_1015745623300017656Grasslands SoilMFAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELIRARASGRLDLNSLPTFEDEDPRASVRHDRYLYG
Ga0134112_1017900413300017656Grasslands SoilALRVHKKDLESAGISEGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDRYLYG
Ga0134083_1009238933300017659Grasslands SoilMFAKVRRWGNGLALRVHKKDLESAGIGEVDVVQVELIRARASGRLDLNSLPTFEDEDPRASVRHDRYLYG
Ga0134083_1017947713300017659Grasslands SoilNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGLLDLKSLPTFEDEDKRASQRHDRYLY
Ga0066655_1003390133300018431Grasslands SoilVSTLCVYMYAKVRRWGNGLALRVHKKDLESAGVGEGDVVQVELIRLRESGRLDLRSLPVFEDEDPRASVRHDRYLYG
Ga0066655_1013978023300018431Grasslands SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDIVQVELIRSPEGGQLDLKSLPTFEDEDKRASERHDRYLYR
Ga0066655_1057747233300018431Grasslands SoilLAKVRRWGNGLALRVHKDDLETAGIAEGDVVQIELTRRPDRGLDLEGLPTFEDEDPRASLRHDRYLYG
Ga0066655_1068094923300018431Grasslands SoilMFAKVRRWGNGLALRVHKKDLESAGIREGDVVQVELIRPRESGRLDLNSLPTFEDEDPRASVRHDRYLYG
Ga0066655_1092001513300018431Grasslands SoilWGNGLALRVHKKDLESAGIGEGDVVQVELIRPRESRRLDLKSLPTFEDEDPRASVRHDRYLYG
Ga0066655_1125445623300018431Grasslands SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGPLDLKSLPTFEDEDKRASQRHDRYLYG
Ga0066667_1007388233300018433Grasslands SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPEGGQLDLKSLPTFEDGDKRASERHDRYLYR
Ga0066667_1011200523300018433Grasslands SoilMLAKVRRWGNGLALRVHKRDLESAGMAEGDVVQVELIRSPERGPLDLKSLPTFEDEDKRASQRHDRYLYG
Ga0066662_1002097043300018468Grasslands SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPEGGQLDLKSLPTFEDEDKRASERHDRYLYR
Ga0066662_1047993723300018468Grasslands SoilMLAKVRRWGNGLALRVHKEDLERAGVVEGDVVQIELTRRPDRGLDLDGLPTFEDDDPKASLRHDRYLYG
Ga0066662_1091217833300018468Grasslands SoilALRVHKKDLESAGIGEGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDRYLYG
Ga0066662_1091459423300018468Grasslands SoilMLAKVRRWGNGLALRVLKEDLETTGIAEGDTVQIEIIGRPGRGLDLGSLPTFEDDDPRASQRHDRYLYG
Ga0066662_1283994013300018468Grasslands SoilMFAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELVRPRESGRLDLNSLPTFEDEDPRASVRHDRYLYG
Ga0137417_105103423300024330Vadose Zone SoilMLAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDRYLYG
Ga0137417_114078353300024330Vadose Zone SoilVYTLCVHMLAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDRYLYG
Ga0137417_114078413300024330Vadose Zone SoilVYTLCVHMLAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVDLVRPRESGRLDLKSLPTFEDEDPRASV
Ga0209431_1000302853300025313SoilMIAKVRRWGNGLALRVRKRDLERAGVSEGDVVQVDVKPVPKGGTLDLDRLPTFEDADPRASVRHDRYLYG
Ga0209519_1065152623300025318SoilVRRWGNGLALRVRKRDLERAGVSEGDVVQVDVKPVPKGGTLDLDRLPTFEDADPRASVRHDRYLYG
Ga0209751_1099430313300025327SoilMLAKVRRWGNGLALRVHTKDLASAGISDGDLVEVELTRVPGLGGLTTKALPTFEDDDPRASQRHDQY
Ga0209234_100191983300026295Grasslands SoilMLAKVRRWGNGLALRVHKEDLESAGVAEGDVVQIELTRRPDRGLDLKGLPTFEDEDPRASLRHDRYLYG
Ga0209234_100487243300026295Grasslands SoilMLAKVRRWGNGLALRVHKDDLASVGIAEGDVVQIELVRRPDRRLDLKGLPTFEDDDPRASLRHDRYLYR
Ga0209234_101491723300026295Grasslands SoilMLAKVRRWGNGLALRVHKDDLEAAGVAEGDVVQIELTRRPDRGLDLEGLPTFEDDDPRASLRHDRYLYG
Ga0209235_1000387183300026296Grasslands SoilMLAKVRRWGNGLALRVHKRDLESVGVAEGDVVQVELIRSSKRGQLDLKSLPTFEDADKRASERHDRYLYG
Ga0209235_100883533300026296Grasslands SoilVYTLCVYMFAKVRRWGNGLALRVHKKDLESAGVGEGDVVQVELIRPRESGRLDLQSLPTFEDKDPRASVRHDRYLYG
Ga0209235_101958433300026296Grasslands SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGRLDLASLPTFEDDDKRASERHDRYLYG
Ga0209235_112699323300026296Grasslands SoilMLAKVRRWGNGLALRVHKRDLEAAGVAEGDVVQVELIRSPERGLLDLKSLPTFEDEDKRASQRHDRYLYG
Ga0209237_108206633300026297Grasslands SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGLLDLKSLPTFEDEDKQASQRHDRYLYG
Ga0209237_110463013300026297Grasslands SoilDVYTLMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPEGGQLDLKSLPTFEDEDKRASERHDRYLYR
Ga0209236_1000787243300026298Grasslands SoilVYTLCVYMFAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDRYLYG
Ga0209027_125842123300026300Grasslands SoilMLAKVRRWGNGLALRVHKDDLEAAGVAEGDVVQIELTRRPDRGLDLEGLPTFEDEDPRASLRHDRYLYG
Ga0209238_110432523300026301Grasslands SoilLALRVHKDDLEAAGVAEGDVVQIELTRRPDGGLDLEGLPTFEDEDPRASLRHDRYLYG
Ga0209055_113715623300026309SoilMLAKVRRWGNGLALRVHKDDLETAGIAEGDVVQIELTRRPDRGLDLDGLPTFEDEDPRASLRHDRYLYG
Ga0209761_100038693300026313Grasslands SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPEGGQLDLKSLPTFEDGDKRASERHDRDLYR
Ga0209761_100684133300026313Grasslands SoilMFAKVRRWGNGLALRVHKKDLESAGISEGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDRYLYG
Ga0209686_121333413300026315SoilVRRWGNGLALRVHKDDLETAGIAEGDVVQIELTRRPDRGLDLEGLPTFEDEDPRASLRHDRYLYG
Ga0209470_101509063300026324SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGLLDLKSLPTFEDEDKRASQRHDRYLYG
Ga0209152_1001336223300026325SoilMLAKVRRWGNGLALRVHKDDLETAGIAEGDVVQIELTRRPDRGLDLAGLPTFEDEDPRASLRHDRYLYG
Ga0209801_100423283300026326SoilMGEWPLALRVHKRDLESAGVAEGDVVQVELIRSPERGPLDLKSLPTFEDEDKRASQRHDRYLYG
Ga0209802_1001987183300026328SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGRLDLASLPTFEDNDKRASERHDRYLYG
Ga0209802_125536423300026328SoilMLAKVRRWGNGLALRVHKEDLERAGVVEGDVVQIELTRPPDRGLDLDGLPTFEDDDPKASLRHDRYLYG
Ga0209158_106366043300026333SoilYTLCRYMLAKVRRWGNGLALRVHKEDLESAGVAEGDVVQIELTRRPDRGLDLKGLPTFEDEDPRASLRHDRYLYG
Ga0209158_108736133300026333SoilAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGPLDLKSLPTFEDEDKRASQRHDRYLYG
Ga0209158_111697723300026333SoilMLAKVRRWGNGLALRVHKDDLETAGIAEGDVVQIELTRRPDRGLDLEGLPTFEDEDPRASLRHDRYLYG
Ga0209057_100991713300026342SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPEGGQLDLKSLPTFEDEDKRASERHDRDLYR
Ga0209690_105562723300026524SoilVYTLCVYMFAKVRRWGNGLALRVHKKDLESAGISEGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDRYLYG
Ga0209690_121849323300026524SoilGNGLALRVHKDDLETAGIAEGDVVQIELTRRPDRGLDLEGLPTFEDEDPRASLRHDRYLY
Ga0209378_1001176143300026528SoilMYAKVRRWGNGLALRVHKKDLESAGVGEGDVVQVELIRLRESGRLDLRSLPVFEDEDPRASVRHDRYLYG
Ga0209058_107867713300026536SoilVDVYTLCVYMFAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELIRPRESRRLDLKSLPTFEDEDPRASVRHDRYLYG
Ga0209157_119844333300026537SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPERGRLDLTSLPTFEDDDKRASERHDRYLYG
Ga0209376_128755923300026540SoilVYTLCVYMFAKVRRWGNGLALRVHKKDLESAGIGEGDVVQVELIRPRESRRLDLKSLPTFEDEDPRASVRHDRYLYG
Ga0209898_101030023300027068Groundwater SandMLAKVRRWGNGLALRVHKKDLESAGVREGDVVQVELIRSPERGRLDLTSLPTFEDKDPRTSVRHDRYLYG
Ga0209897_100543433300027169Groundwater SandMLAKVRRWGNGLALRVHKNDLESAGVREGDVVQVELIRSPERGRLDLTSLPTFEDKDPRTSVRHDRYLYG
Ga0208997_103907623300027181Forest SoilMLAKVRRWGNGVALRVHKDDLAAAGISEGDVVQIEMTHRPERGLDLKTLPTFQDDDPKTSLRHDRYLYR
Ga0209899_100029733300027490Groundwater SandVVYTLRVYMLAKVRRWGNGLALRVHKKDLESAGVREGDVIQVELIRSPERGRLDLTSLPTFEDKDPRTSVRHDRYLYG
Ga0209076_115127523300027643Vadose Zone SoilLALRVHKDDLEAAGVAEGDVVQIELTRRPDRGFDLEGLPTFEDEDPRASLRHDRYLYG
Ga0209588_112949613300027671Vadose Zone SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDVVQVELIRSPKRGQLDLKSLPTFEDADKRASERHDRYLYG
Ga0209180_1003495833300027846Vadose Zone SoilMLAKVRRWGNGLALRVHKQDLESAGVAEGDIVQVELIRSPKRGQLDLKSLPTFEDADKRASERHDRYLYG
Ga0209283_1001095323300027875Vadose Zone SoilMLAKVRRWGNGLALRVHKRDLESAGVAEGDIVQVELIRSPKRGQLDLKSLPTFEDADKRASERHDRYLYG
Ga0209590_1000045323300027882Vadose Zone SoilVYTLCVHMFAKVRRWGNGLALRVHKKDLESAGIREGDVVQVELVRPRESGRLDLKSLPTFEDEDPRASVRHDSYLYG
Ga0209590_1003844033300027882Vadose Zone SoilMLAKVRRWGNGLALRVHKDDLESAGIAEGDLVQIELTRRPDRGLDLKSLPTFEDDDPKASLRHDRYLYR
Ga0209868_101578523300027947Groundwater SandMIAKIRRWGNGLALRVHKKDLESAGLSEGDVVQVELIRAPGRVTLGLGELPTFIDDDPKASLHHDRYLYG
Ga0209853_100677433300027961Groundwater SandVVYTLRVYMLAKVRRWGNGLALRVHKNDLESAGVREGDVVQVELIRSPERGRLDLTSLPTFEDKDPRTSVRHDRYLYG
Ga0137415_1085050623300028536Vadose Zone SoilMLAKVRRWGNGLALRVHKEDLESAGVAEGDVVQIELTRRPDRGLDLKGLPTFEDEDPKTSLRHDRYLYG
Ga0310130_0133495_167_3793300034073Fracking WaterMFAKVRRWGNGLALRVHKKDLESAGVGEGDVVEVELIRSPDRGRLDLRSLPTFEDKDPRASVRHDRYLYG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.