NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F093626

Metagenome Family F093626

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F093626
Family Type Metagenome
Number of Sequences 106
Average Sequence Length 261 residues
Representative Sequence MQPRTDLVVPVSWLLLPLASYLMWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALLNRANLHSSRTRALLWNLISGLESRIGTAGQGALLPLTSAEEALYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMKGAGLQGVSVRTTPVGSRDVLVATVAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSVAEAGTGGVA
Number of Associated Samples 82
Number of Associated Scaffolds 106

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Archaea
% of genes with valid RBS motifs 15.09 %
% of genes near scaffold ends (potentially truncated) 31.13 %
% of genes from short scaffolds (< 2000 bps) 44.34 %
Associated GOLD sequencing projects 59
AlphaFold2 3D model prediction Yes
3D model pTM-score0.26

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (100.000 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(38.679 % of family members)
Environment Ontology (ENVO) Unclassified
(71.698 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(72.642 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 71.96%    β-sheet: 0.00%    Coil/Unstructured: 28.04%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.26
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 106 Family Scaffolds
PF14018DUF4234 15.09
PF01944SpoIIM 14.15
PF08669GCV_T_C 6.60
PF13456RVT_3 5.66
PF14470bPH_3 2.83
PF00248Aldo_ket_red 0.94
PF04032Rpr2 0.94
PF05168HEPN 0.94
PF00075RNase_H 0.94
PF06736TMEM175 0.94
PF00832Ribosomal_L39 0.94
PF00583Acetyltransf_1 0.94
PF03587EMG1 0.94
PF01571GCV_T 0.94

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 106 Family Scaffolds
COG1300Stage II sporulation protein SpoIIM, component of the engulfment complexCell cycle control, cell division, chromosome partitioning [D] 14.15
COG1756rRNA pseudouridine-1189 N-methylase Emg1, Nep1/Mra1 familyTranslation, ribosomal structure and biogenesis [J] 0.94
COG1895HEPN domain protein, predicted toxin of MNT-HEPN systemDefense mechanisms [V] 0.94
COG2023Ribonuclease P protein subunit RPR2Translation, ribosomal structure and biogenesis [J] 0.94
COG2167Ribosomal protein L39ETranslation, ribosomal structure and biogenesis [J] 0.94
COG2250HEPN domain protein, predicted toxin of MNT-HEPN systemDefense mechanisms [V] 0.94
COG3548Uncharacterized membrane proteinFunction unknown [S] 0.94


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002558|JGI25385J37094_10000991All Organisms → cellular organisms → Archaea8620Open in IMG/M
3300002558|JGI25385J37094_10016828All Organisms → cellular organisms → Archaea2617Open in IMG/M
3300002558|JGI25385J37094_10069125All Organisms → cellular organisms → Archaea1132Open in IMG/M
3300002560|JGI25383J37093_10000317All Organisms → cellular organisms → Archaea10766Open in IMG/M
3300002561|JGI25384J37096_10120296All Organisms → cellular organisms → Archaea882Open in IMG/M
3300002908|JGI25382J43887_10000064All Organisms → cellular organisms → Archaea22479Open in IMG/M
3300002908|JGI25382J43887_10003973All Organisms → cellular organisms → Archaea6829Open in IMG/M
3300002908|JGI25382J43887_10013516All Organisms → cellular organisms → Archaea4225Open in IMG/M
3300002912|JGI25386J43895_10010237All Organisms → cellular organisms → Archaea2668Open in IMG/M
3300002912|JGI25386J43895_10044388All Organisms → cellular organisms → Archaea1291Open in IMG/M
3300002916|JGI25389J43894_1053807All Organisms → cellular organisms → Archaea680Open in IMG/M
3300005167|Ga0066672_10306205All Organisms → cellular organisms → Archaea1033Open in IMG/M
3300005174|Ga0066680_10054822All Organisms → cellular organisms → Archaea2349Open in IMG/M
3300005174|Ga0066680_10093440All Organisms → cellular organisms → Archaea1830Open in IMG/M
3300005176|Ga0066679_10039082All Organisms → cellular organisms → Archaea2669Open in IMG/M
3300005177|Ga0066690_10058281All Organisms → cellular organisms → Archaea2378Open in IMG/M
3300005180|Ga0066685_10926644All Organisms → cellular organisms → Archaea580Open in IMG/M
3300005186|Ga0066676_10127923All Organisms → cellular organisms → Archaea1575Open in IMG/M
3300005187|Ga0066675_10502203All Organisms → cellular organisms → Archaea907Open in IMG/M
3300005446|Ga0066686_10130129All Organisms → cellular organisms → Archaea1649Open in IMG/M
3300005447|Ga0066689_10077339All Organisms → cellular organisms → Archaea1858Open in IMG/M
3300005540|Ga0066697_10038796All Organisms → cellular organisms → Archaea → Euryarchaeota → Archaeoglobi → Archaeoglobales → Archaeoglobaceae → Archaeoglobus → Archaeoglobus veneficus2666Open in IMG/M
3300005552|Ga0066701_10156795All Organisms → cellular organisms → Archaea1374Open in IMG/M
3300005555|Ga0066692_10027888All Organisms → cellular organisms → Archaea2958Open in IMG/M
3300005555|Ga0066692_10154718All Organisms → cellular organisms → Archaea1410Open in IMG/M
3300005556|Ga0066707_10017077All Organisms → cellular organisms → Archaea3762Open in IMG/M
3300005559|Ga0066700_10054077All Organisms → cellular organisms → Archaea2480Open in IMG/M
3300005568|Ga0066703_10027343All Organisms → cellular organisms → Archaea3027Open in IMG/M
3300005568|Ga0066703_10206480All Organisms → cellular organisms → Archaea1193Open in IMG/M
3300005586|Ga0066691_10034453All Organisms → cellular organisms → Archaea2638Open in IMG/M
3300005586|Ga0066691_10154932All Organisms → cellular organisms → Archaea1320Open in IMG/M
3300006794|Ga0066658_10000471All Organisms → cellular organisms → Archaea14644Open in IMG/M
3300006796|Ga0066665_10036262All Organisms → cellular organisms → Archaea3290Open in IMG/M
3300006797|Ga0066659_10047096All Organisms → cellular organisms → Archaea2700Open in IMG/M
3300006797|Ga0066659_10428987All Organisms → cellular organisms → Archaea → Euryarchaeota → Archaeoglobi → Archaeoglobales → Archaeoglobaceae → Archaeoglobus → Archaeoglobus veneficus1046Open in IMG/M
3300009012|Ga0066710_100460081All Organisms → cellular organisms → Archaea1910Open in IMG/M
3300009088|Ga0099830_10023950All Organisms → cellular organisms → Archaea4025Open in IMG/M
3300009088|Ga0099830_10721062All Organisms → cellular organisms → Archaea821Open in IMG/M
3300009089|Ga0099828_10011645All Organisms → cellular organisms → Archaea6586Open in IMG/M
3300009090|Ga0099827_10028278All Organisms → cellular organisms → Archaea4011Open in IMG/M
3300009090|Ga0099827_10297177All Organisms → cellular organisms → Archaea1366Open in IMG/M
3300009137|Ga0066709_100270345All Organisms → cellular organisms → Archaea2289Open in IMG/M
3300010303|Ga0134082_10051657All Organisms → cellular organisms → Archaea1577Open in IMG/M
3300010335|Ga0134063_10166781All Organisms → cellular organisms → Archaea1027Open in IMG/M
3300010335|Ga0134063_10360647All Organisms → cellular organisms → Archaea707Open in IMG/M
3300011271|Ga0137393_10435685All Organisms → cellular organisms → Archaea1123Open in IMG/M
3300012189|Ga0137388_10234373All Organisms → cellular organisms → Archaea1666Open in IMG/M
3300012199|Ga0137383_10005171All Organisms → cellular organisms → Archaea8364Open in IMG/M
3300012201|Ga0137365_10062920All Organisms → cellular organisms → Archaea2801Open in IMG/M
3300012204|Ga0137374_10058775All Organisms → cellular organisms → Archaea3881Open in IMG/M
3300012206|Ga0137380_10014818All Organisms → cellular organisms → Archaea7112Open in IMG/M
3300012206|Ga0137380_10200929All Organisms → cellular organisms → Archaea1808Open in IMG/M
3300012206|Ga0137380_10825760All Organisms → cellular organisms → Archaea799Open in IMG/M
3300012207|Ga0137381_10167946All Organisms → cellular organisms → Archaea1893Open in IMG/M
3300012209|Ga0137379_10028953All Organisms → cellular organisms → Archaea5332Open in IMG/M
3300012209|Ga0137379_10064163All Organisms → cellular organisms → Archaea3535Open in IMG/M
3300012350|Ga0137372_10034918All Organisms → cellular organisms → Archaea4582Open in IMG/M
3300012351|Ga0137386_10303295All Organisms → cellular organisms → Archaea1149Open in IMG/M
3300012353|Ga0137367_10061572All Organisms → cellular organisms → Archaea2801Open in IMG/M
3300012358|Ga0137368_10127520All Organisms → cellular organisms → Archaea1916Open in IMG/M
3300012363|Ga0137390_10176490All Organisms → cellular organisms → Archaea2123Open in IMG/M
3300012532|Ga0137373_10273557All Organisms → cellular organisms → Archaea1354Open in IMG/M
3300012685|Ga0137397_10442347All Organisms → cellular organisms → Archaea969Open in IMG/M
3300012918|Ga0137396_10006979All Organisms → cellular organisms → Archaea6712Open in IMG/M
3300012918|Ga0137396_10119399All Organisms → cellular organisms → Archaea1899Open in IMG/M
3300012977|Ga0134087_10063187All Organisms → cellular organisms → Archaea1486Open in IMG/M
3300014150|Ga0134081_10020267All Organisms → cellular organisms → Archaea1866Open in IMG/M
3300014154|Ga0134075_10234624All Organisms → cellular organisms → Archaea792Open in IMG/M
3300015358|Ga0134089_10176146All Organisms → cellular organisms → Archaea853Open in IMG/M
3300015359|Ga0134085_10070849All Organisms → cellular organisms → Archaea1418Open in IMG/M
3300017656|Ga0134112_10164006All Organisms → cellular organisms → Archaea859Open in IMG/M
3300017657|Ga0134074_1096359All Organisms → cellular organisms → Archaea1015Open in IMG/M
3300018468|Ga0066662_10030298All Organisms → cellular organisms → Archaea3200Open in IMG/M
3300021046|Ga0215015_10415576All Organisms → cellular organisms → Archaea3182Open in IMG/M
3300026295|Ga0209234_1006254All Organisms → cellular organisms → Archaea4515Open in IMG/M
3300026296|Ga0209235_1000249All Organisms → cellular organisms → Archaea26458Open in IMG/M
3300026296|Ga0209235_1076706All Organisms → cellular organisms → Archaea1505Open in IMG/M
3300026296|Ga0209235_1191712All Organisms → cellular organisms → Archaea736Open in IMG/M
3300026297|Ga0209237_1003295All Organisms → cellular organisms → Archaea9808Open in IMG/M
3300026297|Ga0209237_1030218All Organisms → cellular organisms → Archaea2967Open in IMG/M
3300026297|Ga0209237_1043248All Organisms → cellular organisms → Archaea2335Open in IMG/M
3300026298|Ga0209236_1004024All Organisms → cellular organisms → Archaea8973Open in IMG/M
3300026298|Ga0209236_1013311All Organisms → cellular organisms → Archaea4779Open in IMG/M
3300026301|Ga0209238_1000247All Organisms → cellular organisms → Archaea17856Open in IMG/M
3300026313|Ga0209761_1017217All Organisms → cellular organisms → Archaea4583Open in IMG/M
3300026317|Ga0209154_1116863All Organisms → cellular organisms → Archaea1129Open in IMG/M
3300026318|Ga0209471_1021046All Organisms → cellular organisms → Archaea3287Open in IMG/M
3300026325|Ga0209152_10000840All Organisms → cellular organisms → Archaea12999Open in IMG/M
3300026328|Ga0209802_1000465All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon31280Open in IMG/M
3300026328|Ga0209802_1032938All Organisms → cellular organisms → Archaea2703Open in IMG/M
3300026332|Ga0209803_1039098All Organisms → cellular organisms → Archaea2173Open in IMG/M
3300026333|Ga0209158_1005438All Organisms → cellular organisms → Archaea6911Open in IMG/M
3300026335|Ga0209804_1054930All Organisms → cellular organisms → Archaea1930Open in IMG/M
3300026342|Ga0209057_1012312All Organisms → cellular organisms → Archaea5230Open in IMG/M
3300026524|Ga0209690_1205407All Organisms → cellular organisms → Archaea625Open in IMG/M
3300026528|Ga0209378_1001135All Organisms → cellular organisms → Archaea19155Open in IMG/M
3300026529|Ga0209806_1019319All Organisms → cellular organisms → Archaea3516Open in IMG/M
3300026537|Ga0209157_1026381All Organisms → cellular organisms → Archaea3453Open in IMG/M
3300026537|Ga0209157_1056257All Organisms → cellular organisms → Archaea2052Open in IMG/M
3300026538|Ga0209056_10014608All Organisms → cellular organisms → Archaea7827Open in IMG/M
3300026540|Ga0209376_1017523All Organisms → cellular organisms → Archaea4885Open in IMG/M
3300027748|Ga0209689_1048711All Organisms → cellular organisms → Archaea2389Open in IMG/M
3300027862|Ga0209701_10113258All Organisms → cellular organisms → Archaea1685Open in IMG/M
3300027875|Ga0209283_10037791All Organisms → cellular organisms → Archaea3035Open in IMG/M
3300027882|Ga0209590_10100110All Organisms → cellular organisms → Archaea1730Open in IMG/M
3300028536|Ga0137415_10418805All Organisms → cellular organisms → Archaea1145Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil38.68%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil27.36%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil23.58%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil9.43%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.94%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_10000991113300002558Grasslands SoilMRARTDFSVPASYLLLPLASYLSWALFMVAWWGAGAGLGTGDLTLAVSELGIVGLVASAAASYVVYLVMSRANNHSSRTRALLWKAVGELQSRTGATGQXAMLPXSSAEEGLYRLSRGEHERSAVLWALLASIPVVGWIFLVTALWFLSRELAKHARLEELVLEDVDRTLKATGLQGASVRGAPVASRDILGVSVAIVSTIELLSSFLLGPAGGLVLIYLTVGAFSLVWLDLAIRDPTVHFSFHSQFEPDILRSLPDTFAGISNVGAG*
JGI25385J37094_1001682833300002558Grasslands SoilGLAASAAASYVVYTLMNRANKHFSRTRALLCRAIDELHSRIGTAGHGALLPLSSADESLYKLSRGEHERSAVMWALLASIPVIGGMFLVAALWLVSRDFTKHARLEELVLEDVDRSMKGNGLQGISVRHASVASRDILGVLVSIVSLIELLSAFLLGPAGCLVLIYLTVGAFSLVWLDLSIRDPTVHFSFHSQFESDILRSLPDAVGEASNVGAG*
JGI25385J37094_1006912513300002558Grasslands SoilMRPRTDLAVPVSWLLLPLTSYLLWAIFAVAWWAAGAGLGTSDLALVVSGLGIVGLAASAAASYVVYTLMNRANEHSSRTRAVLWSALSELESRIGTTRQEALLPLTSAEEGFYXLSRGEHERSAVLWALLASIPVIGWIFLVAALWFLSRDFAKHSRLEELVLEDLDRTMRGAGLQGVSVRHAPIGARDVLGIAVVTVLLVELLSVFLLGLAGCLVLIYLTVGAFSLVWLDLSMRDPAPHFTFHSQFEPEILRALPGATAKAGTVGGA*
JGI25383J37093_1000031763300002560Grasslands SoilMLAQMRARTDFSVPASYLLLPLASYLSWALFMVAWWGAGAGLGTGDLTLAVSELGIVGLVASAAASYVVYLVMSRANNHSSRTRALLWKAVGELQSRTGATGQEAMLPLSSAEEGLYRLSRGEHERSAVLWALLASIPVVGWIFLVTALWFLSRELAKHARLEELVLEDVDRTLKATGLQGASVRGAPVASRDILGVSVAIVSTIELLSSFLLGPAGGLVLIYLTVGAFSLVWLDLAIRDPTVHFSFHSQFEPDILRSLPDTFAGISNVGAG*
JGI25384J37096_1012029623300002561Grasslands SoilLLLPLASYLLWAVFVVAWWGAGAGLRTGDLALVVSVLGIVGLAASAAASYVVYTLMNRANKHFSRTRALLCRAIDELHSRIGTAGHGALLPLSSADESLYKLSRGEHERSAVMWALLASIPVIGGMFLVAALWLVSRDFTKHARLEELVLEDVDRSMKGNGLQGISVRHASVASRDILGVLVSIVSLIELLSAFLLGPAGCLVLIYLTVGAFSLVWLDLSIRDPTVHFSFHSQFESDILRSLPDAVGEASNVGAG*
JGI25382J43887_10000064183300002908Grasslands SoilVTLSDTSRPEIGVSSLVQMRARTDVAVPASYLLLPLASYLLWAVFVVAWWGAGAGLRTGDLALVVSVLGIVGLAASAAASYVVYTLMNRANKHFSRTRALLCRAIDELHSRIGTAGHGALLPLSSADESLYKLSRGEHERSAVMWALLASIPVIGGMFLVAALWLVSRDFTKHARLEELVLEDVDRSMKGNGLQGISVRHASVASRDILGVLVSIVSLIELLSAFLLGPAGCLVLIYLTVGAFSLVWLDLSXRDPTVHFSFHSQFESDILRSLPDAVGEASNVGAG*
JGI25382J43887_1000397323300002908Grasslands SoilMRARTDFSVPASYLLLPLASYLSWALFMVAWWGAGAGLGTGDLTLAVSELGIVGLVASAAASYVVYLVMSRANNHSSRTRALLWKAVGELQSRTGATGQGAMLPLSSAEEGLYRLSRGEHERSAVLWALLASIPVVGWIFLVTALWFLSRELAKHARLEELVLEDVDRTLKATGLQGASVRGAPVASRDILGVSVAIVSTIELLSSFLLGPAGGLVLIYLTVGAFSLVWLDLAIRDPTVHFSFHSQFEPDILRSLPDTFAGISNVGAG*
JGI25382J43887_1001351623300002908Grasslands SoilMSLSMLVQMRPRTDXXVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLAIVVSVLGIVGLAASAAASFVVYTLVNRANLHSSRIRALLGNAISGLESRIGTAGGGALLPLTSAEEDFYKLSRGEHERSAVLWALVASVPIIGWTFLIAALWFLSRDFAKHSRLEELVLEDVDRAMKEAGLQGVSVRNTPVGSHDVLGAAVVIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSSHSQFEPDILRSLPDSVADAGTGGVA*
JGI25386J43895_1001023723300002912Grasslands SoilMRARTDFSVPASYLLLPLASYLSWALFMVAWWGAGAGLGTGDLTLAVSELGIVGLVASAAASYVVYLVMSRANNHSSRTRALLWKAVGELQSRTGATGQEAMLPLSSAEEGLYRLSRGEHERSAVLWALLASIPVVGWIFLVTALWFLSRELAKHARLEELVLEDVDRTLKATGLQGASVRGAPVASRDILGVSVAIVSTIELLSSFLLGPAGGLVLIYLTVGAFSLVWLDLAIRDPTVHFSFHSQFEPDILRSLPDTFAGISNVGAG*
JGI25386J43895_1004438823300002912Grasslands SoilMRARTDVAVPASYLLLPLASYLLWAVFVVAWWGAGAGLRTGDLALVVSVLGIVGLAASAAASYVVYTLMNRANKHFSRTRALLCRAIDELHSRIGTAGHGALLPLSSADESLYKLSRGEHERSAVMWALLASIPVIGGMFLVAALWLVSRDFTKHARLEELVLEDVDRSMKGNGLQGISVRHASVASRDILGVLVSIVSLIELLSAFLLGPAGCLVLIYLTVGAFSLVWLDLSIRDPTVHFSFHSQFESDILRSLPDAVGEASNVGAG*
JGI25389J43894_105380713300002916Grasslands SoilYLMWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALLNRANLHSSRTRALLWNLISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASAPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMKGAGLQGVSVRTTPVGSRDVLVATVAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSYHSQ
Ga0066672_1030620523300005167SoilMRPRTDIVIPVSWLLLPLASYLLWAVFAVAWWVAGAGVETSNLTLVVSGLGIVGLAASAAASYVVFRLLNRANLHSSRSRALLWNAISGLESRVGTAGQGALLPLSSAEEDLYRLFHGDRESSAVLWALLASIPAIGWIFLVAALWFLSRHLAKHNRLEGLVLEDVDRTMRGAGLQGITVKHSPVGARDVLGIAVVVVSLVELLSVFLLGLAGGLVLIYLTVGASSLIWLDLSIRD
Ga0066680_1005482213300005174SoilPSREVRFSTLVQMRTTTDLAVSVSWLLMPLMSYLLWAGFVVAWWAAGAGVGTINLTLVVSGLGIVGLAASAAASYVVFTLVNRLNMHSSRTRALFWNTISELESRIGTVGQSALLPLSSAQEGFHRLSRGEHERSAVLWALLASVPIIGWIFLLAALWYLSRDMAKHNRLEELVLEDVDRTMKGAGLQGVSVKHAPIGSRDVLGIAVVVVSLVELLSVFLLGLAGCLTLIYLTVGAFSLVWLDLSMRDPIPHFVFHSQFEPEILRSLPDASAKAGAVRVE*
Ga0066680_1009344013300005174SoilVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLAIVVSVLGIVGLAASAAASFVVYTLVNRANLHSSRIRALLGNAISGLESRIGTAGGGALLPLTSAEEDFYKLSRGEHERSAVLWALVASVPIIGWTFLIAALWFLSRDFAKHSRLEELVLEDVDRAMKEAGLQGVSVRNTPVGSHDVLGAAVVIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSSHSQFEPDILRSLPDSVADARTGGVA*
Ga0066679_1003908223300005176SoilVAFSDSSSGEMSLSTLVQIRPRTDLAVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLALAVGVLGIVGLAASAAASYLVYALLNRASLHFGRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGLYKLSRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDVDRTMKGAGLQGVSVRNVPVGSRDVLGAAVVIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPNSHFSFHSQLEPDILRSLPDSVSGAGTEGVA*
Ga0066690_1005828123300005177SoilMSLSTLVQIRPRTDLAVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLALAVSVLGIVGLAASVAASYLVYALLNRASLHFGRTRALLWNAISGLESRIGTAGQEALLPLTSAEEGLYKLSRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMKGAGLQGVSVRTTPVGSRDVLVATIAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSLAEAGTGGVT*
Ga0066685_1092664413300005180SoilYLVYALVNRANLHSSRTRALLWNAISGLESRIGTAGQGALLPLTLAEEGFYRLSRGEHDRSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHSRLEELVLEDVDRTMKGAGLQGVSVRSTPVGSHDILGAAVAIASVIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSSHSQFEPDILRSL
Ga0066676_1012792313300005186SoilMSLSTLVQMRPRTDLVVPVSWLLLPLASYLLWAVFVVAWWAAGAGLGNGDLALVVSVLGIVGLAASAAASYLVYALVNRANLHSSRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGFYRLSRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDMDRTMKGAGLQGVSVRTTPVGSRDVLVATIAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLD
Ga0066675_1050220323300005187SoilLVYALVNRANLHSSRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDMDRTMKGAGLQGVSVRTTPVGSRDVLVAAVAIASLTELLSVFLLGPAGGLVLIYLTVGASSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSVADVGTGGVA*
Ga0066686_1013012923300005446SoilMSLSTLVQMRPRTDLVVPVSWLLLPLASYLLWAVFVIFWWAGGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALVNRANLHSSRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDMDRTMKGAGLQGVSVRTTPVGSRDVLVAAVAIASLTELLSVFLLGPAGGLVLIYLTVGASSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSVADVGTGGVA*
Ga0066689_1007733923300005447SoilMWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALLNRANLHSSRTRALLWNLISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMKGAGLQGVSVRTTPVGSRDVLVATIAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSLAEAGTGGVT*
Ga0066697_1003879623300005540SoilMSLSTLVQMRPRTDLVVPVSWLLLPLASYLLWAVFVVAWWAAGAGLGNGDLALVVSVLGIVGLAASAAASYLVYALVNRANLHSSRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGFYRLSRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDVNRTMKGAGLQGVSVRNTPIGSRDVLGAAVIIASLTELLSVFLLGPAGGLVLIYLTVGASSLIWLDLSIRDPISHFSSHSQFEPDILRSLPDATATADNVGGA*
Ga0066701_1015679523300005552SoilMSLSTLVQMRPRTDLVVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLAIVVSLLGIVGLAASAAASFVVYTLVNRVNLHSGRIRALLWSAISGLESRIGTAGGGALLPLTSAEEGFYKLSRGEHERSAVLWALVVLVPIIGWTFLIAALWFLSRDFAKHSRLEELVLEDVDRAMKEAGLQGVSVRNTPVGSHDVLGAAVVIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSSHSQFEPDILRSLPDSVADAGTGGVA*
Ga0066692_1002788833300005555SoilMSYLLWAGFVVAWWAAGAGVGTINLTLVVSGLGIVGLAASAAASYVVFTLVNRLNMHSSRTRALFWNTISELESRIGTVGQSALLPLSSAQEGFHRLSRGEHERSAVLWALLASVPIIGWIFLLAALWYLSRDMAKHNRLEELVLEDVDRTMKGAGLQGVSVKHAPIGSRDVLGIAVVVVSLVELLSVFLLGLAGCLTLIYLTVGAFSLVWLDLSMRDPIPHFVFHSQFEPEILRSLPDASAKAGAVRVE*
Ga0066692_1015471823300005555SoilMRTRTDVAVPASYLLLPLASYLLWAVFVVAWWGAGAGLRTGDLALVVSVLGIVGLAASAAASYVVYTLMNRANKHFSRTRALLCRAIDELHSRIGTAGHGALLPLSSADESLYKLSRGEHERSAVMWALLASIPVIGGMFLVAALWLVSRDFTKHARLEELVLEDVDRSMKGNGLQGISVRHASVASRDILGVLVSIVSLIELLSAFLLGPAGCLVLIYLTVGAFSLVWLDLSIRDPTVHFSFHSQFESDILRSLPDAVGGASNVGAG*
Ga0066707_1001707733300005556SoilMWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALLNRANLHSSRTRALLWNLISGLESRIGTAGQGALLPLTSAEEALYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMKGAGLQGVSVRTTPVGSRDVLVATVAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSLAEAGTGGVT*
Ga0066700_1005407713300005559SoilRTRSWPSFLQPFRLLRQFRLRRRARLNYSNLGVRVAFPDSSSREMSLLTLVRMQPRTDLVVPVSWLLLPLASYLMWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALLNRANLHSSRTRALLWNLISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMKGAGLQGVSVRTTPVGSRDVLVATIAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSLAEAGTGGVT*
Ga0066703_1002734323300005568SoilMGLLTLVQMRPRTDLVVPVSWLLLPLASYLLWAVFVVSWWAAGAGVGTEDLALVFSVLGMAGLAASAAASHLVYALVNRGNLHSSRTRALLWNVISGLESRIGTAGQGALLSLNSAEEGFYRLSRGEHERSAVLWALLASVPIVGWIFLFAALWFLSRDFAKHSRLEGPVLEDMDRTMKGAGSQGVSVRSTPVGSHDVLGAAVAIASVIELLSVFLLGPAGGLVLIYLTVGASSLIWLDLSIRDPISHFSSHSQFEPEILRSLPDSVADSGTGGGA*
Ga0066703_1020648023300005568SoilMWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALVNRANLHSSRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMKGAGLQGVSVRTTPVGSRDVLVATVAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSLAEAGTGGVT*
Ga0066691_1003445323300005586SoilMSLSTLVQIRPRTDLAVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLALAVSVLGIVGLAASAAASYLVYALLNRASLHFGRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGLYKLSRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDVDMTMKGAGLQGVSVRNVPVGSRDVLGAAVVIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPNSHFSFHSQLEPDILRSLPDSVSGAGTEGVA*
Ga0066691_1015493223300005586SoilVTLSDTSRPEIGVSSLVQMRTRTDVAVPASYLLLPLASYLLWAVFVVAWWGAGAGLGTDDLALVVNVLGIVGLAASAAASYVVYTLMNRANKHFSRTQALLGKAIDELQSRIGTAGHGALLPLSSADEGLYKLSRGEHERSAVLWALLASIPVIGWMFLVAALWLVSRDFTKHARLEELVLEDVDRSMKGNGLQGISVRHASVASRDILGVLVSIVSLIELLSAFLLGPAGCLVLIYLTVGAFSLV
Ga0066658_1000047173300006794SoilLVQMRPRTDIVIPVSWLLLPLASYLLWAVFAVAWWVAGAGVETSNLTLVVSGLGIVGLAASAAASYVVFRLVNRANLHSSRSRALLWNAISGLESRVGTAGQGALLPLSSAEENLYRLFHGDRESSAVLWALLASIPAIGWIFLVAALWFLSRHLAKHNRLEGLVLEDVDRTMRGAGLQGITVKHSPVGARDVLGIAVVVVSLVELLSVFLLGLAGGLVLIYLTVGASSLIWLDLSIRDPTFHFSSHSQFEPEILRSLPDATVGGGSIGAA*
Ga0066665_1003626223300006796SoilMWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALLNRANLHSSRTRALLWNLISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMKGAGLQGVSVRTTPVGSRDVLVATVAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSLAEAGTGGVT*
Ga0066659_1004709623300006797SoilMQPRTDLVVPVSWLLLPLASYLMWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALLNRANLHSSRTRALLWNLISGLESRIGTAGQGALLPLTSAEEALYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMKGAGLQGVSVRTTPVGSRDVLVATVAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSVAEAGTGGVA*
Ga0066659_1042898713300006797SoilWPRFLQPLRLRRRFRPRQRARLNHSNCGVRVAFSDSSSSEMSLSTLVQIRPRTDLAVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLALAVSVLGIVGLAASAAASYLVYALLNRASLHFGRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGLYKLSRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDVDRTMKGAGLQGVSVRNVPVGSRDVLGAAVVIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPNSHFSFHSQLEPDILRSLPDSVSGAGTEGVA*
Ga0066710_10046008123300009012Grasslands SoilMGLLTLVQMRPRTDLVVPVSWLLLPLASYLLWAVFVVSWWAAGAGVGTEDLALVFSVLGMAGLAASAAASHLVYALVNRGNLHSSRTRALLWNVISGLESRIGTAGQGALLSLNSAEEGFYRLSRGEHERSAVLWALLASVPIVGWIFLFAALWFLSRDFAKHSRLEGPVLEDMDRTMKGAGSQGVSVRSTPVGSHDVLGAAVAIASVIELLSVFLLGPAGGLVLIYLTVGASSLIWLDLSIRDPISHFSSHSQFEPDILRSLPDSVADSGTGGGA
Ga0099830_1002395033300009088Vadose Zone SoilMLPLTSYLLWAVFLVAWWAAGAGLGTSNLTFVFSGLGIVGLAASAAASYMVYTLVNRVNKHSSRTRALLSTAISELELRIGTARREALLPLNSAEDDLYRLSRGEHERSAVLWALLASIPIIGWIFLVAALWFLSRDFAKHTRLEELVLEDVDRAMKGAGLQGISVKHAPVDSRDFLGIAVVVASLVELLSVLPLGPTGSFVLIYLTVGAFSLVWLDLSMRDPTPHFAFHSQFEPEILRSLPDAAGARNVGAS*
Ga0099830_1072106213300009088Vadose Zone SoilSVSWLLMPLMSYLLWAGFVVAWWAAGAGVGTINLTLVVSGLGIVGLAASAAASFVVFTLVNRENMHSSRTRALFWNTISELESRIGTAGQSALLPLSSAEEGFHRLSRGEHERSAVLWALLASVPIIGWTFLLAALWFLSRDMAKHNRLEELVLEDVDRTMKGAGLQGVSVKHAPIGSRDVLGIAVVVVSLVELLSVFLLGLAGCLTLIYLTVGAFSLVWLDLSMRDPIPHFVFHSQFEPEILRSLPDTSAGADGVGAV*
Ga0099828_1001164523300009089Vadose Zone SoilMRPRTDIAVPVSLLMLPLTSYLLWAVFLVAWWAAGAGLGTSNLTFVFSGLGIVGLAASAAASYMVYTLVNRVNKHSSRTSALLSTAISELELRIGTARREALLPLNSAEDDLYRLSRGEHERSAVLWALLASIPIIGWIFLVAALWFLSRDFAKHTRLEELVLEDVDRAMKGAGLQGISVKHAPVDSRDFLGIAVVVASLVELLSVLPLGPTGSFVLIYLTVGAFSLVWLDLSMRDPTPHFAFHSQFEPEILRSLPDAAGARNVGAS*
Ga0099827_1002827823300009090Vadose Zone SoilMPLMSYLLWAGFVVAWWAAGAGVGTINLTLVVSGLGIVGLAASAAASFVVFTLVNRENMHSSRTRALFWNTISELESRIGTAGKSALLPLSSAEEGFHRLSRGEHERSAVLWALLASVPIIGWIFLLAALWFLSRDMAKHNRLEELVLEDVDRTMKGAGLQGVSVKHAPIGSRDVLGIAVVVVSLVELLSVFLLGLAGCLTLIYLTVGAFSLVWLDLSMRDPIPHFVFHSQFEPEILRSLPDASAKAGAVGVE*
Ga0099827_1029717713300009090Vadose Zone SoilMRPRTDLAVPVSWLLLPLASYLLWAVFAVAWWAAGAGLGTSDLALVVSGLGIVGLAASAAASYVVYTLVNRANKHSTRTRILLWTAMSELESRIGTTSQEALLPLASAEEGFYRLSRGGHERSAVLWALLASIPVIGWVFLVAGLCLVSRDLAKHSRLEELVLEDVARTMRGVGVQGIAVRRAPIGSHDILGLGLVIVSLIEFFSVFLLGLAGSVVLIYLTVGAFSLLWLDLS
Ga0066709_10027034523300009137Grasslands SoilMDLLTLVQMRPRTDLVVPVSWLLLPLASYLLWAVLVVSWWAAGAGMGTEDLALVFSVLGMAGLAASAAASHLVYALVNRGNLHSSRTRALLWNVISGLESRIGTAGQGALLSLNSAEEGFYRLSRGEHERSAVLWALLASVPIVGWIFLFAALWFLSRDFAKHSRLEGPVLEDMDRTMKGAGSQGVSVRSTPVGSHDVLGAAVAIASVIELLSVFLLGPAGGLVLIYLTVGASSLIWLDLSIRDPISHFSSHSQFEPEILRSLPDSVADSGTGGGA*
Ga0134082_1005165723300010303Grasslands SoilMWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALVNRANLHSSRTRALLWNLISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDMDRTMKGAGLQGVSVRTTPVGSRDVLVAAVAIASLIELLSVFLLGPGGGLVLIYLTVGAFSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSVADVGTGGVA*
Ga0134063_1016678113300010335Grasslands SoilMSLSTLVQMRPRTDLVVPVSWLLLPLASYLLWAVFVVFWWAGGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALVNRANLHSSRTRALLWNAISGLESRIGTAGHGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDVDRTMKGAGLQGVSVRTTPVGSRDVLVAAVAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIGLDLSIRDPISHFSYHSQFEPDILRSLPDSVADAGTGGGA*
Ga0134063_1036064713300010335Grasslands SoilVLGIVGLAASAAASYLVYALLNRASSHFGRTRALLWNAIGGLESRIGTAGQGALLPLTSAEEGLYKLSRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDVDRTMRGSGLQGVSVRNVPVGSRDVLGAAVVIASLIELLSVFLLGPAGGLVLIYLTVGAFSLVWLDLSIRDPISHFSFHSQLEPDILRSLPDSVSGAATGGAA*
Ga0137393_1043568513300011271Vadose Zone SoilMRPRTDLAVPVSWLLLPLASYLLWAIFAVAWWAAGAGLGTGDLALVVSGLGIVGLAASAAASYVVYTLMNRATEHSSRTRALLWNALSELESRIGTTRQEALLPLTSAQEGFYRLSRGEHERSAVLWALLASVPIIGWIFLLAALWFLSRDMAKHNRLEELVLEDVDRTMKGAGLQGVSVKHAPIGSRDVLGIAVVVVSLVELLSVFLLGLAGCLTLIYLTVGAFSLVWLDLSMRDPIPHFVFHSQFEPEILRSLPDASAKAGAVGVE*
Ga0137388_1023437313300012189Vadose Zone SoilRPRTDIAVPVSLLMLPLTSYLLWAVFLVAWWAAGAGLGTSNLTFVFSGLGIVGLAASAAASYMVYTLVDRVNKHSSRTRALLSTAISELELRIGTARREALLPLNSAEDDLYRLSRGEHERSAVLWALLASIPIIGWVCLVAALWFLSRDIAKHTRLEGLVLEDMDRAMKGAGLQGISVRYAQVGSRDFLGIAVVIASLVELLSVFPLGPAGSLVLIYLTVGALSLVWLDLSMRDPAPHFALHSQFEPEILRSLPDATGAGNVGGS*
Ga0137383_1000517133300012199Vadose Zone SoilMRTTTDLAVSVSWLLMPLMSYLLWAGFVVAWWAAGAGVGTINLTLVVSGLGIVGLAASAAASHVVFTLVNRLNMHSTRTRALFRKTISELESRIGTAGQSALLPLSSAQEGFHRLSRGERERSAVLWALLASVPIIGWIFLLAALWFLSQDMAKHNRLEELVLEDVDRTMKGAGLQGVSAKHAPIGSRDVLGIAVVVVSLVELLSVFLLGIAGCLTLIYLTVGAFSLVWLDLSMRDPIPHFVFHSQFEPEILRSLPDASAKAGAVRGE*
Ga0137365_1006292033300012201Vadose Zone SoilMNLSTLVQTRPRTDLVVPVSWLLLPLASYLLWTVFVVSWWAAGAGLGTGDLALVVSVLGIVGLGASAAGSYLVYALLNRANLHSSRTRELLWNAISGLESRIGTAGQGVLLPLTMAEEGFYRLSRGEHERSAVLWALLASVPIIGWIFLVAALWFLSRDFAKHGRLEELVLEDVNRTMKGAGLQGVSVRNTPIGSRDVLGAAVIIASLTELLSVFLLGPAGGLVLIYLTVGASSLIWLDLSIRDPTSHFSSHSQFEPDILRSLPDAVAGAGTGGGA*
Ga0137374_1005877533300012204Vadose Zone SoilMNLSTLVQMRPRTDLVVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTDDLALVVSVLGIVGLGASAAGSYLVYALLNRANLHSSRTRELLWNAISGLESRIGTAGQGVLLPLTTAEEGFYRLSRGEHERSAVLWALLASVPIIGWIFLVAALWFLSRDFAKHGRLEELVLEDVNRTMKGAGLQGVSVRNTPIGSRDVLGAAVIIASLTELLSVFLLGPAGGLVLIYLTVGASSLIWLDLSIRDPTSHFSSHSQFEPDILRSLPDAVAGAGTGGGA*
Ga0137380_1001481853300012206Vadose Zone SoilMPLMSYLLWAGFVVAWWAAGAGVGTINLTLVVSGLGIVGLAASAAASHVVFTLVNRLNMHSTRTRALFRKTISELESRIGTAGQSALLPLSSAQEGFHRLSRGERERSAVLWALLASVPIIGWIFLLAALWFLSQDMAKHNRLEELVLEDVDRTMKGAGLQGVSAKHAPIGSRDVLGIAVVVVSLVELLSVFLLGLAGCLTLIYLTVGAFSLVWLDLSMRDPIPHFVFHSQFEPEILRSLPDASAKAGAVRGE*
Ga0137380_1020092923300012206Vadose Zone SoilMRPRTDLAVPVSWLLLPLTSYLLWAIFAVAWWADGAGLGTSDLALVVSGLGIVGLAASAAASYVVYTLMNRANEHSSRTRALLWSALSELESRIGTTRQEALLPLTSAEEGFYRLSRGEHARSAVLWALLASIPVIGWIFLVAALWFLSRDFAKHSRLEELVLEDLDRTMKGAGLQGVSVRHAPIGARDVLGIAVVTVLLVELLSVFLLGLAGCLVLIYLTVGAFSLVWLDLSMRDPTPHFTFHSQFEPEILRALPGATAKAGAVGGA*
Ga0137380_1082576023300012206Vadose Zone SoilSFVVYTLVNRANLHSSRIRALLWNAISGLESRIGTAGGGALLPLTSAEESFYKLSRGEHERSAVLWALVASVPIIGWTFLIAALWFLSRDFAKHSRLEELVLEDVDRAIKEAGLQGVSVRNTPVGSHDVLGAAVVIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSSHSQFEPDILRSLPDSVAEAGNGGVA*
Ga0137381_1016794613300012207Vadose Zone SoilMSLSTLVQMRPRTDLVVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLAIVVSLLGIVGLAASAAASFVVYTLVNRANLHSSRIRALLWNAISGLESRIGTAGGGALLPLTSAEESFYKLSRGEHERSAVLWALVASVPIIGWTFLIAALWFLSRDFAKHSRLEELVLEDVDRAIKEAGLQGVSVRNTPVGSHDVLGAAVVIASLIELLSVFLLWPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSSHSQFEPDILRSLPDSVADAGTGGVA*
Ga0137379_1002895373300012209Vadose Zone SoilMRARTDVSVPASYLLLPLGSYLLWAIFVVAWWGAGAGLGTGDLTLVVSGLGIVGLVASTAASYVIYLVMSRANRHSSRTRALLWTTVGELQSSPGTGGQAALLPLSSAEDGLYRLSRGEHERSAMLWSLLALIPVIGWIFLVVALWLISRDFTRHARLEELVLEDIDRSMKGTGLQGTSIGHGSVASRDILGVSVVIVSLIELLSAFLLGPAGCLVLVDLTVGAFSLVWLDLSIRDPTVHFSFHSQFESDILRSLPDAVGGASNVGAG*
Ga0137379_1006416323300012209Vadose Zone SoilMRPRTDLVVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLAIVVSLLGIVGLAASAAASFVVYTLVNRANLHSSRIRALLWNAISGLESRIGTAGGGALLPLTSAEESFYKLSRGEHERSAVLWALVASVPIIGWTFLIAALWFLSRDFAKHSRLEELVLEDVDRAIKEAGLQGVSVRNTPVGSHDVLGAAVVIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSSHSQFEPDILRSLPDSVAEAGNGGVA*
Ga0137372_1003491853300012350Vadose Zone SoilMNLSTLVQMRPRTDLVVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLGASAAGSYLVYALLNRANLHSSRTRELLWNAISGLESRIGTAGQGVLLPLTMAEEGFYRLSRGEHERSAVLWALLASVPIIGWIFLVAALWFLSRDFAKHGRLEELVLEDVNRTMKGAGLQGVSVRNTPIGSRDVLGAAVIIASLTELLSVFLLGPAGGLVLIYLTVGASSLIWLDLSIRDPTSHFSSHSQFEPDILRSLPDAVAGAGTGGGA*
Ga0137386_1030329513300012351Vadose Zone SoilMSLSTLVQMRPRTDLVVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLAIVVSLLGIVGLAASAAASFVVYTLVNRANLHSSRIRALLWNAISGLESRIGTAGGGALLPLTSAEESFYKLSRGEHERSAVLWALVASVPIIGWTFLIAALWFLSRDFAKHSRLEELVLEDVDRAIKEAGLQGVSVRNTPVGSHDVLGAAVVIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSSHSQFEPDILRSLPDSVAEAGNGGVA*
Ga0137367_1006157223300012353Vadose Zone SoilMNLSTLVQTRPRTDLVVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTDDLALVVSVLGIVGLGASAAGSYLVYALLNRANLHSSRTRELLWNAISGLESRIGTAGQGVLLPLTTAEEGFYRLSRGEHERSAVLWALLASVPIIGWIFLVAALWFLSRDFAKHGRLEELVLEDVNRTMKGAGLQGVSVRNTPIGSRDVLGAAVIIASLTELLSVFLLGPAGGLVLIYLTVGASSLIWLDLSIRDPTSHFSSHSQFEPDILRSLPDAVAGAGTGGGA*
Ga0137368_1012752013300012358Vadose Zone SoilLVYALLNRANLHSSRTRELLWNAISGLESRIGTAGQGVLLPLTTAEEGFYRLSRGEHERSAVLWALLASVPIIGWIFLVAALWFLSRDFAKHGRLEELVLEDVNRTMKGAGLQGVSVRNTPIGSRDVLGAAVIIASLTELLSVFLLGPAGGLVLIYLTVGASSLIWLDLSIRDPTSHFSSHSQFEPDILRSLPDAVAGAGTGGGA*
Ga0137390_1017649023300012363Vadose Zone SoilSYLLWAGFVVAWWAAGAGVGTINLTLVVSGLGIVGLAASAAASFVVFTLVNRENMHSSRTRALFWNTISELESRIGTAGQSALLPLSSAEEGFHMLSRGEHERSAVLWALLASVPIIGWIFLLAALWFLSRDMAKHNRLEELVLEDVDRTMKGAGLQGVSVKHAPIGSRDVLGIAVVVVSLVELLSVFLLGLAGCLVLIYLTVGAFSLVWLDLSMRDPTPHFAFHSLFEPEILRSLLSATARAGIVGGA*
Ga0137373_1027355713300012532Vadose Zone SoilMSLSTLVQMRSRTDLVVPVSWLLLPLASYLLWAVFVVAWWVAGTGVGMSNLTLVVRGLGIVGLAASAAASYVVFTLVNRVNRHSSRTRALLWSAISELESKIGTAGHGAMLPLSSAEEGFHRLSHEEHERSAVLWALLASVPIVGWIFLVAALWFLSRDVSKHIRLEELVLEDVDRAMKGTGLQGTSVRHAPVEASDVLGVAVVFVSLVELLSVFLLGPAGCLVLIYLTVGAFSLVLLDLSMRDPIPHFAFHSQFEP
Ga0137397_1044234723300012685Vadose Zone SoilMRSRTDFDIPVSWLLLPLASYLLWAIFVVAWWAAGAGIGTSSLTLVVNGLGILGLATSAAASRIVYTLVNRANNHSGRTRALLSTAISELESKLGTTSRGPLLSLNSAEDGLYKLSRGEHERSAVLWALLASIPVTGWIFLIAELWFLSRDFAKHSRLEESVLEDVDRAMKGSGLPGVSVKHAPVASHGLLGIVAVVAVLIELFSMFVLGPVGCLVLIYLTVGAFALVWIDLSVRDPTAHFAFHSQFEPDVLRSLPDAAVGAASVGGA*
Ga0137396_1000697993300012918Vadose Zone SoilMRPRTDLVVPVSWLLLPLASYLLWAIFVVAWWATGAGIETSNLTLVVSGLGIVGLATSAAASYVVFTLVNRENMHSSRTRALLWNTISDLQSRIGTAGQSALLPLSSAEEGFHKLSRGEHERSAVLWALLASIPIIGWIFLVAALWFLSRDMARHSRLEELVLEDVDRTLRGAGLQGISVRPAPVGSRDVLGIAAVVVSLVELLSVFLLGPVGCLVLIYLTVGAFSLVWLDLSMSDPTSHFAFHSQFEQEILRSLPDSTIKAGTIGGG*
Ga0137396_1011939923300012918Vadose Zone SoilMRSRTDFDIPVSWLLLPLASYLLWAIFVVAWWAAGAGIGTSSLTLVVNGLGILGLATSAAASRIVYTLVNRANNHSGRTRALLSTAISELESKIGTTSQGPLLSLNSAEDGLYKLSRGEHERSAVLWALLASIPVIGWIFLIAELWFLSRDFAKHSRLEESVLEDVDRAMKGSGLPGVSVKHAPVASHGVLGIVAVVAVLIELFSMFVLGPVGCLVLIYLTVGAFALVWIDLSVRDPTAHFAFHSQFEPDVLRSLPDAAVGAASVGGA*
Ga0134087_1006318713300012977Grasslands SoilNRANLHSSRTRALLWNLISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDVDRTMKGAGLQGVSVRTTPVGSRDVLVATIAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSLAEAGTGGVT*
Ga0134081_1002026723300014150Grasslands SoilMRPGTDLVVPVSWLLLPLASYLLWAVFVVFWWAGGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALLNRANLHSSRTRALLWNLISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDVDRTMKGAGLQGVSVRTTPVGSRDVLVAAVAIASLTELLSVFLLGPAGGLVLIYLTVGAFSLIGLDLSIRDPISHFSYHSQFEPDILRSLPDSVADVGTGGVA*
Ga0134075_1023462413300014154Grasslands SoilMWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALLNRANLHSSRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGFYRLSRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDVNRTMKGAGLQGVSVRTTPVGSRDVLVAAVAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSYHSQFEPDILRSL
Ga0134089_1017614613300015358Grasslands SoilASAAASYLVYALVNRANLHSSRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDMDRTMKGAGLQGVSVRTTPVGSRDVLVAAVAIASLIELLSVFLLGPAGGLVLIYLTVGASSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSVADVGTGGVA*
Ga0134085_1007084923300015359Grasslands SoilMSLSTLVQMRPRTDLVVPVSWLLLPLASYLLWAIFVVFWWAGGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALVNRANLHSSRTRALLWIAISGLESRIGTAGQGALLPLSSAEEGLYKLSRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDVDRTMKGAGLQGVSVRTTPVGSRDVLVAAVAIASLTELLSVFLLGPAGGLVLIYLTVGASSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSVADVGTGGVA*
Ga0134112_1016400613300017656Grasslands SoilMSLSTLVQMRPRTDLVVPVSWLLLPLASYLLWAVFVVFWWAGGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALVNRANLHSSRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDMDRTMKGAGLQGVSVRTTPVGSRDVLVAAVAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIGLDLSIRDPISHFSYHSQ
Ga0134074_109635913300017657Grasslands SoilVIFWWAGGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALVNRANLHSSRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDMDRTMKGAGLQGVSVRTTPVGSRDVLVAAVAIASLTELLSVFLLGPAGGLVLIYLTVGASSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSVADVGTGGV
Ga0066662_1003029833300018468Grasslands SoilMPLMSYLLWAGFVVAWWAAGAGVGTINLTLVVSGLGIVGLAASAAASYVVFTLVNRLNMHSSRTRALFWNTISELESRIGTVGQSALLPLSSAQEGFHRLSRGEHERSAVLWALLASVPIIGWIFLLAALWYLSRDMAKHNRLEELVLEDVDRTMKGAGLQGVSVKHAPIGSRDVLGIAVVVVSLVELLSVFLLGLAGCLTLIYLTVGAFSLVWLDLSMRDPIPHFVFHSQFEPEILRSLPDASAKAGAVRVE
Ga0215015_1041557623300021046SoilMRSRTDLAVSVSWLVLPLASYLLWAAFVVAWWAAGAGLGTGTLTFVVGGFGIVGLATSAAASYVVYALVDRANKHSSRTRALLWKGVSELESRSRMTGQEALLPLSSAEEGLYRLSRGEHERSAVLWALLASIPVVGWIFLVVALWLLSRDFARHSRLEEFVLEDIDRTMKGAGIQGISVKHARVGSHDMLGTIVVIAAVIELLSVFLLGPVGCFVLVYLTVGAFSLVWIDLSIRDPLAHFSFHSQFEPDLLRSLPDATAGAGNLGGA
Ga0209234_100625463300026295Grasslands SoilMWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALLNRANLHSSRTRALLWNLISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASAPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMKGAGLQGVSVRTTPVGSRDVLVATVAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFCYHSQLEPDILRSLPDSVAEAGTGGVA
Ga0209235_1000249363300026296Grasslands SoilMLAQMRARTDFSVPASYLLLPLASYLSWALFMVAWWGAGAGLGTGDLTLAVSELGIVGLVASAAASYVVYLVMSRANNHSSRTRALLWKAVGELQSRTGATGQEAMLPLSSAEEGLYRLSRGEHERSAVLWALLASIPVVGWIFLVTALWFLSRELAKHARLEELVLEDVDRTLKATGLQGASVRGAPVASRDILGVSVAIVSTIELLSSFLLGPAGGLVLIYLTVGAFSLVWLDLAIRDPTVHFSFHSQFEPDILRSLPDTFAGISNVGAG
Ga0209235_107670623300026296Grasslands SoilMRPRTDLAVPVSWLLLPLTSYLLWAIFAVAWWAAGAGLGTSDLALVVSGLGIVGLAASAAASYVVYTLMNRANEHSSRTRAVLWSALSELESRIGTTRQEALLPLTSAEEGFYKLSRGEHERSAVLWALLASIPVIGWIFLVAALWFLSRDFAKHSRLEELVLEDLDRTMRGAGLQGVSVRHAPIGARDVLGIAVVTVLLVELLSVFLLGLAGCLVLIYLTVGAFSLVWLDLSMRDPAPHFTFHSQFEPEILRALPGATAKAGTVGGA
Ga0209235_119171213300026296Grasslands SoilNLGVRVAFPDSSSREMSLLTLVRMQPRTDLVVPVSWLLLPLASYLMWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALLNRANLHSSRTRALLWNLISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDVDRTMKGAGLQGVSVRTTPVGSRDVLVAAVAIASLIELLSVFLLGPAGGLVLIYLTV
Ga0209237_1003295113300026297Grasslands SoilMRARTDVAVPASYLLLPLASYLLWAVFVVAWWGAGAGLRTGDLALVVSVLGIVGLAASAAASYVVYTLMNRANKHFSRTRALLCRAIDELHSRIGTAGHGALLPLSSADESLYKLSRGEHERSAVMWALLASIPVIGGMFLVAALWLVSRDFTKHARLEELVLEDVDRSMKGNGLQGISVRHASVASRDILGVLVSIVSLIELLSAFLLGPAGCLVLIYLTVGAFSLVWLDLSIRDPTVHFSFHSQFESDILRSLPDAVGEASNVGAG
Ga0209237_103021823300026297Grasslands SoilMRARTDFSVPASYLLLPLASYLSWALFMVAWWGAGAGLGTGDLTLAVSELGIVGLVASAAASYVVYLVMSRANNHSSRTRALLWKAVGELQSRTGATGQEAMLPLSSAEEGLYRLSRGEHERSAVLWALLASIPVVGWIFLVTALWFLSRELAKHARLEELVLEDVDRTLKATGLQGASVRGAPVASRDILGVSVAIVSTIELLSSFLLGPAGGLVLIYLTVGAFSLVWLDLAIRDPTVHFSFHSQFEPDILRSLPDTFAGISNVGAG
Ga0209237_104324833300026297Grasslands SoilMLVQMRPRTDLVVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLAIVVSVLGIVGLAASAAASFVVYTLVNRANLHSSRIRALLGNAISGLESRIGTAGGGALLPLTSAEEDFYKLSRGEHERSAVLWALVASVPIIGWTFLIAALWFLSRDFAKHSRLEELVLEDVDRAMKEAGLQGVSVRNTPVGSHDVLGAAVVIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSSHSQFEPDILRSLPDSVADARTGGVA
Ga0209236_100402473300026298Grasslands SoilVALSDSSGREMSLSTLVQMRPRTDLVVPVSWLLLPLASYLLWAVFVVFWWAGGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALVNRANLHSSRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDVDRTMKGAGLQGVSVRTTPVGSRDVLVAAVAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIGLDLSIRDPISHFSYHSQFEPDILRSLPDSVADVGTGGVA
Ga0209236_101331123300026298Grasslands SoilMRPRTDLAVPVSWLLLPLTSYLLWAIFAVAWWAAGAGLGTSDLALVVSGLGIVGLAASAAASYVVYTLMNRANEHSSRTRALLWSALSELESRIGTTRQEALLPLTSAEEGFYRLSRGEHERSAVLWALLASIPVIGWIFLVAALWFLSRDFAKHSRLEELVLEDLDRTMRGAGLQGVSVRHAPIGARDVLGIAVVTVLLVELLSVFLLGLAGCLVLIYLTVGAFSLVWLDLSMRDPAPHFTFHSQFEPEILRALPGATAKAGTVGGA
Ga0209238_100024743300026301Grasslands SoilLVRLQPRTDLVVPVSWLLLPLASYLMWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALLNRANLHSSRTRALLWNLISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASAPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMKGAGLQGVSVRTTPVGSRDVLVATVAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFCYHSQLEPDILRSLPDSVAEAGTGGVA
Ga0209761_101721773300026313Grasslands SoilVALSDSSGREMSLSTLVQMRPRTDLVVPVSWLLLPLASYLLWAVFVVFWWAGGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALVNRANLHSSRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMKGAGLQGVSVRTTPVGSRDVLVATVAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSVAEA
Ga0209154_111686323300026317SoilVRVAFSDSSSGEMSLSTLVQIRPRTDLAVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLALAVGVLGIVGLAASAAASYLVYALLNRASLHFGRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGLYKLSRGEHERSWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMRGAGLQGVSVRNVPVGSRDVLGAAVVIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPNSHFSFHSQLEPDILRSLPDSVSGAGTEGVA
Ga0209471_102104623300026318SoilMSLSTLVQIRPRTDLAVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLALAVGVLGIVGLAASAAASYLVYALLNRASLHFGRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGLYKLSRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMRGAGLQGVSVRNVPVGSRDVLGAAVVIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPNSHFSFHSQLEPDILRSLPDSVSGAGTEGVA
Ga0209152_1000084033300026325SoilMRPRTDIVIPVSWLLLPLASYLLWAVFAVAWWVAGAGVETSNLTLVVSGLGIVGLAASAAASYVVFRLVNRANLHSSRSRALLWNAISGLESRVGTAGQGALLPLSSAEENLYRLFHGDRESSAVLWALLASIPAIGWIFLVAALWFLSRHLAKHNRLEGLVLEDVDRTMRGAGLQGITVKHSPVGARDVLGIAVVVVSLVELLSVFLLGLAGGLVLIYLTVGASSLIWLDLSIRDPTFHFSSHSQFEPEILRSLPDATVGGGSIGAA
Ga0209802_1000465463300026328SoilVVAWWAAGAGVGTINLTLVVSGLGIVGLAASAAASYVVFTLVNRLNMHSSRTRALFWNTISELESRIGTVGQSALLPLSSAQEGFHRLSRGEHERSAVLWALLASVPIIGWIFLLAALWYLSRDMAKHNRLEELVLEDVDRTMKGAGLQGVSVKHAPIGSRDVLGIAVVVVSLVELLSVFLLGLAGCLTLIYLTVGAFSLVWLDLSMRDPIPHFVFHSQFEPEILRSLPDASAKAGAVRV
Ga0209802_103293813300026328SoilQMRPRTDIAVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLAIVVSVLGIVGLAASAAASFVVYTLVNRANLHSSRIRALLGNAISGLESRIGTAGGGALLPLTSAEEDFYKLSRGEHERSAVLWALVASVPIIGWTFLIAALWFLSRDFAKHSRLEELVLEDVDRAMKEAGLQGVSVRNTPVGSHDVLGAAVVIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSSHSQFEPDILRSLPDSVADARTGGVA
Ga0209803_103909833300026332SoilMQPRTDLVVPVSWLLLPLASYLMWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALLNRANLHSSRTRALLWNLISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMKGAGLQGVSVRTTPVGSRDVLVATIAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLS
Ga0209158_100543873300026333SoilMSLSTLVQIRPRTDLAVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLALAVGVLGIVGLAASAAASYLVYALLNRASLHFGRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGLYKLSRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDVDMTMKGAGLQGVSVRNVPVGSRDVLGAAVVIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPNSHFSFHSQLEPDILRSLPDSVSGAGTEGVA
Ga0209804_105493023300026335SoilVRVAFSDSSSGEMSLSTLVQIRPRTDLAVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLALAVSVLGIVGLAASVAASYLVYALLNTASLHFGRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGLYKLSRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDVDRTMKGAGLQGVSVRNVPVGSRDVLGAAVVIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPNSHFSFHSQLEPDILRSLPDSVSGAGTEGVA
Ga0209057_101231243300026342SoilMSLSTLVQMRPRTDLVVPVSWLLLPLASYLLWAVFVVAWWAAGAGLGNGDLALVVSVLGIVGLAASAAASYLVYALVNRANLHSSRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGFYRLSRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDVNRTMKGAGLQGVSVRNTPIGSRDVLGAAVIIASLTELLSVFLLGPAGGLVLIYLTVGASSLIWLDLSIRDPISHFSSHSQFEPDILRSLPDATATADNVGGA
Ga0209690_120540713300026524SoilNRVNLHSGRIRALLWSAISGLESRIGTAGGGALLPLTSAEEGFYKLSRGEHERSAVLWALVVLVPIIGWTFLIAALWFLSRDFAKHSRLEELVLEDVDRAMKEAGLQGVSVRNTPVGSHDVLGAAVVIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSSHSQFEPDILRSLPDSVADAGTGGVA
Ga0209378_1001135173300026528SoilMWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALLNRANLHSSRTRALLWNLISGLESRIGTAGQGALLPLTSAEEALYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMKGAGLQGVSVRTTPVGSRDVLVATVAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSVAEAGTGGVT
Ga0209806_101931953300026529SoilMSLSTLVQIRPRTDLAVPVSWLLLPLASYLLWAVFVVSWWAAGAGLGTGDLALAVGVLGIVGLAASAAASYLVYALLNRASLHFGRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGLYKLSRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMKGAGLQGVSVRTTPVGSRDVLVATVAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSLAEAGTGGVT
Ga0209157_102638123300026537SoilMSLSTLVQMRPRTDLVVPVSWLLLPLASYLLWAVFVIFWWAGGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALVNRANLHSSRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDMDRTMKGAGLQGVSVRTTPVGSRDVLVAAVAIASLTELLSVFLLGPAGGLVLIYLTVGASSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSVADVGTGGVA
Ga0209157_105625723300026537SoilPLASYLLWAVFVVAWWAAGAGLGNGDLALVVSVLGIVGLAASAAASYLVYALVNRANLHSSRTRALLWNAISGLESRIGTAGQGALLPLTSAEEGFYRLSRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLEDVNRTMKGAGLQGVSVRNTPIGSRDVLGAAVIIASLTELLSVFLLGPAGGLVLIYLTVGASSLIWLDLSIRDPISHFSSHSQFEPDILRSLPDATATADNVGGA
Ga0209056_1001460833300026538SoilMWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALLNRANLHSSRTRALLWNLISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMKGAGLQGVSVRTTPVGSRDVLVATVAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSVAEAGTGGVT
Ga0209376_101752343300026540SoilMILSTLVQMRPRTDLVIPVSWLLLPLASYLLWAVFVVFWWAGGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALVNRANLHSSRTRALLWNAISGLESRIGTAGQGALLPLTLAEEGFYRLSRGEHDRSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHSRLEELVLEDVDRTMKGAGLQGVSVRSTPVGSHDILGAAVAIASVIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSSHSQFEPDILRSLPDSVADAGTGGGA
Ga0209689_104871123300027748SoilGVRVAFPDSSSREMSLLTLVRMQPRTDLVVPVSWLLLPLASYLMWAVFVVSWWAAGAGLGTGDLALVVSVLGIVGLAASAAASYLVYALLNRANLHSSRTRALLWNLISGLESRIGTAGQGALLPLTSAEEGLYKLFRGEHERSAVLWALLASVPIVGWIFLVAALWFLSRDFAKHCRLEELVLDDVDRTMKGAGLQGVSVRTTPVGSRDVLVATIAIASLIELLSVFLLGPAGGLVLIYLTVGAFSLIWLDLSIRDPISHFSYHSQFEPDILRSLPDSLAEAGTGGVT
Ga0209701_1011325813300027862Vadose Zone SoilMLPLTSYLLWAVFLVAWWAAGAGLGTSNLTFVFSGLGIVGLAASAAASYMVYTLVNRVNKHSSRTRALLSTAISELELRIGTARREALLPLNSAEDDLYRLSRGEHERSAVLWALLASIPIIGWIFLVAALWFLSRDFAKHTRLEELVLEDVDRAMKGAGLQGISVKHAPVDSRDFLGIAVVVASLVELLSVLPLGPTGSFVLIYLTVGAFSLVWLDLSMRDPTPHFAFHAQFEPEILRSLPDTSAGADGVGAV
Ga0209283_1003779133300027875Vadose Zone SoilMLPLTSYLLWAVFLVAWWAAGAGLGTSNLTFVFSGLGIVGLAASAAASYMVYTLVNRVNKHSSRTSALLSTAISELELRIGTARREALLPLNSAEDDLYRLSRGEHERSAVLWALLASIPIIGWIFLVAALWFLSRDFAKHTRLEELVLEDVDRAMKGAGLQGISVKHAPVDSRDFLGIAVVVASLVELLSVLPLGPTGSFVLIYLTVGAFSLVWLDLSMRDPTPHFAFHSQFEPEILRSLPDAAGARNVGAS
Ga0209590_1010011023300027882Vadose Zone SoilMRTRTDLAVSVSWLLMPLMSYLLWAGFVVAWWAAGAGVGTINLTLVVSGLGIVGLAASAAASFVVFTLVNRENMHSSRTRALFWNTISELESRIGTAGQSALLPLSSAEEGFHRLSRGEHERSAVLWALLASVPIIGWIFLLAALWFLSRDMAKHNRLEELVLEDVDRTMKGAGLQGVSVKHAPIGSRDVLGIAVVVVSLVELLSVFLLGLAGCLTLIYLTVGAFSLVWLDLSMRDPIPHFVFHSQFEPEILRSLPDASAKAGAVGVE
Ga0137415_1041880523300028536Vadose Zone SoilPPTSVSRFSSVVYVPVEKSKYAIVRLLLPLASYLLWAIFVVAWWAAGAGIGTSSLTLVVNGLGILGLATSAAASRIVYTLVNRANNHSGRTRALLSTAISELESKIGTTSQGPLLSLNSAEDGLYKLSRGEHERSAVLWALLASIPVTGWIFLIAELWFLSRDFAKHSRLEESVLEDVDRAMKGSGLPGVSVKHAPVASHGLLGIVAVVAVLIELFSMFVLGPVGCLVLIYLTVGAFSLVWLDLSMRDPTPHFAFHSQFEPEILRSLPDTAAKAGAVGVG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.