NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F099102

Metagenome / Metatranscriptome Family F099102

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F099102
Family Type Metagenome / Metatranscriptome
Number of Sequences 103
Average Sequence Length 243 residues
Representative Sequence MFASEVICVTAIVLEVCMLLLLLRRGLWRTYTFLFVYAIWLLIGNSAIFATFLYFPTVYPSLYWNIDSIDVVLRFVVIWEVFHQTFPKRSGLNKSLSKGLGIIALSLLIFACGTFLIYQNYTGPRSIHLALDRSFGFVQALMILGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLSSSFISYWYYLRPLSFVVMIAVWIWALWVYEPNPPIMESEAIDLNRWTQDWNRTISTARTIIRP
Number of Associated Samples 74
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 6.80 %
% of genes near scaffold ends (potentially truncated) 36.89 %
% of genes from short scaffolds (< 2000 bps) 65.05 %
Associated GOLD sequencing projects 67
AlphaFold2 3D model prediction Yes
3D model pTM-score0.55

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(28.155 % of family members)
Environment Ontology (ENVO) Unclassified
(25.243 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(80.583 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 72.10%    β-sheet: 0.72%    Coil/Unstructured: 27.17%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.55
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 103 Family Scaffolds
PF00158Sigma54_activat 16.50
PF10771DUF2582 6.80
PF02954HTH_8 5.83
PF12844HTH_19 1.94
PF00108Thiolase_N 1.94
PF02863Arg_repressor_C 0.97
PF02803Thiolase_C 0.97
PF13620CarboxypepD_reg 0.97
PF03848TehB 0.97
PF13442Cytochrome_CBB3 0.97
PF13560HTH_31 0.97

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 103 Family Scaffolds
COG0183Acetyl-CoA acetyltransferaseLipid transport and metabolism [I] 2.91
COG1438Arginine repressorTranscription [K] 0.97


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001167|JGI12673J13574_1005539All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium739Open in IMG/M
3300002245|JGIcombinedJ26739_100107458All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2618Open in IMG/M
3300002558|JGI25385J37094_10126018All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium719Open in IMG/M
3300004099|Ga0058900_1419662All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2115Open in IMG/M
3300004101|Ga0058896_1007388All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2054Open in IMG/M
3300004102|Ga0058888_1414748All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium922Open in IMG/M
3300004103|Ga0058903_1503179All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1088Open in IMG/M
3300004119|Ga0058887_1013485All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1052Open in IMG/M
3300004139|Ga0058897_11164250All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1107Open in IMG/M
3300004631|Ga0058899_10006272All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1024Open in IMG/M
3300004631|Ga0058899_11972737All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium810Open in IMG/M
3300004631|Ga0058899_12268499All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium883Open in IMG/M
3300005174|Ga0066680_10012819All Organisms → cellular organisms → Bacteria4371Open in IMG/M
3300005174|Ga0066680_10108974All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1702Open in IMG/M
3300005174|Ga0066680_10165218All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1390Open in IMG/M
3300005176|Ga0066679_10755064All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium625Open in IMG/M
3300005454|Ga0066687_10226290All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1033Open in IMG/M
3300005531|Ga0070738_10049035All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2665Open in IMG/M
3300005555|Ga0066692_10025726All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3052Open in IMG/M
3300005555|Ga0066692_10086744All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1829Open in IMG/M
3300005557|Ga0066704_10100306All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1900Open in IMG/M
3300005557|Ga0066704_10190088All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1386Open in IMG/M
3300005557|Ga0066704_10306438All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1072Open in IMG/M
3300005568|Ga0066703_10134295All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1478Open in IMG/M
3300005568|Ga0066703_10192292All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1236Open in IMG/M
3300006755|Ga0079222_10106996All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1486Open in IMG/M
3300006797|Ga0066659_10246198All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1336Open in IMG/M
3300006806|Ga0079220_10090063All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1564Open in IMG/M
3300011120|Ga0150983_11337172All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium886Open in IMG/M
3300011120|Ga0150983_14798902All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2076Open in IMG/M
3300011271|Ga0137393_10065821All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2876Open in IMG/M
3300012205|Ga0137362_10095868All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2492Open in IMG/M
3300012207|Ga0137381_10398431All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1201Open in IMG/M
3300012351|Ga0137386_10631497All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium771Open in IMG/M
3300012359|Ga0137385_10011510All Organisms → cellular organisms → Bacteria7820Open in IMG/M
3300012927|Ga0137416_10030160All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3601Open in IMG/M
3300017927|Ga0187824_10000097All Organisms → cellular organisms → Bacteria13875Open in IMG/M
3300017955|Ga0187817_10226599All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1191Open in IMG/M
3300017955|Ga0187817_10234802All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1168Open in IMG/M
3300017970|Ga0187783_10856011All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium655Open in IMG/M
3300017995|Ga0187816_10256126All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium764Open in IMG/M
3300018062|Ga0187784_10364294All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1170Open in IMG/M
3300018468|Ga0066662_11616381All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium676Open in IMG/M
3300020579|Ga0210407_10084924All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2389Open in IMG/M
3300020579|Ga0210407_10169950All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1686Open in IMG/M
3300020579|Ga0210407_10232864All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1434Open in IMG/M
3300020583|Ga0210401_10092465All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2842Open in IMG/M
3300021046|Ga0215015_10211779All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2862Open in IMG/M
3300021046|Ga0215015_10501624All Organisms → cellular organisms → Bacteria → Acidobacteria13066Open in IMG/M
3300021088|Ga0210404_10010981All Organisms → cellular organisms → Bacteria3716Open in IMG/M
3300021170|Ga0210400_10069700All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2752Open in IMG/M
3300021171|Ga0210405_10024419All Organisms → cellular organisms → Bacteria4874Open in IMG/M
3300021171|Ga0210405_10077871All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2612Open in IMG/M
3300021171|Ga0210405_10289028All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1296Open in IMG/M
3300021406|Ga0210386_10405627All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1175Open in IMG/M
3300021420|Ga0210394_10012440All Organisms → cellular organisms → Bacteria8280Open in IMG/M
3300021420|Ga0210394_10124901All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2229Open in IMG/M
3300021432|Ga0210384_10229586All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1670Open in IMG/M
3300021474|Ga0210390_10166032All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1863Open in IMG/M
3300021479|Ga0210410_10244937All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1607Open in IMG/M
3300021559|Ga0210409_10178764All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1938Open in IMG/M
3300021559|Ga0210409_10651099All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium923Open in IMG/M
3300022504|Ga0242642_1033348All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium753Open in IMG/M
3300022506|Ga0242648_1006573All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1204Open in IMG/M
3300022507|Ga0222729_1004559All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1258Open in IMG/M
3300022525|Ga0242656_1001902All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2147Open in IMG/M
3300022525|Ga0242656_1028068All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium884Open in IMG/M
3300022532|Ga0242655_10055583All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium988Open in IMG/M
3300022532|Ga0242655_10085152All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium844Open in IMG/M
3300022717|Ga0242661_1016723All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1137Open in IMG/M
3300022722|Ga0242657_1008031All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1703Open in IMG/M
3300022724|Ga0242665_10050088All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1108Open in IMG/M
3300022724|Ga0242665_10051206All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1099Open in IMG/M
3300022726|Ga0242654_10046766All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1207Open in IMG/M
3300026298|Ga0209236_1005591All Organisms → cellular organisms → Bacteria7646Open in IMG/M
3300026328|Ga0209802_1089269All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1412Open in IMG/M
3300026328|Ga0209802_1176576All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium863Open in IMG/M
3300026328|Ga0209802_1246548All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium618Open in IMG/M
3300026334|Ga0209377_1023155All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3105Open in IMG/M
3300026334|Ga0209377_1164443All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium818Open in IMG/M
3300026529|Ga0209806_1064645All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1666Open in IMG/M
3300026532|Ga0209160_1016522All Organisms → cellular organisms → Bacteria5116Open in IMG/M
3300026532|Ga0209160_1096533All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1509Open in IMG/M
3300026532|Ga0209160_1115546All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1330Open in IMG/M
3300026532|Ga0209160_1148329All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1082Open in IMG/M
3300027635|Ga0209625_1001615All Organisms → cellular organisms → Bacteria5243Open in IMG/M
3300027725|Ga0209178_1001215All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium8047Open in IMG/M
3300027884|Ga0209275_10364130All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium812Open in IMG/M
3300027889|Ga0209380_10036158All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2793Open in IMG/M
3300028536|Ga0137415_10030528All Organisms → cellular organisms → Bacteria5306Open in IMG/M
3300031057|Ga0170834_110073455All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2647Open in IMG/M
3300031231|Ga0170824_114552883All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1083Open in IMG/M
3300031708|Ga0310686_104939454All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3193Open in IMG/M
3300031708|Ga0310686_114837412All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium881Open in IMG/M
3300031754|Ga0307475_10826165All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium734Open in IMG/M
3300032180|Ga0307471_100027694All Organisms → cellular organisms → Bacteria4301Open in IMG/M
3300032515|Ga0348332_11985520All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium881Open in IMG/M
3300032515|Ga0348332_13360164All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2026Open in IMG/M
3300032770|Ga0335085_10000021All Organisms → cellular organisms → Bacteria588913Open in IMG/M
3300032770|Ga0335085_10430756All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1523Open in IMG/M
3300032783|Ga0335079_10247883All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1960Open in IMG/M
3300032805|Ga0335078_10099934All Organisms → cellular organisms → Bacteria4191Open in IMG/M
3300032954|Ga0335083_10345441All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1290Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil28.16%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil22.33%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil13.59%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil6.80%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil4.85%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment3.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.88%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.91%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.91%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.94%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.94%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.94%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.94%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter1.94%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.97%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001167Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M2EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300004099Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF236 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004101Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF228 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004102Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF212 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004103Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF242 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004119Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF210 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004139Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF230 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004631Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF234 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005531Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen12_06102014_R2EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300017970Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300017995Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_1EnvironmentalOpen in IMG/M
3300018062Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP15_20_MGEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022504Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-2-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022506Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-26-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022507Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-27-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022525Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022532Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022717Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-11-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022722Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-12-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022726Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300027635Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032515FICUS49499 Metatranscriptome Czech Republic combined assembly (additional data)EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300032783Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.3EnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M
3300032954Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.2EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12673J13574_100553913300001167Forest SoilTWLFLANSAILITSLYLPAEGQAINWAGRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLAVACVTFWGYQNYTGLNSIHLALDRSFGFIQALMILGTLLMARYYGVRCGRNIRGIALAFGGWVSISTATNAMADLNSSFLAYWYYLRPLTFVFMIAVWIWALWVYEPNPPIRESDAVELNRWTQDWNRTISAARTIIRP*
JGIcombinedJ26739_10010745843300002245Forest SoilFAYTTWLFLANSAILITSLYLPAEGQAINWAGRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLAVACVTFWGYQNXTGLNSIHLALDRSFGFIQALMILGTLLMARYYGVRCGRNIRGIALAFGGWVSISTATNAMADLNSSFLAYWYYLRPLTFVFMIAVWIWALWVYEPNPPIRESDAVELNRWTQDWNRTISAARTIIRP*
JGI25385J37094_1012601813300002558Grasslands SoilRRGLWRTYTFLFVYAIWLLIGNSAIFATFLYFPTVYPSLYWNIDSIDVVLRFVVIWEVFHQTFPKRSGLNKSLSKGLGIIALSLLIFACGTFLIYQNYTGPRSIHLALDRSFGFVQALMILGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLSSSFISYWYYLRPLSFVVMIAVWIWALWVYEPNPPIMESEAIDLNRWTQDWNRTISTARTIIRP*
Ga0058900_141966213300004099Forest SoilMFLSDAICVTAVALELATVLLLLRKRVWRSYTLFFAYTTWLFLANSAILITSLYLPAEGQAINWAGRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLVVASATFWGYQTNTGLRSLHLALDRSFGFIQALMILGTLLMARYYRVSCGRNIRGIALAFGGWVSISTATNALADLNSSFVAYWYYLRPLTFVFMIAVWIWALWVYEPNPPIRESEAVELNRWTQDWNRTISAARTIIRP*
Ga0058896_100738813300004101Forest SoilMLLSGAICVIAEALGLATLFLLLRRSLWRTYAFFSVYALWLLIGNTVLLATSFYFPRHHPSWYWNIDSVDVVLRFLVVWEVFRQTFPKGSGLNKSLSKGLSIGAFGLLVVACATFWGYQTYTGLRSLHLALDRSFGFVQALMILGTLVTARYYGVLCGRNIRGIALAFGGWVSISTATNAMADLDSSFLTYWYYLRPLSFVVMIGVWIWALWVYEPNPPISESEPVELNQWTEEWNRTLSAARTIIRP*
Ga0058888_141474813300004102Forest SoilMVISLLICVAAVVLEVCTLLLLSRKLLWRAYPFLLAYVVWLVIGNSAILVSFISYLRMAPEVRVHSLYPSLYWHSDTLDVVLRFLLVWEVFHQTFPKGSGLNRSLSKGLGIAAFLLLVFACATFWGYQNYSSLRSIHLALDRSFDFVQAVMILGTLLMARYYGVRFGRNVRGIALAFGGWVSISTVNSAVVDLTNSFLPYWYYLRPLSFLVMMVAWIWALWVYEPNPPILESGTAELNQWTQDWNRTISAARTIIRP*
Ga0058903_150317913300004103Forest SoilMFLSDAICVTAVALELATVLLLLRKRVWRSYTLFFAYTTWLFLANSAILITSLYLPAEGQAINWAGRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLVVASATFWGYQTNTGLRSLHLALDRSFGFIQALMILGTLLMARYYGVSCGRNIRGIALAFGGWVSISTATNALADLNSSFVAYWYYLRPLTFVFMIAVWIWALWVYEPNPPIRESEAVELNRWTQDWNRTISAARTIIRP*
Ga0058887_101348513300004119Forest SoilMLLSGAICVIAEALGLATLFLLLRKSLWRTYAFFFVYALWLLIGNSVLLATSFYFPRHHPSWYWNIDSVDVALRFLVVWEVFRQTFPKGSGLNKSLSKGLSIGAFGLLVVACATFWGYQTYTGLRSLHLALDRSFGFVQALMILGTLVTARYYGVLCGRNIRGIALAFGGWVSISTATNAMADLDSSFLTYWYYLRPLSFVVMICVWIWALWVYEPNPPISESEPVELNQWTEEWNRTLSAARTIIRP*
Ga0058897_1116425023300004139Forest SoilMVTSVVISVAAVALEVGSLALLLRRSLWRVYTLLFIYVLWLLIGNSTLFFSFLYFPAVYSNLYWQSDTIDVILRFLVVWEVFRQTFPKSSRLNRSLSKGLGIIAFVLLLFACATFWSYQNYTAFRSIHLALDRSFGFVQALMILGTLVTARYYGVKCGRNIRGIALAFGAWVSISTATNAMADLDSSFLSYWYYLRPLSFVVMIGVWIWALWVYDPNPPIKESESVELRQWTEDWNRAISTTRTIIRP*
Ga0058899_1000627213300004631Forest SoilMVTSVVISVAAVALEVGSLALLLRRSLWRVYTLLFIYVLWLLIGNSTLFFSFLYFPAVYSNLYWQSDTIDVILRFLVVWEVFRQTFPKSSRLNRSLSKGLGIIAFGLLLFACATFWSYQNYTAFRSIHLALDRSFGFVQALMILGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLDSSFLSYWYYLRPLSFVVMIGVWIWALWVYEPNPPISESEPVELNQWTEEWNRTISAARTII
Ga0058899_1197273713300004631Forest SoilMLLSGAICVIAEALGLATLFLLLRKSLWRTYAFFFVYALWLLIGNTVLLATSFYFPRHHPSWYWNIDSVDVALRFLVVWEVFRQTFPKGSGLNKSLSKGLSIGAFGLLVVACATFWGYQTYTGLRSLHLALDRSFGFVQALMILGTLVTARYYGVLCGRNIRGIALAFGGWVSISTATNAMADLDSSFLTYWYYLRPLSFVVMICVWIWALWVYEPNPPISESEPVEL
Ga0058899_1226849913300004631Forest SoilMVISLLICVAAVVLEVCTLSLLSRKSLWRAYPFLLAYVIWLVVGNSAILVSFISYLRMAPELRAHSLYPSIYWHSDTLDVVLRFLLVWEVFHQTFPKGSGLNRSLSKGLGITAFLLLVFAGATFWGYQNYTSLRSIHLALDRSFDFVQAVMILGTLLMARYYGVRFGRNVRGIALAFGGWVSISTVNSAVVDLTNSFLPYWYYLRPLSFLVMMVAWIWALWVYEPNPPILESGTAELNQWTQDWNRTI
Ga0066680_1001281933300005174SoilVDIFVVICVAGPVFELGALLLLVRNGLWRSYTSFFVYLTWLLVGNSAILIASVYFPGIYPTLYWHIDSIDVVLRFLVIWEVFHQIFPKTSGLNRSLSKGFGLIAFGLLAFGCATFLIYQNYTGPRSIHLALDRSFAFVQALMILGTLVAARYYGVRCGRNIRGIALAFGGWMSISTATNAMADLTTSFITYWYYLRPLSFVVMIAVWIWALWIYDPNPPIVESEPVELGQWTEDWNRTISAARTIIRP*
Ga0066680_1010897413300005174SoilMLLSGAICVVAEALGLATLVLLLRKSLWRTYAFFFVYALWLLSGNSVLLITSFYFPGHHPSWYWNIDSIDVVLRFLVVWEVFRQTFPKGSGLNKSLSKGLSIGAFGLLVVACATFWGYQTYTGLRSLHLALDRSFGFVQALMVLGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLSSSFISYWYYLRPLSFVSMIAAWIWALWIYEPNPPIRESEAVDLDRWTQDWNRTISTARTIIRL*
Ga0066680_1016521813300005174SoilMFLSGAICVAAVSLELATLLLLVSKGVWRSYRLFFVYATWLFLANSAILVTFLYLPADSHTINWASRVYPALYWNIDSIDVVLRFVVVWEVFHQIFPKSSGLNKSLSKGLAIVAFALLVVGCGTFLIYQNYTGPRSIHLALDRSFGVVQAFMILGTLVTARYYGVRCGRNVRGIALAFGGWVSISTATNAMADLASPFLAYWYYLRPLSFVVMIAVWIWALWVYEPNPPIMESEAIELDQWTQDWNRTISTARTIIRP*
Ga0066679_1075506413300005176SoilREYPFFLLYAIWLLTANSAMLIADLYFRSIYPVVYWNIDSIDIVLRLLVVWEVFRQTFPKNSGVGRTLSKGLGIVALGLLIFACASFWDYQNYTSLRSTHLALDRSFGFVQAIMVLGTLVMARYYGLSCGRNIRGIALGFGAWVSVSTANNAMADLTSSFLPYFYRLRPLSFVFMLLVWIWALWVYEPNPPIIESDEVELSRWTQDWN
Ga0066687_1022629013300005454SoilPFFLLYAIWLLTANSAMLITDLYFRSIYPVVYWNIDSIDIVLRLLVVWEVFRQTFPKNSGVGRTLSRGLGIIALGLLIFACASFWDYQNYTSLRSAHLALDRSFGFVQAIMVLGTLVMARYYGLSCGRNIRGIALGFGAWVSVSTANNAMADLTSSFLPYFYRLRPLSFVFMLLVWMWALWVYEPNPPIIESDEVELSRWTQDWNRTITAARTIIRP*
Ga0070738_1004903523300005531Surface SoilMLLSDVICVTAVVLELITLLLLRRKGLWRIYPLFSIYTAWLLAGNSVILFTFIHWPSIYAAMYWTIDSVDVALRFIIVWEVFHQIFPKGSALNKSLSKGLGAVASGLLVFACAMFLAYQRDTGPRAIHLALDRSFGVVQAVMILGILLMARYYGVKCGRNVRGIAIAFGAWVSISTATNAMADLTSSFLPYWYYLRPLSFVVMIVVWIWAVSVYEPNPPILESEAPQLEQWTEDWNRTISAARTIIRP*
Ga0066692_1002572623300005555SoilMFASEVICVTAIVLEVCMLLLLLRRGLWRTYTFLFVYAIWLLIGNSAIFATFLYFPTVYPSLYWNIDSIDVVLRFVVIWEVFHQTFPKRSGLNKSLSKGLGIIALSLLIFACGTFLIYQNYTGPRSIHLALDRSFGFVQALMILGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLSSSFISYWYYLRPLSFVVMIAVWIWALWVYEPNPPIMESEAIDLNRWTQDWNRTISTARTIIRP*
Ga0066692_1008674413300005555SoilMFLSGAICVAAVSLELATLLLLVSKGVWRSYRLFFVYATWLFLANSAILVTFLYLPADSHTINWASRVYPALYWNIDSIDVVLRFVVVWEVFHQIFPKSSGLNKSLSKGLAIVAFALLVVGCGTFLIYQNYTGPRSIHLALDRSFGVVQAFMILGTLVTARYYGVRCGRNVRGIALAFGGWVSISTATNAMADLASPFLAYWYYLRPLSFVVMIAVWIWALWVYEPNPPIMESEAIELDRWTQDWNRTISTARTIIRP*
Ga0066704_1010030633300005557SoilVFELGALLLLVRNGLWRSYTSFFVYLTWLLVGNSAILIASVYFPGIYPTLYWHIDSIDVVLRFLVIWEVFHQIFPKTSGLNRSLSKGFGLIAFGLLAFGCATFLIYQNYTGPRSVHLALDRSFAFVQALMILGTLVAARYYGVRCGRNIRGIALAFGGWMSISTATNAMADLTTSFITYWYYLRPLSFVVMIAVWIWALWIYDPNPPIVESEPVELGQWTEDWNRTISAARTIIRP*
Ga0066704_1019008813300005557SoilMFASEVICVTAIVLEVCMLLLLLRRGLWRTYTFLFVYAIWLLIGNSAIFATFLYFPTVYPSLYWNIDSIDVVLRFVVIWEVFHQTFPKRSGLNKSLSKGLGIIALSLLIFACGTFLIYQNYTGPRSIHLALDRSFGFVQALMILGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLSSSFISYWYYLRPLSFVSMIAAWIWALWIYEPNPPIRESEAVDLDRWTQDWNRTISTARTIIR
Ga0066704_1030643823300005557SoilMLLSQMIRAAAVALEACALVLMLRQARWRAYPFLCLYTIWLLVGNSVQEITSAYKPAIYASLYWRDDTIDVIVRFLVIWEVFRQTFPRSSRLNKSLSRGLGIIAFGLLLFGCALFWGYQNYSGIRSLHLALDRTFGFVQAVMILGTLLVARYYGVRCGRNVRGIALAFGGWVSISTATNAMADLANSFLPYWYYLRPLSFVVMMAVWIWALWVYEPNPPIVESEPVE
Ga0066703_1013429533300005568SoilWLLIGNSAIFATFLYFPTVYPSLYWNIDSIDVVLRFVVIWEVFHQTFPKRSGLNKSLSKGLGIIALSLLIFACGTFLIYQNYTGPRSIHLALDRSFGFVQALMILGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLSSSFISYWYYLRPLSFVVMIAVWIWALWVYEPNPPIMESEAIDLNRWTQDWNRTISTARTIIRP*
Ga0066703_1019229223300005568SoilVDIFVVICVAGPVFELGALLLLVRNGLWRSYTSFFVYLTWLLVGNSAILIASVYFPGIYPTLYWHIDSIDVVLRFLVIWEVFHQIFPKTSGLNRSLSKGFGLIAFGLLAFGCATFLIYQNYTGPRSIHLALDRSFAFVQALMILGTLVAARYYGVRCGRNIRGIALAFGGWMSISTATNAMADLTTSFITYWYYLRPLSFVVMIAVWIWALWIYDPNPPIVESEPVE
Ga0079222_1010699623300006755Agricultural SoilMLLSEAIGLAGPALELAIILLLLRKKLWRVYTFLFVYAFWLLIGNSVILGTFLYFSKIYPDFPNVYPALYWKIYWNIDSIDVVLRFVVIWEIFRQTFPKRSGLNKSLSKGLGIGALALLAAACSIFLIYQDYAGPRSIHLALDRSFGFVQALMILGTLVTARYYGVRYGRNIRGIALAFGAWVSVSTATNAMVDLTNSFLPYWYYLRPLSFVGMIGVWIWALWVYEPNPPIMESGELELSQWNQEWNRTISATRTIMRP*
Ga0066659_1024619813300006797SoilVDIFVVICVAGPVFELGALLLLVRNGLWRSYTSFFVYLTWLLVGNSAILIASVYFPGIYPTLYWHIDSIDVVLRFLVILEVFHQIFPKTSGLNRSLSKGFGLIAFGLLAFGCATFLIYQNYTGPRSIHLALDRSFAFVQALMILGTLVAARYYGVRCGRNIRGIALAFGGWMSISTATNAMADLTTSFITYWYYLRPLSFVVMIAVWIWALWIYDPNPPIVESEPVEL
Ga0079220_1009006313300006806Agricultural SoilMLIFVAGVVLELWAILLLLRNNLWRVYARLFTYITWLFIGNSTILIAFLYFPRVYPSLYWHSDSVDVVLRFFVVWEVFHQTFPKSSGLNRSLSKGLAGIAFGLLVFASASFWVYQNYAELRPLHLALDRSFGFVQALMILGTLVTARYYGVRCGRNVRGIALAFGAWISISTATNAMADLTNSFLPYWYYLRPLSFVVMMVVWIWALWVYEPNPPIVEGDAVELSQWTQDWNRTISAARTIIRP*
Ga0150983_1133717213300011120Forest SoilMLLSGAICVIAEALGLATLFLLLRRSLWRTYAFFSVYALWLLIGNTVLLATSFYFPRHHPSWYWNIDSVDVALRFLVVWEVFRQTFPKGSGLNKSLSKGLSIGAFGLLVVACATFWGYQTYTGLRSLHLALDRSFGFVQALMILGTLVTARYYGVLCGRNIRGIALAFGGWVSISTATNAMADLDSSFLTYWYYLRPLSFVVMIGVWIWALWVYEPNPPISESEPVELNQWTEEWNRTISAARTIIRP*
Ga0150983_1479890223300011120Forest SoilMLLSDVISVTALALEACAVLLLLRKSLWRTYTFIFIYAVWLLVGNSLQSLAVVHFPAKFPAIYWYNDTIDVVLRFLVVWEVFRQTFPKGSGLNRSLSKGLGIVAFVLVVFACAAFWGYQNYTNLRSIHLALDRIFDFVQAIMILGTLLMARYYGVRYGRNVRGIALAFGGWVSISTVNSAMVDLTNSFLPYWYYLRPLSFVLMMVVWIWALWVYEPNPPIVESGTIELNQWTEDWNRTVSAARTIIRP*
Ga0137393_1006582133300011271Vadose Zone SoilMFLSEVIGLAGPALEVATILLLLRKRLWRVYTFLFVYALWLLIGNSTILATFLYFSKVYPDFPNAYPALYWNIYWNIDSIDVVLRFVVVWEVFRQTFPKRSGLNKSLSKGLGVIAFSLLAFACGTFLIYQNYTGPRSIHLALDRSFGVVQAFMILGTLVTARYYGVRCGRNVRGIALAFGGWVSISTATNAMADLASPFLPYWYYLRPLSFVVMIAVWIWALWVYEPNPPIMESEPVELGQWTEDWNRTISATRTIIRP*
Ga0137362_1009586823300012205Vadose Zone SoilLVGNSVQEITSAYKPAVYASLYWRDDTIDVIVRFLVIWEVFRQTFPRSSRLNKSLSRGLGIIAFGLLLFGCALFWGYQNYSGIRSLHLALDRTFGFVQAVMILGTLLVARYYGVRCGRNVRGIALAFGGWVSISTATNAMADLANSFLPYWYYLRPLSFVVMMAVWIWALWVYEPNPPIVESEPVELGQWTEDWNRTISATRTIIRP*
Ga0137381_1039843113300012207Vadose Zone SoilLVGNSVQEITSAYKPAVYASLYWRDDTIDVIVRFLVIWEVFRQTFPRSSRLNKSLSRGLGIIAFGLLLFGCALFWGYQNYSGIRSLHLALDRTFGFVQAVMILGTLLVARYYGVRCGRNVRGIALAVGGCVSISTATNAMADLANSFLPYWYYLRPLSFVVMMAVWIWALWVYEPNPPIVESEPVELGQWTEDWNRTISATRTIIRP*
Ga0137386_1063149723300012351Vadose Zone SoilLLVGNSVQEITSAYKPAVYASLYWRDDTIDVIVRFLVIWEVFRQTFPRSSRLNKSLSRGLGIIAFGLLLFGCALFWGYQNYSGIRSLHLALDRTFGFVQAVMILGTLLVARYYGVRCGRNVRGIALAFGGWVSISTATNAMADLANSFLPYWYYLRPLSFVVMMAVWIWALWVYEPNPPIVESEPVELGQWTEDWNRTISATRTIIRP*
Ga0137385_1001151033300012359Vadose Zone SoilMFLSGAICVAAVSLELATLLLLVSKGVWRSYRLFFVYATWLFLANSAILVTFLYLPADSHTINWASRVYPALYWNIDSIDVVLRFVVVWEVFHQIFPKSSGLNKSLSKGLAIVAFALLVVGCGTFLIYQNYTGPRSIHLALDRSFGVVQAFMILGTLVTARYYGVKCGRNVRGIALAFGGWVSISTATNAMADLASPFLAYWYYLRPLSFVVMIAVWIWALWVYEPNPPIMESEAIELDRWTQDWNRTISTARTIIRP*
Ga0137416_1003016013300012927Vadose Zone SoilYATWLFLANSAILVTFLYLPADSHTVNWASRVYPALYWNIDSIDVVLRFVVVWEVFHQIFPKSSGLNKSLSKGLAIVAFALLVVACGTFLIYQNYTGPRSIHLALDRSFGLVQALMILGTMLMARYYGVRCGRNVRGIALAFGGWVSISTATNAMADLAGPFLPYWYYLRPLSFVVMIAVWIWALWVYEPNPPIMESEPVELGQWTEDWNRTISATRTIIRP*
Ga0187824_1000009783300017927Freshwater SedimentMMLASDVICISAVILEGGLLLLFLGKGLWRAYSFLFAYTIWLFIGNSAILASFLYLPAIYPALYWNIDSIDVVLRFVVVWEVFHQTFPRKSGLNKTLSKGLGIIAATLLAFACGTFLVYQNFTGPRSIHLALDRSFAFIQALMILGTLVTARYYGVKCGRNVRGIALAFGAWVSISTVTNAMADLPGSFLAYWYYLRPLSFVVMMAVWIWALWVYEPNPPIMESDAAELSQWTEDWNRTISAARTIIRP
Ga0187817_1022659913300017955Freshwater SedimentMVISLLICIAAVVLELCTLVLLSRKALWRAYPFLLAYVIWLVVGNSAILVSFISYLRMAPEVRLHSLYPTLYWQSDTLDVVLRFLLVWEIFHQTFPKGSGLNRSLSKGLGIAAFLLLVLACATFWGYQNYTSLRSIHLALDRSFDFVQAVMILGTLLMARYYGVRFGRNVRGIALAFGGWVSISTVNSAMVDMTNSFLPYWYYLRPLSFLVMMVAWIWAIWVYDPNPPILESGTAELNQWTEDWNRTVSAARTIIRP
Ga0187817_1023480223300017955Freshwater SedimentMVISLLICIAAVALEVCTLLLLSRKSLWRAYPFLLAYVIWLVVGNSAILVSFISYLRMAPEVRVYSLYPSLYWHSDTLDVVLRFLLVWEIFHQTFPKGSGLNRSLSKGLGITAFLLLVFAGATFWGYQNYTSLRSIHLALDRSFGFVQAVMILGTLLMARYYGVRFGRNVRGIALAFGGWVSISTVNSAMVDLTNSFLPYWYYLRPLSFLVMMVVWIWALWVYEPNPPILESGRAELNQWTEEWNRTVSAARTIIRP
Ga0187783_1085601113300017970Tropical PeatlandMLLSDVIGVSGPLLELVTLCLFLRKSLWRAYPLLFTYTVWLLIANSVLLTTSLYFPRVHPSYYWNIDSIDIVLRFIVVWEIFHQIFPKNSGLNKTLSKGLGIVALGLIVFACATFLIYQNYTGPRSVHLALDRSFGFIQAIMILGTLLMARYYGVQCGRNVRGIALAFGGWVSISTATNAMADLTVSFIPYWKYLRPLSFVVMMAVWIWALWVYEPNP
Ga0187816_1025612623300017995Freshwater SedimentKALWRAYPFLLAYVIWLVVGNSAILVSFISYLRMAPEVRLHSLYPTLYWQSDTLDVVLRFLLVWEIFHQTFPKGSGLNRSLSKGLGIAAFLLLVLACATFWGYQNYTSLRSIHLALDRSFDLVQAVMILGPLLMARYYGVRFGRNVRGIALAFGGWVSISTVNSAMVDMTNSFLPYWYYLRPLSFLVMMVAWIWAIWVYDPNPPILESGTAELNQWTEDWNRTVSAARTIIRP
Ga0187784_1036429423300018062Tropical PeatlandCLFLRKGLWRAYPLLFTYAVWLLVANSVLLTTSLYFPRVHPSYYWNIDSIDVVLRFVVVWEIFHQIFPKNSGLNKTLSKGLAMVALGLIVFACATFLIYQNYTGPRSVHLALDRSFGFIQAIMILGTLLMARYYGVQCGRNVRGIALAFGGWVSISTATNAMADLAVSFIPYWYYLRPLSFVVMMAVWIWALWVYEPNPPIPEGESVELNQWTEDWNRTISTARTIIRS
Ga0066662_1161638113300018468Grasslands SoilEPWLVYTATVMFASEVICVTAIVLEVCMLLLLLRRGLWRTYTFLFVYAIWLLIGNSAIFATFLYFPTVYPSLYWNIDSIDVVLRFVVIWEVFHQTFPKRSGLNKSLSKGLGIIALSLLIFACGTFLIYQNYTGPRSIHLALDRSFGFVQALMILGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLSSSFISYWYYLRPLSFVVMIAVWIWALWVYE
Ga0210407_1008492423300020579SoilMFLSDAICVAAVGLELATVLLLLRKRVWRSYTLFFVYATWLFLANSAILVTFLYLPAEGQAINWARRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLAVACVTFWGYQNYTGLNSIHLALDRSFGFIQALMILGTLLMARYYGVRCGRNIRGIALAFGGWVSISTATNALADLNGSFVAYWYYLRPLTFVFMIAVWIWALWVYEPNPPIRESEAVELNRWTQDWNRTISAARTIIRP
Ga0210407_1016995013300020579SoilVDTFFAIGVAGPILELGVLLLLLHNGLWRRYKFLLTYDLWLLLGNSAILFTFLYFHRQIDSHPIYGTLYWDIDSIDVVLRFLLIWEVFHHTFPRGSGLNRSLSKGLGIVAFGLLVFACATFWGYQNYASVRSVHLALDRSFGFVQAIMILGTLMMARYYGVNYGRNVRGIALAFGGWVSLSTANNAMADLTNSFLPYWYYLRPLSFVFMMAVWIWALWVDEPNPPIVESEAAELNQWTEEWNRTISTARTIIRP
Ga0210407_1023286423300020579SoilMVTSVVISVAAVALEVGSLALLLRRSLWRVYTLLFIYVLWLLIGNSTLFFSFLYFPAVYSNLYWQSDTIDVILRFLVVWEVFRQTFPKSSRLNRSLSKGLGIIAFVLLLFACATFWSYQNYTAFRSIHLALDRSFGFVQALMILGTLVTARYYGVKCGRNIRGIALAFGAWVSISTATNAMADLDSSFLNYWYYLRPLSFVVMIGVWIWALWVYEPNPPISESEPVELNQWTEEWNRTLSAARTIIRP
Ga0210401_1009246533300020583SoilMFLSDAICVAAVGLELATVLLLLRKRVWRSYTLFFVYATWLFLANSAILVTFLYLPAEGQAINWARRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLAVACVTVWGYQNYTGLNSIHLALDRSFGFIQALMILGTLLMARYYGVRCGRNIRGIALAFGGWVSISTATNAMADLNSSFLAYWYYLRPLTFVFMIAVWIWALWIYEPNPPIRESEAVELNRWTQDWNRTISAARTIIRP
Ga0215015_1021177913300021046SoilMFLSDAICVAAVALELATVLLLLRKRAWRSYALFFVYATWLFLANLAILVTFLYLPAEGQAINWASRLYPALYWNIDSIDVVLRFVVVWEVFRQTFPKGSGLNKSLSKGLAIVAFALLVVACATFWGYQTYTGLRSLHLALDRSFGFVQALMILGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLSSSFISYWYYLRPLSFVFMIAVWIWALWVYEPNPPIRESEAVELNRWTQDWNRTISTARTIIRP
Ga0215015_10501624133300021046SoilMFLSEVIGLAGPALEVAAILLLLRKRLWRVYTFLFIYSVWLLIGNSTILATFLYFSKVYPDFPNAYPALYWNIYWNIDSIDVVLRFVVVWEVFRQTFPKRSGLNKSLSKGLGVIAFSLMAFACGTFLIYQDYTGPRSIHLALDRSFGVVQALMILGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLSSSFISYWYYLRPLSFVVMIAVWIWALWVYEPNPPIRESEAIDLNRWTQDWNRTISTARTIIRP
Ga0210404_1001098133300021088SoilMFLSDAICVAAVGLELATVLLLLRKRVWRSYTLFFVYATWLFLANSAILVTFLYLPAEGQAINWARRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLAVACVTVWGYQNYTGLNSIHLALDRSFGFIQALMILGTLLMARYYGVRCGRNIRGIALAFGGWVSISTATNAMADLNSSFLAYWYYLRPLTFVFMIAVWIWALWVYEPNPPIRESEAVELNRWTQDWNRTISAARTIIRP
Ga0210400_1006970023300021170SoilMFLSDAICVAAVGLELATVLLLLRKRVWRSYTLFFVYATWLFLANSAILVTFLYLPAEGQAINWARRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLAVACVTFWGYQNYTGLNSIHLALDRSFGFIQALMILGTLLMARYYGVRCGRNIRGIALAFGGWVSISTATNAMADLNSSFLAYWYYLRPLTFVFMIAVWIWALWIYEPNPPIRESEAVELNRWTQDWNRTISAARTIIRP
Ga0210405_1002441953300021171SoilMVTSVVISVAAVALEVGSLALLLRRSLWRVYTLLFIYVLWLLIGNSTLFFSFLYFPAVYSNLYWQSDTIDVILRFLVVWEVFRQTFPKSSRLNRSLSKGLGIIAFGLLLFACATFWSYQNYTAFRSIHLALDRSFGFVQALMILGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLDSSFLSYWYYLRPLSFVVMIGVWIWALWVYDPNPPIKESESVELRQWTEDWNRAISTARTIMRP
Ga0210405_1007787133300021171SoilMLLSDAICVTAVALELATVLLLLRQRVWRSYTLFFAYATWLFLANSAILITSLYLPAAGQAINWAGRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLVVASATFWGYQTNTGLRSLHLALDRSFGFIQALMILGTLVTARYYGVRCGRNIRGIALAFGGWVSISTATNALADLNGSFVAYWYYLRPLTFVFMIAVWIWALWVYEPNPPIRESEAVELNRWTQDWNRTISAARTIIRP
Ga0210405_1028902813300021171SoilMLLSGAICVIAEALGLATLFLLLRKSLWRTYAFFFVYALWLLIGNTVLLATSFYFPRHHPSWYWNIDSVDVVLRFLVVWEVFRQTFPKGSGLNKSLSKGLSIGAFGLLVVACATFWGYQTYTGLRSLHLALDRSFGFVQALMILGTLVTARYYGVLCGRNIRGIALAFGGWVSISTATNAMADLDSSFLTYWYYLRPLSFVVMIGVWIWALWVYEPNPPISESEPVELNQWTEEWNRTLSAARTIIRP
Ga0210386_1040562723300021406SoilMFLSDAICVAAVGLELATVLLLLRKRVWRSYTLFFVYATWLFLANSAILVTFLYLPAEGQAINWARRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLVVASATFWGYQTNTGLRSLHLALDRSFGFIQALMILGTLVTARYYGVRCGRNIRGIALAFGGWVSISTATNALADLNGSFVAYWYYLRPLTFVFMIAVWIWALWVYEPNPPIRESEAVELNRWTQDWNRTISAARTIIRP
Ga0210394_1001244083300021420SoilMFLSDAICVTAVALELATVLLLLRKRVWRSYTLFFAYTTWLFLANSAILITSLYLPAEGQAINWAGRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLVVASATFWGYQTNTGLRSLHLALDRSFGFIQALMILGTLLMARYYGVSCGRNIRGIALAFGGWVSISTATNALADLNSSFVAYWYYLRPLTFVFMIAVWIWALWVYEPNPPIRESEAVELNRWTQDWNRTISAARTIIRP
Ga0210394_1012490133300021420SoilMVISLLICVAAVVLEVCTLSLLSRKSLWRAYPFLLAYVIWLVVGNSAILVSFISYLRMAPELRAHSLYPSIYWHSDTLDVVLRFLLVWEVFHQTFPKGSGLNRSLSKGLGITAFLLLVFAGATFWGYQNYTSLRSIHLALDRSFDFVQAVMILGTLLMARYYGVRYGRNVRGIALAFGGWVSISTVNSAMVDLTNSFLPYWYYLRPLSFLVMMVAWIWALWVYEPNPPIVESG
Ga0210384_1022958613300021432SoilMLLSGAICVIAEALGLATLFLLLRKSLWRTYAFFFVYALWLLIGNTVLLATSFYFPRHHPSWYWNIDSVDVALRFLVVWEVFRQTFPKGSGLNKSLSKGLSIGAFGLLVVACATFWGYQTYTGLRSLHLALDRSFGFVQALMILGTLVAARYYGVLCGRNIRGIALAFGGWVSISTATNAMADLDSSFLTYWYYLRPLSFVVMIGVWIWALWVYEPNPP
Ga0210390_1016603223300021474SoilMVISLLICVAAVVLEVCTLSLLSRKSLWRAYPFLLAYVIWLVVGNSAILVSFISYLRMAPELRAHSLYPSIYWHSDTLDVVLRFLLVWEVFHQTFPKGSGLNRSLSKGLGITAFLLLVFAGATFWGYQNYTSLRSIHLALDRSFDFVQAVMILGTLLMARYYGVRFGRNVRGIALAFGGWVSISTVNSAMVDLTNSFLPYWYYLRPLSFLVMMVAWIWALWVYEPNPPILESGTAELNQWTEDWNRTISAARTIIRP
Ga0210410_1024493723300021479SoilMVTSAVISVAAVALEVGSLALLLRRSLWRVYILLFIYVLWLLIGNSTLFFSFLYFPAVYSNLYWQSDTIDVILRFLVVWEVFRQTFPKSSRLNRSLSKGLGIIAFVLLLFACATFWSYQNYTAFRSIHLALDRSFGFVQALMILGTLVTARYYGVKCGRNIRGIALAFGAWVSISTATNAMADLDSSFLSYWYYLRPLSFVVMIGVWIWALWVYDPNPPIKESESVELRQWTEDWNRAISTARTIIRP
Ga0210409_1017876413300021559SoilMVISLLICVAAVVLEVCTLLLLSRKLLWRAYPFLLAYVVWLVIGNSAILVSFISYLRMAPEVRVHSLYPSLYWHSDTLDVVLRFLLVWEVFHQTFPKGSGLNRSLSKGLGIAAFLLLVFACATFWGYQNYSSLRSIHLALDRSFDFVQAVMILGTLLMARYYGVRFGRNVRGIALAFGGWVSISTVNSAVVDLTNSFLPYWYYLRPLSFLVMMVAWIWALWVYEPNPPILESGTAELNQWTQDWNRTISAARTIIRP
Ga0210409_1065109923300021559SoilMLLSGAICVIAEALGLATLFLLLRRSLWRTYAFFSVYALWLLIGNTVLLATSFYFPRHHPSWYWNIDSVDVVLRFLVVWEVFRQTFPKGSGLNKSLSKGLSIGAFGLLVVACATFWGYQTYTGLRSLHLALDRSFGFVQALMILGTLVTARYYGVLCGRNIRGIALAFGGWVSISTATNAMADLDSSFLTYWYYLRPLSFVVMIGVWIWA
Ga0242642_103334823300022504SoilANSAILITSLYLPAEGQAINWAGRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLAVACVTFWGYQNYTGLNSIHLALDRSFGFIQALMILGTLLMARYYGVRCGRNIRGIALAFGGWVSISTATNAMADLNSSFLAYWYYLRPLTFVFMIAVWIWALWVYEPNPPIRESEAVELNRWTQDWNRTISAARTIIRP
Ga0242648_100657323300022506SoilMFLSDAICVAAVGLELATVLLLLRKRVWRSYTLFFVYATWLFLANSAILVTFLYLPAEGQAINWARRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLAVACVTFWGYQNYTGLNSIHLALDRSFGFIQALMILGTLLMARYYGVRCGRNIRGIALAFGGWVSISTATNAMADLNSSFLAYWYYLRPLTFVFMIAVWIWALWVYEPNPPIRESEAVELNRWTQDWNRTISAARTIIRP
Ga0222729_100455913300022507SoilMFLSDAICVAAVGLELATVLLLLRKRVWRSYTLFFVYATWLFLANSAILVTFLYLPAEGQAINWASRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLAVACVTFWGYQNYTGLNSIHLALDRSFGFIQALMILGTLLMARYYGVRCGRNIRGIALAFGGWVSISTATNALADLNGSFVAYWYYLRPLTFVFMIAVWIWALWVYEPNPPIRESEAVELNRWTQDWNRTISAARTIIRP
Ga0242656_100190233300022525SoilMFLSDAICVAAVGLELATVLLLLRKRVWRSYSLFFAYTTWLFLANSAILITSLYLPAEGQAINWAGRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLAVACVTFWGYQNYTGLNSIHLALDRSFGFIQALMILGTLLMARYYGVRCGRNIRGIALAFGGWVSISTATNAMADLNSSFLAYWYYLRPLTFVFMIAVWIWALWVYEPNPPIRESEAVELNRWTQDWNRTISAARTIIRP
Ga0242656_102806823300022525SoilVMVTSVVISVAAVALEVGSLALLLRRSLWRVYTLLFIYVLWLLIGNSTLFFSFLYFPAVYSNLYWQSDTIDVILRFLVVWEVFRQTFPKSSRLNRSLSKGLGIIAFVLLLFACATFWSYQNYTAFRSIHLALDRSFGFVQALMILGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLDSSFLSYWYYLRPLSFVVMIGVWIWALWVYDPNPPIKESESVELRQWTEDWNRAISTARTIMRP
Ga0242655_1005558323300022532SoilMLLSDAICVTAVALELATVLLLLRQRVWRSYTLFFAYATWLFLANSAILITSLYLPAAGQAINWAGRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLVVASATFWGYQTNTGLRSLHLALDRSFGFIQALMILGTLVTARYYGVRCGRNIRGIALAFGGWVSISTATNALADLNGSFVAYWYYLRPLTFVFMIAVWIWALWVYE
Ga0242655_1008515213300022532SoilMLLSDVISVTALALEACAVLLLLRKSLWRTYTFIFIYAVWLLVGNSLQSLAVVHFPAKFPAIYWYNDTIDVVLRFLVVWEVFRQTFPKGSGLNRSLSKGLGIVAFVLVVFACAAFWGYQNYTDLRSIHLALDRIFDFVQAIMILGTLLMARYYGVRYGRNVRGIALAFGGWVSISTVNSAMVDLTNSFLPYWYYLRPLSFVLMMVVWIWALWVYEPNPPIVES
Ga0242661_101672313300022717SoilMVTSVVISVAAVALEVGSLALLLRRSLWRVYTLLFIYVLWLLIGNSTLFFSFLYFPAVYSNLYWQSDTIDVILRFLVVWEVFRQTFPKSSRLNRSLSKGLGIIAFGLLLFACATFWSYQNYTAFRSIHLALDRSFGFVQALMILGTLVTARYYGVKCGRNIRGIALAFGAWVSISTATNAMADLDSSFLSYWYYLRPLSFVVMIGVWIWALWVYDPNPPIKESESVELRQWTEDWNRAISTARTIMRP
Ga0242657_100803123300022722SoilVDTFFAIGFAGPILELGVLLLLLHNGLWRRYKFLLTYDLWLLLGNSAILFTFLYFHRQIDSHPIYGTLYWDIDSIDVVLRFLLIWEVFHHTFPRGSGLNRSLSKGLGIVAFGLLVFACATFWGYQNYASVRSVHLALDRSFGFVQAVMILGTLLMARYYGVNYGRNVRGIALAFGGWVSLSTANNAMADLTNSFLPYWYYLRPLSFVFMMAVWIWALWVDEPNPPIVESEAAELNQWTEEWNRTISTARTIIRP
Ga0242665_1005008823300022724SoilMLLSGAICVIAEALGLATLFLLLRRSLWRTYAFFSVYALWLLIGNTVLLATSFYFPRHHPSWYWNIDSVDVVLRFLVVWEVFRQTFPKGSGLNKSLSKGLSIGAFGLLVVACATFWGYQTYTGLRSLHLALDRSFGFVQALMILGTLVTARYYGVLCGRNIRGIALAFGGWVSISTATNAMADLDSSFLTYWYYLRPLSFVVMIGVWIWALWVYEPNPPISESEPVELNQWTEEWNRTLSAARTIIRP
Ga0242665_1005120623300022724SoilMVTSVVISVAAVALEVGSLALLLRRSLWRVYTLLFIYVLWLLIGNSTLFFSFLYFPAVYSNLYWQSDTIDVILRFLVVWEVFRQTFPKSSRLNRSLSKGLGIIAFGLLLFACATFWSYQNYTAFRSIHLALDRSFGFVQALMILGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLDSSFLSYWYYLRPLSFVVMIGVWIWALWVYDPNPPIKESESVELRQWTEDWNRAISTARTIIRP
Ga0242654_1004676613300022726SoilMLLSGAICVIAEALGLATLFLLLRKSLWRTYAFFFVYALWLLIGNTVLLATSFYFPRHHPSWYWNIDSVDVALRFLVVWEVFRQTFPKGSGLNKSLSKGLSIGAFGLLVVACATFWGYQTYTGLRSLHLALDRSFGFVQALMILGTLVTARYYGVLCGRNIRGIALAFGGWVSISTATNAMADLDSSFLTYWYYLRPLSFVVMIGVWIWALWVYEPNPPISESEPVELNQWTEEWNRTLSAARTIIRP
Ga0209236_100559133300026298Grasslands SoilMFASEVICVTAIVLEVCMLLLLLRRGLWRTYTFLFVYAIWLLIGNSAIFATFLYFPTVYPSLYWNIDSIDVVLRFVVIWEVFHQTFPKRSGLNKSLSKGLGIIALSLLIFACGTFLIYQNYTGPRSIHLALDRSFGFVQALMILGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLSSSFISYWYYLRPLSFVVMIAVWIWALWVYEPNPPIMESEAIDLNRWTQDWNRTISTARTIIRP
Ga0209802_108926913300026328SoilFVYALWLLSGNSVLLITSFYFPGHHPSWYWNIDSIDVVLRFLVVWEVFRQTFPKGSGLNKSLSKGLSIGAFGLLVVACATFWGYQTYTGLRSLHLALDRSFGFVQALMVLGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLSSSFISYWYYLRPLSFVSMIAAWIWALWIYEPNPPIRESEAVDLDRWTQDWNRTISTARTIIRL
Ga0209802_117657613300026328SoilVDIFVVICVAGPVFELGALLLLVRNGLWRSYTSFFVYLTWLLVGNSAILIASVYFPGIYPTLYWHIDSIDVVLRFLVIWEVFHQIFPKTSGLNRSLSKGFGLIAFGLLAFGCATFLIYQNYTGPRSIHLALDRSFAFVQALMILGTLVAARYYGVRCGRNIRGIALAFGGWMSISTATNAMADLTTSFITYWYYLRPLSFVVMIAVWIWALWIYDPNPPIVESEPVELGQWTEDWNRTISAARTIIRP
Ga0209802_124654813300026328SoilLLVSKGVWRSYRLFFVYATWLFLANSAILVTFLYLPADSHTINWASRVYPALYWNIDSIDVVLRFVVVWEVFHQIFPKSSGLNKSLSKGLAIVAFALLVVGCGTFLIYQNYTGPRSIHLALDRSFGVVQAFMILGTLVTARYYGVRCGRNVRGIALAFGGWVSISTATNAMADLASPFLAYWYYLRPLSFVVMIAVWIWALWVYE
Ga0209377_102315553300026334SoilVYTATVMFASEVICVTAIVLEVCMLLLLLRRGLWRTYTFLFVYAIWLLIGNSAIFATFLYFPTVYPSLYWNIDSIDVVLRFVVIWEVFHQTFPKRSGLNKSLSKGLGIIALSLLIFACGTFLIYQNYTGPRSIHLALDRSFGFVQALMILGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLSSSFISYWYYLRPLSFVVMIAVWIWALWVYEPNPPIMESEAIDLNRWTQDWNRTISTARTIIRP
Ga0209377_116444313300026334SoilGAICVAAVSLELATLLLLVSKGVWRSYRLFFVYATWLFLANSAILVTFLYLPADSHTINWASRVYPALYWNIDSIDVVLRFVVVWEVFHQIFPKSSGLNKSLSKGLAIVAFALLVVGCGTFLIYQNYTGPRSIHLALDRSFGVVQAFMILGTLVTARYYGVRCGRNVRGIALAFGGWVSISTATNAMADLASPFLAYWYYLRPLSFVVMIAVWIWALWVYEPNPPIMESEAIELDRWTQDWNRTISTARTIIRP
Ga0209806_106464533300026529SoilWLLIGNSAIFATFLYFPTVYPSLYWNIDSIDVVLRFVVIWEVFHQTFPKRSGLNKSLSKGLGIIALSLLIFACGTFLIYQNYTGPRSIHLALDRSFGFVQALMILGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLSSSFISYWYYLRPLSFVVMMAVWIWALWVYEPNPPIVESEPVELGQWTEDWNRTISATRTIIRP
Ga0209160_101652233300026532SoilMFLSGAICVAAVSLELATLLLLVSKGVWRSYRLFFVYATWLFLANSAILVTFLYLPADSHTINWASRVYPALYWNIDSIDVVLRFVVVWEVFHQIFPKSSGLNKSLSKGLAIVAFALLVVGCGTFLIYQNYTGPRSIHLALDRSFGVVQAFMILGTLVTARYYGVRCGRNVRGIALAFGGWVSISTATNAMADLASPFLAYWYYLRPLSFVVMIAVWIWALWVYEPNPPIMESEAIELDQWTQDWNRTISTARTIIRP
Ga0209160_109653323300026532SoilMLLSQMIRAAAVALEACALVLMLRQARWRAYPFLCLYTIWLLVGNSVQEITSAYKPAIYASLYWRDDTIDVIVRFLVIWEVFRQTFPRSSRLNKSLSRGLGIIAFGLLLFGCALFWGYQNYSGIRSLHLALDRTFGFVQAVMILGTLLVARYYGVRCGRNVRGIALAFGGWVSISTATNAMADLANSFLPYWYYLRPLSFVVMMAVWIWALWVYEPNPPIVESEPVELGQWTEDWNRTISATRTIIRP
Ga0209160_111554613300026532SoilMLLSGAICVVAEALGLATLVLLLRKSLWRTYAFFFVYALWLLSGNSVLLITSFYFPGHHPSWYWNIDSIDVVLRFLVVWEVFRQTFPKGSGLNKSLSKGLSIGAFGLLVVACATFWGYQTYTGLRSLHLALDRSFGFVQALMVLGTLVTARYYGVRCGRNIRGIALAFGAWVSISTATNAMADLSSSFISYWYYLRPLSFVSMIAAWIWALWIYEPNPPIRESEAVDLDRWTQDWNRTISTARTIIRL
Ga0209160_114832923300026532SoilVFLTWLLVGNSAILIASVYFPGIYPTLYWHIDSIDVVLRFLVIWEVFHQIFPKTSGLNRSLSKGFGLIAFGLLAFGCATFLIYQNYTGPRSVHLALDRSFAFVQALMILGTLVAARYYGVRCGRNIRGIALAFGGWMSISTATNAMADLTTSFITYWYYLRPLSFVVMIAVWIWALWIYDPNPPIVESEPVELGQWTEDWNRTISAARTIIRP
Ga0209625_100161543300027635Forest SoilMFLSDAICVAAVGLELATVLLLLRKRVWRSYSLFFAYTTWLFLANSAILITSLYLPAEGQAINWAGRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLAVACVTFWGYQNYTGLNSIHLALDRSFGFIQALMILGTLLMARYYGVRCGRNIRGIALAFGGWVSISTATNAMADLNSSFLAYWYYLRPLTFVFMIAVWIWALWVYEPNPPIRESDAVELNRWTQDWNRTISAARTIIRP
Ga0209178_100121563300027725Agricultural SoilMLLSEAIGLAGPALELAIILLLLRKKLWRVYTFLFVYAFWLLIGNSVILGTFLYFSKIYPDFPNVYPALYWKIYWNIDSIDVVLRFVVIWEIFRQTFPKRSGLNKSLSKGLGIGALALLAAACSIFLIYQDYAGPRSIHLALDRSFGFVQALMILGTLVTARYYGVRYGRNIRGIALAFGAWVSVSTATNAMVDLTNSFLPYWYYLRPLSFVGMIGVWIWALWVYEPNPPIMESGELELSQWNQEWNRTISATRTIMRP
Ga0209275_1036413013300027884SoilLLLRKRVWRSYTLFFAYTTWLFLANSAILITSLYLPAEGQAINWAGRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLVVASATFWGYQTNTGLRSLHLALDRSFGFIQALMILGTLLMARYYRVSCGRNIRGIALAFGGWVSISTATNALADLNSSFVAYWYYLRPLTFVFMIAVWIWALWVYEPNPPIRESEAVELNRWTQDWNRTISAARTIIRP
Ga0209380_1003615833300027889SoilMFLSDAICVTAVALELATVLLLLRKRVWRSYTLFFAYTTWLFLANSAILITSLYLPAEGQAINWAGRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLVVASATFWGYQTNTGLRSLHLALDRSFGFIQALMILGTLLMARYYRVSCGRNIRGIALAFGGWVSISTATNALADLNSSFVAYWYYLRPLTFVFMIAVWIWALWVYEPNPPIRESEAVELNRWTQDWNRTISAARTIIRP
Ga0137415_1003052843300028536Vadose Zone SoilMFLSGAICVAAVSLELATLLLLVSKGVWRSYRLFFVYATWLFLANSAILVTFLYLPADSHTVNWASRVYPALYWNIDSIDVVLRFVVVWEVFHQIFPKSSGLNKSLSKGLAIVAFALLVVACGTFLIYQNYTGPRSIHLALDRSFGLVQALMILGTMLMARYYGVRCGRNVRGIALAFGGWVSISTATNAMADLAGPFLPYWYYLRPLSFVVMIAVWIWALWVYEPNPPIMESEPVELGQWTEDWNRTISATRTIIRP
Ga0170834_11007345533300031057Forest SoilMLVFVAGVLLELWALLLLLRNSLWRVYTRLFTYVIWLLLGNSAILIAFLYFPSAYPSLYWHSDSVDVVLRFFVVWEVFHQTFPKSSGLNRSLSKGLGIIALGLLIFACSTFWVYQNYTGLRSLHLALDRSFGFVQALMVLGTLVTARYYGVTCGRNIRGIALAFGGWVSISTATNAMADLTNSFLPYWYYLRPVSFVLMIAVWIWALWVYEPNPPIRESEAVELHRWTEDWNRTISTARTIIRP
Ga0170824_11455288323300031231Forest SoilVDTFFAIGVTGPILELGVLLLLLRNGLWRRYKSLLTYSLWLLIGNSAILFTFLHFHQQVDNHPIYATLYWDIDSIDVVLRFLLIWEVFHHTFPRGSGLNRSFSKGLGIVAFGLLVFACATFWGYQNYASDRSVHLALDRSFGFVQAVMILGTLLMARYYGVNYGRNVRGIALAFGGWVSLSTANNAMADLTNSFLPYWYYLRPLSFVFMMAVWIWAVWVDEPNPPIVESEAAELDQWTEEWNRTISTARTIIRP
Ga0310686_10493945423300031708SoilMVISLLICVAAVVLEVCTLSLLSRKSLWRAYPFLLAYVIWLVVGNSAILVSFISYLRMAPELRAHSLYPSLYWHSDTLDVVLRFLLVWEVFHQTFPKGSGLNRSLSKGLGITAFLLLVFAGATFWGYQNYTSLRSIHLALDRSFDFVQAVMILGTLLMARYYGVRYGRNVRGIALAFGGWVSISTVNSAMVDLTNSFLPYWYYLRPLSFLVMMVAWIWALWVYEPNPPIVESGTTELNQWTEDWNRTISAARTIIRP
Ga0310686_11483741213300031708SoilLLLRRSLWRVYTLLFVYAIWLLIGNSTILVTFLYFPAIYSNLYWQSDTIDVVLRFLIVWEVFRQIFPKSSRLNRSLSKGLATIAFGLLLFACATFLSYQHYTAFRSIHLALDRSFGFVQALMILGTLVTARYYGVQCGRNIRGIALAFGGWVSISTANNAVADLTNSFLPYWFYLRPLTFVLMMAVWVWALWVYEPNPPIMESGEVELGRWTEDWNRTISTARTLIQP
Ga0307475_1082616513300031754Hardwood Forest SoilLATVLLLLRKRVWRSYTLFFAYTTWLFVANSTILIASLYLPAEGQAINWAGRVYPALYWNIDSIDVVLRFVVVWEVFHQVFPKSSGLNKSLSKGLALVAFALLVVASATFWDYQTDTGLRSLHLALDRSFGFIQALMILGTLLTARYYGVKCGRNIRGIALAFGGWVSISTATNALADLNSSFVAYWYYLRPLTFVFMIAVWVWALWVYEPNPPIRESETVELNRWTQDWNRTISAARTIIRP
Ga0307471_10002769453300032180Hardwood Forest SoilMFLSDAICVTAVALELATVLLLLRKRVWRSYTLFFAYTTWLFLANSAILITSLYLPAEGQAINWAGRVYPALYWNIDSIDVVFRFVVVWEVFHQVFPKSSGLNKSLSKGLAIVAFALLVVASAMFWSYQTDTGLRSLHLALDRSFGFIQALMILGTLLMARYYGVRCGRNIRGIALAFGGWVSISTATNALADLNSSFVAYWYYLRPLTFVFMIAVWIWALWVYEPNPPIRESEAVELNRWTQDWNRTISATRTIIRQ
Ga0348332_1198552013300032515Plant LitterMVTSVVICVAAVALEVGALILLLRRSLWRVYTLLFVYAIWLLIGNSTILVTFLYFPAIYSNLYWQSDTIDVVLRFLIVWEVFRQIFPKSSRLNRSLSKGLATIAFGLLLFACATFWSYQHYTAFRSIHLALDRSFGFVQALMILGTLVTARYYGVQCGRNIRGIALAFGGWVSISTANNAVADLTNSFLPYWFYLRPLTFVLMMAVWVWALWVYEPNPPIMESGEVELGRWTEDWNRTISTARTLI
Ga0348332_1336016433300032515Plant LitterMVISLLICVAAVVLEVCTLSLLSRKSLWRAYPFLLAYVIWLVVGNSAILVSFISYLRMAPELRAHSLYPSIYWHSDTLDVVLRFLLVWEVFHQTFPKGSGLNRSLSKGLGITAFLLLVFAGATFWGYQNYTSLRSIHLALDRSFDFVQAVMILGTLLMARYYGVRYGRNVRGIALAFGGWVSISTVNSAMVDLTNSFLPYWYYLRPLSFLVMMVAWIWALWVYEPNPPIVESGTTELNQWTEDWNRTISAARTIIRP
Ga0335085_100000212523300032770SoilMVISLLICIAAVVLEVCTLLLLSRRSLWRAYPFLLAYVIWLVVGNSAILVSFISYLRMAPEVRAHSLYPSLYWHSDTLDVVLRFLLVWEIFHQTFPKGSGLNRSLSKGLGITAFLLLVFAGATFWGYQNYTSLRSVHLALDRSFDFVQAVMILGTLVMARYYGVRFGRNVRGVALAFGGWVSISTVNSAMVDLTNSFLPYWYYLRPLSFLVMMVAWIWALWVYEPNPPILESGTAELNQWTEDWNRTVSAARTIIRP
Ga0335085_1043075623300032770SoilMALSLVIFVAGAVLQVVVLSLLARNSLWRAYPYFFVYVLWICLGNFFLLAFYLVLPSARTQGTQAAAMYANLYWQSDAIDIVLRLLVVWEVFHQTFPKGSGLNRSLSKGLGIIAFCLLVFAGATFWGYQNYSGLRSVHLALDRSFGFVQALMILGTLLMARYYGVNYGRNVRGIAIAFGGWVSISTANSAMVDLTNSFLPYWYYLRPLSFVAMMAGWIWALWVYEPNPPIMEGEAPELSQWTEDWNRTISAARTIIRS
Ga0335079_1024788333300032783SoilMVMSLVIWIAALTLATGTLVLFLRRALWRTYPFLLAYVIWLIIGNSALLIGFRFFNSVYTSLYWHSDTVDVVLRFLVVWEVFHQTFPKGSGLNRSLSKGLGIAAFLLLVFACATFWGYQNYTNLRSIHMALDRSFDFVQAVMILGTLLMARYYGVLFGRNVRGIALAFGGWVSISTVNSAMVDLTNSFLPYWYYLRPLSFLVMMVAWIWALWVYEPNPPILESGATELNQWTEDWNRTVSAARTIIRP
Ga0335078_1009993453300032805SoilMVISLLICIAAVVLELSTLLLLSRRSLWRAYPFLLAYVIWLVVGNSAILVSFISYLRMAPEVRVHSLYPTIYWHSDTLDVVLRFLLVWEIFHQTFPRGSGLNRSLSKGLGITAFLLLVFAGATFWGYQNYSSLRSIHLALDRSFDFVQAVMILGTLVMARYYGVRFGRNVRGIALAFGGWVSISTVNSAMVDMTNSFLPYWYYLRPLSFLVMMVVWIWALWVYEPNPPILESGTAELDQWTEDWNRTVSAARTIIRP
Ga0335083_1034544113300032954SoilMVISLLICIAAVVLEVCTLLLLSRRSLWRAYPFLLAYVIWLVVGNSAILVSFISYLRMAPEVRAHSLYPSLYWHSDTLDVVLRFLLVWEIFHQTFPKGSGLNRSLSKGLGITAFLLLVFAGATFWGYQNYTSLRSVHLALDRSFDFVQAVMILGTLVMARYYGVRFGRNVRGVALAFGGWVSISTVNSAMVDLTNSFLPYWYYLRPLSFLVMMVAWIWALWVYEP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.