NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F091422

Metagenome / Metatranscriptome Family F091422

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F091422
Family Type Metagenome / Metatranscriptome
Number of Sequences 107
Average Sequence Length 192 residues
Representative Sequence MGDSATSLLRSLGGERLHTKYMHGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAA
Number of Associated Samples 91
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 45.79 %
% of genes near scaffold ends (potentially truncated) 41.12 %
% of genes from short scaffolds (< 2000 bps) 53.27 %
Associated GOLD sequencing projects 80
AlphaFold2 3D model prediction Yes
3D model pTM-score0.87

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Engineered → Wastewater → Anaerobic Digestor → Unclassified → Unclassified → Anaerobic Digestor Sludge
(12.149 % of family members)
Environment Ontology (ENVO) Unclassified
(32.710 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Water (non-saline)
(46.729 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 38.33%    β-sheet: 17.62%    Coil/Unstructured: 44.05%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.87
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.108.1.1: N-acetyl transferase, NATd1cjwa_1cjw0.6163
d.108.1.1: N-acetyl transferase, NATd1yk3a11yk30.61129
d.108.1.0: automated matchesd5icva_5icv0.60617
d.108.1.1: N-acetyl transferase, NATd1yrea11yre0.58886
d.108.1.0: automated matchesd6th0a_6th00.58518


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 107 Family Scaffolds
PF03466LysR_substrate 15.89
PF00126HTH_1 13.08
PF13365Trypsin_2 5.61
PF00583Acetyltransf_1 4.67
PF02627CMD 2.80
PF13508Acetyltransf_7 2.80
PF14403CP_ATPgrasp_2 1.87
PF12902Ferritin-like 1.87
PF07690MFS_1 1.87
PF01979Amidohydro_1 0.93
PF01569PAP2 0.93
PF12680SnoaL_2 0.93
PF04168Alpha-E 0.93
PF07715Plug 0.93
PF06808DctM 0.93
PF04391DUF533 0.93
PF00656Peptidase_C14 0.93
PF01850PIN 0.93
PF03484B5 0.93
PF00265TK 0.93
PF09140MipZ 0.93
PF05099TerB 0.93
PF00690Cation_ATPase_N 0.93
PF02518HATPase_c 0.93
PF13188PAS_8 0.93

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 107 Family Scaffolds
COG0599Uncharacterized conserved protein YurZ, alkylhydroperoxidase/carboxymuconolactone decarboxylase familyGeneral function prediction only [R] 2.80
COG2128Alkylhydroperoxidase family enzyme, contains CxxC motifInorganic ion transport and metabolism [P] 2.80
COG0072Phenylalanyl-tRNA synthetase beta subunitTranslation, ribosomal structure and biogenesis [J] 0.93
COG0474Magnesium-transporting ATPase (P-type)Inorganic ion transport and metabolism [P] 0.93
COG1192ParA-like ATPase involved in chromosome/plasmid partitioning or cellulose biosynthesis protein BcsQCell cycle control, cell division, chromosome partitioning [D] 0.93
COG1435Thymidine kinaseNucleotide transport and metabolism [F] 0.93
COG2307Uncharacterized conserved protein, Alpha-E superfamilyFunction unknown [S] 0.93
COG2979Uncharacterized membrane protein YebE, DUF533 familyFunction unknown [S] 0.93
COG3793Tellurite resistance protein TerBInorganic ion transport and metabolism [P] 0.93
COG4249Uncharacterized conserved protein, contains caspase domainGeneral function prediction only [R] 0.93


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000271|PBR_1026391All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3589Open in IMG/M
3300000883|EsDRAFT_10030599All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria737Open in IMG/M
3300000883|EsDRAFT_10073702All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1737Open in IMG/M
3300001800|JGI24115J20150_1001550All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales7310Open in IMG/M
3300003278|U2draft_1001582All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria18944Open in IMG/M
3300004123|Ga0066181_10048251All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomonadales → Hyphomonadaceae1112Open in IMG/M
3300004128|Ga0066180_10072353All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomonadales → Hyphomonadaceae1228Open in IMG/M
3300004481|Ga0069718_14201893All Organisms → cellular organisms → Bacteria5858Open in IMG/M
3300004794|Ga0007751_10868320All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomonadales → Hyphomonadaceae → Hyphomonas796Open in IMG/M
3300005341|Ga0070691_10426873All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomonadales → Hyphomonadaceae → Henriciella → Henriciella marina752Open in IMG/M
3300005525|Ga0068877_10032728All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales3549Open in IMG/M
3300005662|Ga0078894_10051438All Organisms → cellular organisms → Bacteria3493Open in IMG/M
3300005758|Ga0078117_1015232All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomonadales → Hyphomonadaceae7162Open in IMG/M
3300005982|Ga0075156_10482332All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria632Open in IMG/M
3300005986|Ga0075152_10000093All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria68565Open in IMG/M
3300005988|Ga0075160_10335564All Organisms → cellular organisms → Bacteria824Open in IMG/M
3300006039|Ga0073915_10049395All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria741Open in IMG/M
3300006056|Ga0075163_10000041All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria127235Open in IMG/M
3300006056|Ga0075163_10001669All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria29953Open in IMG/M
3300006056|Ga0075163_10071093All Organisms → cellular organisms → Bacteria4234Open in IMG/M
3300006056|Ga0075163_10191294All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2392Open in IMG/M
3300006056|Ga0075163_11352662All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomonadales → Hyphomonadaceae → Hyphomonas → Hyphomonas hirschiana701Open in IMG/M
3300007216|Ga0103961_1293171All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3739Open in IMG/M
3300007232|Ga0075183_10663277All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomonadales → Hyphomonadaceae → Henriciella → Henriciella marina563Open in IMG/M
3300007232|Ga0075183_11251292All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1559Open in IMG/M
3300009032|Ga0105048_10001053All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria44326Open in IMG/M
3300009078|Ga0105106_10010693All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria6646Open in IMG/M
3300009079|Ga0102814_10247884All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria969Open in IMG/M
3300009086|Ga0102812_10102108All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1575Open in IMG/M
3300009086|Ga0102812_10211169All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1058Open in IMG/M
3300009087|Ga0105107_10009140All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomonadales → Hyphomonadaceae6803Open in IMG/M
3300009100|Ga0075418_10754645All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales1050Open in IMG/M
3300009120|Ga0117941_1023180All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1633Open in IMG/M
3300009147|Ga0114129_12345259All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria640Open in IMG/M
3300009148|Ga0105243_11715139All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria657Open in IMG/M
3300009152|Ga0114980_10005445All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria8449Open in IMG/M
3300009669|Ga0116148_1328379All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria619Open in IMG/M
3300009688|Ga0116176_10001080All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria21624Open in IMG/M
3300009696|Ga0116177_10053933All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2460Open in IMG/M
3300009779|Ga0116152_10047730All Organisms → cellular organisms → Bacteria2713Open in IMG/M
3300009780|Ga0116156_10001895All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria19560Open in IMG/M
3300010346|Ga0116239_10175993All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1621Open in IMG/M
3300010357|Ga0116249_10234475All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1702Open in IMG/M
3300010400|Ga0134122_10069689All Organisms → cellular organisms → Bacteria2731Open in IMG/M
3300010401|Ga0134121_10270200All Organisms → cellular organisms → Bacteria → Proteobacteria1495Open in IMG/M
3300010401|Ga0134121_10519978All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1099Open in IMG/M
3300010885|Ga0133913_10216191All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria5103Open in IMG/M
3300011406|Ga0137454_1045906All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria685Open in IMG/M
3300012046|Ga0136634_10381027All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria586Open in IMG/M
3300012943|Ga0164241_10185201All Organisms → cellular organisms → Bacteria1489Open in IMG/M
(restricted) 3300013126|Ga0172367_10000790All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria47875Open in IMG/M
(restricted) 3300013131|Ga0172373_10004229All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria20367Open in IMG/M
3300013315|Ga0173609_10377574All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1359Open in IMG/M
(restricted) 3300014720|Ga0172376_10001031All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria36192Open in IMG/M
3300015360|Ga0163144_10002881All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria39331Open in IMG/M
3300015360|Ga0163144_10291143All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2096Open in IMG/M
3300017788|Ga0169931_10006501All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria17599Open in IMG/M
3300020027|Ga0193752_1091797All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1253Open in IMG/M
3300020172|Ga0211729_11440508All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1507Open in IMG/M
3300021090|Ga0210377_10016659All Organisms → cellular organisms → Bacteria → Proteobacteria5479Open in IMG/M
3300023100|Ga0247738_10013753All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2753Open in IMG/M
3300024343|Ga0244777_10799592All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria557Open in IMG/M
3300024545|Ga0256347_1029360All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1311Open in IMG/M
3300024858|Ga0255286_1028160All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1203Open in IMG/M
3300025526|Ga0208492_1009287All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2862Open in IMG/M
3300025772|Ga0208939_1000119All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria68877Open in IMG/M
3300025772|Ga0208939_1000386All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria36941Open in IMG/M
3300025772|Ga0208939_1039971All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2002Open in IMG/M
3300025866|Ga0208822_1017815All Organisms → cellular organisms → Bacteria → Proteobacteria3380Open in IMG/M
3300025871|Ga0209311_1001440All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria19076Open in IMG/M
3300025871|Ga0209311_1004428All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomonadales → Hyphomonadaceae9214Open in IMG/M
3300027091|Ga0209873_1030919All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria642Open in IMG/M
3300027503|Ga0255182_1012501All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2757Open in IMG/M
3300027642|Ga0209135_1080415All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1113Open in IMG/M
3300027697|Ga0209033_1227398All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria547Open in IMG/M
(restricted) 3300027728|Ga0247836_1000050All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria175190Open in IMG/M
3300027806|Ga0209985_10034629All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2951Open in IMG/M
3300027878|Ga0209181_10139244All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2286Open in IMG/M
3300027892|Ga0209550_10048359All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3462Open in IMG/M
3300027973|Ga0209298_10051292All Organisms → cellular organisms → Bacteria1915Open in IMG/M
3300027974|Ga0209299_1037705All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2062Open in IMG/M
3300027979|Ga0209705_10029099All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3054Open in IMG/M
3300028648|Ga0268299_1185418All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria514Open in IMG/M
3300029990|Ga0311336_10920481All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria757Open in IMG/M
3300030294|Ga0311349_11014652All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria778Open in IMG/M
3300031232|Ga0302323_100306184All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1650Open in IMG/M
3300031455|Ga0307505_10050489All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1852Open in IMG/M
3300031726|Ga0302321_102939583All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria556Open in IMG/M
3300031758|Ga0315907_10917741All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria641Open in IMG/M
3300031786|Ga0315908_10350702All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1235Open in IMG/M
3300031902|Ga0302322_100374548All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1626Open in IMG/M
3300031951|Ga0315904_10167593All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2201Open in IMG/M
3300031951|Ga0315904_10248774All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1704Open in IMG/M
3300032092|Ga0315905_10363280All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1370Open in IMG/M
3300032144|Ga0315910_10112893All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2001Open in IMG/M
3300032144|Ga0315910_10963983All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria665Open in IMG/M
3300032144|Ga0315910_11050131All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria636Open in IMG/M
3300032157|Ga0315912_10936869All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria688Open in IMG/M
3300032205|Ga0307472_101203456All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria724Open in IMG/M
3300032205|Ga0307472_101638272All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria634Open in IMG/M
3300033433|Ga0326726_10940097All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria839Open in IMG/M
3300033978|Ga0334977_0004999All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria7563Open in IMG/M
3300033981|Ga0334982_0010704All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria5450Open in IMG/M
3300034021|Ga0335004_0222781All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1161Open in IMG/M
3300034066|Ga0335019_0082722All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2165Open in IMG/M
3300034118|Ga0335053_0305984All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria996Open in IMG/M
3300034150|Ga0364933_147247All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria608Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Anaerobic Digestor SludgeEngineered → Wastewater → Anaerobic Digestor → Unclassified → Unclassified → Anaerobic Digestor Sludge12.15%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater9.35%
Wastewater EffluentEngineered → Wastewater → Nutrient Removal → Unclassified → Unclassified → Wastewater Effluent9.35%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake8.41%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.54%
FreshwaterEnvironmental → Aquatic → Freshwater → Unclassified → Unclassified → Freshwater4.67%
FenEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Fen4.67%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake3.74%
EstuarineEnvironmental → Aquatic → Marine → Intertidal Zone → Estuary → Estuarine3.74%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment2.80%
FreshwaterEnvironmental → Aquatic → Freshwater → River → Unclassified → Freshwater2.80%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.80%
Freshwater Microbial MatEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Microbial Mat1.87%
FreshwaterEnvironmental → Aquatic → Freshwater → Ice → Glacial Lake → Freshwater1.87%
Freshwater And MarineEnvironmental → Aquatic → Marine → Intertidal Zone → Estuary → Freshwater And Marine1.87%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.87%
SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sand1.87%
Serpentinite Rock And FluidEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Serpentinite Rock And Fluid1.87%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.87%
SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Sediment0.93%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.93%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake0.93%
Lake WaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Lake Water0.93%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater0.93%
Polar Desert SandEnvironmental → Aquatic → Freshwater → Ice → Unclassified → Polar Desert Sand0.93%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.93%
Lake SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Lake Sediment0.93%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.93%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.93%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.93%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.93%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.93%
Activated SludgeEngineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Activated Sludge0.93%
SedimentEngineered → Wastewater → Industrial Wastewater → Mine Water → Unclassified → Sediment0.93%
Down-Flow Hanging Sponge ReactorEngineered → Bioreactor → Unclassified → Unclassified → Unclassified → Down-Flow Hanging Sponge Reactor0.93%
Photobioreactor IncubatedEngineered → Biotransformation → Microbial Enhanced Oil Recovery → Unclassified → Unclassified → Photobioreactor Incubated0.93%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000271Photobioreactor incubated microbial communities from Hamburg, Germany - Sample 1EngineeredOpen in IMG/M
3300000883Estuary microbial communities from the Columbia River - 5 PSUEnvironmentalOpen in IMG/M
3300001800Serpentinite rock and fluid subsurface biosphere microbial communities from McLaughlin Reserve, California, USA - CR12Mar_CSW14ABEnvironmentalOpen in IMG/M
3300003278Down-flow hanging sponge reactor microbial communities from the University of Illinois at Urbana-Champaign, USA - U2-648F-DHSEngineeredOpen in IMG/M
3300004123Freshwater lake microbial communities from Lake Michigan, USA - Su13.BD.MM110.SD (version 2)EnvironmentalOpen in IMG/M
3300004128Freshwater lake microbial communities from Lake Michigan, USA - Su13.BD.MM110.SN (version 2)EnvironmentalOpen in IMG/M
3300004481Combined Assembly of Gp0112041, Gp0112042, Gp0112043EnvironmentalOpen in IMG/M
3300004794Metatranscriptome of freshwater lake microbial communities from Lake Michigan, USA - Su13.BD.MM110.SN (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005525Freshwater lake microbial communities from Lake Erie, under a cyanobacterial bloom - NOAA_Erie_Diel6S_1000h metaGEnvironmentalOpen in IMG/M
3300005662Freshwater lake microbial communities from Lake Michigan, USA - Su13.BD.MLB.SD (version 4)EnvironmentalOpen in IMG/M
3300005758Cyanobacteria communities in tropical freswater systems - freshwater lake in SingaporeEnvironmentalOpen in IMG/M
3300005982Wastewater effluent complex algal communities from Wisconsin, to seasonally profile nutrient transformation and Carbon sequestration - JI 8/11/14 A brown DNAEngineeredOpen in IMG/M
3300005986Wastewater effluent complex algal communities from Wisconsin, to seasonally profile nutrient transformation and Carbon sequestration - JI 6/11/14 C2 DNAEngineeredOpen in IMG/M
3300005988Wastewater effluent complex algal communities from Wisconsin, to seasonally profile nutrient transformation and Carbon sequestration - JI 9/18/14 C2 DNAEngineeredOpen in IMG/M
3300006039Groundwater microbial communities from the Columbia River, Washington, USA, for microbe roles in carbon and contaminant biogeochemistry - GW-RW metaG T2_30-Apr-14EnvironmentalOpen in IMG/M
3300006056Wastewater effluent complex algal communities from Wisconsin, to seasonally profile nutrient transformation and Carbon sequestration - JI 10/23/14 1A DNAEngineeredOpen in IMG/M
3300007216Combined Assembly of cyanobacterial bloom in Punggol water reservoir, Singapore (Diel cycle-Surface and Bottom layer) 16 sequencing projectsEnvironmentalOpen in IMG/M
3300007232Wastewater effluent complex algal communities from Wisconsin, to seasonally profile nutrient transformation and Carbon sequestration - JI 10/23/14 A2 RNA (Eukaryote Community Metatranscriptome)EngineeredOpen in IMG/M
3300009032Freshwater microbial communities from Lake Fryxell liftoff mats and glacier meltwater in Antarctica - MAT-05EnvironmentalOpen in IMG/M
3300009078Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015EnvironmentalOpen in IMG/M
3300009079Estuarine microbial communities from the Columbia River estuary - Flood tide ETM metaG S.741EnvironmentalOpen in IMG/M
3300009086Estuarine microbial communities from the Columbia River estuary - Flood tide ETM metaG S.713EnvironmentalOpen in IMG/M
3300009087Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm September2015EnvironmentalOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009120Lake sediment microbial communities from Tanners Lake, St. Paul, MNEnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009152Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140806_EF_MetaGEnvironmentalOpen in IMG/M
3300009669Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_UKC055_MetaGEngineeredOpen in IMG/M
3300009688Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_STIC08_MetaGEngineeredOpen in IMG/M
3300009696Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_STIC10_MetaGEngineeredOpen in IMG/M
3300009779Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from Hong Kong - AD_UKC119_MetaGEngineeredOpen in IMG/M
3300009780Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_UKC045_MetaGEngineeredOpen in IMG/M
3300010346AD_USMOcaEngineeredOpen in IMG/M
3300010357AD_USSTcaEngineeredOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010885northern Canada Lakes Co-assemblyEnvironmentalOpen in IMG/M
3300011406Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT539_2EnvironmentalOpen in IMG/M
3300012046Polar desert sand microbial communities from Dry Valleys, Antarctica - metaG UQ833 (21.06)EnvironmentalOpen in IMG/M
3300012943Backyard soil microbial communities from Emeryville, California, USA - Original compost - Back yard soil (BY)EnvironmentalOpen in IMG/M
3300013126 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_022012_10mEnvironmentalOpen in IMG/M
3300013131 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_092012_10mEnvironmentalOpen in IMG/M
3300013315Sediment microbial communities from Acid Mine Drainage holding pond in Pittsburgh, PA, USA - 1BEngineeredOpen in IMG/M
3300014720 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_092012_35mEnvironmentalOpen in IMG/M
3300015360Freshwater microbial mat bacterial communities from Lake Vanda, McMurdo Dry Valleys, Antarctica - Oligotrophic Lake LV.19.BULKMAT1EnvironmentalOpen in IMG/M
3300017788Freshwater microbial communities from Lake Kivu, Western Province, Rwanda to study Microbial Dark Matter (Phase II) - Kivu_15m_20LEnvironmentalOpen in IMG/M
3300020027Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1c1EnvironmentalOpen in IMG/M
3300020172Freshwater lake microbial communities from Lake Erken, Sweden - P4710_102 megahit1EnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300023100Plant litter microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-L013-104B-2EnvironmentalOpen in IMG/M
3300024343Combined assembly of estuarine microbial communities from Columbia River, Washington, USA >3um size fractionEnvironmentalOpen in IMG/M
3300024545Metatranscriptome of freshwater microbial communities from Altamaha River, Georgia, United States - Atl_Colum_RepB_8d (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300024858Metatranscriptome of freshwater microbial communities from Altamaha River, Georgia, United States - Atl_Colum_RepA_8d (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300025526Serpentinite rock and fluid subsurface biosphere microbial communities from McLaughlin Reserve, California, USA - CR12Mar_CSW14AB (SPAdes)EnvironmentalOpen in IMG/M
3300025772Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_STIC12_MetaG (SPAdes)EngineeredOpen in IMG/M
3300025866Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_STIC08_MetaG (SPAdes)EngineeredOpen in IMG/M
3300025871Active sludge microbial communities of municipal wastewater-treating anaerobic digesters from USA - AD_UKC045_MetaG (SPAdes)EngineeredOpen in IMG/M
3300027091Groundwater microbial communities from the Columbia River, Washington, USA, for microbe roles in carbon and contaminant biogeochemistry - GW-RW metaG T2_30-Apr-14 (SPAdes)EnvironmentalOpen in IMG/M
3300027503Freshwater microbial communities from Altamaha River, Georgia, United States - Atl_UVDOM_RepA_8dEnvironmentalOpen in IMG/M
3300027642Freshwater lake microbial communities from Lake Michigan, USA - Su13.BD.MM110.SD (SPAdes)EnvironmentalOpen in IMG/M
3300027697Freshwater lake microbial communities from Lake Michigan, USA - Fa13.BD.MLB.DN (SPAdes)EnvironmentalOpen in IMG/M
3300027728 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2015_14mEnvironmentalOpen in IMG/M
3300027806Freshwater lake microbial communities from Lake Erie, under a cyanobacterial bloom - NOAA_Erie_Diel6S_1000h metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027878Freshwater microbial communities from Lake Fryxell liftoff mats and glacier meltwater in Antarctica - MAT-05 (SPAdes)EnvironmentalOpen in IMG/M
3300027892Freshwater lake microbial communities from Lake Michigan, USA - Su13.BD.MM15.SN (SPAdes)EnvironmentalOpen in IMG/M
3300027973Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140806_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027974Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140806_MF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027979Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015 (SPAdes)EnvironmentalOpen in IMG/M
3300028648Activated sludge microbial communities from bioreactor in Nijmegen, Gelderland, Netherland - NOB reactorEngineeredOpen in IMG/M
3300029990I_Fen_N2 coassemblyEnvironmentalOpen in IMG/M
3300030294II_Fen_E3 coassemblyEnvironmentalOpen in IMG/M
3300031232Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Fen_T0_3EnvironmentalOpen in IMG/M
3300031455Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 23_SEnvironmentalOpen in IMG/M
3300031726Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Fen_T0_1EnvironmentalOpen in IMG/M
3300031758Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 12 MA123EnvironmentalOpen in IMG/M
3300031786Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 4 MA124EnvironmentalOpen in IMG/M
3300031902Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Fen_T0_2EnvironmentalOpen in IMG/M
3300031951Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 12 MA120EnvironmentalOpen in IMG/M
3300032092Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 4 MA121EnvironmentalOpen in IMG/M
3300032144Garden soil microbial communities collected in Santa Monica, California, United States - Edamame soilEnvironmentalOpen in IMG/M
3300032157Garden soil microbial communities collected in Santa Monica, California, United States - V. faba soilEnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033978Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME28Sep2014-rr0002EnvironmentalOpen in IMG/M
3300033981Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME24Aug2014-rr0011EnvironmentalOpen in IMG/M
3300034021Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME01Oct2014-rr0057EnvironmentalOpen in IMG/M
3300034066Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME11Jul2017-rr0087EnvironmentalOpen in IMG/M
3300034118Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME05Aug2017-rr0165EnvironmentalOpen in IMG/M
3300034150Sediment microbial communities from East River floodplain, Colorado, United States - 25_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
PBR_102639123300000271Photobioreactor IncubatedMAALMWNSRTQLRSLGGERLHAKYMRGASEEMMASVRLTHAVSDDDYVAVHALATAELGPGLASLEEIKRVDSLTGASIWVIRRNDMVTGFLAPLALTEAGVAALADNTFDAANIDQKWVARLGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDKHFTDLPFFGRDSTEAGARIMRHLGFFPFDDTPHLFWRCCSLMEPVAC*
EsDRAFT_1003059913300000883Freshwater And MarineMVSKRLTTVLGMTVMTENAHTLLASLGAEKVHPKYIPGASEQMMASVRLTHATSDQDFEDVHALATKELGPGLASLVEIKRVDALTGASIWVXXXXXXXXXXXXXALTGASIWVIRRNSEVTGFLAPLALTAAGVAALVDNTFDAAHIDQKWVARVGEPLAGFYCWCYAGKDQVSRGAL
EsDRAFT_1007370223300000883Freshwater And MarineMHGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAA*
JGI24115J20150_100155033300001800Serpentinite Rock And FluidMLMSFDDARTRLQSLGGERLHPKYMSGASEEMMASVRLEHAVSEDDYAAVHALATAELGPGLASLEEIKRVDQLTGASIWVIRRKGEVTGFLAPLALSAAGVKALVDNTFDAANIRAEWVVRVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFETLPFFGRDSTEAGARIMAHLGFFPFDSTPHLYWRCCSLMQEIAA*
U2draft_1001582173300003278Down-Flow Hanging Sponge ReactorMVMVHEDARALLRSLGGERVHSHYVQGASEAMMASVRLEHAVSEQDYADVHRLATNELGPGLASLDEIKRVDSLTGASIWVIRRKGEVTGFLAPLALTAAGVAALVDNTFDAANIDEKWVARIGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDKHFTDLPFFGRDSTEAGARIMRHLGFFPFADTPHLFWRCASVMEMAA*
Ga0066181_1004825123300004123Freshwater LakeHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAA*
Ga0066180_1007235323300004128Freshwater LakeMGDSATSLLRSLGGERLHTKYMHGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAA*
Ga0069718_1420189343300004481SedimentMAALMWTSRTQLRSLGGDRLHPKYMQGASEEMMASVRLTHAVSDADYEAVHALATAELGPGLASLDEIKRVDALTGASIWVIRRNDVVTGFLAPLALSSEGVAALVDNTFDAANIDAKWVARVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDTHFTDLPFFGRDSTEAGARIMRHLGFFPFDDTPHLFWRCCSLMEPIS*
Ga0007751_1086832013300004794Freshwater LakeQNGRGVRGVVMGDSATSLLRSLGGERLHTKYMHGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAA*
Ga0070691_1042687313300005341Corn, Switchgrass And Miscanthus RhizosphereHPESAEDYAQVHALATAELGPDLASLEEIKRVDRLTGASIWVVRRHGAVSGFLAPLALTLAGRNALIEGKFDSAHIQQEWVARMGEPLAGFYCWCYAGRDQVTRGALVLALRTLIDTHFRDLPFFGRDSTEAGARIMRHLGFFPFDGTPHLFWRCCAVMGGKAA*
Ga0068877_1003272833300005525Freshwater LakeMGDSATALLRSLGGERLHTQYMPGAAEAMMASVRLEHARTDQDFADVHALATAELGPGLASLEEIKRVDGLTNASIWVIRRRGEITGFLAPLALTSAGVAALVDNTFNAARIDQKWVARMGEPLAGFYCWCYAGNDQVSRGALVLGLRTLIDRHFPDLPFFGRDSTDAGARIMRHLGFYPFDTTPHLFWRCASVMEMAA*
Ga0078894_1005143833300005662Freshwater LakeMGNSATSLLRSLGGERLHTQYMHGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAA*
Ga0078117_101523263300005758Lake WaterMLMSFDDARTQLQSLGGERLHPKYMRGASEEMMASVRLEHAVTDEDYVAVHALATAELGPGLASLEEIKRVDDLTGASIWVIRRKGEVTGFLAPLALTSAGVKALVDSTFDAANIRQEWVVRVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFPDLPFFGRDSTEAGARIMRHLGFFPFDSTPHLFWRCCSLMQEIAA*
Ga0075156_1048233223300005982Wastewater EffluentSDQDFEDVHALATKELGPGLASLAEIKRVDALTGASIWVIRRNNEVTGFLAPLALTAAGVAALVDNTFDAAHIDQKWVARVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFPDLPFFGRDSTDAGARIMRHLGFFPFDSTPHLFWRCASVMEMAA*
Ga0075152_10000093343300005986Wastewater EffluentMAALMWNSRTQLRSLGGERLHLKYMAGASEEMMASVRLTHAVSDEDYAAVHALATAELGPGLASLQEIKRVDSLTGASIWVIRRNGAVTGFLAPLALTEAGVAALADNTFDAANIDQKWVARLGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDKHFTDLPFFGRDSTEAGARIMHHLGFFPFDDTPHLFWRCCSLMEPVAC*
Ga0075160_1033556423300005988Wastewater EffluentAMMASVKLHHAQSEQDFADVHALAVAELGPGLASLAEIKRVDSLTNAAIWVIRRNDSVTGFLAPLALTAAGVAALTDGTFNAARIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLALRTLIDKHFPDLPFFGKDSTAAGAKIMRHLGFFPFDGVEHLFWRCRTVMDAA*
Ga0073915_1004939513300006039SandEQMMASVRLTHAASDQDFEDVHALATKELGPGLASLAEIKRVDALTGASIWVIRRNSEVTGFLAPLALTAAGVAALVDNTFDAAHIDQKWVARVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFPDLPFFGRDSTDAGARIMRHLGFFPFDSTPHLFWRCASVMEMAA*
Ga0075163_10000041493300006056Wastewater EffluentMMVGARVFDAPCDAYDAVEALGGNRLHPQYVSGASEEMMASVKLFHPDDADYAEVHALASDTIGGSLASEDEIRRVDALTGASIWVNRRHGKVAGFLAPLALTLAGRDALIDGSFDPFAIDAAWVARVGQPLAGFYCWSYAGRDQFTRGVLVLALRTLIDRHFPDLPFFGRDTTAAGARIMAHLGFSPFDTTPHLFWRCRSVMDAKS*
Ga0075163_1000166913300006056Wastewater EffluentEAPPSSLWSKVSKRLKKNGGSVMVMVLDDARSALRSLGGERLHPRYMPGASEAMMASVKLHHAQSEQDFADVHALAVAELGPGLASLAEIKRVDSLTNAAIWVIRRNDTVTGFLAPLALTAAGVAALTDGTFNAARIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLALRTLIDKHFPDLPFFGKDSTAAGAKIMRHLGFFPFDGVEHLFWRCRTVMDAA*
Ga0075163_1007109333300006056Wastewater EffluentMTENAHTLLASLGAERVHPKYIPGASEQMMASVRLTHAASDQDFEDVHALATKELGPGLASLAEIKRVDALTGASIWVIRRNSEVTGFLAPLALTAAGVAALVDNTFDAAHIDQKWVARVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFPDLPFFGRDSTDAGARIMRHLGFFPFDSTPHLFWRCASVMEMAA*
Ga0075163_1019129443300006056Wastewater EffluentMVMVLDDARSALRSLGGERLHPRYMSGASEAMMASVKLHHATSEQDFADVHALAVAELGPGLASLAEIKRVDALTNAAIWVVRRNGAVTGFLAPLALTSAGVAALVDGSFNAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLALRTLIDKHFPDLPFFGKDSTAAGAKIMRHLGFFPFDGVEHLFWRCRTVMDAA*
Ga0075163_1135266213300006056Wastewater EffluentEAPPSSLWSKVSKRLKKNGGWEMVMVLEDARSALRSLGGERSHPRYMPGASEAMMASVKLHHAESDQDFADVHALAVAELGPGLASLAEIKRVDALTDAAIWVVRRNGAVTGFLAPLALTSAGVAALVDGSFNAAKIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLALRTLIDKHFPDLPFFGKDSTAAGAKIMRHLGFFPFDGVEHLFWRCRTVMDAA*
Ga0103961_129317123300007216Freshwater LakeMTAQTVSARAQLKSLGGERRHPCYRPGASESMMADIRLEHAVNDADYAAVHQLACDQLGPGLASLEEIRRVDQLTGASIWLLRRHDQVTGFLAPLALTSAGVNALTEGRFTAANIEDAWVARLGEPLAGFYCWCYAGGDQVSRGALVLGLRTLIDRRFPGMPFFGRDSTEAGARIMRHLGFFPFDDAPHLFWRCCQLMEPVS*
Ga0075183_1066327713300007232Wastewater EffluentGMMVGARVFDAPCDAYDAVEALGGNRLHPQYVSGASEEMMASVKLFHPDDADYAEVHALASDTIGGSLASEDEIRRVDALTGASIWVNRRHGKVAGFLAPLALTLAGRDALIDGSFDPFAIDAAWVARVGQPLAGFYCWSYAGRDQFTRGVLVLALRTLIDRHFPDLPFFGRDTTAAGARIMAHLGF
Ga0075183_1125129223300007232Wastewater EffluentMAEAPPSSLWSKVSKRLKKNGGSVMVMVLDDARSALRSLGGERLHPRYMPGASEAMMASVKLHHAQSEQDFADVHALAVAELGPGLASLAEIKRVDSLTNAAIWVIRRNDTVTGFLAPLALTAAGVAALTDGTFNAARIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLALRTLIDKHFPDLPFFGKDSTAAGAKIMRHLGFFPFDGVEHLFWRCRTVMDAA*
Ga0105048_10001053363300009032FreshwaterMAALIWNARTQLKSLGGERLHAKYMPGASEDMMASVRLTHAVSDEDYLAVHALATAELGPGLASLEEIKRVDNLTGASIWVIRRNDAVTGFLAPLALSSVGVAALADNTFDAANIDAKWVARLGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDKHFTDLPFFGRDSTEAGARIMRHLGFFPFDDTPHLFWRCCTLMEPIS*
Ga0105106_1001069353300009078Freshwater SedimentMKWGSGTFDAQFALQSLGGDRLHAKYRPGASEEMMASVRLTRPESEQDFADVHSLAVNELGSGLASLEEIKRVDALTGAAIWTVKRHGAVTGFLAPLALTKAGRDALVEDTFDAARIEQSWVARMGQPLAGFYCWCYAGKDQVTRGALVLALRTLIDKHFPDLPFFGRDATEAGARIMRHLGFAPFDGTPHLFWRCCSVMDEAA*
Ga0102814_1024788423300009079EstuarineASEQMMASVRLTHAASDQDFEDVHALATKELGPGLASLAEIKRVDALTGASIWVIRRNSEVTGFLAPLALTAAGVAALVDNTFDAAHIDQKWVARVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFPDLPFFGRDSTDAGARIMRHLGFFPFDSTPHLFWRCASVMEMAA*
Ga0102812_1010210813300009086EstuarineMTVMTENAHTLLASLGAEKVHPKYIPGASEQMMASVRLTHATSDQDFEDVHALATKELGPGLASLAEIKRVDALTGASIWVIRRNSEVTGFLAPLALTAAGVAALVDNTFDAAHIDQKWVARVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFPDLPFFGRDSTDAGARIMRHLGFFPFDSTPHLFWRCASVMEMAA*
Ga0102812_1021116923300009086EstuarineMGDSATSLLRSLGGERLHTQYMHGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAA*
Ga0105107_1000914083300009087Freshwater SedimentMKWGSGTFDAQFALQSLGGDRLHAKYRPGASEEMMASVRLTRPESEQDFADVHSLAVNELGAGLASLEEIKRVDALTGAAIWTVKRHGAVTGFLAPLALTKAGRDALVEDTFDAARIEQSWVARMGQPLAGFYCWCYAGKDQVTRGALVLALRTLIDKHFPDLPFFGRDSTEAGARIMRHLGFAPFDGTPHLFWRCCSVMDEAA*
Ga0075418_1075464523300009100Populus RhizosphereMKWGSGTFDAHFALQSLGGDRLHAKYQPVASEAMMASVRLLRPEGEQDFAAVHSLATAELGPGLASLEEIQRVDGLTGAAIWVVKRHGAVTGFLAPLALTQAGRDALVEDEFDASNIHQSWVARMGQPLAGFYCWCYAGKDQVTRGALVLALRTLIDRHFPDLPFFGRDATEAGARIMRHLGFAPFDGTPHLFWRCCSVMDEAT*
Ga0117941_102318013300009120Lake SedimentSHAMMASVKLFRPESEQDYADIHGLALAEIGPGLASLEEIKRVDSLTDAALWAVKRGARITGFLAPLALTLAGRDALVSGEFDAAHIEAKWVARLGQPLAGFYCWAYAGCDQVSRGALVLALRKLIDAHFPDLPFFGRDTTEAGARIMRHLGFPPFDASPHLYWRCCSVMDENAA*
Ga0114129_1234525913300009147Populus RhizosphereMMASVRLLRPEGEQDFAAVHSLATAELGPGLASLEEIQRVDGLTGAAIWVVKRHGAVTGFLAPLALTQAGRDALVEDEFDASNIHQSWVARMGQPLAGFYCWCYAGKDQVTRGALVLALRTLIDRHFPDLPFFGRDATEAGARIMRHLGFAPFDGTP
Ga0105243_1171513923300009148Miscanthus RhizosphereLHPESEEDYAQVHALATAELGPSLASLEEIKRVDRLTGASIWVVRRHGAVSGFLAPLALTLAGRNALIEGKFDSAHIQQEWVARMGEPLAGFYCWCYAGRDQVTRGALVLALRTLIDTHFRDLPFFGRDSTEAGARIMRHLGFFPFDGAPHLFWRCCAVMGGKAA*
Ga0114980_1000544553300009152Freshwater LakeMTENARTLLASLGAERVHPKYTPGASEQMMATVRLTHAASDQDFEDVHALATRELGPGLASLAEIKRVDALTGASIWVIRRNHEVTGFLAPLALTAAGVAALVDNTFDAAHIDQKWVARVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFQDLPFFGRDSTEAGARIMRHLGFFPFDSTPHLFWRCASVMEMAA*
Ga0116148_132837913300009669Anaerobic Digestor SludgeLMMASVKLHHAVTEQDFADVHALATAELGPSLASLAEIKRVDGLTDAAIWVVRRNDAVTGFLAPLALTSAGVAAMTDGTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLALRTLIDKHFPDLPFFGKDTTEAGAKIMRHLGFFPFDGVDHLFWRCRTVMDAAA*
Ga0116176_1000108023300009688Anaerobic Digestor SludgeMVMVLDDARSALRSLGGERLHPRYMPGASEAMMASVKLHHAQSEQDFADVHALAVAELGPGLASLAEIKRVDSLTNAAIWVIRRNDTVTGFLAPLALTAAGVAALTDGTFNAARIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLALRTLIDKHFPDLPFFGKDSTAAGAKIMRHLGFFPFDGVEHLFWRCRTVMDAA*
Ga0116177_1005393323300009696Anaerobic Digestor SludgeMVEAPPSSLWSKVSKRLKNNGGWEMVMVLDDARSALRSLGGERLHPRYMSGASEAMMASVKLHHATSEQDFADVHALAVAELGPGLASLAEIKRVDALTNAAIWVVRRNGAVTGFLAPLALTSAGVAALVDGSFNAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLALRTLIDKHFPDLPFFGKDSTAAGAKIMRHLGFFPFDGVEHLFWRCRTVMDAA*
Ga0116152_1004773023300009779Anaerobic Digestor SludgeMAALMWNSRAQLKSLGGERRHPQYARGASEAMMASVRLTHAENDADYEAVHELATSELGPGLASLEEIKRVDAMTGASIWVVRRKGAVTGFLAPLALTSAGVAALVDNTFDAAKIDPAWVARLGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDTHFSDLPFFGRDSTEAGAKIMRHLGFFPFDDTPHLFWRCCSLMEPIA*
Ga0116156_1000189523300009780Anaerobic Digestor SludgeMVMVLEDARAALRRLGGNRLHPRYTHGASELMMASVKLHHAVTDQDFADVHALATAELGPDLASLAEIKRVDRLTDAAIWVVRRNETVTGFLAPLALTAEGVAALTDGAFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLALRTLIDKHFPDLPFFGKDTTEAGAKIMRHLGFFPFDGVDHLYWRCRTVMDDAA*
Ga0116239_1017599333300010346Anaerobic Digestor SludgeMMASVKLHHAVTEQDFADVHALATAELGPSLASLAEIKRVDGLTDAAIWVVRRNDAVTGFLAPLALTSAGVAAMTDGTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLALRTLIDKHFPDLPFFGKDTTEAGAKIMRHLGFFPFDGVDHLFWRCRTVMDAAA*
Ga0116249_1023447523300010357Anaerobic Digestor SludgeMMVGARVFDAPCDAYDAVEALGGNRLHPQYVSGASEEMMASVKLFHPDDADYAEVHALASDTIGGSLASEDEIRRVDALTGASIWVNRRHGKVAGFLAPLALTLAGRDALIDGSFDPFAIDAAWVARVGEPLAGFYCWSYAGRDQFTRGVLVLALRTLIDRHFPDLPFFGR
Ga0134122_1006968923300010400Terrestrial SoilMLGAGTFDADFALQSLGGNRLHSKYSPGASEAMMASVKLLHPESAEDYAQVHALATAELGPDLASLEEIKRVDRLTGASIWVVRRHGAVSGFLAPLALTLAGRNALIEGKFDSAHIQQEWVARMGEPLAGFYCWCYAGRDQVTRGALVLALRTLIDTHFRDLPFFGRDSTEAGARIMRHLGFFPFDGAPHLFWRCCAVMGGKAA*
Ga0134121_1027020023300010401Terrestrial SoilMMAAVKLFHPETDEEYARVHELATYELGKGLASLDEIKRIDALTHAAIWVVRRKAQVSGFLAPLALTLAGRDALINDTFDAANIRQEWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRNLIDRYFPDLPFFGRDSTEAGARIMQHLGFYPFGATPHLFWRCCSVMDVAA*
Ga0134121_1051997813300010401Terrestrial SoilMCGAGTFDANFALQSLGGERLHPKYFPVTSEAMMASVRLVRAETHEDYSQVHDLATAELGSGLASLEEIKRVDGLTNAAIWVVRRHGAVSGFLAPLALTLAGRDALVEDKFDAANIQQQWVARMGEPLAGFYCWCYAGRDQVTRGALVLALRTLIDTHFPDLPFFGRDSTEAGARIMRHLGFFPFDRVPHLFWRCCAVMGENAA*
Ga0133913_1021619113300010885Freshwater LakeMTVMTENARTLLASLGAERVHPKYVPGASEQMMATVRLTHAVSHQDFEDVHALATKELGPGLASLEEIKRVDGLTGASIWVIRRTGEVTGFLAPLALTAAGVSALVDNTFDAANIDQRWVARVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFPDLPFFGRDSTEAGARIMRHLGFFPFDSTPHLFWRCASVMEMAA*
Ga0137454_104590613300011406SoilPENTQDFTDVHTLAVAELGEGLASLEEIQRVDGLTSAAIWVVKRHGAVTGFLAPLALTKPGRDALIDDTFDAAHIEQCWVARAGQPLAGFYCWCYAGKDQVTRGLLVLALRTLIDKHFPDLPFFGRDSTEAGARIMRHLGFAPFDGTPHLFWRCCSVMDEAA*
Ga0136634_1038102713300012046Polar Desert SandDFADVHALATQELGPGLASLDEIKRVDSLTGASIWVIRRRGDVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPIAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAA*
Ga0164241_1018520123300012943SoilMIAQQVKSDARFALDRLGGNRLHARYAPVASEAMMESVRLQRPTAAEFADVHAIASEQIGGALATLEEIRRVDALTDAAIWVVRRKGAVTGFLAPLALTALGRDALVDGTFRAGAIKQEWVARLGQPLAGFYCWSYAGRDQVTRGALVLALRTLIDTHFPDLPFFGRDTTAAGGRIMAHLGFSPFDDVEHLYWRCRSVMMTEQEERPASPMLSPISSSITAPAQDAAQ*
(restricted) Ga0172367_10000790383300013126FreshwaterMGDSATALLRSLGGERLHTQYVPGASEAMMASVRLEHARTEQDFADVHDLATAELGPGLASLEEIKRVDGLTNASIWVIRRRGEITGFLAPLALTSAGVAALVDNTFNAAHIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTDAGARIMRHLGFYPFDTTPHLFWRCASVMEMAA*
(restricted) Ga0172373_10004229253300013131FreshwaterMPGASEAMMASVRLEHARTEQDFADVHDLATAELGPGLASLEEIKRVDRLTNASIWVIRRRGEITGFLAPLALTSAGVAALVDNTFNAAHIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFPDLPFFGRDSTDAGARIMRHLGFYPFDTTPHLFWRCASVMEMAA*
Ga0173609_1037757413300013315SedimentIPGASEQMMASVRLTHAASDQDFEDVHALATKELGPGLASLAEIKRVDALTGASIWVIRRNNEVTGFLAPLALTAAGVAALVDNTFDAAHIDQKWVARVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFPDLPFFGRDSTDAGARIMRHLGFFPFDSTPHLFWRCASVMEMAA*
(restricted) Ga0172376_10001031103300014720FreshwaterMPGASEAMMASVRLEHARTEQDFADVHDLATAELGPGLASLEEIKRVDRLTNASIWVIRRRGEITGFLAPLALTSAGVAALVDNTFNAAHIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTDAGARIMRHLGFYPFDTTPHLFWRCASVMEMAA*
Ga0163144_10002881283300015360Freshwater Microbial MatMLMGDNATSLLRSLGGERLHAQYMRGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLEEIKRVDALTGASIWVVRRRGEVTGFLAPLALTAAGVAALLDNTFDAASIDQKWVARMGEPIAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFADLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAT*
Ga0163144_1029114333300015360Freshwater Microbial MatMGDNATSLLLSLGGERLHPQYIRGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEITGFLAPLALTAAGVAALVDNTFDAAHIDQTWVARMGEPIAGFYCWCYAGKDQVSRGALVIGLRTLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAA*
Ga0169931_1000650153300017788FreshwaterMTAQTGSARAQLKSLGGERRHPCYRPGASESMMADIRLEHAVNEADYAAVHQLACDQLGPGLASLEEIRRVDELTGASIWVLRRHDQVTGFLAPLALTSAGVSALTEGRFTAANIEDAWVARLGEPLAGFYCWCYAGGDQVSRGALVLGLRTLIDRRFPDMPFFGRDSTEAGARIMRHLGFFPFDDAPHLFWRCCQLMEPVS
Ga0193752_109179723300020027SoilMWGSGSFDAHVALQRLGGDRQHQRYLPGAAQAMMASVKLHHAESDQDFADVHALATAELGPGLASLEEIKRVDALTDAAIWVVRRHGAISGFLAPLALTKAGRDALVSGEFDAANIEERWVARMGQPLAGFYCWCYAGKDQVTRGALVLALRTLIDHHFTDLPFFGKDSTEAGARIMRHLGFFPFASTPHLFWRCASVMEEKAA
Ga0211729_1144050823300020172FreshwaterMGNSATSLLRSLGGERLHTQYMHGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAA
Ga0210377_1001665943300021090Groundwater SedimentMKWGSGTFDAHFALQSLGGDRLHAKYQPTASEAMMASVRLLRPEGEQDFADVHALATAELGPGLASLEEIERVDGLTGAAIWVVKRHGAITGFLAPLALTKAGRDALVEGEFDASNIHQGWVARMGQPLAGFYCWCYAGKDQVTRGALVLALRTLIDRHFPDLPFFGRDATEAGARIMRHLGFAPFDGTPHLFWRCCSVMDEAT
Ga0247738_1001375323300023100Plant LitterMLMVLDDARDALRRLGGERLHPRYMPGASELMMTSVKLHHATSEQDFEDVHALATSELGSGLASLAEIKRVDGLTDAAIWVVRRNDQVTGFLAPLALTSAGVAALTDGTFDAANIDQKWVARMGQPLAGFYCWCYAGKDQVSRGALVLALRTLIDKHFPDLPFFGKDTTEAGAKIMRHLGFFPFDGVQHLFWRCRSVMEAA
Ga0244777_1079959213300024343EstuarineMGNSATSLLRSLGGERLHTQYMHGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTE
Ga0256347_102936023300024545FreshwaterVMLMSFDDARTQLQSLGGERLHPKYMRGASEEMMASVRLEHAVTDEDYVAVHALATAELGPGLASLEEIKRVDDLTGASIWVIRRKGEVTGFLAPLALTSAGVKALVDSTFDAANIRQEWVVRVGEPLAGFYCWCYAGKGQVSRGALVLGLRTLIDRHFPDLPFFGRDSTEAGARIMRHLGFFPFDSTPHLFWSCCSLMPEIAA
Ga0255286_102816023300024858FreshwaterSFDDARTQLQSLGGERLHPKYMRGASEEMMASVRLEHAVTDEDYVAVHALATAELGPRLASLEEIKRVDDLTGASIWVIRRKGEVTGFLAPLALTSTGVKALVDNTFDAANIRQEWVVRVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFPDLPFFGRDSTEAGARIMRHLGFFPFDSTPHLFWRCCSLMQEIAA
Ga0208492_100928723300025526Serpentinite Rock And FluidMLMSFDDARTRLQSLGGERLHPKYMSGASEEMMASVRLEHAVSEDDYAAVHALATAELGPGLASLEEIKRVDQLTGASIWVIRRKGEVTGFLAPLALSAAGVKALVDNTFDAANIRAEWVVRVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFETLPFFGRDSTEAGARIMAHLGFFPFDSTPHLYWRCCSLMQEIAA
Ga0208939_1000119413300025772Anaerobic Digestor SludgeMAALMWNSRTQLRSLGGERLHLKYMAGASEEMMASVRLTHAVSDEDYAAVHALATAELGPGLASLQEIKRVDSLTGASIWVIRRNGAVTGFLAPLALTEAGVAALADNTFDAANIDQKWVARLGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDKHFTDLPFFGRDSTEAGARIMHHLGFFPFDDTPHLFWRCCSLMEPVAC
Ga0208939_1000386343300025772Anaerobic Digestor SludgeMVMVLDDARSALRSLGGERLHPRYMPGASEAMMASVKLHHAQSEQDFADVHALAVAELGPGLASLAEIKRVDSLTNAAIWVIRRNDTVTGFLAPLALTAAGVAALTDGTFNAARIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLALRTLIDKHFPDLPFFGKDSTAAGAKIMRHLGFFPFDGVEHLFWRCRTVMDAA
Ga0208939_103997133300025772Anaerobic Digestor SludgeMVMVLDDARSALRSLGGERLHPRYMSGASEAMMASVKLHHATSEQDFADVHALAVAELGPGLASLAEIKRVDALTNAAIWVVRRNGAVTGFLAPLALTSAGVAALVDGSFNAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLALRTLIDKHFPDLPFFGKDSTAAGAKIMRHLGFFPFDGVEHLFWRCRTVMDAA
Ga0208822_101781533300025866Anaerobic Digestor SludgeMVLDDARSALRSLGGERLHPRYMSGASEAMMASVKLHHATSEQDFADVHALAVAELGPGLASLAEIKRVDALTNAAIWVVRRNGAVTGFLAPLALTSAGVAALVDGSFNAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLALRTLIDKHFPDLPFFGKDSTAAGAKIMRHLGFFPFDGVEHLFWRCRTVMDAA
Ga0209311_100144023300025871Anaerobic Digestor SludgeMVMVLEDARAALRRLGGNRLHPRYTHGASELMMASVKLHHAVTDQDFADVHALATAELGPDLASLAEIKRVDRLTDAAIWVVRRNETVTGFLAPLALTAEGVAALTDGAFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLALRTLIDKHFPDLPFFGKDTTEAGAKIMRHLGFFPFDGVDHLYWRCRTVMDDAA
Ga0209311_100442893300025871Anaerobic Digestor SludgeMAEAPPSSLWSKVSKRLKKNGGSVMVMVLDDARSALRSLGGERLHPRYMPGASEAMMASVKLHHAQSEQDFADVHALAVAELGPGLASLAEIKRVDSLTNAAIWVIRRNDTVTGFLAPLALTAAGVAALTDGTFNAARIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLALRTLIDKHFPDLPFFGKDSTAAGAKIMRHLGFFPFDGVEHLFWRCRTVMDAA
Ga0209873_103091913300027091SandEQMMASVRLTHAASDQDFEDVHALATKELGPGLASLAEIKRVDALTGASIWVIRRNSEVTGFLAPLALTAAGVAALVDNTFDAAHIDQKWVARVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFPDLPFFGRDSTDAGARIMRHLGFFPFDSTPHLFWRCASVMEMAA
Ga0255182_101250113300027503FreshwaterMLMSFDDARTQLQSLGGERLHPKYMRGASEEMMASVRLEHAVTDEDYVAVHALATAELGPGLASLEEIKRVDDLTGASIWVIRRKGEVTGFLAPLALTSAGVKALVDSTFDAANIRQEWVVRVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFPDLPFFGRDSTEAGARIMRHLGFFPFDSTPHLFWRCCSLMQEIAA
Ga0209135_108041523300027642Freshwater LakeHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAA
Ga0209033_122739813300027697Freshwater LakeHTQYMHGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEM
(restricted) Ga0247836_10000501593300027728FreshwaterMGDNATSLLRSLGGARLHAQYMRGASEAMMAAVRLEHAASNQDFAEVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEITGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGTDQVSRGALVLGLRSLIDHYFPDLPFFGRDSTEAGARIMRHLGFVPFDNTPHLFWRCASVMEMAA
Ga0209985_1003462923300027806Freshwater LakeMGDSATALLRSLGGERLHTQYMPGAAEAMMASVRLEHARTDQDFADVHALATAELGPGLASLEEIKRVDGLTNASIWVIRRRGEITGFLAPLALTSAGVAALVDNTFNAARIDQKWVARMGEPLAGFYCWCYAGNDQVSRGALVLGLRTLIDRHFPDLPFFGRDSTDAGARIMRHLGFYPFDTTPHLFWRCASVMEMAA
Ga0209181_1013924433300027878FreshwaterMAALIWNARTQLKSLGGERLHAKYMPGASEDMMASVRLTHAVSDEDYLAVHALATAELGPGLASLEEIKRVDNLTGASIWVIRRNDAVTGFLAPLALSSVGVAALADNTFDAANIDAKWVARLGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDKHFTDLPFFGRDSTEAGARIMRHLGFFPFDDTPHLFWRCCTLMEPIS
Ga0209550_1004835933300027892Freshwater LakeMGDSATSLLRSLGGERLHTKYMHGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAA
Ga0209298_1005129223300027973Freshwater LakeMTENARTLLASLGAERVHPKYTPGASEQMMATVRLTHAASDQDFEDVHALATRELGPGLASLAEIKRVDALTGASIWVIRRNHEVTGFLAPLALTAAGVAALVDNTFDAAHIDQKWVARVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFQDLPFFGRDSTEAGARIMRHLGFFPFDSTPHLFWRCASVMEMAA
Ga0209299_103770513300027974Freshwater LakeMTENARTLLASLGAERVHPKYTPGASEQMMATVRLTHAASDQDFEDVHALATRELGPGLASLAEIKRVDALTGASIWVIRRNHEVTGFLAPLALTAAGVAALVDNTFDAAHIDQKWVARVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFQDLPFFGRDSTEAGARIMRHLGFFPFDSTPHLFW
Ga0209705_1002909923300027979Freshwater SedimentMKWGSGTFDAQFALQSLGGDRLHAKYRPGASEEMMASVRLTRPESEQDFADVHSLAVNELGSGLASLEEIKRVDALTGAAIWTVKRHGAVTGFLAPLALTKAGRDALVEDTFDAARIEQSWVARMGQPLAGFYCWCYAGKDQVTRGALVLALRTLIDKHFPDLPFFGRDATEAGARIMRHLGFAPFDGTPHLFWRCCSVMDEAA
Ga0268299_118541813300028648Activated SludgeHLATEQLGPGLATAAEIQRVDAMTGAAIWVVRRHGEVAGFLAPLALTLAGRDAIIDGTFDATSIQRKWVAAMGAPLAGFYCWCYGGRDQVTRGALVLGLRTLIDRCFPDLPFFGKDTTEAGARIMRHLGFRNYGNQPHLFWRCAQMMETGKMESGQ
Ga0311336_1092048123300029990FenMIAQLVKSEGRFALDRLGGDRMHKLYAPVSSLAMMASVRLHRPSPSEFADVHAIASEQIGGALATLDEIRRVDALTDAAIWVVRRKGEVTGFLAPLALSEKGRDALVDGSFRAGAIEQAWVARMGEALAGFYCWSYAGRDQVTRGALVLALRTLIDRHFPDLPFFGRDTTAAGGRIMAHLGFAPFDDVAHLYWRCCSLMA
Ga0311349_1101465223300030294FenMIAQLVKSEGRFALDRLGGDRMHKLYAPVSSLAMMASVRLHRPSPVEFADVHAIASEQIGGALATLDEIRRVDALTDAAIWVVRRKGEVTGFLAPLALSEKGRDALVDGSFRAGAIEQAWVARMGEALAGFYCWSYAGRDQVTRGALVLALRTLIDRHFPDLPFFGRDTTAAGGRIMAHLGFAPFDDVAHLYWRCCSLMAPAAHAP
Ga0302323_10030618423300031232FenMIAQLVKSEGRFALDRLGGDRMHKLYAPVSSLAMMASVRLHRPSPSEFADVHAIASEQIGGALASLDEIRRVDALTDAAIWVVRRKGEVTGFLAPLALSEKGRDALVDGSFRAGAIEQAWVARMGEALAGFYCWSYAGRDQVTRGALVLALRTLIDRHFPDLPFFGRDTTAAGGRIMAHLGFAPFDDVAHLYWRCCSLMASAAHAPSPQVLDAAP
Ga0307505_1005048933300031455SoilMAALMWNSRTQLRSLGGERLHAQYMPGASEAMMASVRLTHAVSDEDYEAVHALASAELGPGLASLDEIKRVDGLTGASIWVIRRNAAVTGFLAPLALTAAGVEALADNTFDAANIDQAWVARIGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDTHFTDLPFFGRDSTEAGARIMRHLGFFPFDDTPHLFWRCCTLMEPISC
Ga0302321_10293958313300031726FenMMASVRLHRPSPAEFADVHAIASEQIGGALATLDEIRRVDALTDAAIWVVRRKGEVTGFLAPLALSETGRDALVDGSFRAGAIEQAWVARMGEALAGFYCWSYAGRDQVTRGALVLALRTLIDRHFPDLPFFGRDTTAAGGRIMAHLGFAPFDDVAHLYWRCCSLMAPAAHAPSPQVLDAAP
Ga0315907_1091774123300031758FreshwaterGGERLHTQYMHGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAA
Ga0315908_1035070223300031786FreshwaterMGNSATSLLRSLGGERLHTQYMHGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMA
Ga0302322_10037454833300031902FenLHRPTPGEFADVHAIASEQIGGALATLDEIRRVDALTDAAIWVVRRKGDVTGFLAPLALSEKGRDALVDGSFRAGAIEQAWVARMGEALAGFYCWSYAGRDQVTRGALVLALRTLIDRHFPDLPFFGRDTTAAGGRIMAHLGFAPFDDVAHLYWRCCSLMAPAAHAPSPQVLDAAP
Ga0315904_1016759333300031951FreshwaterVINGLNGRGAWGVVMRDNATSLLRSLGGERLHAQYMRGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRSLIDHYFLDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAS
Ga0315904_1024877433300031951FreshwaterASVRLEHARTDQDFADVHALATAELGPGLASLEEIKRVDGLTNASIWVIRRRGEITGFLAPLALTSAGVAALVDNTFNAARIDQKWVARMGEPLAGFYCWCYAGNDQVSRGALVLGLRTLIDRHFPDLPFFGRDSTDAGARIMRHLGFYPFDTTPHLFWRCASVMEMAA
Ga0315905_1036328023300032092FreshwaterMGDSATALLRSLGGERLHTQYMPGAAEAMMASVRLEHARTDQDFADVHALAIAELGPGLASLEEIKRVDGLTNASIWVIRRRGEITGFLAPLALTSAGVAALVDNTFNAARIDQKWVARMGEPLAGFYCWCYAGNDQVSRGALVLGLRTLIDRHFPDLPFFGRDSTDAGARIMRHLGFYPFDTTPHLFWRCASVMEMAA
Ga0315910_1011289323300032144SoilMRGTAAFDAGAALKGIGGERLHPKYAQGASHAMMASVKLVRPDSEQDYADIHSLAVAEIGPGLASLEEIKRVDGLTDAALWAVKRNGQVTGFLAPLALTIAGRDALVSGEFDAAHIQAKWVARLGQPLAGFYCWAYGGRDQVSRGALVLALRKLIDDHFPDLPFFGRDTTEAGARIMRHLGFPPFGATPHLYWRCCSVMDENAA
Ga0315910_1096398313300032144SoilSAGHQFVLKGYTDINERIRVMKWGTAAFDAHSALSAVGGERLHPKYAPGASHAMMASVKLVRPESEQDYADIHSLAVAELGPGLATLEEIKRVDALTDAALWAVKRHGQVTGFLAPLALTLAGRDALTSGTFVGAHIQAKWVARLGEPLAGFYCWCYAGKDQVTRGALVLALRKLIDTHFPDLPFFGRDTTEAGARIMRHLGFPPFDATPHLFWRCCSVMD
Ga0315910_1105013123300032144SoilAMMAGVRLEHAVTHEDYVAVHQLATSELGPTLASLEEIKRVDALTGASIWVIRRKGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRYFPDLPFFGRDSTEAGARIMRHLGFYPFDNTPHLFWRCASVMEMAA
Ga0315912_1093686923300032157SoilMMASVKLVRPESEQDYTDIHALAVAELGSGLATLPEIKRVDALTDAALWAVKRHGQITGFLAPLALTLAGRDALISGTFDAAHIQARWVARLGEPLAGFYCWCYAGKDQVTRGALVLALRKLIDIHFPDLPFFGRDTTEAGARIMRHLGFPPFDATPHLFWRCCSVMDENAA
Ga0307472_10120345623300032205Hardwood Forest SoilETDEEYARVHELATHELGKGLASLDEIKRIDALTHAAIWVVRRKGEVSGFLAPLALTLAGRDALIDDTFDAANIRQEWVARMGQPLAGFYCWCYAGKDQVSRGALVLGLRNLIDRYFPDLPFFGRDSTEAGARIMQHLGFYPFGATPHLFWRCCSVMDVAA
Ga0307472_10163827213300032205Hardwood Forest SoilGASEAMMASVKLLHPENDDEFAQVHAIASTELGPDLASLEEIQRVDRMTNAAIWVVRRHGAVSGFLAPLALTLAGRNALVEDEFDASNIERQWVARLGEPLAGFYCWCYAGRDQVTRGALVLALRTLIDTHFPDLPFFGRDSTEAGARIMRHLGFFPFDRVPHLFWRCCAVMGGKAA
Ga0326726_1094009713300033433Peat SoilMAAVKLFHPETDEEYARVHELATHELGKGLASLDEIKRVDALTNAAIWVVRRKGDVTGFLAPLALTLAGRDALIANTFDAANIRQEWVARMGEPIAGFYCWCYAGKDQVSRGALVLGLRNLIDRYFPDLPFFGRDSTEAGARIMQHLGFYPFGATPHLFWRCCSVMDVAA
Ga0334977_0004999_4215_48143300033978FreshwaterMGDNATSLLCSLGGERLHTQYMRGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRSLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAA
Ga0334982_0010704_3_4763300033981FreshwaterDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRSLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAA
Ga0335004_0222781_2_5143300034021FreshwaterMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRSLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAA
Ga0335019_0082722_1627_21633300034066FreshwaterMRGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRSLIDRYFPDLPFFGRDSTEAGARIMRHLGFFPFDNTPHLFWRCASVMEMAA
Ga0335053_0305984_556_9963300034118FreshwaterMRGASEAMMAAVRLEHAASDQDFADVHALATQELGPGLASLDEIKRVDALTGASIWVIRRRGEVTGFLAPLALTAAGVAALVDNTFDAANIDQKWVARMGEPLAGFYCWCYAGKDQVSRGALVLGLRSLIDRYFPDLPYFGRDSTEA
Ga0364933_147247_22_5403300034150SedimentMMASVKLFRPESEQDYADIHGLALAEIGPGLASLEEIKRVDALTDAALWAFKRGDQITGFLAPLALTIAGRDALVSGEFDAAHIQAKWVARLGQPLAGFYCWAYAGCDQVSRGALVLALRKLIDTHFPDLPFFGRDTTEAGARIMRHLGFPPFDATPHLYWRCCSVMDENAA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.