NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F041239

Metagenome / Metatranscriptome Family F041239

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F041239
Family Type Metagenome / Metatranscriptome
Number of Sequences 160
Average Sequence Length 94 residues
Representative Sequence MSEPGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLAAFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEITDVQTRMGRA
Number of Associated Samples 148
Number of Associated Scaffolds 160

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 46.88 %
% of genes near scaffold ends (potentially truncated) 31.25 %
% of genes from short scaffolds (< 2000 bps) 74.38 %
Associated GOLD sequencing projects 142
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (56.875 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(13.125 % of family members)
Environment Ontology (ENVO) Unclassified
(33.750 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(31.875 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 47.37%    β-sheet: 0.00%    Coil/Unstructured: 52.63%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 160 Family Scaffolds
PF02900LigB 9.38
PF04454Linocin_M18 8.75
PF00313CSD 6.88
PF13559DUF4129 5.00
PF07355GRDB 3.12
PF01882DUF58 1.25
PF07726AAA_3 1.25
PF00487FA_desaturase 1.25
PF02515CoA_transf_3 1.25
PF01979Amidohydro_1 0.62
PF00528BPD_transp_1 0.62
PF00462Glutaredoxin 0.62

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 160 Family Scaffolds
COG1398Fatty-acid desaturaseLipid transport and metabolism [I] 1.25
COG1721Uncharacterized conserved protein, DUF58 family, contains vWF domainFunction unknown [S] 1.25
COG1804Crotonobetainyl-CoA:carnitine CoA-transferase CaiB and related acyl-CoA transferasesLipid transport and metabolism [I] 1.25
COG3239Fatty acid desaturaseLipid transport and metabolism [I] 1.25


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms56.88 %
UnclassifiedrootN/A43.12 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002914|JGI25617J43924_10210632Not Available647Open in IMG/M
3300003994|Ga0055435_10055974Not Available963Open in IMG/M
3300003995|Ga0055438_10144084Not Available701Open in IMG/M
3300004009|Ga0055437_10005685All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2390Open in IMG/M
3300004052|Ga0055490_10155922Not Available673Open in IMG/M
3300004058|Ga0055498_10114026Not Available560Open in IMG/M
3300004062|Ga0055500_10012207Not Available1426Open in IMG/M
3300004114|Ga0062593_101078178Not Available831Open in IMG/M
3300004156|Ga0062589_100044236All Organisms → cellular organisms → Bacteria2427Open in IMG/M
3300004463|Ga0063356_100134861All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2775Open in IMG/M
3300004463|Ga0063356_102853894Not Available744Open in IMG/M
3300005205|Ga0068999_10112496Not Available551Open in IMG/M
3300005206|Ga0068995_10086930Not Available619Open in IMG/M
3300005294|Ga0065705_10527168Not Available757Open in IMG/M
3300005295|Ga0065707_10227578All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1201Open in IMG/M
3300005341|Ga0070691_10101373All Organisms → cellular organisms → Bacteria1430Open in IMG/M
3300005406|Ga0070703_10606496Not Available506Open in IMG/M
3300005440|Ga0070705_100260928All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1222Open in IMG/M
3300005444|Ga0070694_100015147All Organisms → cellular organisms → Bacteria4835Open in IMG/M
3300005445|Ga0070708_100105854All Organisms → cellular organisms → Bacteria → Proteobacteria2581Open in IMG/M
3300005445|Ga0070708_100691448Not Available960Open in IMG/M
3300005468|Ga0070707_100499208All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales1178Open in IMG/M
3300005546|Ga0070696_100128175All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1843Open in IMG/M
3300005549|Ga0070704_100021760All Organisms → cellular organisms → Bacteria4163Open in IMG/M
3300005549|Ga0070704_100210741All Organisms → cellular organisms → Bacteria1574Open in IMG/M
3300005876|Ga0075300_1000636All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2300Open in IMG/M
3300005878|Ga0075297_1009482All Organisms → cellular organisms → Bacteria926Open in IMG/M
3300005886|Ga0075286_1069160All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium515Open in IMG/M
3300006049|Ga0075417_10711412Not Available516Open in IMG/M
3300006354|Ga0075021_10002028All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria10518Open in IMG/M
3300006755|Ga0079222_10478442Not Available902Open in IMG/M
3300006845|Ga0075421_100661881All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1219Open in IMG/M
3300007255|Ga0099791_10001254All Organisms → cellular organisms → Bacteria9901Open in IMG/M
3300007265|Ga0099794_10138806All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1230Open in IMG/M
3300007265|Ga0099794_10394434Not Available723Open in IMG/M
3300009053|Ga0105095_10698241Not Available566Open in IMG/M
3300009078|Ga0105106_10928607Not Available619Open in IMG/M
3300009087|Ga0105107_10763776Not Available672Open in IMG/M
3300009088|Ga0099830_10014551All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium4941Open in IMG/M
3300009157|Ga0105092_10193062Not Available1137Open in IMG/M
3300010391|Ga0136847_13037600Not Available598Open in IMG/M
3300010400|Ga0134122_10056317All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3029Open in IMG/M
3300010401|Ga0134121_10215316All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1667Open in IMG/M
3300011119|Ga0105246_12599181Not Available500Open in IMG/M
3300011270|Ga0137391_11404075Not Available544Open in IMG/M
3300011395|Ga0137315_1060221Not Available546Open in IMG/M
3300011406|Ga0137454_1066914All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium598Open in IMG/M
3300011443|Ga0137457_1277622All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium573Open in IMG/M
3300012096|Ga0137389_10436900All Organisms → cellular organisms → Bacteria1120Open in IMG/M
3300012134|Ga0137330_1026595All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium726Open in IMG/M
3300012199|Ga0137383_10590278Not Available813Open in IMG/M
3300012200|Ga0137382_10849728Not Available658Open in IMG/M
3300012203|Ga0137399_11644229Not Available530Open in IMG/M
3300012208|Ga0137376_10628333Not Available928Open in IMG/M
3300012225|Ga0137434_1058057Not Available598Open in IMG/M
3300012226|Ga0137447_1059035All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium709Open in IMG/M
3300012360|Ga0137375_10410388All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1180Open in IMG/M
3300012685|Ga0137397_10010882All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium6334Open in IMG/M
3300012685|Ga0137397_10107860All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2045Open in IMG/M
3300012923|Ga0137359_10234874All Organisms → cellular organisms → Bacteria → Proteobacteria1636Open in IMG/M
3300012925|Ga0137419_10991784Not Available695Open in IMG/M
3300012929|Ga0137404_10164416All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1856Open in IMG/M
3300012931|Ga0153915_12331793All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium626Open in IMG/M
3300012931|Ga0153915_12451374Not Available610Open in IMG/M
3300013306|Ga0163162_10949939Not Available971Open in IMG/M
3300014318|Ga0075351_1084794Not Available656Open in IMG/M
3300014873|Ga0180066_1009010All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1677Open in IMG/M
3300014884|Ga0180104_1003037All Organisms → cellular organisms → Bacteria3527Open in IMG/M
3300014885|Ga0180063_1050047All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1202Open in IMG/M
3300015254|Ga0180089_1037266All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium933Open in IMG/M
3300015371|Ga0132258_10871779All Organisms → cellular organisms → Bacteria2272Open in IMG/M
3300017927|Ga0187824_10040671All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1418Open in IMG/M
3300017930|Ga0187825_10386743Not Available535Open in IMG/M
3300017997|Ga0184610_1005781All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2943Open in IMG/M
3300018000|Ga0184604_10322150Not Available547Open in IMG/M
3300018028|Ga0184608_10077324All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1358Open in IMG/M
3300018031|Ga0184634_10011004All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3220Open in IMG/M
3300018052|Ga0184638_1149832All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium842Open in IMG/M
3300018054|Ga0184621_10347135Not Available521Open in IMG/M
3300018061|Ga0184619_10071820All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1530Open in IMG/M
3300018066|Ga0184617_1098922Not Available813Open in IMG/M
3300018071|Ga0184618_10001406All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria5925Open in IMG/M
3300018074|Ga0184640_10025263All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2311Open in IMG/M
3300018077|Ga0184633_10125892All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1327Open in IMG/M
3300018084|Ga0184629_10043639All Organisms → cellular organisms → Bacteria2012Open in IMG/M
3300018422|Ga0190265_10010192All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium7011Open in IMG/M
3300018422|Ga0190265_10016686All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria5730Open in IMG/M
3300018422|Ga0190265_10281544All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1725Open in IMG/M
3300018429|Ga0190272_10330439All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1201Open in IMG/M
3300019360|Ga0187894_10102182All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1531Open in IMG/M
3300019458|Ga0187892_10013838All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium9191Open in IMG/M
3300019789|Ga0137408_1262244All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1260Open in IMG/M
3300019879|Ga0193723_1017148All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2239Open in IMG/M
3300019889|Ga0193743_1032846All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2461Open in IMG/M
3300019997|Ga0193711_1021280Not Available822Open in IMG/M
3300019998|Ga0193710_1007085Not Available1127Open in IMG/M
3300020002|Ga0193730_1042725All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1312Open in IMG/M
3300020060|Ga0193717_1133426Not Available749Open in IMG/M
3300020067|Ga0180109_1258571Not Available765Open in IMG/M
3300020068|Ga0184649_1458084Not Available764Open in IMG/M
3300020199|Ga0179592_10212055Not Available876Open in IMG/M
3300021086|Ga0179596_10102416All Organisms → cellular organisms → Bacteria → Proteobacteria1303Open in IMG/M
3300021432|Ga0210384_10961564Not Available755Open in IMG/M
3300022534|Ga0224452_1113679Not Available830Open in IMG/M
3300022534|Ga0224452_1144517All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium732Open in IMG/M
3300025324|Ga0209640_10038297All Organisms → cellular organisms → Bacteria4180Open in IMG/M
3300025538|Ga0210132_1023769Not Available860Open in IMG/M
3300025885|Ga0207653_10236221Not Available697Open in IMG/M
3300025907|Ga0207645_10296733Not Available1076Open in IMG/M
3300025910|Ga0207684_10102574All Organisms → cellular organisms → Bacteria → Proteobacteria2446Open in IMG/M
3300025917|Ga0207660_10213667All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1511Open in IMG/M
3300025922|Ga0207646_10291638All Organisms → cellular organisms → Bacteria → Proteobacteria1474Open in IMG/M
3300025942|Ga0207689_10011713All Organisms → cellular organisms → Bacteria7515Open in IMG/M
3300025957|Ga0210089_1018921Not Available797Open in IMG/M
3300025962|Ga0210070_1032886Not Available621Open in IMG/M
3300025971|Ga0210102_1145276Not Available508Open in IMG/M
3300026005|Ga0208285_1003321All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1030Open in IMG/M
3300026048|Ga0208915_1022431Not Available578Open in IMG/M
3300026285|Ga0209438_1008923All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3386Open in IMG/M
3300026320|Ga0209131_1036668All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2840Open in IMG/M
3300026333|Ga0209158_1210991Not Available674Open in IMG/M
3300026340|Ga0257162_1015248All Organisms → cellular organisms → Bacteria → Proteobacteria912Open in IMG/M
3300026351|Ga0257170_1000061All Organisms → cellular organisms → Bacteria5579Open in IMG/M
3300026360|Ga0257173_1005684All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1288Open in IMG/M
3300026376|Ga0257167_1021706All Organisms → cellular organisms → Bacteria924Open in IMG/M
3300026377|Ga0257171_1002746All Organisms → cellular organisms → Bacteria2597Open in IMG/M
3300026507|Ga0257165_1062876Not Available674Open in IMG/M
3300026508|Ga0257161_1005568All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla2177Open in IMG/M
3300026515|Ga0257158_1005016All Organisms → cellular organisms → Bacteria1830Open in IMG/M
3300027384|Ga0209854_1041633Not Available782Open in IMG/M
3300027815|Ga0209726_10444098Not Available594Open in IMG/M
3300027846|Ga0209180_10001870All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium10540Open in IMG/M
3300027909|Ga0209382_10566238All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1242Open in IMG/M
3300028536|Ga0137415_10161437All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2075Open in IMG/M
3300028792|Ga0307504_10413943Not Available533Open in IMG/M
3300028812|Ga0247825_11261309Not Available540Open in IMG/M
3300028885|Ga0307304_10139785All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1001Open in IMG/M
3300030006|Ga0299907_10301627All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1306Open in IMG/M
3300030620|Ga0302046_10122172All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2130Open in IMG/M
(restricted) 3300031150|Ga0255311_1003555All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2989Open in IMG/M
(restricted) 3300031150|Ga0255311_1028535Not Available1161Open in IMG/M
(restricted) 3300031197|Ga0255310_10241200Not Available511Open in IMG/M
(restricted) 3300031248|Ga0255312_1065305Not Available876Open in IMG/M
3300031716|Ga0310813_10530666All Organisms → cellular organisms → Bacteria → Proteobacteria1032Open in IMG/M
3300031720|Ga0307469_10129361All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1845Open in IMG/M
3300031720|Ga0307469_10172315All Organisms → cellular organisms → Bacteria1652Open in IMG/M
3300031740|Ga0307468_100317618Not Available1142Open in IMG/M
3300031820|Ga0307473_10267696All Organisms → cellular organisms → Bacteria → Proteobacteria1057Open in IMG/M
3300033233|Ga0334722_10343607Not Available1083Open in IMG/M
3300033412|Ga0310810_10004580All Organisms → cellular organisms → Bacteria15324Open in IMG/M
3300033432|Ga0326729_1002454All Organisms → cellular organisms → Bacteria → Proteobacteria3823Open in IMG/M
3300033432|Ga0326729_1044836Not Available685Open in IMG/M
3300033433|Ga0326726_10346373All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1401Open in IMG/M
3300033486|Ga0316624_11586964Not Available603Open in IMG/M
3300033502|Ga0326731_1101742Not Available661Open in IMG/M
3300033513|Ga0316628_100101113All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3232Open in IMG/M
3300033550|Ga0247829_11555152Not Available546Open in IMG/M
3300034817|Ga0373948_0001947All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2971Open in IMG/M
3300034820|Ga0373959_0108412Not Available668Open in IMG/M
3300034894|Ga0373916_0069824Not Available537Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil13.12%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil10.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.12%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands7.50%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment7.50%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil6.25%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil5.62%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment2.50%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.50%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil2.50%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil2.50%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil2.50%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.88%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.88%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.25%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands1.25%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment1.25%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.25%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.25%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere1.25%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.25%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.25%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.25%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands1.25%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.25%
Rhizosphere SoilHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere Soil1.25%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.62%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment0.62%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.62%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.62%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.62%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.62%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.62%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.62%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.62%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.62%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze0.62%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.62%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.62%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.62%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.62%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.62%
Sediment SlurryEngineered → Bioremediation → Metal → Unclassified → Unclassified → Sediment Slurry0.62%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300003994Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300003995Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D2EnvironmentalOpen in IMG/M
3300004009Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004052Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004058Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D1EnvironmentalOpen in IMG/M
3300004062Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005205Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleC_D2EnvironmentalOpen in IMG/M
3300005206Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005876Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_401EnvironmentalOpen in IMG/M
3300005878Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_104EnvironmentalOpen in IMG/M
3300005886Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_205EnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006354Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009053Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009078Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015EnvironmentalOpen in IMG/M
3300009087Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm September2015EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011119Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-4 metaGHost-AssociatedOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011395Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT200_2EnvironmentalOpen in IMG/M
3300011406Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT539_2EnvironmentalOpen in IMG/M
3300011443Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT630_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012134Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT142_2EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012225Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT860_2EnvironmentalOpen in IMG/M
3300012226Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT400_2EnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300014318Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D1_rdEnvironmentalOpen in IMG/M
3300014873Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200B_16_10DEnvironmentalOpen in IMG/M
3300014884Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_1DaEnvironmentalOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300015254Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT860_16_10DEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018066Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_b1EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019889Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2c2EnvironmentalOpen in IMG/M
3300019997Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m2EnvironmentalOpen in IMG/M
3300019998Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m1EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020060Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c2EnvironmentalOpen in IMG/M
3300020067Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLIBT47_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020068Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025538Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025907Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025942Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025957Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025962Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025971Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300026005Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_101 (SPAdes)EnvironmentalOpen in IMG/M
3300026048Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleB_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026340Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-AEnvironmentalOpen in IMG/M
3300026351Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-05-BEnvironmentalOpen in IMG/M
3300026360Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-BEnvironmentalOpen in IMG/M
3300026376Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-BEnvironmentalOpen in IMG/M
3300026377Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-BEnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300026508Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-AEnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300027384Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300028885Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_185EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300030620Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300033233Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_bottomEnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M
3300033432Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF6AY SIP fractionEnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033486Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_N3_C1_D5_AEnvironmentalOpen in IMG/M
3300033502Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF9FY SIP fractionEnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300033550Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day4EnvironmentalOpen in IMG/M
3300034817Populus rhizosphere microbial communities from soil in West Virginia, United States - GW9791_WV_N_1Host-AssociatedOpen in IMG/M
3300034820Populus rhizosphere microbial communities from soil in West Virginia, United States - WV94_WV_N_2Host-AssociatedOpen in IMG/M
3300034894Uranium-contaminated sediment microbial communities from bioreactor in Oak Ridge, Tennessee, United States - A1A0.2EngineeredOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI25617J43924_1021063223300002914Grasslands SoilMRETGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQXILSHMGARADEIVDVQTRMGRI*
Ga0055435_1005597423300003994Natural And Restored WetlandsMPEPGLGLSALWEERDYKACGASRECRGRFILGKLFDRDTRKLQSAWLAAFGTRVLGPPEIRTSEITKSQWQLILTHMGASPDEIADVQARMSRAGR*
Ga0055438_1014408413300003995Natural And Restored WetlandsLWEERDYKACGASRECRGRFILGKLFDRDTRKLQSAWLGAFGTRVLGPPEIRTSEITKSQWQLILTHMGASPDEIADVQAR
Ga0055437_1000568533300004009Natural And Restored WetlandsLWEERDYKACGASRECRGRFILGKLFDRDTRKLQSAWLAAFGTRVLGPPEIRTSEITKSQWQLILTHMGASPDEIADVQARMSRAGR*
Ga0055490_1015592213300004052Natural And Restored WetlandsMAEPGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRRLKAAWLATFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEIADVQTRMGRI*
Ga0055498_1011402623300004058Natural And Restored WetlandsLQPVTMPEPGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRRLKAAWLAAFGTRVLGPPEIRTDEITKAQWEVILSHMGASPDEIADVQTRMGRL*
Ga0055500_1001220723300004062Natural And Restored WetlandsMPEPGLGLSALWEERDYKACGVSRECRGRFILGKLFDRDTRKLQGAWLAAFGTRVLGPPEIRTSEITKSQWQLILSHMGASPDEIADVQARMSRAGR*
Ga0062593_10107817813300004114SoilLSALWDERDYKACGASIECRGRFILGKLFDRDTPLLKTAWLAAFGTRVLGPPEIRTDEITKTQWQVILSHMGASADEI
Ga0062589_10004423663300004156SoilMPAQGLGLSALWDERDYKACGASIECRGRFILGKLFDRDTPLLKTAWLAAFGTRVLGPPEIRTDEITKTQWQVILSHMGASADEITDVQARMGRA*
Ga0063356_10013486133300004463Arabidopsis Thaliana RhizosphereMPEPGLGLSALWEERDYKACGGSRECRGRFILGKLFDRNTRLLQGAWLAAFGTRVLGPPEVRTAEITKAQWQVILSHMGASPGEIADVQARMGRT*
Ga0063356_10285389423300004463Arabidopsis Thaliana RhizosphereMPQPGLGLSALWEERDYKACGASKECRGRFILGKLFDRDTRMLKAAWLAAFGTRVLGPPEIRTDEITKAQWGMILSHMGASPDEITDVQTRMGRI*
Ga0068999_1011249613300005205Natural And Restored WetlandsMPEPGLGLSALWEERDYKACGVSRECRGRFILGKLFDRDTRKLQGAWLAAFGTRVLGPPEIRTSEITKSQWQLILSHMGASPDEIADVQARMSRA
Ga0068995_1008693013300005206Natural And Restored WetlandsMPEPGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRRLKAAWLAAFGTRVLGPPEIRTDEITKAQWEVILSHMGASPDEIADVQTRMGRL*
Ga0065705_1052716823300005294Switchgrass RhizosphereMSATGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLVAFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEIIDVQTRMGRA*
Ga0065707_1022757833300005295Switchgrass RhizosphereMVRRRGALADSMSATGLGLSALWEERDYKACGGSMECRGRFILGKLFDRDTRMLKAAWLVAFGTRVLGPPEIRTDEITKSQWQVILSHMG
Ga0070691_1010137313300005341Corn, Switchgrass And Miscanthus RhizosphereDYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAVFGRRLLGPPEIQAHDITRSQWGLILSHMGASPDEVADVQARMGRT*
Ga0070703_1060649613300005406Corn, Switchgrass And Miscanthus RhizosphereGTRGRPGPEGLGRSDVEGMRETGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQMILSHMGARADEIVDVQTRMGRN*
Ga0070705_10026092833300005440Corn, Switchgrass And Miscanthus RhizosphereMRETGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQMILSHMGARADEIVDVQTRMGRN*
Ga0070694_10001514773300005444Corn, Switchgrass And Miscanthus RhizosphereMREAGLGLTVWWDERDYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAAFGRRLLGPPEIQAHDITRSQWGLILSHMGASPDEVADVQARMGRT*
Ga0070708_10010585443300005445Corn, Switchgrass And Miscanthus RhizosphereCGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARADEIVDVQTRMGRI*
Ga0070708_10069144823300005445Corn, Switchgrass And Miscanthus RhizosphereMPAQGLGLSALWDERDYKACGASIECRGRFILGKLFDRHTPLLKTAWLAAFGTRVLGPPEIRTDEITKTQWQVILSHMGASADEITDVQARMGLAGVMRGGSPASGQR*
Ga0070707_10049920813300005468Corn, Switchgrass And Miscanthus RhizosphereMRETGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARADEIVDVQTRMGRI*
Ga0070696_10012817513300005546Corn, Switchgrass And Miscanthus RhizosphereMRESGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARPDEIVDVQTRMGRI*
Ga0070704_10002176053300005549Corn, Switchgrass And Miscanthus RhizosphereMVRRRGALADSMSATGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLVAFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEIIDVQTRMGRA*
Ga0070704_10021074123300005549Corn, Switchgrass And Miscanthus RhizosphereMREAGLGLTVWWDERDYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAVFGRRLLGPPEIQAHDITRSQWGLILSHMGASPDEVADVQARMGRT*
Ga0075300_100063633300005876Rice Paddy SoilMREAGLGLSVWWDERDYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAAFGRRLLGPPEIQAHDITRSQWGLILSHMGASPDEVADVQARMGRT*
Ga0075297_100948233300005878Rice Paddy SoilPGPQGLGPAARASMREAGLGLSVWWDERDYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAAFGRRLLGPPEIQAHDITRSQWGLILSHMGASPDEVADVQARMGRT*
Ga0075286_106916023300005886Rice Paddy SoilARASMREAGLGLTVWWDERDYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAAFGRRLLGPPEIQAHDITRSQWGLILSHMGASPDEVADVQARMGRT*
Ga0075417_1071141223300006049Populus RhizosphereMVRRRGALADSMSATGLGLSALWEERDYKACGGSMECRGRFILGKLFDRDTRMLKAAWLVAFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEIIDVQTRMGRA*
Ga0075021_10002028113300006354WatershedsMRETGLGLEVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARADEIVDVQTRMGRI*
Ga0079222_1047844223300006755Agricultural SoilMREPGLGLSVWWDEREYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAAFGRRLLGPPEIQAHDITRLQWGLILSHMGASAGEIADVQARMGRP*
Ga0075421_10066188123300006845Populus RhizosphereMAERGLGLRALWEDRDYKACGTSMECRGRFILGKLFDRDTRMLQVAWLAAFGTRVLGPPEIRTAEITKSQWQVILSHMGASPDEIADVQARMSRA*
Ga0099791_1000125493300007255Vadose Zone SoilMRETGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARADEIVDVQTRMGRN*
Ga0099794_1013880623300007265Vadose Zone SoilMPELGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMSRA*
Ga0099794_1039443423300007265Vadose Zone SoilMRETGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARA
Ga0105095_1069824113300009053Freshwater SedimentLSALWEERDYKACGASMECRGRFILGKLFDRDTRRLKAAWLATFGTRVLGPPEIRTDEITKAQWQVILSHMGASADEIVDVQTRMGRT*
Ga0105106_1092860723300009078Freshwater SedimentDYKACGASMECRGRFILGKLFDRDTRRLKAAWLATFGTRVLGPPEIRTDEITKAQWQVILSHMGASADEIVDVQTRMGRT*
Ga0105107_1076377623300009087Freshwater SedimentMPEPGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRRLKAAWLATFGTRVLGPPEIRTDEITKAQWQVILSHMGASADEIVDVQTRMGRT*
Ga0099830_1001455173300009088Vadose Zone SoilMPELGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWEVLLSHMGASPEEIADVQTRMSRA*
Ga0105092_1019306223300009157Freshwater SedimentMPEPGLGLSALWEERDYKACGASMECRGRFILDKLFDRDTRRLKGAWLAAFGTRVLGPPEIRTDEITKAQWQVILSHMGASPDEIADVQTRMGRS*
Ga0136847_1303760023300010391Freshwater SedimentMPEPGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRTLKAAWLAAFGTRVLGPPEIRTDEITKAQWQMILSHMGASPDEIADVQTRMGRS*
Ga0134122_1005631773300010400Terrestrial SoilMAEPGLGLSALWEERDYKACGGSRECRGRFILGKLFDRDTRLLKGAWLAAFGTRVLGPPEVRTAEITKAQWQVILSHMGASADEITDVQARMGRV*
Ga0134121_1021531643300010401Terrestrial SoilMREPGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARADEIVDVQTRMGRS*
Ga0105246_1259918123300011119Miscanthus RhizosphereMPAQGLGLSALWDERDYKACGASIECRGRFILGKLFDRDTPLLKTAWLAAFGTRVLGPPEIRTDEITKAQWQVILSHMGATPDEIT
Ga0137391_1140407513300011270Vadose Zone SoilMSEPGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLAAFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEITDVQTRMGRA*
Ga0137315_106022123300011395SoilMPELGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMGRI*
Ga0137454_106691423300011406SoilMPEPGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLAAFGTRVLGPPEIRTDEITKAQWGMILSHMGASPDEIADVQTRMSRV*
Ga0137457_127762213300011443SoilKACGASLECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMSRV*
Ga0137389_1043690023300012096Vadose Zone SoilMRETGLGLDVWWGEREYRACGASLACRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARADEIVDVQTRMGRN*
Ga0137330_102659523300012134SoilMPETGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMSRA*
Ga0137383_1059027813300012199Vadose Zone SoilEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARADEIVDVQTRMGRI*
Ga0137382_1084972823300012200Vadose Zone SoilMREPGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRLLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARADEIVDVQTRMGRT*
Ga0137399_1164422923300012203Vadose Zone SoilMSATGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLVAFGTRVLGPPEIRTDEITKLQWQVILSHMGASPDEIIDVQTRMGRA*
Ga0137376_1062833323300012208Vadose Zone SoilMREPGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARADEIV
Ga0137434_105805713300012225SoilMPELGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADRIVERADGVPLF
Ga0137447_105903513300012226SoilMPETGLGLRALWEERDYKACGTSMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMSRT*
Ga0137375_1041038813300012360Vadose Zone SoilMAARGLGLSALWEERDYKACGGSMECRGRFILGKLFDRDTRMLKAAWLAAFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEITDVQTRMGRA*
Ga0137397_1001088273300012685Vadose Zone SoilMSATGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLAAFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEIIDVQTRMGRA*
Ga0137397_1010786043300012685Vadose Zone SoilMPARGLGLSALWDERDYKACGVSIECRGRFILGKLFDRDTPLLKTAWLAAFGTRVLGPPEIRTDEITKTQWQVILSHMGASADEITDVQARMGRA*
Ga0137359_1023487413300012923Vadose Zone SoilMREPGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARADEIVDVQTRMGRI*
Ga0137419_1099178423300012925Vadose Zone SoilTGLGLSALWEERDYKACGAAMEGRGRFILGKLFDRDTRMLKAAWLVAFGTRVLGPPEIRTDEITKLQWQVILSHMGASPDEIIDVQTRMGRA*
Ga0137404_1016441613300012929Vadose Zone SoilMPARGLGLSALWDERDYKACGVSIECRGRFILGKLFDRDTPLLKTAWLAAFGTRVLGPPEIRTDEITKTQWQVILSHMGASADEITD
Ga0153915_1233179333300012931Freshwater WetlandsGMREPGLGLDVWWDERDYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAAFGRRLLGPPEIQAHDITRSQWGLILSHMGASPHEVADVQARMGRT*
Ga0153915_1245137413300012931Freshwater WetlandsMREPGLGLDVWWDERDYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAAFGRRLLGPPEIQAHDITRSQWGLILSRMGASPREVADVQARMGRT*
Ga0163162_1094993913300013306Switchgrass RhizosphereMPAQGLGLSALWDERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLVAFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEIIDVQTRMGRA*
Ga0075351_108479423300014318Natural And Restored WetlandsGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRRLKAAWLAAFGTRVLGPPEIRTDEITKAQWEVILSHMGASPDEIADVQTRMGRL*
Ga0180066_100901043300014873SoilMPELGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTQMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMSRA*
Ga0180104_100303723300014884SoilMPELGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMSRI*
Ga0180063_105004723300014885SoilMPERGLGLRALWEERDYKACGGSMECRGRFILGKLFDRDTRMLQVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMSRA*
Ga0180089_103726623300015254SoilMPELGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTQMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMTRA*
Ga0132258_1087177913300015371Arabidopsis RhizosphereMQEPGLGVSVWWDERDYRACGASLECRGRFIVGKLFDLDTRLLQAAWLAAFGRRLLGPPEIQAHDITRLQWGLILSHMGASAGEIADVQARMGRP*
Ga0187824_1004067133300017927Freshwater SedimentMREPGLGLSVWWDEREYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAAFGRRLLGPPEIQAHDITRLQWGLILSHMGASAGEIADVQARMGRA
Ga0187825_1038674313300017930Freshwater SedimentPQGLGATAAAGMREPGLGLSVWWDEREYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAAFGRRLLGPPEIQAHDITRLHWGLILSHMGASAGEIADVQARMGRA
Ga0184610_100578133300017997Groundwater SedimentMPELGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMSRV
Ga0184604_1032215013300018000Groundwater SedimentMVRRRGALVDSMSATGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLEVFGTRVLGPPEIRTDEITKAQWQMILFHMGASPDEIADVQTRM
Ga0184608_1007732423300018028Groundwater SedimentMPELGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHIGASPDEIADVQTRMSRV
Ga0184634_1001100473300018031Groundwater SedimentMPETGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQIRMSRA
Ga0184638_114983223300018052Groundwater SedimentMPELGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMSRA
Ga0184621_1034713513300018054Groundwater SedimentGALAGSMSATGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLVAFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEIVDVQTRMGRA
Ga0184619_1007182023300018061Groundwater SedimentMVRRRGALADSMSAPGLGLSALWEERDYKGCGASMECRGRFILGKLFDRDTRMLKAAWLVAFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEIIDVQTRMGRA
Ga0184617_109892223300018066Groundwater SedimentMVRRRGALADSMSATGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLVAFGTRVLGPPEIRTDEITKSQWQVILSHMGTSPDEIIDVQTRMGRA
Ga0184618_1000140623300018071Groundwater SedimentMVRRRGALADSMSATGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLVAFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEIVDVQTRMGRA
Ga0184640_1002526323300018074Groundwater SedimentMPELGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMSRT
Ga0184633_1012589223300018077Groundwater SedimentMPELGLGLRALWEERDYKACGASIECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTTEITKSQWQVLLSHMGASPDEIADVQTRMSRA
Ga0184629_1004363913300018084Groundwater SedimentMPELGLGLRALWEERDYKACGTSMECRGRFILGKLFDRDTQMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGAS
Ga0190265_1001019273300018422SoilMPEPGLGLSALWEERDYKACGGSIECRGRFILSKLFDRDTRMLQAAWLAAFGTRVLGPPEIRTAQITKSQWQVILSRMGASPREIADVQTRMSRA
Ga0190265_1001668623300018422SoilMPDRGLGLSALWEERDYKACGVSMECRGRFILGKLFDRDTPMLKAAWLAAFGTRVLGPPEIRTDEITKAQWQVILSHMGASPDEIADVQIRMGRA
Ga0190265_1028154423300018422SoilMPERELGLSALWEERDYKACGASMECRGRFILGKLFDRDTPMLKAAWLAAFGTRVLGPPEIRTDEITKAQWQVILSHMGASPDEITDVQTRMGRM
Ga0190272_1033043913300018429SoilMVRRRDALADSMSATGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLVAFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEIVDVQTRMGRA
Ga0187894_1010218223300019360Microbial Mat On RocksMAERGLGLRALWEDRDYKACGTSRECRGRFILGKLFDRDTRMLQVAWLAAFGSRVLGPPEVRTAEITKSQWQVILSHMGASPDEIADVQARMSRT
Ga0187892_1001383823300019458Bio-OozeMAERGLGLRALWEDRDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVILSHMGASPDEIADVQVRMSRT
Ga0137408_126224413300019789Vadose Zone SoilMVRPRGALADSMSATGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLVAFGTRVLGPPEIRTDEITKLQWQVILSHMGASPDEIIDVQTRMGRA
Ga0193723_101714823300019879SoilMPARGLGLSALWDERDYKACGASIECRGRFILGKLFDRDTPLLKTAWLAAFGTRVLGPPEIRTDEITKTQWQVILSHMGASADEITDVQARMGLGVMRRGSPASGQR
Ga0193743_103284653300019889SoilMPELGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMSRV
Ga0193711_102128023300019997SoilLSALWDERDYKACGASIECRGRFILGKLFDRDTPLLKTAWLAAFGTRVLGPPEIRTDEITKTQWQVILSHMGASADEITDVQARMGLA
Ga0193710_100708513300019998SoilLSALWDERDYKACGASIECRGRFILGKLFDRDTPLLKTAWLAAFGTRVLGPPEIRTDEITKTQWQVILSHMGASADEITDVQARMGLAGVMRGGSPASGQR
Ga0193730_104272513300020002SoilMREPGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARADEIVDVQTRMGRP
Ga0193717_113342623300020060SoilMPEPGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLVAFGTRVLGPPEIRTDEITKAQWQVLLSHMGASPDEITDVQ
Ga0180109_125857123300020067Groundwater SedimentLRALWEERDYKACGGSMECRGRFILGKLFDRDTRMLQVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMSRA
Ga0184649_145808423300020068Groundwater SedimentLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQIRMSRA
Ga0179592_1021205523300020199Vadose Zone SoilMRETGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARADEIVDVQTRMGRN
Ga0179596_1010241613300021086Vadose Zone SoilARGRPGPEGLGRSDVERMRETGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARADEIVDVQTRMGRI
Ga0210384_1096156423300021432SoilMRETGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMG
Ga0224452_111367913300022534Groundwater SedimentMVRRRGALADSMSATGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLVAFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEIIDVQTRMGRA
Ga0224452_114451713300022534Groundwater SedimentGLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMSRV
Ga0209640_1003829723300025324SoilLRALWEERDYKACGGSMECRGRFILGKLFDRDTRMLQVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMSRT
Ga0210132_102376923300025538Natural And Restored WetlandsMPEPGLGLSALWEERDYKACGASRECRGRFILGKLFDRDTRKLQSAWLAAFGTRVLGPPEIRTSEITKSQWQLILTHMGASPDEIADVQARMSRAGR
Ga0207653_1023622113300025885Corn, Switchgrass And Miscanthus RhizosphereMRETGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARADEIVDVQTRMGRI
Ga0207645_1029673313300025907Miscanthus RhizosphereMPAQGLGLSALWDERDYKACGASIECRGRFILGKLFDRDTPLLKTAWLAAFGTRVLGPPEIRTDEITKTQWQVILSHMGASADEITDVQARMGR
Ga0207684_1010257413300025910Corn, Switchgrass And Miscanthus RhizosphereRETGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQMILSHMGARADEIVDVQTRMGRN
Ga0207660_1021366713300025917Corn RhizosphereMPAQGLGLSALWDERDYKACGASIECRGRFILGKLFDRDTPLLKTAWLAAFGTRVLGPPEIRTDEITKAQWGMILSHMGASPDEITDVQTRMGRT
Ga0207646_1029163813300025922Corn, Switchgrass And Miscanthus RhizosphereTGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQMILSHMGARADEIVDVQTRMGRN
Ga0207689_1001171343300025942Miscanthus RhizosphereMPAQGLGLSALWDERDYKACGASIECRGRFILGKLFDRDTPLLKTAWLAAFGTRVLGPPEIRTDEITKTQWQVILSHMGASADEITDVQARMGRA
Ga0210089_101892113300025957Natural And Restored WetlandsACGASMECRGRFILGKLFDRDTRRLKAAWLAAFGTRVLGPPEIRTDEITKAQWEVILSHMGASPDEIADVQTRMGRL
Ga0210070_103288623300025962Natural And Restored WetlandsMPEPGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRRLKAAWLAAFGTRVLGPPEIRTDEITKAQWEVILSHMGASPDEIADVQTRMGRL
Ga0210102_114527613300025971Natural And Restored WetlandsMAEPGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRRLKAAWLATFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEIADVQTRMGRI
Ga0208285_100332123300026005Rice Paddy SoilMRETGLGLSVWWDERDYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAAFGRRLLGPPEIQAHDITRSQWGLILSHMGASPDEVADVQARMGRT
Ga0208915_102243123300026048Natural And Restored WetlandsMPEPGLGLSALWEERDYKACGVSRECRGRFILGKLFDRDTRKLQGAWLAAFGTRVLGPPEIRTSEITKSQWQLILSHMGASPDEIADVQARMSRAGR
Ga0209438_100892363300026285Grasslands SoilMPARGLGLSALWDERDYKACGVSIECRGRFILGKLFDRDTPLLKTAWLAAFGTRVLGPPEIRTDEITKTQWQVILSHMGASADEITDVQARMGRA
Ga0209131_103666833300026320Grasslands SoilMPARGLGLSALWDERDYKACGASIECRGRFILGKLFDRDTPLLKTAWLAAFGTRVLGPPEIRTDEITKTQWQVILSHMGASADEITDVQARMGRA
Ga0209158_121099113300026333SoilMRETGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARADEIVD
Ga0257162_101524813300026340SoilRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQMILSHMGARADEIVDVQTRMGRN
Ga0257170_100006183300026351SoilMPELGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPEEIADVQTRMSRA
Ga0257173_100568413300026360SoilPELGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMSRA
Ga0257167_102170613300026376SoilGARGRPGPEGLGRSDVERMRETGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQMILSHMGARADEIVDVQTRMGRN
Ga0257171_100274633300026377SoilMRETGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQMILSHMGARADEIVDVQTRMGRN
Ga0257165_106287623300026507SoilMPELGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMS
Ga0257161_100556823300026508SoilMREPGLGLDVWWGEREYKACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARADEIVDVQTRMGRS
Ga0257158_100501633300026515SoilMREPGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARADEIVDVQTRMGRT
Ga0209854_104163323300027384Groundwater SandMPAQGLGLSALWEQRDYKACGGSMECRGRFILGKLFDRDTRRFQAAWLVAFGTRVLGPPEVQTDEITKSQWQVILSHMGASPDEITDVQARMGRA
Ga0209726_1044409813300027815GroundwaterMPETGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTPMLKAAWLAAFGTRVLGPPEIRTAEITKSQWQVLLSHMGASPDEIADVQTRMSRA
Ga0209180_1000187053300027846Vadose Zone SoilMPELGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEIRTAEITKSQWEVLLSHMGASPEEIADVQTRMSRA
Ga0209382_1056623823300027909Populus RhizosphereMAERGLGLRALWEDRDYKACGTSMECRGRFILGKLFDRDTRMLQVAWLAAFGTRVLGPPEIRTAEITKSQWQVILSHMGASPDEIADVQARMSRA
Ga0137415_1016143733300028536Vadose Zone SoilMVRRRGALADSMSATGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRMLKAAWLVAFGTRVLGPPEIRTDEITKLQWQVILSHMGASPDEIIDVQTRMGRA
Ga0307504_1041394313300028792SoilLSALWDERDYKACGASIECRGRFILGKLFDRDTPLLKTAWLAAFGTRVLGPPEIRTDEITKTQWQVILSHMGASADEITDVQARMGR
Ga0247825_1126130923300028812SoilMPAQGLGLSALWEERDYKACGVSIECRGRFILGKLFDRDTRLLKAAWLAAFGTRVLGPPEIRTDEITKAQWQVILSHMGATPDEITDVQARMGRT
Ga0307304_1013978533300028885SoilMVRRRVALADSMSAPGLGLSALWEERDYKGCGASMECRGRFILGKLFDRDTRMLKAAWLVAFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEI
Ga0299907_1030162723300030006SoilMPAPGLGLHALWEERDYKGCGASMECRGRFILGKLFDRDTRRFQAAWLAAFGTRVLGPPEIRTDEITKAQWQVILSHMGASPEEIADVQTRMGRA
Ga0302046_1012217223300030620SoilMPAPGLGLHALWEERDYKGCGASMECRGRFILGKLFDRDTRRFQAAWLAAFATRVLGPPEIRTDEITKAQWQVILSHMGASPEEIADVQTRMGRA
(restricted) Ga0255311_100355523300031150Sandy SoilLSALWEERDYKACGVSIECRGRFILGKLFDRDTRLLKAAWLAAFGTRVLGPPEIRTDEITKAQWQVILSHMGATPDEITDVQARMGRA
(restricted) Ga0255311_102853523300031150Sandy SoilMPEPGLGLSALWEERDYKACGTSMECRGRFILGKLFDRDTRRLKAAWLATFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEIADVQTRMGRI
(restricted) Ga0255310_1024120023300031197Sandy SoilMPERGLGLSALWDERDYKACGASMECRGRFILGKLFDRDTRKLKAAWLVAFGTRVLGPPEIRTDEITKAQWQVILSHMGASPDEITDVQTRMGRA
(restricted) Ga0255312_106530523300031248Sandy SoilMPPQGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTAMLKAAWLAAFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEITDVQTRMGRA
Ga0310813_1053066633300031716SoilPGLGVSVWWDEREYRACGTSLECRGRFIVGKLFDLDTRLLQAAWLAAFGRRLLGPPEIQAHDITRLQWGLILSHMGASAGEIADVQARMGRP
Ga0307469_1012936143300031720Hardwood Forest SoilMPEPGLGLRALWEERDYKACGASMECRGRFILGKLFDRDTPMLKAAWLAAFGTRVLGPPEIRTDEITKAQWQVILSHMGASPDEIADVQTRMGRI
Ga0307469_1017231543300031720Hardwood Forest SoilLSALWDERDYKACGASIECRGRFILGKLFDRDTPLLKTAWLAAFGTRVLGPPEIRTDEITKTQWQVILSHMGASSDEITDVQARMGRA
Ga0307468_10031761813300031740Hardwood Forest SoilMVRRRGALADSMSATGLGLSALWEERDYKACGASMECRGRFILGKLFDRDTRMLKVAWLAAFGTRVLGPPEVRTAEITKAQWQVILSHMGASPDEITDVQTRMGRA
Ga0307473_1026769633300031820Hardwood Forest SoilVEGMRETGLGLDVWWGEREYRACGASLECRGRYIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQMILSHMGARADEIVDVQTRMGRI
Ga0334722_1034360723300033233SedimentMPQPGLGLSALWEERDYKACGASIECRGRFILGKLFDRDTAMLKAAWLAAFGTRVLGPPEIRTDEITKSQWQVILSHMGASPDEIADVQIRMGRA
Ga0310810_10004580133300033412SoilMQEPGLGVSVWWDEREYRACGTSLECRGRFIVGKLFDLDTRLLQAAWLAAFGRRLLGPPEIQAHDITRLQWGLILSHMGASAGEIADVQARMGRP
Ga0326729_100245443300033432Peat SoilMREPGLGLSVWWDEREYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAAFGRRLLGPPEIQAHDITRFQWGLILSHMGASAGEIADVQARMGRA
Ga0326729_104483613300033432Peat SoilWDERDYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAAFGRRLLGPPEIQAHDITRSQWGLILSHMGASPREVADVQARMGRT
Ga0326726_1034637333300033433Peat SoilMREPGLGLDVWWDERDYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAAFGRRLLGPPEVQAHDITRSQWGLILSHMGASPREVADVQARMGRT
Ga0316624_1158696413300033486SoilGRREPGLGLDVWWDERDDRACGASLECRGRFIVGKLFDLDTRMLQAAWLAAFGRRLLGPPEIQAHDITRSQWGLILSHMGASPDEIADVQARMGRT
Ga0326731_110174223300033502Peat SoilSVWWDEREYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAAFGRRLLGPPEIQAHDITRFQWGLILSHMGASAGEIADVQARMGRA
Ga0316628_10010111363300033513SoilMRAPGLGLSVWWDERDYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAAFGRRLLGPPEIQAHDITRSQWGLILSHMGASPDEIADVQARMGRT
Ga0247829_1155515223300033550SoilMPAQGLGLSALWDERDYKACGASIECRGRFILGKLFDRDTPLLKTAWLAAFGTRVLGPPEIRTDEITKTQWQVILSH
Ga0373948_0001947_834_11213300034817Rhizosphere SoilMREAGLGLTVWWDERDYRACGASLECRGRFIVGKLFDLDTRMLQAAWLAAFGRRLLGPPEIQAHDITRSQWGLILSHMGASPDEVADVQARMGRT
Ga0373959_0108412_407_6673300034820Rhizosphere SoilMREAGLGLTVWWDERDYRACGASLECRGRFIVGKLFDLDTRMLQTAWLAAFGRRLLGPPEIQAHEITRSQWQLILSHMGARPDEIVD
Ga0373916_0069824_33_3203300034894Sediment SlurryMPEPGLGLSALWDERDYKACGASMECRGRFILGKLFDRDTRRLKAGWLAAFGTRVLGPPEIRTDEITKAQWEVILSHMGASPDEIADVQTRMGRL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.