NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F081601

Metagenome Family F081601

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F081601
Family Type Metagenome
Number of Sequences 114
Average Sequence Length 61 residues
Representative Sequence MNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAASLSTP
Number of Associated Samples 98
Number of Associated Scaffolds 114

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 82.46 %
% of genes near scaffold ends (potentially truncated) 21.93 %
% of genes from short scaffolds (< 2000 bps) 70.18 %
Associated GOLD sequencing projects 94
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (75.439 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere
(18.421 % of family members)
Environment Ontology (ENVO) Unclassified
(28.070 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(41.228 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 30.65%    β-sheet: 0.00%    Coil/Unstructured: 69.35%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 114 Family Scaffolds
PF01738DLH 57.02
PF13378MR_MLE_C 21.93
PF01425Amidase 2.63
PF07729FCD 1.75
PF01979Amidohydro_1 1.75
PF13649Methyltransf_25 1.75
PF07883Cupin_2 0.88
PF07978NIPSNAP 0.88
PF00561Abhydrolase_1 0.88
PF00392GntR 0.88
PF13518HTH_28 0.88
PF12779WXXGXW 0.88
PF00005ABC_tran 0.88

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 114 Family Scaffolds
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 2.63
COG1802DNA-binding transcriptional regulator, GntR familyTranscription [K] 1.75
COG2186DNA-binding transcriptional regulator, FadR familyTranscription [K] 1.75


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms75.44 %
UnclassifiedrootN/A24.56 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300003324|soilH2_10021486All Organisms → cellular organisms → Bacteria → Proteobacteria12079Open in IMG/M
3300004114|Ga0062593_100063691All Organisms → cellular organisms → Bacteria2391Open in IMG/M
3300005331|Ga0070670_100006635All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria9812Open in IMG/M
3300005338|Ga0068868_100101196All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales2332Open in IMG/M
3300005341|Ga0070691_10017920All Organisms → cellular organisms → Bacteria3260Open in IMG/M
3300005406|Ga0070703_10087121Not Available1076Open in IMG/M
3300005436|Ga0070713_102192282All Organisms → cellular organisms → Bacteria535Open in IMG/M
3300005440|Ga0070705_100045523All Organisms → cellular organisms → Bacteria → Proteobacteria2524Open in IMG/M
3300005445|Ga0070708_100490073Not Available1159Open in IMG/M
3300005445|Ga0070708_101040124Not Available767Open in IMG/M
3300005445|Ga0070708_101047474Not Available764Open in IMG/M
3300005467|Ga0070706_100043901All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales4129Open in IMG/M
3300005467|Ga0070706_100105385All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales2623Open in IMG/M
3300005467|Ga0070706_100422580All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1240Open in IMG/M
3300005468|Ga0070707_100547480All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1119Open in IMG/M
3300005468|Ga0070707_100921558Not Available838Open in IMG/M
3300005468|Ga0070707_101095552Not Available762Open in IMG/M
3300005471|Ga0070698_100001489All Organisms → cellular organisms → Bacteria → Proteobacteria26007Open in IMG/M
3300005471|Ga0070698_100025016All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales6223Open in IMG/M
3300005545|Ga0070695_101153869All Organisms → cellular organisms → Bacteria636Open in IMG/M
3300005557|Ga0066704_10745557All Organisms → cellular organisms → Bacteria613Open in IMG/M
3300005559|Ga0066700_10582992Not Available780Open in IMG/M
3300005618|Ga0068864_100019375All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales5689Open in IMG/M
3300005875|Ga0075293_1001980All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1760Open in IMG/M
3300005875|Ga0075293_1060314All Organisms → cellular organisms → Bacteria557Open in IMG/M
3300005876|Ga0075300_1010064All Organisms → cellular organisms → Bacteria1061Open in IMG/M
3300005878|Ga0075297_1003911All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1248Open in IMG/M
3300005883|Ga0075299_1000473All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1668Open in IMG/M
3300005884|Ga0075291_1063351All Organisms → cellular organisms → Bacteria512Open in IMG/M
3300005985|Ga0081539_10127661Not Available1254Open in IMG/M
3300006041|Ga0075023_100133145All Organisms → cellular organisms → Bacteria898Open in IMG/M
3300006050|Ga0075028_100661442All Organisms → cellular organisms → Bacteria626Open in IMG/M
3300006175|Ga0070712_101241762All Organisms → cellular organisms → Bacteria649Open in IMG/M
3300006804|Ga0079221_10058955All Organisms → cellular organisms → Bacteria1750Open in IMG/M
3300006854|Ga0075425_100085811All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3565Open in IMG/M
3300006854|Ga0075425_103167743All Organisms → cellular organisms → Bacteria500Open in IMG/M
3300006903|Ga0075426_10305779Not Available1163Open in IMG/M
3300006903|Ga0075426_10786260All Organisms → cellular organisms → Bacteria715Open in IMG/M
3300006904|Ga0075424_100249171All Organisms → cellular organisms → Bacteria → Proteobacteria1888Open in IMG/M
3300007076|Ga0075435_100028812All Organisms → cellular organisms → Bacteria → Proteobacteria4356Open in IMG/M
3300007255|Ga0099791_10572197All Organisms → cellular organisms → Bacteria551Open in IMG/M
3300007258|Ga0099793_10317743All Organisms → cellular organisms → Bacteria758Open in IMG/M
3300009088|Ga0099830_10278949All Organisms → cellular organisms → Bacteria1329Open in IMG/M
3300009088|Ga0099830_10413464All Organisms → cellular organisms → Bacteria1092Open in IMG/M
3300009090|Ga0099827_10160800All Organisms → cellular organisms → Bacteria1844Open in IMG/M
3300009156|Ga0111538_12711852All Organisms → cellular organisms → Bacteria621Open in IMG/M
3300009808|Ga0105071_1082600Not Available562Open in IMG/M
3300009814|Ga0105082_1064557All Organisms → cellular organisms → Bacteria640Open in IMG/M
3300009820|Ga0105085_1025760Not Available1027Open in IMG/M
3300009821|Ga0105064_1012310All Organisms → cellular organisms → Bacteria1533Open in IMG/M
3300010159|Ga0099796_10389964Not Available609Open in IMG/M
3300010362|Ga0126377_11029089All Organisms → cellular organisms → Bacteria891Open in IMG/M
3300010397|Ga0134124_12918996Not Available522Open in IMG/M
3300010400|Ga0134122_10659434All Organisms → cellular organisms → Bacteria976Open in IMG/M
3300010401|Ga0134121_10264682All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1510Open in IMG/M
3300011269|Ga0137392_10005773All Organisms → cellular organisms → Bacteria → Proteobacteria7855Open in IMG/M
3300012189|Ga0137388_10197072All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1813Open in IMG/M
3300012209|Ga0137379_11514916All Organisms → cellular organisms → Bacteria571Open in IMG/M
3300012362|Ga0137361_11914289Not Available510Open in IMG/M
3300012582|Ga0137358_10171864All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1478Open in IMG/M
3300012923|Ga0137359_10241340All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1612Open in IMG/M
3300012931|Ga0153915_10462104All Organisms → cellular organisms → Bacteria1446Open in IMG/M
3300012931|Ga0153915_10845732All Organisms → cellular organisms → Bacteria1062Open in IMG/M
3300012957|Ga0164303_10087260All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1507Open in IMG/M
3300015371|Ga0132258_10626790All Organisms → cellular organisms → Bacteria2702Open in IMG/M
3300015371|Ga0132258_10758290All Organisms → cellular organisms → Bacteria2445Open in IMG/M
3300017927|Ga0187824_10105805Not Available909Open in IMG/M
3300017930|Ga0187825_10032658All Organisms → cellular organisms → Bacteria1755Open in IMG/M
3300017994|Ga0187822_10025135All Organisms → cellular organisms → Bacteria1551Open in IMG/M
3300018063|Ga0184637_10067355Not Available2191Open in IMG/M
3300019458|Ga0187892_10037308All Organisms → cellular organisms → Bacteria3689Open in IMG/M
3300020002|Ga0193730_1015268Not Available2197Open in IMG/M
3300020021|Ga0193726_1116961Not Available1192Open in IMG/M
3300021418|Ga0193695_1057174All Organisms → cellular organisms → Bacteria847Open in IMG/M
3300025910|Ga0207684_10001378All Organisms → cellular organisms → Bacteria26499Open in IMG/M
3300025910|Ga0207684_10006212All Organisms → cellular organisms → Bacteria → Proteobacteria10907Open in IMG/M
3300025910|Ga0207684_10068055All Organisms → cellular organisms → Bacteria3027Open in IMG/M
3300025922|Ga0207646_10011686All Organisms → cellular organisms → Bacteria8489Open in IMG/M
3300026001|Ga0208000_105436Not Available753Open in IMG/M
3300026088|Ga0207641_10033754All Organisms → cellular organisms → Bacteria → Proteobacteria4252Open in IMG/M
3300026285|Ga0209438_1002727All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria5924Open in IMG/M
3300026340|Ga0257162_1000234All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria5098Open in IMG/M
3300026345|Ga0257148_1001701Not Available1322Open in IMG/M
3300026355|Ga0257149_1066122All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300026361|Ga0257176_1003451All Organisms → cellular organisms → Bacteria1687Open in IMG/M
3300026475|Ga0257147_1000444All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3497Open in IMG/M
3300026480|Ga0257177_1025683All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria854Open in IMG/M
3300026481|Ga0257155_1014445All Organisms → cellular organisms → Bacteria1090Open in IMG/M
3300026494|Ga0257159_1005881All Organisms → cellular organisms → Bacteria1823Open in IMG/M
3300026499|Ga0257181_1076565Not Available578Open in IMG/M
3300026514|Ga0257168_1029137All Organisms → cellular organisms → Bacteria1172Open in IMG/M
3300026515|Ga0257158_1014496All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1261Open in IMG/M
3300026551|Ga0209648_10389776Not Available924Open in IMG/M
3300027324|Ga0209845_1017547Not Available1178Open in IMG/M
3300027651|Ga0209217_1082978All Organisms → cellular organisms → Bacteria929Open in IMG/M
3300027787|Ga0209074_10254746Not Available683Open in IMG/M
3300027894|Ga0209068_10074813All Organisms → cellular organisms → Bacteria1754Open in IMG/M
3300028047|Ga0209526_10083662Not Available2247Open in IMG/M
3300028799|Ga0307284_10452694Not Available525Open in IMG/M
(restricted) 3300031150|Ga0255311_1153354All Organisms → cellular organisms → Bacteria512Open in IMG/M
3300031231|Ga0170824_124321591Not Available807Open in IMG/M
(restricted) 3300031248|Ga0255312_1092845All Organisms → cellular organisms → Bacteria735Open in IMG/M
3300031716|Ga0310813_10751645All Organisms → cellular organisms → Bacteria874Open in IMG/M
3300031720|Ga0307469_10053730All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2551Open in IMG/M
3300031720|Ga0307469_11957117Not Available568Open in IMG/M
3300031949|Ga0214473_10007250All Organisms → cellular organisms → Bacteria → Proteobacteria13172Open in IMG/M
3300032075|Ga0310890_10475529All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria945Open in IMG/M
3300033432|Ga0326729_1000464All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales9454Open in IMG/M
3300033433|Ga0326726_10006458All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria10556Open in IMG/M
3300033480|Ga0316620_10071096All Organisms → cellular organisms → Bacteria2498Open in IMG/M
3300033513|Ga0316628_100452987All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1645Open in IMG/M
3300033814|Ga0364930_0139946All Organisms → cellular organisms → Bacteria827Open in IMG/M
3300034165|Ga0364942_0012953All Organisms → cellular organisms → Bacteria2603Open in IMG/M
3300034177|Ga0364932_0353865All Organisms → cellular organisms → Bacteria554Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere18.42%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil10.53%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil9.65%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil6.14%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere6.14%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil5.26%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand4.39%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment2.63%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.63%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.63%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment2.63%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands1.75%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.75%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.75%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.75%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.75%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.75%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.75%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil1.75%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.75%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.75%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.88%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.88%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.88%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.88%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze0.88%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.88%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.88%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.88%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300005331Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S6-3 metaGHost-AssociatedOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005618Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2Host-AssociatedOpen in IMG/M
3300005875Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_101EnvironmentalOpen in IMG/M
3300005876Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_401EnvironmentalOpen in IMG/M
3300005878Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_104EnvironmentalOpen in IMG/M
3300005883Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_302EnvironmentalOpen in IMG/M
3300005884Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_80N_302EnvironmentalOpen in IMG/M
3300005985Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S4T2R2Host-AssociatedOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009808Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_40_50EnvironmentalOpen in IMG/M
3300009814Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_50_60EnvironmentalOpen in IMG/M
3300009820Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_50_60EnvironmentalOpen in IMG/M
3300009821Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_20_30EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017994Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_2EnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020021Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c1EnvironmentalOpen in IMG/M
3300021418Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3s2EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026001Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_104 (SPAdes)EnvironmentalOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026340Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-AEnvironmentalOpen in IMG/M
3300026345Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-14-AEnvironmentalOpen in IMG/M
3300026355Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-AEnvironmentalOpen in IMG/M
3300026361Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-BEnvironmentalOpen in IMG/M
3300026475Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-AEnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026481Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-AEnvironmentalOpen in IMG/M
3300026494Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-AEnvironmentalOpen in IMG/M
3300026499Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-BEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027324Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028799Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_123EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300033432Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF6AY SIP fractionEnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033480Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M1_C1_D5_BEnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300033814Sediment microbial communities from East River floodplain, Colorado, United States - 55_j17EnvironmentalOpen in IMG/M
3300034165Sediment microbial communities from East River floodplain, Colorado, United States - 19_s17EnvironmentalOpen in IMG/M
3300034177Sediment microbial communities from East River floodplain, Colorado, United States - 17_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
soilH2_1002148643300003324Sugarcane Root And Bulk SoilMTLRATAVMIANQFEGRCTCTTMDGGVDCRWCQVFREVLRTQPLDPWPARGATTPAASLSAG*
Ga0062593_10006369143300004114SoilMNLRATAMMIANQFEGRCTCATMDGGVDCRWCQLFREMPAPPLDPRPARKASTAAAA*
Ga0070670_100006635103300005331Switchgrass RhizosphereMNLRATAMMIANQFEGRCTCATMDGGVDCRWCQLFREMLRTRPLDPRPARKASTAAASRPA*
Ga0068868_10010119633300005338Miscanthus RhizosphereMNLRATAMMIANQFEGRCTCATMDGGVDCRWCQLFREMLRTRPLDPRPARKASTAAASLTA*
Ga0070691_1001792023300005341Corn, Switchgrass And Miscanthus RhizosphereMNLRAKAAAIANQFEGRCTCGTMDGGVECRWCQVFRELLRTQPLDPRPTRTAAEPAPSLSTG*
Ga0070703_1008712123300005406Corn, Switchgrass And Miscanthus RhizosphereVAIANQFEGRCSCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAASLSTP*
Ga0070713_10219228213300005436Corn, Switchgrass And Miscanthus RhizosphereMNLRAKAAAIANQFEGRCTCSTMDGGVECRWCQLFRELLRTYPLEPRPARAAIPPAPSLSAP*
Ga0070705_10004552323300005440Corn, Switchgrass And Miscanthus RhizosphereMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAASLSTP*
Ga0070708_10049007323300005445Corn, Switchgrass And Miscanthus RhizosphereMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLEPRATRSAATP
Ga0070708_10104012413300005445Corn, Switchgrass And Miscanthus RhizosphereMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAAS
Ga0070708_10104747413300005445Corn, Switchgrass And Miscanthus RhizosphereMNLRARAVAIANQFEGRCSCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAAS
Ga0070706_10004390153300005467Corn, Switchgrass And Miscanthus RhizosphereAGMNLRAKAAAIANQFEGRCTCSTMDGGVECRWCQLFRELLRTYPLEPRPARATIPPAPSLSAP*
Ga0070706_10010538523300005467Corn, Switchgrass And Miscanthus RhizosphereMNLRAKAVAIANQFEGRCTCPTMDGGVECPWCRLFQEMLRTYPLGGGSARPAPEVSATPSMS*
Ga0070706_10042258023300005467Corn, Switchgrass And Miscanthus RhizosphereMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLEPRATRSAATPSTSLSAP*
Ga0070707_10054748023300005468Corn, Switchgrass And Miscanthus RhizosphereMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLEPRATRSAATPAASLSAP*
Ga0070707_10092155813300005468Corn, Switchgrass And Miscanthus RhizosphereMNLRARAVAIANQFEGRCSCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAASLSTP*
Ga0070707_10109555213300005468Corn, Switchgrass And Miscanthus RhizosphereMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAA
Ga0070698_100001489173300005471Corn, Switchgrass And Miscanthus RhizosphereMNLRAKAVAIANQFEGRCTCPTMDGGVECPWCRLFQEMLRTYPLGGRSARPAPEVSATPSMS*
Ga0070698_10002501633300005471Corn, Switchgrass And Miscanthus RhizosphereMNLRAKAAAIANQFEGRCTCSTMDGGVECRWCQLFRELLRTYPLEPRPARATIPPAPSLSAP*
Ga0070695_10115386923300005545Corn, Switchgrass And Miscanthus RhizosphereMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAASLSAP*
Ga0066704_1074555723300005557SoilAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLEPRATRSAATPSTSLSAP*
Ga0066700_1058299223300005559SoilVAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLEPRATRSAATPSTSLSAP*
Ga0068864_10001937583300005618Switchgrass RhizosphereNLRATAMMIANQFEGRCTCATMDGGVDCRWCQLFREMPAPPLDPRPARKASTAAAA*
Ga0075293_100198023300005875Rice Paddy SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCPWCQVFREMLRVYPLGPVAVRPARTPTATLTTS*
Ga0075293_106031423300005875Rice Paddy SoilMNLRAKAAAIANQFEGRCTCGTMDGGVECRWCQVFRELLRTQPLDPRPTRTATEPAPSLSTG*
Ga0075300_101006423300005876Rice Paddy SoilMNLRAKAAAIANQFEGRCTCGTMDGGVECRWCQVFRELLRTQPLDPRPPRAATEPAPSLSTS*
Ga0075297_100391123300005878Rice Paddy SoilMNLRAKAAAIANQFEGRCTCGTMDGGVECRWCQVFRELLRTQPLDPRPTRAATEPAPSLSTG*
Ga0075299_100047323300005883Rice Paddy SoilMNLRAKAAAIANQFEGRCTCGTMDGGVECRWCQVFRELLRTQPLDPRPTRPATEPAASLSTS*
Ga0075291_106335113300005884Rice Paddy SoilMNLRAKAAAIANQFEGRCTCGTMDGGVECRWCQVFRELLRTQPLDPRPPRAATEPAPSLSTG*
Ga0081539_1012766123300005985Tabebuia Heterophylla RhizosphereMNLRATAVMIANQFEGRCTCATMDGGVDCRWCQVFREVLRTQPLDSRPVGTPRARAATLSAG*
Ga0075023_10013314513300006041WatershedsMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLEPRPTRSAATPAASLSAP*
Ga0075028_10066144223300006050WatershedsMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRIYPLEPRLVRSAATPAASLTP*
Ga0070712_10124176213300006175Corn, Switchgrass And Miscanthus RhizosphereEAGMNLRAKAAAIANQFEGRCTCSTMDGGVECRWCQLFRELLRTYPLEPRPARAAIPPAPSLSAP*
Ga0079221_1005895523300006804Agricultural SoilMTLRETAVIIANRFEGRCTCATMDGGVDCRWCQVFREVLRSQPLDPRPARGAPTPAASLSAG*
Ga0075425_10008581153300006854Populus RhizosphereMDLRATAMMIANQFEGRCTCATMDGGVDCRWCQLFREMLRTRPLDPRPARKASTAAASRPA*
Ga0075425_10316774323300006854Populus RhizosphereAAIANQFEGRCTCSTMDGGVECRWCQLFRELLRTYPLEPRPARAASPPAPSLSAP*
Ga0075426_1030577923300006903Populus RhizosphereMNLRAKAAAIANQFEGRCTCSAMDGGVECRWCQVFRELLRTYPLEPRPARAAIPPAPSLSAP*
Ga0075426_1078626023300006903Populus RhizosphereMNLRAKAAAIANQFEGRCTCSTMDGGVECRWCQLFRELLRTYPLEPRPARAASPPAPSLSAP*
Ga0075424_10024917153300006904Populus RhizosphereGRPTMNLRATAMMIANQFEGRCTCATMDGGVDCRWCQLFREMLRTRPLDPRPARKASTAAASRPA*
Ga0075435_10002881223300007076Populus RhizosphereMDLRATAMMIANQFEGRCTCATMDGGVDCRWCQLFREMLRTRPLDPRPARKASTAAASLPT*
Ga0099791_1057219723300007255Vadose Zone SoilVAIANQFEGRCTCSTMDGGVECRWCQVLRELLRTHPLEPRATRSAATPAASLSAP*
Ga0099793_1031774323300007258Vadose Zone SoilMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVLRELLRTHPLEPRATRSAATPAASLSAP*
Ga0099830_1027894923300009088Vadose Zone SoilANQFEGRCTCSTMDSGVDCRWCQVFRELLRTHPLEPRPTRSAATPAASLSTP*
Ga0099830_1041346423300009088Vadose Zone SoilMNLRAKAVAIANQFEGRCTCPTMDGGVECPWCQLFQEMLRTYPLGEAPASPAPEVAVG*
Ga0099827_1016080023300009090Vadose Zone SoilMNLRTKAVAIANQFEGRCTCPAMDGGVECPWCQLYQEMLRTYPLGAAPARPAPEVAVTPSMS*
Ga0111538_1271185213300009156Populus RhizosphereMDLRATAMMIANQFEGRCTCATMDGGVDCRWCQLFREMLRTQPLDPRPARKASTAAASLPT*
Ga0105071_108260013300009808Groundwater SandMNLRAKAVAIANQFEGRCTCPTMDGGVDCPWCQLFQEMLRTYPLGGGPARPAPEVAVG*
Ga0105082_106455723300009814Groundwater SandMNLRAKAVAIANQFEGRCTCPTMDGGVDCPWCQLFQEMLRTYPLGAAPARPAPEVAVTPNMS*
Ga0105085_102576023300009820Groundwater SandMNLRAKAVAIANQFEGRCTCPTMDGGVDCPWCQLFQEMLRTYPLGAAPARPAPEVAVTPSMS*
Ga0105064_101231023300009821Groundwater SandMNLRAKAAAIANQFEGRCTCPTMDGGVDCPWCQLFQEMLRTYPLGAAPARPAPEVAVTPNMS*
Ga0099796_1038996423300010159Vadose Zone SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRIYPLESRLIRSAATPAASLSTP*
Ga0126377_1102908923300010362Tropical Forest SoilMNLRATAVMIANQFEGRCTCATMDGGVDCRWCQVFREVLRTQPLDPRPARAAATPAASLSAG*
Ga0134124_1291899623300010397Terrestrial SoilMEAGMNLRAKAAAIANQFEGRCTCSTMDGGVECRWCQLFRELLRTYPLEPRPARAAIPPAPSLSAP*
Ga0134122_1065943423300010400Terrestrial SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTHPLEARPTRAAATPAASLSAP*
Ga0134121_1026468223300010401Terrestrial SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRIYPLESRPIRSAATPAASLSTP*
Ga0137392_1000577323300011269Vadose Zone SoilMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLEPRATRSAATPAASLSTP*
Ga0137388_1019707223300012189Vadose Zone SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTHPLEPRATRSAATPAASLSAP*
Ga0137379_1151491623300012209Vadose Zone SoilVNNWRPVMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLEPRATRSAATPSTSLSAP*
Ga0137361_1191428913300012362Vadose Zone SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAASL
Ga0137358_1017186423300012582Vadose Zone SoilMNLRAKAAAIANQFEGRCTCSTMDGGVECRWCQLFRELLRTYPLEPRPARATGPPAPSLSAP*
Ga0137359_1024134023300012923Vadose Zone SoilMNLRARAVAIANQFEGRCTRSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAASLSTP*
Ga0153915_1046210423300012931Freshwater WetlandsMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLDPWPTRPASAPAASLSTS*
Ga0153915_1084573223300012931Freshwater WetlandsMNLRAKAAAIANQFEGRCTCGTMDGGVECRWCQVFRELLRTQPLDPRPTRPATEPAASLSTG*
Ga0164303_1008726023300012957SoilMNLRARAVAIANQFEGRCTCSTMDSGVDCRWCQVFRELLRIYPLESRLIRSAATPAASLSTP*
Ga0132258_1062679053300015371Arabidopsis RhizosphereMNLRATAMMIANQFEGRCTCATMDGGVDCRWCQLFREMLRTRPLDPRPARKASAAAASLPA*
Ga0132258_1075829033300015371Arabidopsis RhizosphereMNLRARAVAIANQFEGRCSCSTMDGGVECRWCQVFRELLRTHPLDPWPTGRPSAPAASLSTS*
Ga0187824_1010580523300017927Freshwater SedimentMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLDPGPTGRTPAPAASLS
Ga0187825_1003265833300017930Freshwater SedimentMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLDPGPTGRTSAPAASLSTG
Ga0187822_1002513533300017994Freshwater SedimentMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLDPGPAGRTPAPAESLSTG
Ga0184637_1006735533300018063Groundwater SedimentMNLRAKAVAIANQFEGRCTCPTMDGGVDCPWCQLFQEMLRTYPLGAAPARSAPEVAVTPSMS
Ga0187892_1003730823300019458Bio-OozeMNLRAKAVAIANQFEGRCTCPTMDGGVACPWCQLLQEMLRTHPLGEAPARPAPAVAVG
Ga0193730_101526823300020002SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRIYPLESRLIRSAATPAASLT
Ga0193726_111696123300020021SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTYPLEPRPIRSAATPAASLA
Ga0193695_105717423300021418SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRIYPLESRLIRSAATPAASLSTR
Ga0207684_10001378113300025910Corn, Switchgrass And Miscanthus RhizosphereMNLRAKAVAIANQFEGRCTCPTMDGGVECPWCRLFQEMLRTYPLGGGSARPAPEVSATPSMS
Ga0207684_10006212113300025910Corn, Switchgrass And Miscanthus RhizosphereMNLRARAVAIANQFEGRCSCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAASLSTP
Ga0207684_1006805543300025910Corn, Switchgrass And Miscanthus RhizosphereIANQFEGRCTCSTMDGGVECRWCQLFRELLRTYPLEPRPARATIPPAPSLSAP
Ga0207646_1001168653300025922Corn, Switchgrass And Miscanthus RhizosphereMNLRAKAAAIANQFEGRCTCSTMDGGVECRWCQLFRELLRTYPLEPRPARATIPPAPSLSAP
Ga0208000_10543613300026001Rice Paddy SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCPWCQVFREMLRVYPLGPVAVRPARTPTATLTTS
Ga0207641_1003375423300026088Switchgrass RhizosphereMNLRATAMMIANQFEGRCTCATMDGGVDCRWCQLFREMLRTRPLDPRPARKASTAAASRP
Ga0209438_100272733300026285Grasslands SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRIHPLEPRPTRSATTPAASLSTP
Ga0257162_100023443300026340SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAASLSTP
Ga0257148_100170123300026345SoilVMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAASLSTP
Ga0257149_106612213300026355SoilVMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRIYPLESRPIRSAATPAASLTP
Ga0257176_100345123300026361SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTHPLEPRATRSAATPAASLSAP
Ga0257147_100044423300026475SoilMNLRARAVAIANQFEGRGTCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAASLSTP
Ga0257177_102568323300026480SoilMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVLRELLRTHPLEPRATRSAATPAASLSAP
Ga0257155_101444523300026481SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAASLT
Ga0257159_100588113300026494SoilPVMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAASLSTP
Ga0257181_107656523300026499SoilMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLEPRPTRSAATPAASLSTP
Ga0257168_102913723300026514SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTHPLQPRPTRSAATPAASLSTP
Ga0257158_101449623300026515SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTHPLEPRPIRSAATPAASLT
Ga0209648_1038977623300026551Grasslands SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTYPLEPRPTRSAATPAASLSTP
Ga0209845_101754723300027324Groundwater SandMNLRAKAVAIANQFEGRCTCPTMDGGVDCPWCQLFQEMLRTYPLGAAPARPAPEVAVTPNMS
Ga0209217_108297813300027651Forest SoilGMNLRAKAAAIANQFEGRCTCSTMDGGVECRWCQLFRELLRTYPLEPRPARAAVPPAPSLSAP
Ga0209074_1025474623300027787Agricultural SoilMTLRATAVMIANQFEGRCTCATMDGGVDCRWCQVFREVLRSQPLDPRPARGAPTPAASLSAG
Ga0209068_1007481323300027894WatershedsMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLEPRPTRSAATPAASLSAP
Ga0209526_1008366223300028047Forest SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSSATPAGSLT
Ga0307284_1045269413300028799SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTYPLEPRPIRCAATPAASLA
(restricted) Ga0255311_115335413300031150Sandy SoilMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLDPGPPGRTPAPAASVSTS
Ga0170824_12432159123300031231Forest SoilGRCTCSTMDGGVDCRWCQVFRELLRTYPLEPRPTRSAATPAASLSTP
(restricted) Ga0255312_109284523300031248Sandy SoilMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLDPWPTGRTPAPAASLSTS
Ga0310813_1075164523300031716SoilMNLRARAVAIANQFEGRCSCTTMDGGVECRWCQVFRELLRTHPLDPWPTGRTPAPAASLSTS
Ga0307469_1005373013300031720Hardwood Forest SoilRAVAIANQFEGRCSCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAASLSTP
Ga0307469_1195711713300031720Hardwood Forest SoilMNLRARAVAIANQFEGRCTCSTMDGGVDCRWCQVFRELLRTHPLEPRPTRSAATPAASLS
Ga0214473_10007250123300031949SoilMNLRAKAVAIANQFEGRCTCPMMDGGVDCPWCRLFQEMLRTYPLGPAPARPAPQVAATPSMP
Ga0310890_1047552923300032075SoilMNLRAKAAAIANQFEGRCTCGTMDGGVECRWCQVFRELLRTQPLDPRPTRAAAEPAPSLSTG
Ga0326729_100046483300033432Peat SoilMNLRARAVVIANQFEGRCSCSTMDGGVECRWCQVFRELLRTHPLDPWPTGRPPAPAASLSTS
Ga0326726_1000645893300033433Peat SoilMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLDPWPTRPVPAPAASLSTS
Ga0316620_1007109633300033480SoilMNLRARAVAIANQFEGRCTCSTMDGGVECRWCQVFRELLRTHPLDPWPTRPASAPAASLSTS
Ga0316628_10045298723300033513SoilMNLRAKAAAIANQFEGRCTCGTMDGGVECRWCQVFRELLRTQPLDPRPTRPATEPAASLSTG
Ga0364930_0139946_579_7673300033814SedimentMNLRVKAVTIANQFEGRCTCPTMDGGVDCPWCRLFQEMLRTYPLGPAPARPAPEVTATPSMP
Ga0364942_0012953_206_3943300034165SedimentMNLRAKAVTIANQFEGRCTCPTMDGGVDCPWCRLFQEMLRTYPLGPAPARPAPEVTATPSMP
Ga0364932_0353865_1_1653300034177SedimentTIANQFEGRCTCPTMDGGVDCPWCRLFQEMLRTYPLGPAPARPAPEVTATPSMP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.