NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F064908

Metagenome / Metatranscriptome Family F064908

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F064908
Family Type Metagenome / Metatranscriptome
Number of Sequences 128
Average Sequence Length 76 residues
Representative Sequence MNEPDIEYRGHYIVVQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVLFDSEDAADTYSLAMAKKWIDDNAG
Number of Associated Samples 76
Number of Associated Scaffolds 128

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 87.90 %
% of genes near scaffold ends (potentially truncated) 24.22 %
% of genes from short scaffolds (< 2000 bps) 69.53 %
Associated GOLD sequencing projects 70
AlphaFold2 3D model prediction Yes
3D model pTM-score0.73

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (57.031 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere
(31.250 % of family members)
Environment Ontology (ENVO) Unclassified
(35.156 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(58.594 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 18.45%    β-sheet: 31.07%    Coil/Unstructured: 50.49%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.73
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.50.1.1: Double-stranded RNA-binding domain (dsRBD)d1whna11whn0.65011
e.3.1.0: automated matchesd6huha16huh0.63607
e.3.1.1: beta-Lactamase/D-ala carboxypeptidased4f7ya_4f7y0.63324
e.3.1.0: automated matchesd4ovda24ovd0.63171
d.179.1.1: Substrate-binding domain of HMG-CoA reductased4i6ya14i6y0.63119


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 128 Family Scaffolds
PF01381HTH_3 7.81
PF04392ABC_sub_bind 4.69
PF07589PEP-CTERM 2.34
PF13560HTH_31 2.34
PF07992Pyr_redox_2 1.56
PF00589Phage_integrase 1.56
PF02687FtsX 1.56
PF02899Phage_int_SAM_1 1.56
PF01068DNA_ligase_A_M 1.56
PF02958EcKL 1.56
PF09992NAGPA 0.78
PF00557Peptidase_M24 0.78
PF02371Transposase_20 0.78
PF02954HTH_8 0.78
PF00528BPD_transp_1 0.78
PF01258zf-dskA_traR 0.78
PF01757Acyl_transf_3 0.78
PF05192MutS_III 0.78
PF13533Biotin_lipoyl_2 0.78
PF09723Zn-ribbon_8 0.78
PF12801Fer4_5 0.78
PF01255Prenyltransf 0.78
PF12704MacB_PCD 0.78
PF12697Abhydrolase_6 0.78
PF13411MerR_1 0.78
PF12844HTH_19 0.78
PF00483NTP_transferase 0.78
PF13365Trypsin_2 0.78
PF03807F420_oxidored 0.78
PF00331Glyco_hydro_10 0.78

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 128 Family Scaffolds
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 4.69
COG0510Thiamine kinase or a related kinaseCoenzyme transport and metabolism [H] 1.56
COG1423ATP-dependent RNA circularization protein, DNA/RNA ligase (PAB1020) familyReplication, recombination and repair [L] 1.56
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 1.56
COG2334Ser/Thr protein kinase RdoA involved in Cpx stress response, MazF antagonistSignal transduction mechanisms [T] 1.56
COG3173Predicted kinase, aminoglycoside phosphotransferase (APT) familyGeneral function prediction only [R] 1.56
COG3178Predicted phosphotransferase, aminoglycoside/choline kinase (APH/ChoK) familyGeneral function prediction only [R] 1.56
COG4973Site-specific recombinase XerCReplication, recombination and repair [L] 1.56
COG4974Site-specific recombinase XerDReplication, recombination and repair [L] 1.56
COG0020Undecaprenyl pyrophosphate synthaseLipid transport and metabolism [I] 0.78
COG0249DNA mismatch repair ATPase MutSReplication, recombination and repair [L] 0.78
COG1734RNA polymerase-binding transcription factor DksATranscription [K] 0.78
COG3547TransposaseMobilome: prophages, transposons [X] 0.78
COG3693Endo-1,4-beta-xylanase, GH35 familyCarbohydrate transport and metabolism [G] 0.78


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A57.03 %
All OrganismsrootAll Organisms42.97 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002245|JGIcombinedJ26739_100401971Not Available1248Open in IMG/M
3300002245|JGIcombinedJ26739_101295659Not Available619Open in IMG/M
3300004463|Ga0063356_100050684All Organisms → cellular organisms → Bacteria4182Open in IMG/M
3300005434|Ga0070709_10210594All Organisms → cellular organisms → Bacteria1382Open in IMG/M
3300005440|Ga0070705_100298396Not Available1153Open in IMG/M
3300005445|Ga0070708_100011703All Organisms → cellular organisms → Bacteria7141Open in IMG/M
3300005445|Ga0070708_100030842All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Xanthomonadaceae → Pseudolysobacter → Pseudolysobacter antarcticus4637Open in IMG/M
3300005445|Ga0070708_100416751Not Available1267Open in IMG/M
3300005445|Ga0070708_101694516Not Available588Open in IMG/M
3300005467|Ga0070706_100093779All Organisms → cellular organisms → Bacteria2785Open in IMG/M
3300005467|Ga0070706_100457121Not Available1188Open in IMG/M
3300005467|Ga0070706_101044855Not Available753Open in IMG/M
3300005467|Ga0070706_101139560All Organisms → cellular organisms → Bacteria717Open in IMG/M
3300005467|Ga0070706_101220939All Organisms → cellular organisms → Bacteria690Open in IMG/M
3300005468|Ga0070707_100419845Not Available1298Open in IMG/M
3300005468|Ga0070707_100701716Not Available975Open in IMG/M
3300005468|Ga0070707_101254302All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria707Open in IMG/M
3300005471|Ga0070698_100138742Not Available2384Open in IMG/M
3300005471|Ga0070698_100816402All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium877Open in IMG/M
3300005471|Ga0070698_101562550Not Available611Open in IMG/M
3300005536|Ga0070697_100077546All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2734Open in IMG/M
3300005536|Ga0070697_100340316Not Available1294Open in IMG/M
3300006058|Ga0075432_10196664All Organisms → cellular organisms → Bacteria796Open in IMG/M
3300006176|Ga0070765_100910522Not Available831Open in IMG/M
3300007255|Ga0099791_10023424All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2669Open in IMG/M
3300007255|Ga0099791_10028304Not Available2445Open in IMG/M
3300007255|Ga0099791_10247783Not Available844Open in IMG/M
3300007255|Ga0099791_10523020Not Available577Open in IMG/M
3300007258|Ga0099793_10470223Not Available623Open in IMG/M
3300007265|Ga0099794_10010005All Organisms → cellular organisms → Bacteria → Proteobacteria4014Open in IMG/M
3300007265|Ga0099794_10142376Not Available1215Open in IMG/M
3300009038|Ga0099829_10093978Not Available2321Open in IMG/M
3300009038|Ga0099829_10215714Not Available1557Open in IMG/M
3300009088|Ga0099830_11370148Not Available588Open in IMG/M
3300009088|Ga0099830_11501169Not Available561Open in IMG/M
3300009147|Ga0114129_10026546All Organisms → cellular organisms → Bacteria8202Open in IMG/M
3300009147|Ga0114129_10040459All Organisms → cellular organisms → Bacteria6571Open in IMG/M
3300010400|Ga0134122_10819839Not Available890Open in IMG/M
3300010400|Ga0134122_11533344All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Cytophagia → Cytophagales → Fulvivirgaceae → Chryseolinea687Open in IMG/M
3300010401|Ga0134121_12611209Not Available549Open in IMG/M
3300011270|Ga0137391_10576528All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium946Open in IMG/M
3300012174|Ga0137338_1002951All Organisms → cellular organisms → Bacteria2886Open in IMG/M
3300012202|Ga0137363_10745858Not Available829Open in IMG/M
3300012205|Ga0137362_10568282Not Available979Open in IMG/M
3300012205|Ga0137362_10653576Not Available905Open in IMG/M
3300012362|Ga0137361_10444651All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1191Open in IMG/M
3300012362|Ga0137361_10806628Not Available854Open in IMG/M
3300012362|Ga0137361_11027436All Organisms → cellular organisms → Bacteria744Open in IMG/M
3300012363|Ga0137390_10149435Not Available2320Open in IMG/M
3300012917|Ga0137395_10336286All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1074Open in IMG/M
3300012923|Ga0137359_10464182Not Available1119Open in IMG/M
3300012927|Ga0137416_10806424Not Available830Open in IMG/M
3300012929|Ga0137404_10519655Not Available1064Open in IMG/M
3300012929|Ga0137404_11341156Not Available660Open in IMG/M
3300012931|Ga0153915_10363501Not Available1631Open in IMG/M
3300012986|Ga0164304_11876426Not Available502Open in IMG/M
3300014884|Ga0180104_1033331All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1331Open in IMG/M
3300014885|Ga0180063_1014223All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Corynebacteriales → Mycobacteriaceae → Mycolicibacterium → Mycolicibacterium iranicum2137Open in IMG/M
3300015259|Ga0180085_1190930Not Available613Open in IMG/M
3300018429|Ga0190272_11676781Not Available656Open in IMG/M
3300020060|Ga0193717_1153443All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria674Open in IMG/M
3300020579|Ga0210407_10182854All Organisms → cellular organisms → Bacteria1625Open in IMG/M
3300021073|Ga0210378_10013470All Organisms → cellular organisms → Bacteria3384Open in IMG/M
3300021073|Ga0210378_10285320Not Available622Open in IMG/M
3300021088|Ga0210404_10038747All Organisms → cellular organisms → Bacteria2182Open in IMG/M
3300021088|Ga0210404_10149063Not Available1224Open in IMG/M
3300021088|Ga0210404_10289431All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium899Open in IMG/M
3300021170|Ga0210400_11185095Not Available616Open in IMG/M
3300021178|Ga0210408_10022240All Organisms → cellular organisms → Bacteria5054Open in IMG/M
3300021432|Ga0210384_10025067Not Available5649Open in IMG/M
3300021432|Ga0210384_10327910All Organisms → cellular organisms → Bacteria → Terrabacteria group → unclassified Terrabacteria group → Terrabacteria group bacterium ANGP11378Open in IMG/M
3300021432|Ga0210384_10383733Not Available1265Open in IMG/M
3300021559|Ga0210409_10023443All Organisms → cellular organisms → Bacteria5973Open in IMG/M
3300022525|Ga0242656_1066813Not Available652Open in IMG/M
3300025885|Ga0207653_10029094Not Available1779Open in IMG/M
3300025898|Ga0207692_10537556Not Available745Open in IMG/M
3300025906|Ga0207699_10389657Not Available990Open in IMG/M
3300025910|Ga0207684_10004468All Organisms → cellular organisms → Bacteria13165Open in IMG/M
3300025910|Ga0207684_10005027All Organisms → cellular organisms → Bacteria12331Open in IMG/M
3300025910|Ga0207684_10013738All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales6994Open in IMG/M
3300025910|Ga0207684_10066999All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3050Open in IMG/M
3300025910|Ga0207684_10116809Not Available2286Open in IMG/M
3300025910|Ga0207684_10126377Not Available2194Open in IMG/M
3300025910|Ga0207684_10950976Not Available720Open in IMG/M
3300025910|Ga0207684_11008837All Organisms → cellular organisms → Bacteria696Open in IMG/M
3300025910|Ga0207684_11013597All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium694Open in IMG/M
3300025910|Ga0207684_11016610Not Available693Open in IMG/M
3300025917|Ga0207660_10351955All Organisms → cellular organisms → Bacteria1180Open in IMG/M
3300025922|Ga0207646_10108026Not Available2496Open in IMG/M
3300025922|Ga0207646_10130775Not Available2259Open in IMG/M
3300025922|Ga0207646_10256607All Organisms → cellular organisms → Bacteria1580Open in IMG/M
3300025922|Ga0207646_10454613All Organisms → cellular organisms → Bacteria1155Open in IMG/M
3300025922|Ga0207646_11796683Not Available524Open in IMG/M
3300026354|Ga0257180_1020351Not Available858Open in IMG/M
3300026376|Ga0257167_1024883Not Available874Open in IMG/M
3300026376|Ga0257167_1065125Not Available568Open in IMG/M
3300026498|Ga0257156_1105325Not Available587Open in IMG/M
3300026514|Ga0257168_1000124All Organisms → cellular organisms → Bacteria5538Open in IMG/M
3300026535|Ga0256867_10036491All Organisms → cellular organisms → Bacteria → Proteobacteria2037Open in IMG/M
3300026551|Ga0209648_10074892All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2839Open in IMG/M
3300026551|Ga0209648_10419592Not Available859Open in IMG/M
3300026557|Ga0179587_11135313Not Available514Open in IMG/M
3300027651|Ga0209217_1043114Not Available1380Open in IMG/M
3300027655|Ga0209388_1013231All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2250Open in IMG/M
3300027727|Ga0209328_10181090Not Available638Open in IMG/M
3300027731|Ga0209592_1155230All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium819Open in IMG/M
3300027765|Ga0209073_10108929All Organisms → cellular organisms → Bacteria986Open in IMG/M
3300027862|Ga0209701_10289832Not Available944Open in IMG/M
3300027910|Ga0209583_10338662All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria696Open in IMG/M
3300028047|Ga0209526_10372674Not Available952Open in IMG/M
3300028047|Ga0209526_10516926All Organisms → cellular organisms → Bacteria → Proteobacteria776Open in IMG/M
3300028536|Ga0137415_10375819Not Available1225Open in IMG/M
3300028792|Ga0307504_10051138All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1180Open in IMG/M
3300030006|Ga0299907_10070272All Organisms → cellular organisms → Bacteria2829Open in IMG/M
(restricted) 3300031197|Ga0255310_10191825All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium570Open in IMG/M
3300031229|Ga0299913_10080446All Organisms → cellular organisms → Bacteria3160Open in IMG/M
3300031720|Ga0307469_10427675Not Available1139Open in IMG/M
3300031740|Ga0307468_100073478All Organisms → cellular organisms → Bacteria1911Open in IMG/M
3300031820|Ga0307473_10176936All Organisms → cellular organisms → Bacteria1243Open in IMG/M
3300031962|Ga0307479_11687109All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300032180|Ga0307471_102006578Not Available726Open in IMG/M
3300032180|Ga0307471_103453350Not Available559Open in IMG/M
3300032180|Ga0307471_104070248Not Available516Open in IMG/M
3300033417|Ga0214471_10128380Not Available2088Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere31.25%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil22.66%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil12.50%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.47%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.69%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.69%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil3.12%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.34%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.34%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.56%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.56%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.56%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.78%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.78%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.78%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.78%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil0.78%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.78%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.78%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.78%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300006058Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1Host-AssociatedOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012174Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT366_2EnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300014884Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_1DaEnvironmentalOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300015259Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_10DEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300020060Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c2EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022525Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025898Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026354Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-BEnvironmentalOpen in IMG/M
3300026376Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-BEnvironmentalOpen in IMG/M
3300026498Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-49-AEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026535Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (HiSeq)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027727Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027731Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 19-21cm March2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031229Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT155D38EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033417Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10040197123300002245Forest SoilMTRGAGKDMSEPIDIEYRGHFIVVQSYESDSKRWRPKALVSIYHAGALQQTIVSAPDDVWFDSEEDAVTHSLAAAKKWIDDHEER*
JGIcombinedJ26739_10129565913300002245Forest SoilMNNSPDIEYRGHFIEVQAYQSDGRRWRPKALVSIYQGGTVHQSFVSPPVDVLFDSEDDAVTYSLLMAKK
Ga0063356_10005068413300004463Arabidopsis Thaliana RhizosphereVSEPPEIEYRGHFIAVRSESDGQRWRPKAVVSIYQRGTLRKQTVKAPDRVLLDSEEAAETYALAIAKKWIDEQ*
Ga0070709_1021059423300005434Corn, Switchgrass And Miscanthus RhizosphereMNTSPDIEYRGHFIEVQAYQSDDQRWRPKALLSTYQGGTVHQHSVSAAVEVSFESEDDAVTYSLLMAKKWIDDH*
Ga0070705_10029839643300005440Corn, Switchgrass And Miscanthus RhizosphereMKEPDIEYRGHYIVVQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVLFDSKDAADTFSLAMAKKWIDDNAGSPMP*
Ga0070708_10001170363300005445Corn, Switchgrass And Miscanthus RhizosphereMNEPDIEYRGHYIVVQSSESDSQRWRPKALVSIYHSGTMHRKIVVAPIEVLFDSEDAADTFSLAMAKKWIDDNAGNPMP*
Ga0070708_10003084253300005445Corn, Switchgrass And Miscanthus RhizosphereMNEPDIEYRGHYIVVQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVLFDSEDAADTFSLAMAKKWIDDNAGSPMP*
Ga0070708_10041675113300005445Corn, Switchgrass And Miscanthus RhizosphereMSEADIEYRGHFIDVQSYESDGKRWLPKAVVSIYHSGAMHTKIVPAPIEVLFDSEAAADTYSLAMAKKWIDDNS*
Ga0070708_10169451613300005445Corn, Switchgrass And Miscanthus RhizosphereMSEAPDIEYRGHFIEVQSYESDAKRWRPKAVVSIYHGGALHQKLVSAPIEVLFDSEVEADTYSLTMAKKWIDDNARNPT*
Ga0070708_10202109713300005445Corn, Switchgrass And Miscanthus RhizosphereVQSSESDSKRWRPKALVSIYHRGTVHRKIVVAPIEVLFDSEDAADTFSLAMAKKWIDDNAGSPMP*
Ga0070706_10009377913300005467Corn, Switchgrass And Miscanthus RhizosphereMDEPDIEYRGHFIIVRSYGSEGTQWRPKALVSIYHSGTVHRRVLVAPVDVRFDSEDAADTCALALAKKWIDDHAGHPMP*
Ga0070706_10045712133300005467Corn, Switchgrass And Miscanthus RhizosphereMDEPDIEYRGHYIVVKSYESEGTRWRPKALVSIYHSGTVHRKMIVAPADVRLDSEDAADTYSLSLAKKWIDDHAGSQMP*
Ga0070706_10104485523300005467Corn, Switchgrass And Miscanthus RhizosphereMSEADIEYRGHFIDVQSHESDGKQWRPKAVVSIYHGGVLHQQTVAAPIEVLFDSEVAADTYSLAIAKKWIDDNREE*
Ga0070706_10113956023300005467Corn, Switchgrass And Miscanthus RhizosphereMKEPDIEYRGHYIVVQSSESDSERWRPKALVSIYHSGTVHRKIVVAPIEVLFDSEDAADTFSLAMAKKWIDD
Ga0070706_10122093913300005467Corn, Switchgrass And Miscanthus RhizosphereMVIPTAQGGSYMNTSPDIEYRGHFIEVQAYQSDDQRWRPKALLSIYQGGTVHQHSVSAAVEVSFESEDDAVTYSLLMAKKWIDDH*
Ga0070707_10041984513300005468Corn, Switchgrass And Miscanthus RhizosphereMSEADIEYRGHFIDVQSHESDGKQWRPKAVVSIYHSGALHTKIVSAPIEVLFDSEVAADTYSLAMAKKWIDDNS*
Ga0070707_10057137513300005468Corn, Switchgrass And Miscanthus RhizosphereMDEPDIEYRGHFIIVRSYGSEGTQWRPKALVSIYHSGTVHRRMLVAPVDVRFDSEDAADT
Ga0070707_10070171633300005468Corn, Switchgrass And Miscanthus RhizosphereMSEAPDIEYRGHFIEVQSYESDGKRWRPKALVSIYQAGTLHQKFVTAPVEVLCDSEEAADTYSLAMAKKWIDD
Ga0070707_10125430213300005468Corn, Switchgrass And Miscanthus RhizosphereMSEALDIEYRGHFIEVQSYESDGKRWRPKAVESLYHGGPLNQKVVSAPIEVLFDSKAEADTYSLAVAKKWIDDNS*
Ga0070698_10013874213300005471Corn, Switchgrass And Miscanthus RhizosphereMNEPDIEYRGHYIVVQSSESDSKRWRPKALVSIDHSGTVHRKIVVAPIEVLFDSEDAADTFSLAMAKKWIDDNAG
Ga0070698_10081640213300005471Corn, Switchgrass And Miscanthus RhizosphereNACCRAVELVPRWREAMSEAPNIEYRDHFIEVQSYESDGGRWRPKALVSIYHAGTLHQKCVTAPVDVLCDSEEAADTYSLAMAKKWIDDKR*
Ga0070698_10149873523300005471Corn, Switchgrass And Miscanthus RhizosphereMDEPDIEYRGHYIVVKSYESEGTRWRPKALVSIYHSGTVHRKMIVAPADVRLDSEDAAD
Ga0070698_10156255023300005471Corn, Switchgrass And Miscanthus RhizosphereMNEPDIEYRGHYIVLQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVRFDSEDAADTYSLAMAKKWIDDNAGSPMP*
Ga0070697_10007754613300005536Corn, Switchgrass And Miscanthus RhizosphereEPDIEYRGHYIVVKSYESEGTRWRPKALVSIYHSGTVHRKMIVAPADVRLDSEDAADTYSLSLAKKWIDDHAGSQMP*
Ga0070697_10034031623300005536Corn, Switchgrass And Miscanthus RhizosphereMNEPDIEYRGHYIVVQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVLFDSEDAADTFSLAMAKKWID
Ga0075432_1019666423300006058Populus RhizosphereMNTSPDIEYRGHFIEVQAYQSDDRRWRPKALLSIYQGGTVHRHSVSAAVEVSFESEDDAVTYSLLMAKKWIDDH*
Ga0070765_10091052233300006176SoilMNEPDIEYRGHFIIVLSYYESDGRQWRPKALVSIYHSGTVHRRIVVAPVEVRFDSEDAADTHSLAMAKKWIDDHAPMA*
Ga0099791_1002342443300007255Vadose Zone SoilMNEQPDIEYRGHFIVVQSYESEGQRWRPKALVSIYHSGTMHRKMVVAPVDVRFDSEDAADTFSLAMAKKWIDDNAGSPMP*
Ga0099791_1002830453300007255Vadose Zone SoilMSEANIEYRGHFIDVQSFESGGKRWRPKAVVSMYQGGALQTRLVSAPVEVLLDSEVAADTYSLAMAKKWIDDHS*
Ga0099791_1024778313300007255Vadose Zone SoilMSQTPDIEYRGHFIAVQAYQSDGQRWRPKALVSIYQGGTVHQTSVSAPVEVSFDSEDDAVTYSLLMAKKWIDDH*
Ga0099791_1052302023300007255Vadose Zone SoilMNEPDIEYRGHYIVVQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVRFDSEDAADTYSLAMAKKWIDDNAGSPMP*
Ga0099793_1047022313300007258Vadose Zone SoilMSEANIEYRGHFIDVQSFESGGKRWRPKAVVSMYQGGALQTRLVSAPVEVLLDSEVAADTYSLAMAKKWID
Ga0099794_1001000523300007265Vadose Zone SoilMSQTPDIEYRGHFIAVQAYQSDGQRWRPKALVSIYQGGTVRQTSVSAPVEVSFDSEDDAVTYSLLMAKKWIDDH*
Ga0099794_1014237623300007265Vadose Zone SoilMSEANIEYRGHFIDVQSFESGGKRWRPKAVVSIYQGGALQTRLVSAPVEVLLDSEVAADTYSLAMAKKWIDDHS*
Ga0099829_1009397823300009038Vadose Zone SoilMSEADIEYRGHFIDVQSSESDGKRWRPKAVVSIYHSGALHTKIVSAPIEVLFDSEVAADTYSLAMAKKWIDDNS*
Ga0099829_1021571433300009038Vadose Zone SoilMATMNEPDIEYRGHYIVVQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVLFDSEDAADTFSLAMAKKWIDDNAGNPMP*
Ga0099830_1137014823300009088Vadose Zone SoilMSEANIEYRGHFIDVQSFESGGKRWRPKAVVSIYQGGALPTRLVSAPVEVLLDSEVAADTYSLAMAKKWIDDQLEGGRLEGKTEK
Ga0099830_1150116923300009088Vadose Zone SoilMATMNEPDIEYRGHYIVVQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVLFDSEDGADTFSLAMAKKWIDDNAGNPMP*
Ga0114129_10026546153300009147Populus RhizosphereMSEADIEYRGHFIDVQSFESEDNHWRPKAVVSIYRSGTLHREIMSAPSSELFDSEVGADTYALEMAKKWIDDNS*
Ga0114129_1004045983300009147Populus RhizosphereMSEADVEYRGHFIDVQSHESDGKRWRPKAVVSIYHSGALHQKIVSAPIEVLFDSETEADTYSLAMAKKFIDDKS*
Ga0134122_1081983913300010400Terrestrial SoilVPSATREEDNMSEADIEYRGHIIDVQSFESEGKWWRPKAVVSIYHGGTVHTKMVAAPIDVMFDSEVAADTYSLEMAKKWIDDKS*
Ga0134122_1153334413300010400Terrestrial SoilVSEPPEIEYRGHFIAVRSESDGHRWRPKAVVSIYQRGTLRKQTVKAPDRVLLDSEEAAETYALAIAKKWIDEQ*
Ga0134121_1261120923300010401Terrestrial SoilMNEPVDIEYRGHFIVVQSYESDDKRWRPKALVSIYRAGALQQTIVSAPVDVWFDSEEDAITHSLAAAKQWIDDHER*
Ga0137391_1057652823300011270Vadose Zone SoilMSEANIEYRGRFIDVQSFESGGKRWRPKAVVSMYQGGALQTRLVSAPVEVLLDSEVAADTYSLAMAKKWIDDQL
Ga0137338_100295163300012174SoilMSESDIEYRGHFINVQSSESDGNRWRPNAVVSTYHRGALHTQIVSAPIEVLFDSEVAADTHSLAMAKKWIDDN
Ga0137363_1074585823300012202Vadose Zone SoilMNEPDIEYRGHYIVVQSSESDSKRWRPNALVSIYHSGTVHRRIVAAPIEVLFDSEDAADTFSLAMAKKWIDDNAGSPMP*
Ga0137362_1056828223300012205Vadose Zone SoilMNEPDIEYRGHYIIVQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVLFDSEDAADTYSLAMAKKWIDDNAGSPMP*
Ga0137362_1065357623300012205Vadose Zone SoilMNEPDIEYRGHYIVVQSFESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVLFDSEDAADTFSLAMAKKWIDDNAGSPMP*
Ga0137361_1044465143300012362Vadose Zone SoilMNEPDIEYRGHYIVVQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVLFDSEDAADTYSLAMAKKWIDDNAG
Ga0137361_1080662823300012362Vadose Zone SoilMNEPDVEYRGHYIVVHSYESEGTRWRPKALVSIYHSGTVHRKMIVAPVDVRFDSEDAADTFSLAMAKKWIDDNAGSPMP*
Ga0137361_1102743613300012362Vadose Zone SoilMSQTPDIEYRGHFIAVQAYQSDGQRWRPKALVSIYQGGTVRQTSVSAPVEVSFDSEDDAVTYSLL
Ga0137390_1014943523300012363Vadose Zone SoilMSEADIEYRGHFIDVQSFESDGQRWRPKAVVSIYEGGALQTRLVAAPVEVLLDSEGAADTYALAMAKKWIDDNN*
Ga0137395_1033628623300012917Vadose Zone SoilMATMNEPDIEYRGHYIVVQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVLFDSEDAADTYSLAMAKKWIDDNAGSPMP*
Ga0137395_1035456313300012917Vadose Zone SoilMSEANIEYRGHFIDVQSFESGGKRWRPKAVVSIYQGGALQTRLVSAPVEVLLDSEVAADTYS
Ga0137359_1046418223300012923Vadose Zone SoilMNEPDIEYRGHYIIVQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVRFDSEDAADTYSLAMAKKWIDDNAGSPMP*
Ga0137416_1080642423300012927Vadose Zone SoilMSEANIEYRGHFIDVQSFESGGKRWRPKAVVSMYQGGALQTRLVSAPVEVLLDSEVAADTYSLAMAKKWIDDQLEGG
Ga0137404_1051965513300012929Vadose Zone SoilMSEADVEYRGHFIDVQSYESDGKRWRPKAVVSIYHSGALHQKIVSAPIEVLFDSETKADTYSLAMAKKWIDDKG*
Ga0137404_1134115623300012929Vadose Zone SoilMNEPDIEYRGHYIVVQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVRFDSEDAADTYSLALAKKWIDDNAGSPMP*
Ga0153915_1036350123300012931Freshwater WetlandsMSEADIEYRGHFIDVQSYESDGKRWRPKAVVSIYRGGALHHKIVSAPIEVLFESETAADTYSLAMAKKWIDDNS*
Ga0164304_1187642623300012986SoilMNEPDIEYRGHYIVVQSSESDSKRWRPKALVSIYHRGTVHRKIVVAPIEVLFDSEDAADTFSLAMAKKWIDDNAGSPMP*
Ga0180104_103333113300014884SoilMSESDIEYRGHFINVQSSESDGNRWRPNAVVSTYHRGALHTQIVSAPIEVLFDSEVAADTHSLAMAKKWIDDNS*
Ga0180063_101422353300014885SoilDIEYRGHFIDVQSYESEGRRWRPKAILSIYRSGTLHQQILSAPGEVLLESEEAAETYSLAMAKKWIDEQD*
Ga0180085_119093023300015259SoilADIEYRSHFIDVQSHESAGKQWRPKAVVSIYHGGVLHQKTVAAPIEVLFDSETAADTYSLGMAKKWIDDHS*
Ga0190272_1167678113300018429SoilMSEADIEYRGHFIDVQSHESDGKQWRPKAVVSSYQGGVLHQKTVAAPIEVLFDSEVAADTYSLAMAKKWIDDNS
Ga0193717_115344313300020060SoilMSEADIEYRGHFIDVQASESADKRWRPNAVVSLFHGGALHQNTVSAPIGVLFDSETDADTYSLAMAKKWIDHQLVE
Ga0210407_1018285413300020579SoilMSEPIDIEYRGHFIVVQSYESDSKRWRPKALVSIYHAAALQQTVVSAPVDVWFDSEEDAVTYSLAAAKKWIDDQEER
Ga0210378_1001347053300021073Groundwater SedimentMSEAPDIEYRGHFIDVHSYESEGKRWRPKAVVSIYRSGTLHQQILSAPGEVLLESEEAAETYSLAMAKKWIDDH
Ga0210378_1028532023300021073Groundwater SedimentMSEADIEYRGHFIDVQSSESDGTRWRPKAVVSIYHNGALHTKIVSAPIEVLFDSEVAADTYSLAMAKKWIDDNS
Ga0210404_1003874713300021088SoilMNEQPDIEYRGHFIVVQSYESEGTRWRPKALVSIYHSGTVHRKMVVAPVDVRFDSEDAADTYSLALAKKWIDDNDGSPMP
Ga0210404_1014906323300021088SoilMSETDIEYRGHFIDVQSFESDGKRWRPKAVVSIYQGGAVQTRLVSAPVEVLLESEVAADTYSLAMAKKWIDDHS
Ga0210404_1028943123300021088SoilMSEAPDIEYRDHFIEVQSYESDGGRWRPKALVSIYHAGTLHQKCVTAPVDVLCDSEEAADTYSLAMAKKWIDDKR
Ga0210400_1118509513300021170SoilSVSTQSSSCGLTTTPGGEDMNEPVDIEYRGHFIAVQTYESDSKQWRPKALVSIYHAGALQQTIVSAPVDVRFDSEEEAVTYSLAAAKKWIDDHEP
Ga0210408_1002224023300021178SoilMNEPVDIEYRGHFIAVQTYESDSKQWRPKALVSIYHAGALQQTIVSAPVDVRFDSEEEAVTYSLAAAKKWIDDHEP
Ga0210384_10025067113300021432SoilMNEQPDIEYRGHFIVVQSYESEGTRWRPKALVSIYHSGTVHRKMVVAPVDVRFDSEDAADTCSLALAKKWIDDNNGSPMP
Ga0210384_1032791033300021432SoilMNEQPDIEYRGHYIVVQSSESEGTRWRPKALVSIYHSGTVHRKMVVAPVDVRFDSEDAADTYSLALAKKWIDDNDGSPMP
Ga0210384_1038373323300021432SoilMNEPDIEYRGHFIIVLSYYESDGRQWRPKALVSIYHSGTVHRRIVVAPVEVRFDSEDAADTHSLAMAKKWIDDHAPMA
Ga0210409_1002344393300021559SoilEPIDIEYRGHFIVVQSYESDSKRWRPKALVSIYHAAALQQTVVSAPVDVWFDSEEDAVTYSLAAAKKWIDDQEER
Ga0242656_106681313300022525SoilDMNEPVDIEYRGHFIAVQTYESDSKQWRPKALVSIYHAGALQQTIVSAPVDVRFDSEEEAVTYSLAAAKKWIDDHEP
Ga0207653_1002909423300025885Corn, Switchgrass And Miscanthus RhizosphereMNEPDIEYRGHYIVVQSSESDSKRWRPKALVSIYHRGTVHRKIVVAPIEVLFDSEDAADTFSLAMAKKWIDDNAGSPMP
Ga0207692_1053755623300025898Corn, Switchgrass And Miscanthus RhizosphereMNTSPDIEYRGHFIEVQAYQSDDQRWRPKALLSTYQGGTVHQHSVSAAVEVSFESEDDAVTYSLL
Ga0207699_1038965723300025906Corn, Switchgrass And Miscanthus RhizosphereMNTSPDIEYRGHFIEVQAYQSDDQRWRPKALLSTYQGGTVHQHSVSAAVEVSFESEDDAVTYSLLMAKKWIDD
Ga0207684_10004468153300025910Corn, Switchgrass And Miscanthus RhizosphereMNEPDIEYRGHYIVVQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVLFDSEDAADTFSLAMAKKWIDDNAGSPMP
Ga0207684_10005027163300025910Corn, Switchgrass And Miscanthus RhizosphereMNEPDIEYRGHYIVVQSSESDSQRWRPKALVSIYHSGTMHRKIVVAPIEVLFDSEDAADTFSLAMAKKWIDDNAGNPMP
Ga0207684_1001373873300025910Corn, Switchgrass And Miscanthus RhizosphereMSEAPDIEYRGHFIEVQSYEADGGRWRPKALVSIYHAGTLHQKCVTAPVDVLCDSEEAADTYSLAMAKKWIDDKR
Ga0207684_1006699943300025910Corn, Switchgrass And Miscanthus RhizosphereMSEADIEYRGHFIDVQSYESDGKRWLPKAVVSIYHSGAMHTKIVPAPIEVLFDSEAAADTYSLAMAKKWIDDNS
Ga0207684_1011680913300025910Corn, Switchgrass And Miscanthus RhizosphereMNEPDIEYRGHYIVVRSSESDSKRWRPKALVSIYHRGTVHRKIVVAPIEVLFDSEDAADTFSLAMAKKWIDDNAGSPMP
Ga0207684_1012637713300025910Corn, Switchgrass And Miscanthus RhizosphereMNEPDIEYRGHYIVVQSSESDSKRWRPKALVSIYHRGTVHRKIVVAPIEVLFDSKDAADTFSLAMAKKWIDDNAGSPMP
Ga0207684_1095097613300025910Corn, Switchgrass And Miscanthus RhizosphereMNEPDIEYRGHYIVVQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVLFDSEDAADTFSLAMAK
Ga0207684_1100883723300025910Corn, Switchgrass And Miscanthus RhizosphereMKEPDIEYRGHYIVVQSSESDSERWRPKALVSIYHSGTVHRKIVVAPIEVLFDSEDAADTFSLA
Ga0207684_1101359723300025910Corn, Switchgrass And Miscanthus RhizosphereMVIPTTQGGSYMNTSPDIEYRGHFIEVQAYQSDDQRWRPKALLSIYQGGTVHQHSVSAAVEVSFESEDDAVTYSLLMAKKWIDDH
Ga0207684_1101661023300025910Corn, Switchgrass And Miscanthus RhizosphereMDEPDIEYRGHYIVVKSYESEGTRWRPKALVSIYHSGTVHRKMIVAPADVRLDSEDAADTYSLSLAKKWIDDHAGSQMP
Ga0207660_1035195523300025917Corn RhizosphereVSEPPEIEYRGHFIAVRSESDGQRWRPKAVVSIYQRGTLRKQTVKAPDRVLLDSEEAAETYALAIAKKWIDEQ
Ga0207646_1010802673300025922Corn, Switchgrass And Miscanthus RhizosphereDIEYRGHYIVVQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVLFDSEDAADTFSLAMAKKWIDDNAGSPMP
Ga0207646_1013077543300025922Corn, Switchgrass And Miscanthus RhizosphereMDEPDIEYRGHFIIVRSYGSEGTQWRPKALVSIYHSGTVHRRVLVAPVDVRFDSEDAADTCALALAKKWIDD
Ga0207646_1025660723300025922Corn, Switchgrass And Miscanthus RhizosphereMSEALDIEYRGHFIEVQSYESDGKRWRPKAVESLYHGGPLNQKVVSAPIEVLFDSKAEADTYSLAVAKKWIDDNS
Ga0207646_1045461313300025922Corn, Switchgrass And Miscanthus RhizosphereMVIPTAQGGSYMNTSPDIEYRGHFIEVQAYQSDDQRWRPKALLSIYQGGTVHQHSVSAAVEVSFESEDDAVTYSLLMAKKWIDDH
Ga0207646_1179668313300025922Corn, Switchgrass And Miscanthus RhizosphereMSEAPDIEYRGHFIEVQSYESDGKRWRPKALVSIYQAGTLHQKFVTAPVEVLCDSEEAADTYSLAMAKKWIDDKS
Ga0257180_102035133300026354SoilMSEADIEYRGHFIDVQSSESDGKRWRPKAVVSIYHSGALHTKIVSAPIEVLFDSEVAADTYSLAMAKKWIDDNS
Ga0257167_102488333300026376SoilEYRGHFIVVQSYESEGKRWRPKALVSTYHSGTVHRKIVVAPIEVRFDSEDAADTYSLAMAKKWIDDNAGSPMP
Ga0257167_106512513300026376SoilHFIDVQSFESGGKRWRPKAVASIYQGGAVQTRLVSAPVEVLFESEIAADTYSLAMAKKWIDDHS
Ga0257156_110532523300026498SoilMNEPDIEYRGHYIVVQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVRFDSEDAADTYSLAMAKKWIDDNAGSPMP
Ga0257168_100012413300026514SoilMSQTPDIEYRGHFIAVQAYQSDGQRWRPKALVSIYQGGTVRQTSVSAPVEVSFDSEDDAVTYSLLMAKKWIDDH
Ga0256867_1003649133300026535SoilMGEAPDIEYRGHFIEVRSYESEGKRWRPKAVVSIYRRGTLQRQILSAPGEVLVESEEAAETYSLAMAKKWIDDQ
Ga0209648_1007489213300026551Grasslands SoilVSETPDIEYRDHFIEVQAYQSDGQRWRPKALLSIYQGGTVIQTSVSAPGEVSFDSEDDAVTYSLLMAKKWIDDH
Ga0209648_1041959223300026551Grasslands SoilMNESDIEYRGHYIVVQSSESDSKRWRPKALVSIYHSGTVHRKIVVAPIEVLFDSEDAADTFSLAMAKKWIDDNAGSPMP
Ga0179587_1113531313300026557Vadose Zone SoilMSEADVEYRGHFIDVQSYESDGKRWRPKAVVSIYHSGALHQKIVSAPIEVLFDSETKADTYSLAMAKKWIDDKG
Ga0209217_104311453300027651Forest SoilMNEQPDIEYRGHFIVVRSFESDGKRWRPKALVSIYHSGTVHRKMVVAPVDVRFDSEDAADTYSLALAKKWIDDNAGSPMP
Ga0209388_101323163300027655Vadose Zone SoilMNEQPDIEYRGHFIVVQSYESEGQRWRPKALVSIYHSGTMHRKMVVAPVDVRFDSEDAADTFSLAMAKKWIDDNAGSPMP
Ga0209328_1018109023300027727Forest SoilMSEPIDIEYRGHFIVVQSYESDSKRWRPKALVSIYHAGALQQTIVSAPDDVWFDSEEDAVTHSLAAAKKWIDDHEER
Ga0209592_115523023300027731Freshwater SedimentMSEADIEYRGHFIDVQSHESDGKQWRPKAVVSIYHGGALHQKTVAAPIEVLFDSETAADTYSLGMAKKWIDDNS
Ga0209073_1010892923300027765Agricultural SoilMNTSPDIEYRGHFIEVQAYQSDDRRWRPKALLSIYQGGTVHRHSVSAAVEVSFESEDDAVTYSLLMAKKWIDDH
Ga0209701_1028983223300027862Vadose Zone SoilMSQTPDIEYRGHFIAVQAYQSDGQRWRPKALVSIYQGGTVHQTSVSAPVEVSFDSEDDAVTYSLLMAKKWIDDH
Ga0209583_1033866213300027910WatershedsMSEADIEYRGHFIDVQSYESDGKRWRPKAVVSTYHGGALHQQIVSAPIEVMFDSETEADTYSLAVAKKWIDDNSER
Ga0209526_1037267433300028047Forest SoilMNEQPDIEYRGHFIVVRSFESDGKRWRPKALVSIYHSGTVHRKMIVAPADVRFDSEDAADTYSLALAKKWIDDNAGSPMP
Ga0209526_1051692623300028047Forest SoilMNNSPDIEYRGHFIEVQAYQSDGRRWRPKALVSIYQGGTVHQSFVSPPVDVLFDSEDDAVTYSLLMAKKWIDDH
Ga0137415_1037581913300028536Vadose Zone SoilMSEANIEYRGHFIDVQSFESGGKRWRPKAVVSMYQGGALQTRLVSAPVEVLLDSEVAADTYSLAMAKKWIDDHS
Ga0307504_1005113823300028792SoilVSGAPDIEYRGHFIEVQSYESDGRRWRPKALVTIYQAGTLHQKIVTAPGEVLCDSEEAAETYSLALAKQWIDAKG
Ga0299907_1007027213300030006SoilMGEAPDIEYRGHFIEVRSYESESKRWRPKAVVSIYRGGTLHRQTLSAPGEVLVESEEAAETYSLAMAKKWIDDQ
(restricted) Ga0255310_1019182523300031197Sandy SoilMSEADIEYRGHFIDVQSHESDGKQWRAKAVVSIYHGGVLHQKTVAAPIEVLFDSETAADTYSLGMAKKWIDDNS
Ga0299913_1008044623300031229SoilMGEAPDIEYRGHFIEVRSYESEGKRWRPKAVVSIYRGGTLHRQILSAPGEVLVESEEAAETYSLAMAKKWIDDQ
Ga0307469_1042767513300031720Hardwood Forest SoilEPDIEYRGHYIVVRSSESDSKRWRPKALVSIYHRGTVHRKIVVAPIEVRFDSEDAADTYSLAMAKKWIDDNAGSPMP
Ga0307468_10007347813300031740Hardwood Forest SoilMSEADIEYRGHIIDVQSFESEGKWWRPKAVVSIYHGGTVHTKMVAAPIDVMFDSEVAADTYSLEMAKKWIDDKS
Ga0307473_1017693613300031820Hardwood Forest SoilMVIPTAQGGSYMNTSPDIEYRGHFIEVQAYQSDDQRWRPKALLSTYQGGTVHQHSVSAAVEVSFESEDDAVTYSLLMAKKWIDDH
Ga0307479_1168710923300031962Hardwood Forest SoilVIPMAQGGSYMNTSPDIEYRGHFIEVQAYQSDDQRWRPKALLSIYQGGTVHQHSVSAAVEVSFESEDDAVTYSLLMAKKWIDDH
Ga0307471_10200657823300032180Hardwood Forest SoilMSETDVEYRGHFIDVQSFESDGARWRPKAIVSIYLGGTVHTRMVAAPIDVLFDSEVAADTHALAMAKKWIDDNSSGTKTGGAC
Ga0307471_10345335013300032180Hardwood Forest SoilMNEPDIEYRGHYIVVRSSESDSQRWRPKALVSIYHRGTVHRKIVVAPIEVLFDSEDAADTYSLAMAKKWIDDN
Ga0307471_10407024823300032180Hardwood Forest SoilIEYRGHFIDVQSCESEGRRWRPKAILSIYRSGTLHQQILSAAGEVLFESEEAETYSLAMAKKWIDEQG
Ga0214471_1012838023300033417SoilMREAPDIEYRGYFIEVQSYESEGKRWRPKAIVSIYRSGTLHQQILSAPGELLLESEEAAETYSLAMAKKWIDDH


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.