NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F057612

Metagenome / Metatranscriptome Family F057612

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F057612
Family Type Metagenome / Metatranscriptome
Number of Sequences 136
Average Sequence Length 85 residues
Representative Sequence LSAQAESRQMAAQAQWETESENKISAAIEPFKALLVRAEKERDEANQAASERARQVQNLEKKLTEASSFLNGWRNGKPTVGAT
Number of Associated Samples 121
Number of Associated Scaffolds 136

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 1.85 %
% of genes near scaffold ends (potentially truncated) 38.24 %
% of genes from short scaffolds (< 2000 bps) 38.97 %
Associated GOLD sequencing projects 113
AlphaFold2 3D model prediction Yes
3D model pTM-score0.57

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (84.559 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(22.794 % of family members)
Environment Ontology (ENVO) Unclassified
(39.706 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(50.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 67.57%    β-sheet: 0.00%    Coil/Unstructured: 32.43%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.57
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 136 Family Scaffolds
PF00072Response_reg 3.68
PF00578AhpC-TSA 2.94
PF01925TauE 2.21
PF12840HTH_20 2.21
PF07394DUF1501 2.21
PF03279Lip_A_acyltrans 1.47
PF00005ABC_tran 1.47
PF07876Dabb 1.47
PF00756Esterase 0.74
PF06210DUF1003 0.74
PF00847AP2 0.74
PF12399BCA_ABC_TP_C 0.74
PF07878RHH_5 0.74
PF13602ADH_zinc_N_2 0.74
PF11897DUF3417 0.74
PF13360PQQ_2 0.74
PF01262AlaDh_PNT_C 0.74
PF02463SMC_N 0.74
PF08388GIIM 0.74
PF16334DUF4964 0.74
PF02661Fic 0.74
PF01061ABC2_membrane 0.74
PF01053Cys_Met_Meta_PP 0.74
PF07638Sigma70_ECF 0.74
PF04519Bactofilin 0.74
PF13417GST_N_3 0.74
PF06133Com_YlbF 0.74
PF07396Porin_O_P 0.74

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 136 Family Scaffolds
COG0730Sulfite exporter TauE/SafE/YfcA and related permeases, UPF0721 familyInorganic ion transport and metabolism [P] 2.21
COG1560Palmitoleoyl-ACP: Kdo2-lipid-IV acyltransferase (lipid A biosynthesis)Lipid transport and metabolism [I] 1.47
COG4261Predicted acyltransferase, LPLAT superfamilyGeneral function prediction only [R] 1.47
COG0075Archaeal aspartate aminotransferase or a related aminotransferase, includes purine catabolism protein PucGAmino acid transport and metabolism [E] 0.74
COG01567-keto-8-aminopelargonate synthetase or related enzymeCoenzyme transport and metabolism [H] 0.74
COG0399dTDP-4-amino-4,6-dideoxygalactose transaminaseCell wall/membrane/envelope biogenesis [M] 0.74
COG0436Aspartate/methionine/tyrosine aminotransferaseAmino acid transport and metabolism [E] 0.74
COG0520Selenocysteine lyase/Cysteine desulfuraseAmino acid transport and metabolism [E] 0.74
COG0626Cystathionine beta-lyase/cystathionine gamma-synthaseAmino acid transport and metabolism [E] 0.74
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 0.74
COG1664Cytoskeletal protein CcmA, bactofilin familyCytoskeleton [Z] 0.74
COG1921Seryl-tRNA(Sec) selenium transferaseTranslation, ribosomal structure and biogenesis [J] 0.74
COG1982Arginine/lysine/ornithine decarboxylaseAmino acid transport and metabolism [E] 0.74
COG2008Threonine aldolaseAmino acid transport and metabolism [E] 0.74
COG2873O-acetylhomoserine/O-acetylserine sulfhydrylase, pyridoxal phosphate-dependentAmino acid transport and metabolism [E] 0.74
COG3679Cell fate regulator YlbF, YheA/YmcA/DUF963 family (controls sporulation, competence, biofilm development)Signal transduction mechanisms [T] 0.74
COG3746Phosphate-selective porinInorganic ion transport and metabolism [P] 0.74
COG4100Cystathionine beta-lyase family protein involved in aluminum resistanceInorganic ion transport and metabolism [P] 0.74
COG4420Uncharacterized membrane proteinFunction unknown [S] 0.74


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A84.56 %
All OrganismsrootAll Organisms15.44 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300004092|Ga0062389_103218011Not Available611Open in IMG/M
3300004643|Ga0062591_100463634All Organisms → cellular organisms → Bacteria1073Open in IMG/M
3300005329|Ga0070683_101546746Not Available638Open in IMG/M
3300005330|Ga0070690_101261708All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Moraxellales → Moraxellaceae → Acinetobacter → Acinetobacter calcoaceticus/baumannii complex → Acinetobacter calcoaceticus591Open in IMG/M
3300005447|Ga0066689_10408571All Organisms → cellular organisms → Bacteria849Open in IMG/M
3300005458|Ga0070681_11313165Not Available646Open in IMG/M
3300005574|Ga0066694_10249269Not Available841Open in IMG/M
3300005618|Ga0068864_101519176Not Available673Open in IMG/M
3300005844|Ga0068862_100908314All Organisms → cellular organisms → Bacteria866Open in IMG/M
3300006358|Ga0068871_100399242Not Available1224Open in IMG/M
3300006755|Ga0079222_11907827Not Available580Open in IMG/M
3300006796|Ga0066665_11125262All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium598Open in IMG/M
3300006800|Ga0066660_10502474All Organisms → cellular organisms → Bacteria1019Open in IMG/M
3300009088|Ga0099830_10463152All Organisms → cellular organisms → Bacteria1031Open in IMG/M
3300009137|Ga0066709_104191321Not Available525Open in IMG/M
3300011403|Ga0137313_1064469Not Available651Open in IMG/M
3300012199|Ga0137383_10211931Not Available1419Open in IMG/M
3300012203|Ga0137399_11653789Not Available528Open in IMG/M
3300012212|Ga0150985_110303349All Organisms → cellular organisms → Bacteria653Open in IMG/M
3300012349|Ga0137387_10514626Not Available869Open in IMG/M
3300012349|Ga0137387_10625768Not Available780Open in IMG/M
3300012359|Ga0137385_10763943All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium804Open in IMG/M
3300012363|Ga0137390_10386183All Organisms → cellular organisms → Bacteria1380Open in IMG/M
3300012917|Ga0137395_10355624All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1044Open in IMG/M
3300012923|Ga0137359_10232881All Organisms → cellular organisms → Bacteria1644Open in IMG/M
3300012924|Ga0137413_10910704All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium683Open in IMG/M
3300012930|Ga0137407_11207254Not Available718Open in IMG/M
3300013296|Ga0157374_12832491Not Available512Open in IMG/M
3300013297|Ga0157378_11150052Not Available814Open in IMG/M
3300014055|Ga0119878_1211195Not Available505Open in IMG/M
3300014166|Ga0134079_10058735All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1374Open in IMG/M
3300014489|Ga0182018_10125889All Organisms → cellular organisms → Bacteria1481Open in IMG/M
3300014501|Ga0182024_11508528Not Available767Open in IMG/M
3300015241|Ga0137418_11275394Not Available513Open in IMG/M
3300016371|Ga0182034_11440524All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Pirellulales → Pirellulaceae → Rhodopirellula603Open in IMG/M
3300018431|Ga0066655_10557598All Organisms → cellular organisms → Bacteria → PVC group767Open in IMG/M
3300018482|Ga0066669_11663280Not Available585Open in IMG/M
3300024186|Ga0247688_1087915Not Available531Open in IMG/M
3300025918|Ga0207662_10975804Not Available601Open in IMG/M
3300025939|Ga0207665_10917037All Organisms → cellular organisms → Bacteria695Open in IMG/M
3300025944|Ga0207661_11709173Not Available575Open in IMG/M
3300026088|Ga0207641_11051928All Organisms → cellular organisms → Bacteria → PVC group812Open in IMG/M
3300026095|Ga0207676_11165816All Organisms → cellular organisms → Bacteria763Open in IMG/M
3300028145|Ga0247663_1024032Not Available936Open in IMG/M
3300028874|Ga0302155_10370641Not Available613Open in IMG/M
3300028909|Ga0302200_10048146Not Available2503Open in IMG/M
3300029943|Ga0311340_11410327Not Available550Open in IMG/M
3300030503|Ga0311370_11443808Not Available724Open in IMG/M
3300030987|Ga0308155_1029893Not Available541Open in IMG/M
3300031090|Ga0265760_10378933Not Available509Open in IMG/M
3300031095|Ga0308184_1042025All Organisms → cellular organisms → Bacteria568Open in IMG/M
3300031099|Ga0308181_1095519Not Available635Open in IMG/M
3300031231|Ga0170824_117676873Not Available567Open in IMG/M
3300034644|Ga0370548_092336Not Available602Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil22.79%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil8.09%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.62%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil6.62%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere5.88%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.41%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.68%
BogEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Bog3.68%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.94%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.94%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere2.94%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil2.21%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere2.21%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere2.21%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.47%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.47%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.47%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.47%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa1.47%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.47%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.74%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland0.74%
WetlandEnvironmental → Aquatic → Marine → Wetlands → Sediment → Wetland0.74%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.74%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.74%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.74%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.74%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.74%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.74%
PalsaEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Palsa0.74%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost0.74%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.74%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.74%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.74%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.74%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.74%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.74%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.74%
Sewage Treatment PlantEngineered → Wastewater → Industrial Wastewater → Unclassified → Unclassified → Sewage Treatment Plant0.74%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001213Combined assembly of wetland microbial communities from Twitchell Island in the Sacramento Delta (Jan 2013 JGI Velvet Assembly)EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005329Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaGEnvironmentalOpen in IMG/M
3300005330Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaGEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005340Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaGEnvironmentalOpen in IMG/M
3300005438Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-2 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005618Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2Host-AssociatedOpen in IMG/M
3300005840Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M6-2Host-AssociatedOpen in IMG/M
3300005844Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2Host-AssociatedOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006237Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2 (version 2)Host-AssociatedOpen in IMG/M
3300006358Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2Host-AssociatedOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009177Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaGHost-AssociatedOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011403Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT166_2EnvironmentalOpen in IMG/M
3300011415Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT469_2EnvironmentalOpen in IMG/M
3300011421Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT769_2EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012905Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S013-104B-2EnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300013307Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C5-5 metaGHost-AssociatedOpen in IMG/M
3300014055Sewage treatment plant microbial communities from Vermont, USA - ANOX_WEngineeredOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300014489Permafrost microbial communities from Stordalen Mire, Sweden - 812P2M metaGEnvironmentalOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300017948Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_4_10EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300020000Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3a1EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300022522Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-11-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024186Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK29EnvironmentalOpen in IMG/M
3300024232Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK05EnvironmentalOpen in IMG/M
3300024290Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK08EnvironmentalOpen in IMG/M
3300024317Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK01EnvironmentalOpen in IMG/M
3300024331Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK09EnvironmentalOpen in IMG/M
3300025918Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025944Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026035Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026095Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026116Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300028145Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK04EnvironmentalOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028874Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - I_Bog_N3_1EnvironmentalOpen in IMG/M
3300028909Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - II_Bog_N3_1EnvironmentalOpen in IMG/M
3300029913III_Bog_N3 coassemblyEnvironmentalOpen in IMG/M
3300029943I_Palsa_N3 coassemblyEnvironmentalOpen in IMG/M
3300029982Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - III_Bog_N3_1EnvironmentalOpen in IMG/M
3300030503III_Palsa_E3 coassemblyEnvironmentalOpen in IMG/M
3300030518Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - III_Bog_N2_2EnvironmentalOpen in IMG/M
3300030987Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_144 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030990Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_149 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031090Metatranscriptome of rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZI1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031095Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_158 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031099Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_152 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031945Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX082EnvironmentalOpen in IMG/M
3300031946Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF172EnvironmentalOpen in IMG/M
3300032275Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C1_bottomEnvironmentalOpen in IMG/M
3300032893Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.1EnvironmentalOpen in IMG/M
3300034644Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_123 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ13530_10616386323300001213WetlandQLTAQAEARQMAAQTQWETESEKKARAAIEPFKMLLTRAEMERDEARQSASESNRQVQSLEKKLTEVSSFLNSWRNGKHLVGAS*
Ga0062389_10321801113300004092Bog Forest SoilDLITQLTAQAESRHMAAQVLWETELEKKTRAAIEPFKTLLSRAEKERDEAKEIASEGTRQVQNLEKKLTEASSFLNGWRNGKVLVGAA*
Ga0062389_10385819113300004092Bog Forest SoilEQELVAQITAQAEARHSAARVEWETETEIRTRAAVEPVKGLLIRTENERDEARKSVAEGARQVQDLEKKLTEASSFLNPWRNGRNVPANS*
Ga0062591_10046363433300004643SoilQASAKQREQDLVAQMTAQAEAHLMAARAQWETESEKRTRAAIEPIEALLERTQKERDEAAQSASGAVRQAQDLEKKLTEASSFLNGWKNGKHLVVAGR*
Ga0066673_1057944413300005175SoilSAQAEARELAAQMEWETAFQTKTRAAIEPLKLQLARTEKERDEANQSASENARQVQNVEKKLTEASSFLNGWRNGKHLEGTA*
Ga0070683_10057441313300005329Corn RhizosphereTAQAEARHSAAQVEWETQFQTKARFAIEPVKNQLARAEKERDEARQAASEHALQVQNLEKKLTEASTFLNGWRNGKTVAGAV*
Ga0070683_10154674623300005329Corn RhizosphereTAQAEARQTSAQAQWEAETEKKMRVAVEPFRMLLSRTEVERDEARNSSTEALRQVQTLEKRLNEASSFLNTWRNGKHSVGTS*
Ga0070690_10126170813300005330Switchgrass RhizosphereREQDLVAQMTAQAEARQMTVQAHWETESEKKARAATEPFKAMLARAEMDRDEARESAAESARQVQNLEKKLTEVSSFLNTWRNGKHLVGAS*
Ga0066388_10237209823300005332Tropical Forest SoilQKAVQVQWEAEFDKRARLAIEPLKVSLTRAEKERDEARIAASDAARHVQNLEKQLTEASSFLNGWRNGRNPVGA*
Ga0066388_10716344613300005332Tropical Forest SoilQKAVQVQWEAEFDKRARLAIEPLKVSLTRAEKERDEARIAASEAARHVQNLEKQLTEASSFLNGWRNGGRNAVGA*
Ga0070689_10106136713300005340Switchgrass RhizosphereETEFQTKTRVAIEPLKIQLARIEKERDEANQIATEHARHVQNLEKKLTEASSFLNGWRNGKPLVGA*
Ga0070701_1117697813300005438Corn, Switchgrass And Miscanthus RhizosphereREQDLVAQLSAQAEARQMAAQVEWETEFQTKTRFAIEPLKTQLARAEKERDEAKEAAAESARQLQNLEKKLAEASSFLNGWRNGKPLVGA*
Ga0070705_10141665623300005440Corn, Switchgrass And Miscanthus RhizosphereMAAQVEWETEFQTKTRFAIEPLKTQLARAEKERDEAKEAAAESARQLQNLEKKLAEASSFLNGWRNGKPLVGA*
Ga0066689_1040857123300005447SoilCEQNLVAELTAQAAARQMAAQAQWEADSEKKTRAAIEPFKVLLARAEQERDEARESASEGTRQVQHLETKLTEVSSFLNTWRNGKTVAGAA*
Ga0070681_1131316513300005458Corn RhizosphereREQDLVSLLHVQAEARQMAAQAQWEAETEKKSRAAVEPLKALLARTEKERDEARQAASEGARQVQNLEKKLTEASSFLSSWRNGRTPVGTA*
Ga0070697_10174751613300005536Corn, Switchgrass And Miscanthus RhizosphereAKAEWETEFQTKSRIVIEPIKAQLARIEKERDEALQSASEGARRVQNLEQKLTEASTFLSNWRNGKTVSGAI*
Ga0066707_1057891213300005556SoilRQMAAQAQWEADSEKKTRAAMEPFKALLARAEQERDEARESASEGTRQVQHLETKLTEVSSFLNTWRNGKTVAGAA*
Ga0066694_1024926913300005574SoilKQLEQDLVAELSEQAEARQLAAKVEWETEFQTKTRAAIEPLKLQLARIERERDEAAQSASEHSRQVQTLEKKLTEASSFLNSWRNGKHVVGAS*
Ga0066708_1097796533300005576SoilAAQAQWETESENKLRAAIEPFKSLLVRTEKERDEANLSASESARHMQNLEKKLAEASSFLSGWKNGKHSVGAA*
Ga0066706_1108370923300005598SoilRQMAAQAQWEAESEKKTRAAIEPLKALLARAEQERDEARESASAGTRQVQHLETKLTEVSSFLNTWRNGKTVAGAA*
Ga0066706_1133962423300005598SoilAQLSAQAEARQMAAQVDWETEFHTKTRVAIEPLKMQLARTEKERDEAIQAASESARQLQNLEKKLTEASSFLNGWRNGKPSVGAS*
Ga0068864_10151917613300005618Switchgrass RhizosphereQREEELIAQLNAKAEARQMAVQAQWETEFINKTRAVVEPFKVQMARIEKERDEAKQFASERARQVQNLEKKLTEASSFLSSWRNGPTVVGTE*
Ga0068864_10237063913300005618Switchgrass RhizosphereQFTAQTEAHLAAARTQWEAESDKKMRAAIEPFRALLARTEKERDEARQSANDSARQVETLEAQLNEASSFLSSWKNGKKLVGA*
Ga0068870_1130794313300005840Miscanthus RhizosphereAVQAHWETESEKKARAATEPFKATLARVEFERDEARESASESARQVQNLEKKLTEVSSFLNSWRNGKHLVGTS*
Ga0068862_10090831413300005844Switchgrass RhizosphereVAQMTAQAEARQMTVQAHWETESEKKARAATEPFKAMLARAELDRDEARESAAESARQVQNLEKKLTEVSSFLNTWRNGKHLVGAS*
Ga0068862_10216774123300005844Switchgrass RhizosphereESEKRTRAAIEPIEALLERTQKERDEAAQSASGAVRQAQDLEKKLTEASSFLNGWKNGKHLVVAGR*
Ga0070716_10089347323300006173Corn, Switchgrass And Miscanthus RhizosphereEARQKAAQALWEAEADKRARAAIEPFKLQLARAERERDEAKQSAFENARQAQNLEKQLTEASSFLNGWRNGGKNAVGAS*
Ga0097621_10084881423300006237Miscanthus RhizosphereLIAQLTEQAEARHKAAQSEWETEFVTKTRVAIEPFKVQLARTEKERDEAKQSAFETARKAQSLEKKLTEASTFLSGWRNGKDWVGAE*
Ga0097621_10178744723300006237Miscanthus RhizosphereQAEARQMAARAQWETEAEKKARAAIEPFKGLLARAESERDEAKQTAFEKSRQVESLEKKLTEASSFLNGWRNGKSLAGAA*
Ga0068871_10039924213300006358Miscanthus RhizosphereREQELIAQMNAQTDARLSAAQAQWESESDKRSRAATEPLRALLARTEKERDDARQSAFETERQVQSLEKKLTEASTFLNGWRNGKGLVAAP*
Ga0079222_1190782713300006755Agricultural SoilREQDLVAQLNAQAEVRLMAAQALWEREAEKKARASVESYKSQLARTEKERDDAKQAASEGARQVQTLEKKLTEASTFLSGWRNGRNLVED*
Ga0066665_1112526213300006796SoilQAMAMQREQDLVAQLNVQAEARQLAAKVEWETEFQTKTRAAIEPLKLQLARTERERDDAAQSASENARQLQNLEKKLTEASSFLNGWRNGKHVVGAS*
Ga0066660_1050247413300006800SoilQAMGKQREQDLVAQLSAQAEARLMAAQAQWETESENKMRGAIEPFKALLVRTEKERDEANQSASESARHMQNLEKKLTEASSFLNGWKNGKPSVGAA*
Ga0075434_10190143913300006871Populus RhizosphereEARQMAVQAQWETEFINKTRAVVEPFKVQMARIEKERDEAKQFASERARQVQNLEKKLTEASSFLSSWRNGPTVVGTE*
Ga0075426_1026804333300006903Populus RhizosphereRQMAAQAVWENEAEKRTRAAIEPFKTLLARTEKERDEARQAATEGSRQVQNLEKKLTEASSFLNGWRNGGKSVMPVTS*
Ga0075436_10058091413300006914Populus RhizosphereQAEARQMAAQAVWENEAEKRTRAAIEPFKTLLARTEKERDEARQAATEGSRQVQNLEKKLTEASSFLNGWRNGGKSVMPVTS*
Ga0075435_10133024013300007076Populus RhizosphereAQAQWEAESEKKVHAAVEPFKAQLARVEKERDDARQSSFDTTRQVQNLEKKLTEASTFLNGWRNGKSLVGSS*
Ga0075435_10169616823300007076Populus RhizosphereRHSAAKAEWETEFLTKSRIVIEPIKAQLARTEKERDEALQSASEGARRAQHLEQKLTEASTFLSSWKNGKTVSGAV*
Ga0066710_10227581323300009012Grasslands SoilNKEWETELGKMRNTIEPLEALLARTEKERDEARQSASESTFQVREFEKKLTEASSLLTGWKSGNGKHLVGAQNGRNGS
Ga0099829_1126608433300009038Vadose Zone SoilELTAQADVRQRAAQAQWETESEKKTRAAIEPFKSLLSRTEKERDEARQSASEAARKSQDMEKKLTEASSFLNGWKNGNTLLAETR*
Ga0099830_1046315223300009088Vadose Zone SoilAKARQREQNLAAELSAQAETRQVAAQAQWEAESDRKVRAALEPLKVLLARAEQERDEARESASEGTRQVQHLETKLTEVSSFLNTWRNGKTVAGAA*
Ga0105245_1253052913300009098Miscanthus RhizosphereQMNAQTEARLSAAQAQWESESDKRTRAATEPLRALLARTEKERDDARQSAFETERQVQSLEKKLTEASTFLNGWRNGKSMVAAP*
Ga0075418_1315594623300009100Populus RhizosphereVAQAQWQTEAEKKARAAIEPFKALLARTEEERDEARQSASEGARQVQNLEKKLTEASSFLNSWRNGPTVVGAA*
Ga0066709_10419132113300009137Grasslands SoilEAAQARTRQREEELVAQLTAQAEARQTAAQAKWETETEHKTRAAIEPFKLLLARTEKERDEARQSASVAFSHVQDLEKKLTEASSFLTAWKDGKDLLAAGR*
Ga0105248_1252626813300009177Switchgrass RhizosphereAQAKDHQVAAQQWETELGMARGTIEPLRELLARTEKERDEARCSASEGVRQVQNLEKQLMEASSFLTSWRNGKNSVEA*
Ga0134086_1014656223300010323Grasslands SoilAQAQWETESENKMRAAIEPFKALLVRAEKERDEANQSASESARQVQNLEKKLTEASSFLNGWKNGKHLVGAA*
Ga0134065_1041347113300010326Grasslands SoilSAEAEAAKVEWETEFQTKTRAAVEPLKLQLARIERERDEAAQTATEHARQVQTLEKKLTEASSFLNSWRNGKHAVGAS*
Ga0134111_1015188323300010329Grasslands SoilSAQAEARQLAAQMEWETAFQTKTRAAIEPLKLQLARTEKERDEATQSASESARQVQNMEKKLTEASSFLNGWRNGKHVVGAS*
Ga0126379_1208469323300010366Tropical Forest SoilELTAQAEERHSAAKVEWETQFLTKTRMAIDPIKAQLARAEKERDEAKESASEGVRRVQTLEQKLNEASTFLSNWRNGKTVSGAV*
Ga0137391_1126933713300011270Vadose Zone SoilEARQMAAQAHWDAETEKKTRAAIEPFKALLVRAEEERNEAKQSASEAVRKVQHLEKKLTEASSFLNGWRNGDPVLTPAR*
Ga0137313_106446913300011403SoilLVAQLTAQAEARQVAAKAQWESESEARTRAAVEPYKSQLARAEVEREDARESASDAIRQVQHLEKKLNEASTFLSGLRNGNHPVVTGR*
Ga0137325_110420723300011415SoilQREQDLTAQLQAQWETESEKKAHEAIEPFKALLARTEKERDEAKQVAAGALSQVQELDKKLNEASSFLNGWRNGKSFISR*
Ga0137462_117210613300011421SoilQMAAHAQWETESDKKTRAAIEPLRARLARTEKERDEANQSASGAVSQVQDLEKKLTEASSFLNGWKNGKNLVGAGR*
Ga0137383_1021193113300012199Vadose Zone SoilDLVAELTAQADVRQRAAQAQWDTESEKKTRAAVEPFKALLARAEQERDEARQSASESTRHAHHLEKKLTEASSFLNGWRNGKKLVEAA*
Ga0137399_1165378913300012203Vadose Zone SoilAQLTAQAEARQMATLAQWETEAEKRLRAAVEPFKVQLARLEKERDEAKLSADEGTRQVQNLEKKLTEASTFLSSWRNGKALVGAG*
Ga0137399_1180513913300012203Vadose Zone SoilQMAAQAQWEAESEKKTRAAIEPFKALLARTEKERDEAKQFASESIHHAQSLEKKLTEVSSFLNGWKNGKQLVRAS*
Ga0137374_1003895413300012204Vadose Zone SoilLVAELSAEAEARQLAAKVEWETEFQTKTRAAIEPLKLQLARIERERDEAAQSASESARQLQNLEKKLTEASSFLNGWRNGKHVVGTS*
Ga0137362_1078052623300012205Vadose Zone SoilDLINQLTVQMEARQTAAHAEWVSETEFKTRAAVEPFKSLLARTEKERDEAIQAASEGAHQVQSLEKKLTEASTFLNGWRNGKKLFGAAS*
Ga0137380_1132036413300012206Vadose Zone SoilDLVAQLSAQAEARQMAAQSQWETESENKMRAAIEPFKALLVRAEKERDEANQSASESAHQVQNLQKKLTEASSFLNSWRDGKRLVGEA*
Ga0137376_1087725923300012208Vadose Zone SoilLSAQAESRQMAAQAQWETESENKISAAIEPFKALLVRAEKERDEANQAASERARQVQNLEKKLTEASSFLNGWRNGKPTVGAT*
Ga0137377_1029541333300012211Vadose Zone SoilNAQAEARQLAAQAQWETESENKMRAAIEPFKALLVRTEKERDEANHSASETAFRVQNLEEKLTEASSFLSGWRNGKHSAGPSSARNDREDKSK*
Ga0150985_11030334923300012212Avena Fatua RhizosphereARQREQDLVTQAVAKAEAQHLAARAQWESEAEKKALAAIEPFKAMLTRIEKERNEAEQAAAESARQVQTLEKKLTEASSFLSGWRNGKDLVRPV*
Ga0137387_1051462613300012349Vadose Zone SoilAKARQREQNLVAELTAQAEARQMAAQAQWETESEKKTRAAIEPFKALLARAEQERDEARESASEGTRQVQHLETKLTEVSSFLNTWRNGKTVAGAA*
Ga0137387_1062576813300012349Vadose Zone SoilKDEAAQAKARQREQDLTVQLTAQAEAHQMAAQAQWEAESEKKTRVAIEPFKALLARTEKERDEAKQFASESIHHAQSLEKKLTEVSSFLNGWKNGKQLVRAT*
Ga0137372_1030193733300012350Vadose Zone SoilMAAQAQWETESERRTRAAIEPFKALLARTEKERDEARQTASENSRQVQNLEKQLTEASAFLNAWRNGNNSVASAR*
Ga0137371_1061430713300012356Vadose Zone SoilARQMAAQVDWETEFHTKTRVAIEPLKMQLARTEKERDEAIQAASESARQLQNLEKKLTEASSFLNGWRNGKPSVGAS*
Ga0137385_1015251413300012359Vadose Zone SoilQAEARQMAAQTQWETESENKMRAAIEPFKALLVRAEKERDEANLSASESARHMQNLVKKLTEASSFLNGWKNAKD*
Ga0137385_1047804623300012359Vadose Zone SoilLVAQLSAQAEARQMAAKVEWETEFQTKTRAAIEPFKLQLARTEKERDEANQSASESARQVQNVEKKLTEASSFLHGWKNGKPLAGAA*
Ga0137385_1076394313300012359Vadose Zone SoilNLVAELSAQAETRQMASQAQWEAESEKKTRAAIEPLKVLLARAEQERDEARESASEGTRQVQHLETKLTEVSSFLNTWRNGKTVSGAA*
Ga0137360_1087245413300012361Vadose Zone SoilQMAAQAQWEAESDRKVRAALEPLKVLLARAEQERDEARESASAGTRQVQHLETKLTEVSSFLNTWRNGKTVSGAA*
Ga0137390_1038618313300012363Vadose Zone SoilKDETAQAKAKQREQDLVAEFTAKAEARRMAAQAQWDTELEKKTRAAIEPLKVLLARAEQERDEARESASEGTRQVQHLETKLTEVSSFLNTWRNGKTVAGAA*
Ga0157296_1036487813300012905SoilLMAARAQWETESEKKARAAIEPFETLLVRTQKERDEALQSASGAVRQAQDLEKKLTEASSFLNGWKNGKHLVVAGR*
Ga0137395_1035562423300012917Vadose Zone SoilGTDMRKEMQQKLDAAHAVAKQRELYLVAQLSARAEVRQMAAQAQWETASENEMRAAIEPFKALLVRTEKERDEANQSASESARHMQNLEKKLTEASSFLNSWKNGH*
Ga0137359_1023288113300012923Vadose Zone SoilQARAKQREQELIAELTAQAEARHLTAQTEWETEFQTKTRVAIEPLKVQLSRIEKERDEALQSASEGARQVQNLEKKLTEASSFLNSWRNGKTVSGAA*
Ga0137359_1126091213300012923Vadose Zone SoilAEWVSETEFKTRAAVEPFKSLLARTEKERDEAIQAASEGAHQVQSLEKKLTEASTFLNGWRNGKKLFGAAS*
Ga0137413_1091070413300012924Vadose Zone SoilKQKEEAAHARFKQREQDLATQLTAQMEARQTAAHAEWVSETEFKTRAAVEPFKALLARTEKERDEAIQAASEGAHQVQNLEKKLTEASTFLSGWRNGKKLFGAAS*
Ga0137407_1120725413300012930Vadose Zone SoilEAAQAVAKQREQELVAQLSAQAEARQISAQVEWETEFHTKTRFAIEPLKIQLARTEKERDDAIQAATESARQLQNLEKKLTEASSFLNGWRNGKPTVGAT*
Ga0137407_1194787213300012930Vadose Zone SoilSAEAEARQLAAKVEWETEFQTKTRAAIEPFKLQLARTEKERDEANQSASESARQVQNVEKKLTEASSFLNGWKNGKPLVGVG*
Ga0137410_1012559813300012944Vadose Zone SoilTAAHAEWVSETEFKTRAAVEPFKALLARTEKERDEAIQAASEGAHQVQNLEKKLTEASTFLSGWRNGKKLFGAAS*
Ga0157374_1283249113300013296Miscanthus RhizosphereAEARQAAAQARWETEAEKKARAAIEPFKAQVVRAEKERDEAKQAAAEHSRQMQSLEKKLTEASSFLNGLRGTKPAAENALSWALDEKVENHR*
Ga0157378_1115005223300013297Miscanthus RhizosphereQKDEAAQARVRQREQDIVAELTAQAEARQVAIQALWEAESEKKIRAAIEPLKTLLTRAEEERDAAKASALDGVRQVQHLEKKLMEVSSFLSTWGNKGVPRGS*
Ga0157372_1220151923300013307Corn RhizosphereLSAAQAQWESESDKRSRAATEPLRALLARTEKERDDARQSAFETERQVQSLEKKLTEASTFLNGWRNGKGLVAAP*
Ga0119878_121119513300014055Sewage Treatment PlantLTAQAEAQQAAAQAQWQAEAEEKTRAAVEHFKGLLARAEIERDEAKQFAAEGFRQVQNLEKKLTDASSLLAGWKNGKSAVASPGWEG*
Ga0134079_1005873523300014166Grasslands SoilQAMARQREQDLVAQLSAEAEAAKVEWETEFQTKTRAAVEPLKLQLARIARERDEAAQTATEHARQVQTMEKKLTEASSFLNSWRKGKHAVGAS*
Ga0182018_1012588913300014489PalsaEQDLVNQLTAQLEARHLAVLAKWEAESEKKTRAATGPLKEMLVRAEKERDEARQSASEATRQAQNLEKKLTEASSFLNGWRNGKHLVGAA*
Ga0182024_1150852813300014501PermafrostREQDLVAQLTSQSEARHLAVLAKWEAESEKKTRAATEPLKEMLVRAEKERDEARQVASEGKRQLQNMEKKLAEASSFLNGWANGKILAGAG*
Ga0137418_1127539413300015241Vadose Zone SoilKTRQREEDLVAQLTAQAEARQMATLAQWETEAEKRLRAAVEPFKVQLARLEKERDEAKLSADEGTRQVQNLEKKLTEASTFLSSWRNGKALVGAG*
Ga0137409_1091217323300015245Vadose Zone SoilLWEAETEKRTLTAIEPFKAMLARMEKERAEAEQSAAENARQVQNLEKKLTEASSFLSGWRNGKKLVAER*
Ga0182034_1144052413300016371SoilKARQREQDLVAQLTAQAEARSMASLAQWETESEKRMRAAIEPFKVLLANTEKERDDARLAASDGARQVEHLEKKLTEASSFLSSWRNGAKALVEST
Ga0182039_1127601313300016422SoilQRHSAAKAEWETEFQTKTRIVLDPIKAQLSRTEKERDEARHSATEGARRVQNLEKKLTEASTFLSNWGNGKSVSGAV
Ga0187847_1071811123300017948PeatlandAQALWETETDKKIRLAVEPIKAQLARAEEERDEAMQAYSEGARHVQNLEKKLTEASTFLNGWKNGKHLVTGLNE
Ga0184623_1029590033300018056Groundwater SedimentRQMAAQAQWETESENKMRAAIEPFKAQLVRTEKERDEANQSASESARQVQNVEKKLTEASSFLNGWKNGKHLVVAGR
Ga0066655_1055759823300018431Grasslands SoilVAQLSALAEARQMAAQAQWETDSENKMRAAIEPFKALLVRAEKERDESNQFAAESARQVQNLKKKLTEASSFLNTWRDGKPTESNLSWHLDEKAGNHGR
Ga0066669_1166328013300018482Grasslands SoilLTAHAEARQIAAHAKCETESEQKTRPAIEPFKRQLARTEKERDEARQAATEGARHVQNLEKKLTEASSFLNTWRDGKPAENALSWQLDEKAGNHGR
Ga0193692_100263713300020000SoilEARFVGAQAQWETESEKKLRAALEPLKAQLANAEKERDEARLSASDTARDVANLEQKLTEASSFLSAWKNGKQQLVSA
Ga0179592_1002748443300020199Vadose Zone SoilLVAQLTAQMEARQTAAHAEWVSETEFKTRAAVEPFKSLLSRTEKERDEAIQAASEGAHQVQNLEKKLTEASTFLSGWRNGKKLFGAAS
Ga0210401_1057255913300020583SoilQMEARQSAAHAEWVSETEFKTRAAVEPFKALLARTEKERDEAIQAASEGAHQVQNLEKKLTEASTFLSGWRNGKKLFGAAS
Ga0210391_1099224013300021433SoilLAVLAKWEAESEKKTRAATEPLKEMLVRAEKERDEARQAASEGRRQLQNLEKKLAEASTFLNGWGNGKVLAGADR
Ga0242659_111256013300022522SoilQSQWETETEKKTRNAVEPFKALLVRTEKERDEALHIASESARQVQSLEKKLTEASTFLNGWRNGKNLVGADR
Ga0247688_108791523300024186SoilKEEAAQARAEHREQELVAQLTAQADARQKAVHVQWEAEFDKRTRLAIEPLKVSLTRAEKERDDARVAASEAARHVQNLEKQLTEASSFLNGWRNGGRNVVGA
Ga0247664_102453713300024232SoilWEAEADKRARAAIEPFKLQLARAERERDEAKQSAFENARQAQNLEKQLTEASSFLNGWRNGGKNAVGAS
Ga0247667_103927723300024290SoilLIAQLTGQAEARQKAAQALWEAEADKRARAAIEPFKLQLARAERERDEAKQSAFENARQAQNLEKQLTEASSFLNGWRNGGKNAVGAS
Ga0247660_108901413300024317SoilLSAEAEAAKVEWETEFQTKTRAAVEPLKLQLARIERERDEAAQSASEHARQVQTLEKKLTEASSFLNSWRNGKHAVGAS
Ga0247668_102636523300024331SoilARQKAAQALWEAEADKRARAAIEPFKLQLARAERERDEAKQSAFENARQAQNLEKQLTEASSFLNGWRNGGKNAVGAS
Ga0207662_1097580413300025918Switchgrass RhizosphereARARQREQDLVAELTAQAEVRQMAARALWETESEKKTRAAIEPFKTLLARTETERDEARQSVVENTRHVQDLKRKLNDASSLLASWKNGDDLVGTRS
Ga0207665_1091703723300025939Corn, Switchgrass And Miscanthus RhizosphereKARQREQDLVAQLSAQADARYLAAQAKWESDTQHITRAAVEPFKVQLARIEKERDEARQSASEGTHQVQNLEKKLTEASSFLNGWRNGGKNAVGAS
Ga0207661_1170917323300025944Corn RhizosphereTAQAEARQTSAQAQWEAETEKKMRVAVEPFRMLLSRTEVERDEARNSSTEALRQVQTLEKRLNEASSFLNTWRNGKHSVGTS
Ga0207703_1145970413300026035Switchgrass RhizosphereTLQAHWETESEKKARAATEPFKAMLARAEMDRDEARESAAESARQVQNLEKKLTEVSSFLNTWRNGKKPVGAS
Ga0207641_1105192813300026088Switchgrass RhizosphereQANFAQREQDLVTQLTAQAEARQMAARAQWETEAEKKARAAIEPFKGLLARAESERDEAKQTAFEKSRQVETLEKKLTEASSFLNGWRNGKSLAGAA
Ga0207676_1116581623300026095Switchgrass RhizosphereQREEELIAQLNAKAEARQMAVQAQWETEFINKTRAVVEPFKVQMARIEKERDEAKQFASERARQVQNLEKKLTEASSFLSSWRNGPTVVGTE
Ga0207674_1187293923300026116Corn RhizosphereQAEARHSAAQVEWETQFQTKARFAIEPVKNQLARAEKERDEARQAASEHALQVQNLEKKLTEASTFLNGWRNGKTVAGAV
Ga0209056_1017418423300026538SoilSAQAETRQMAAQAQWEAESDRKVRAALEPLKVLLARAEQERDEARESASAGTRQVQHLETKLTEVSSFLNTWRNGKTVSGAA
Ga0209588_117391013300027671Vadose Zone SoilELTAQAEARQTAAQAYWETETEKKTRAAIEPFKALLARAEEERNEAKQSASEAVRKVQHLEKKLTEASSFLNGWRNGDPVLTPAR
Ga0247663_102403223300028145SoilMQQKEEAAQARAEHREQELVAQLTAQADARQKAVHVQWEAEFDKRTRLAIEPLKVSLTRAEKERDDARVAASEAARHVQNLEKQLTEASSFLNGWRNGGRNVVGA
Ga0268265_1087650633300028380Switchgrass RhizosphereQWETESEKRTRAAIEPIEALLERTQKERDEAAQSASGAVRQAQDLEKKLTEASSFLNGWKNGKHLVVAGR
Ga0302155_1037064113300028874BogDLVAQLNSQSEARHLALLAKWEAESEKKTRAATEPLKEMLVRAEKERDEARQVASEGRRQLQNMEKKLAEASSFLNGWGNDKILAGAG
Ga0302200_1004814613300028909BogAKARQREQDLVAQLNSQSEARHLALLAKWEAESEKKTRAATEPLKEMLVRTEKERDEARQVASEGRRQLQNMEKKLAEASSFLNGWGNDKILAGAG
Ga0311362_1019013633300029913BogEARQAAAQALWETETDKKIRLAVEPIKAQLARAEEERDEAMQAYSEGARHVQNLEKKLTEASTFLNGWKNGKHLVTGLNE
Ga0311340_1141032713300029943PalsaQKEEAAVDKAKLREQDLITQLNAQAESRLLAAQVQWETEAEKRSRAAIEPFKALLAKTEKERDEAKHTAFEGSRHVQDLEKKLTEASSFLNGWRNGKILVEAA
Ga0302277_123632823300029982BogSERKTYAALETLKTALARSENERDEAKQTAAESARHVLNLEKKLTEASSFLNGWSNGKHLAEVV
Ga0311370_1144380823300030503PalsaDLVAQLTAQAEARQMAAQARWETESEKTTRATIEPLKALLARAEKERDEARQTASESARQALNLEKKLTEASTFLNGWRNGNHLAVAA
Ga0302275_1015911113300030518BogQFTAQAEARQAAAQALWETETDKKIRLAVEPIKAQLARAEEERDEAMQAYSEGARHVQNLEKKLTEASTFLNGWKNGKHLVTGLNE
Ga0308155_102989313300030987SoilKQREQDLVAELSAQAEARRLAARVEWETEFQTKTRAAIEPFKLQLARTEKERDEANQSASESARQVQNVEKKLTEASSFLNGWKNGKPLVGAT
Ga0308178_105751523300030990SoilMAAEAQWEKESEKKMHAALSPFRALLVRTEKERDEARQSASVALSHVQELDTKLTEASSFLSSWKNGRDLLTRVEEKVGNHGR
Ga0265760_1037893313300031090SoilQREQELVAQLTAQAEARQAAVMAQWDAESEKRTRSAIEPFKTLLSRTEMERDEARQTAAEATRHVQNLEKKLTEASSFLNGWRNGKVLVGAD
Ga0308184_104202523300031095SoilAAQAEARQREQNLVAQLTAQAEAHLMATEEQWEKESEKKMHAAISPFRALLVRTEKERDEARQSASVAVSHVQDLEKKLTEVSSFLNAWKDGKDLVAAGR
Ga0308181_109551913300031099SoilQAEARQREQSLVAQLTAQADAHLMAAEEQWEKESEKKMHAALSPFRALLARTENERDEARQSASVALSHVQELDKKLTEASSFLSAWKNGKDLLARVEEKVGNHGR
Ga0170824_11767687313300031231Forest SoilSRQREQTLIAELTSQAEARQMAAQAQWETESEKKARAAIEPFKALLARTEEERDEARQSALDGSRQVQNLEKKLTEASSFLNGWRNGTTVVGAE
Ga0170824_12242010213300031231Forest SoilACAQWEAESEMKMRAAIEPFKASVARAEKERDEARQVSAESASHVQHLEKKLTEASSFLNSWRNGKSPVGTA
Ga0310686_11874408013300031708SoilEARQMAAQALWDMESEKRTRTAIEPFKALLARSEKERDEARQTASESARQVQTLEKKLTEASSFLNGWRNGNHLVTAA
Ga0307478_1060104823300031823Hardwood Forest SoilDLVTQLTAQAEARQMAAQALWEREAEKKARAAIEPFRAQLARIEKERDEAKQSAFEGLRQVESLEKKLTEASTFLNGWRNGKNFVETD
Ga0310913_1036153533300031945SoilRHEHELVTQLNAQVEARQAAARAQWESESENRTRAAVEPLRAMLGRAEKERDEAKLAASEHVRQVQNMEKKLAEASAFFNTWRNGKPMVGAS
Ga0310910_1102255223300031946SoilQRHEHELVTQLNAQVEARQAAARAQWESESENRTRAAVEPLRAMLGRAEKERDEAKLAASEHVRQVQNMEKKLAEASAFFNTWRNGKPMVGAS
Ga0315270_1013668023300032275SedimentEANLLAAETQWEAESEKRTLAAIEPFKVMLARMEKERSEAEQSAAESARQVQNLEKKLTEASSFLNGWKNGKHLAVAGR
Ga0335069_1066805423300032893SoilEARQIAAQAQWESEAEEKARASVEFFKGLVTRPEKERDEARVSATEATRHVQTLEKKLTEASSFLSTWRNGHDLVGNSR
Ga0370548_092336_314_5863300034644SoilVAKQREQDLGAQLSAQAEARQIAAQAQWEKESENKMRAAIEPFKVLLVRAEKERDEATQSASESARHMQSLEKKLTEASSFLNGWKNAKH


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.