NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F062122

Metagenome / Metatranscriptome Family F062122

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F062122
Family Type Metagenome / Metatranscriptome
Number of Sequences 131
Average Sequence Length 81 residues
Representative Sequence MNSLYRFLWSLFLPRLPRLGPEDVAGTAEYYNRLARNDEATMNPPGMWIGRISTQLVLRIQRYKTAEKVPQQILRRAA
Number of Associated Samples 77
Number of Associated Scaffolds 131

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 70.23 %
% of genes near scaffold ends (potentially truncated) 39.69 %
% of genes from short scaffolds (< 2000 bps) 66.41 %
Associated GOLD sequencing projects 69
AlphaFold2 3D model prediction Yes
3D model pTM-score0.26

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (71.756 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil
(29.771 % of family members)
Environment Ontology (ENVO) Unclassified
(48.092 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(56.489 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 59.43%    β-sheet: 0.00%    Coil/Unstructured: 40.57%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.26
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 131 Family Scaffolds
PF06745ATPase 4.58
PF01435Peptidase_M48 3.82
PF07730HisKA_3 3.82
PF04366Ysc84 3.82
PF00882Zn_dep_PLPC 3.82
PF11752DUF3309 3.05
PF02518HATPase_c 3.05
PF13502AsmA_2 2.29
PF05163DinB 1.53
PF08448PAS_4 1.53
PF00990GGDEF 1.53
PF00072Response_reg 0.76
PF00106adh_short 0.76
PF13477Glyco_trans_4_2 0.76
PF00912Transgly 0.76
PF00196GerE 0.76
PF13426PAS_9 0.76
PF13474SnoaL_3 0.76
PF04055Radical_SAM 0.76
PF04773FecR 0.76
PF04264YceI 0.76
PF04203Sortase 0.76
PF12704MacB_PCD 0.76
PF13480Acetyltransf_6 0.76
PF08530PepX_C 0.76
PF04185Phosphoesterase 0.76
PF07689KaiB 0.76
PF01263Aldose_epim 0.76

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 131 Family Scaffolds
COG2930Lipid-binding SYLF domain, Ysc84/FYVE familyLipid transport and metabolism [I] 3.82
COG3850Signal transduction histidine kinase NarQ, nitrate/nitrite-specificSignal transduction mechanisms [T] 3.82
COG3851Signal transduction histidine kinase UhpB, glucose-6-phosphate specificSignal transduction mechanisms [T] 3.82
COG4564Signal transduction histidine kinaseSignal transduction mechanisms [T] 3.82
COG4585Signal transduction histidine kinase ComPSignal transduction mechanisms [T] 3.82
COG2318Bacillithiol/mycothiol S-transferase BstA/DinB, DinB/YfiT family (unrelated to E. coli DinB)Secondary metabolites biosynthesis, transport and catabolism [Q] 1.53
COG0676D-hexose-6-phosphate mutarotaseCarbohydrate transport and metabolism [G] 0.76
COG0744Penicillin-binding protein 1B/1F, peptidoglycan transglycosylase/transpeptidaseCell wall/membrane/envelope biogenesis [M] 0.76
COG2017Galactose mutarotase or related enzymeCarbohydrate transport and metabolism [G] 0.76
COG2353Polyisoprenoid-binding periplasmic protein YceIGeneral function prediction only [R] 0.76
COG2936Predicted acyl esteraseGeneral function prediction only [R] 0.76
COG3511Phospholipase CCell wall/membrane/envelope biogenesis [M] 0.76
COG3764Sortase (surface protein transpeptidase)Cell wall/membrane/envelope biogenesis [M] 0.76
COG4953Membrane carboxypeptidase/penicillin-binding protein PbpCCell wall/membrane/envelope biogenesis [M] 0.76
COG5009Membrane carboxypeptidase/penicillin-binding proteinCell wall/membrane/envelope biogenesis [M] 0.76


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms71.76 %
UnclassifiedrootN/A28.24 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001593|JGI12635J15846_10017056All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium6020Open in IMG/M
3300001661|JGI12053J15887_10556624Not Available546Open in IMG/M
3300002680|Ga0005483J37271_124290Not Available506Open in IMG/M
3300002681|Ga0005471J37259_110732All Organisms → cellular organisms → Bacteria684Open in IMG/M
3300002908|JGI25382J43887_10379232Not Available597Open in IMG/M
3300002914|JGI25617J43924_10253667Not Available596Open in IMG/M
3300004099|Ga0058900_1431819All Organisms → cellular organisms → Bacteria1240Open in IMG/M
3300004100|Ga0058904_1023663All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia5188Open in IMG/M
3300004100|Ga0058904_1398503All Organisms → cellular organisms → Bacteria755Open in IMG/M
3300004100|Ga0058904_1425753All Organisms → cellular organisms → Bacteria835Open in IMG/M
3300004102|Ga0058888_1442230All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300004104|Ga0058891_1538591All Organisms → cellular organisms → Bacteria651Open in IMG/M
3300004116|Ga0058885_1000466Not Available548Open in IMG/M
3300004132|Ga0058902_1232165All Organisms → cellular organisms → Bacteria1038Open in IMG/M
3300004133|Ga0058892_1008811All Organisms → cellular organisms → Bacteria819Open in IMG/M
3300004133|Ga0058892_1312484All Organisms → cellular organisms → Bacteria1082Open in IMG/M
3300004135|Ga0058884_1006481Not Available557Open in IMG/M
3300004135|Ga0058884_1011274All Organisms → cellular organisms → Bacteria1185Open in IMG/M
3300004135|Ga0058884_1013439All Organisms → cellular organisms → Bacteria → Proteobacteria3761Open in IMG/M
3300004135|Ga0058884_1380935All Organisms → cellular organisms → Bacteria521Open in IMG/M
3300004135|Ga0058884_1407246All Organisms → cellular organisms → Bacteria998Open in IMG/M
3300004135|Ga0058884_1409497Not Available535Open in IMG/M
3300004139|Ga0058897_10859526All Organisms → cellular organisms → Bacteria884Open in IMG/M
3300004139|Ga0058897_11141592All Organisms → cellular organisms → Bacteria640Open in IMG/M
3300004139|Ga0058897_11182436All Organisms → cellular organisms → Bacteria → Proteobacteria1484Open in IMG/M
3300004139|Ga0058897_11184870All Organisms → cellular organisms → Bacteria668Open in IMG/M
3300004631|Ga0058899_12190875Not Available548Open in IMG/M
3300005537|Ga0070730_10000671All Organisms → cellular organisms → Bacteria37315Open in IMG/M
3300005537|Ga0070730_10041725All Organisms → cellular organisms → Bacteria3376Open in IMG/M
3300005542|Ga0070732_10003897All Organisms → cellular organisms → Bacteria8037Open in IMG/M
3300005542|Ga0070732_10915902Not Available536Open in IMG/M
3300005586|Ga0066691_10054293All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2156Open in IMG/M
3300006173|Ga0070716_100000118All Organisms → cellular organisms → Bacteria31177Open in IMG/M
3300006173|Ga0070716_100003519All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → unclassified Acidobacteriaceae → Acidobacteriaceae bacterium KBS 1467382Open in IMG/M
3300006806|Ga0079220_10307550Not Available983Open in IMG/M
3300006903|Ga0075426_10000245All Organisms → cellular organisms → Bacteria36081Open in IMG/M
3300007255|Ga0099791_10084845All Organisms → cellular organisms → Bacteria1447Open in IMG/M
3300007258|Ga0099793_10091827All Organisms → cellular organisms → Bacteria1396Open in IMG/M
3300009038|Ga0099829_10955215Not Available711Open in IMG/M
3300009088|Ga0099830_10061628All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2687Open in IMG/M
3300009088|Ga0099830_11615513Not Available540Open in IMG/M
3300009143|Ga0099792_10317457All Organisms → cellular organisms → Bacteria930Open in IMG/M
3300010379|Ga0136449_100963527All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1379Open in IMG/M
3300011120|Ga0150983_10886340All Organisms → cellular organisms → Bacteria3844Open in IMG/M
3300011120|Ga0150983_12877988All Organisms → cellular organisms → Bacteria1243Open in IMG/M
3300011120|Ga0150983_12908319All Organisms → cellular organisms → Bacteria → Acidobacteria1326Open in IMG/M
3300011120|Ga0150983_13971679Not Available596Open in IMG/M
3300011120|Ga0150983_14260458Not Available505Open in IMG/M
3300011120|Ga0150983_15027964All Organisms → cellular organisms → Bacteria766Open in IMG/M
3300011120|Ga0150983_15384157All Organisms → cellular organisms → Bacteria740Open in IMG/M
3300011120|Ga0150983_15883501All Organisms → cellular organisms → Bacteria → Proteobacteria1192Open in IMG/M
3300012202|Ga0137363_10576349All Organisms → cellular organisms → Bacteria949Open in IMG/M
3300012202|Ga0137363_11670533Not Available529Open in IMG/M
3300012203|Ga0137399_11440351Not Available575Open in IMG/M
3300012361|Ga0137360_10915775Not Available756Open in IMG/M
3300012683|Ga0137398_10486122All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium848Open in IMG/M
3300012918|Ga0137396_10687578Not Available755Open in IMG/M
3300012923|Ga0137359_10011600All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia7361Open in IMG/M
3300012923|Ga0137359_10012678All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfuromonadales7062Open in IMG/M
3300015245|Ga0137409_10043055All Organisms → cellular organisms → Bacteria4321Open in IMG/M
3300020170|Ga0179594_10039282All Organisms → cellular organisms → Bacteria1553Open in IMG/M
3300020199|Ga0179592_10004706All Organisms → cellular organisms → Bacteria5781Open in IMG/M
3300020199|Ga0179592_10039919All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2127Open in IMG/M
3300020579|Ga0210407_10000295All Organisms → cellular organisms → Bacteria → Acidobacteria59529Open in IMG/M
3300020579|Ga0210407_10028300All Organisms → cellular organisms → Bacteria4168Open in IMG/M
3300020579|Ga0210407_10612999All Organisms → cellular organisms → Bacteria848Open in IMG/M
3300020580|Ga0210403_10631968All Organisms → cellular organisms → Bacteria862Open in IMG/M
3300020581|Ga0210399_10044293All Organisms → cellular organisms → Bacteria → Proteobacteria3568Open in IMG/M
3300020581|Ga0210399_10658501Not Available863Open in IMG/M
3300021046|Ga0215015_10040592All Organisms → cellular organisms → Bacteria → Proteobacteria1225Open in IMG/M
3300021046|Ga0215015_10464278Not Available998Open in IMG/M
3300021046|Ga0215015_10834557All Organisms → cellular organisms → Bacteria945Open in IMG/M
3300021088|Ga0210404_10139828All Organisms → cellular organisms → Bacteria1260Open in IMG/M
3300021088|Ga0210404_10471139Not Available707Open in IMG/M
3300021171|Ga0210405_10000306All Organisms → cellular organisms → Bacteria → Acidobacteria68726Open in IMG/M
3300021171|Ga0210405_10002480All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae19092Open in IMG/M
3300021178|Ga0210408_10103317All Organisms → cellular organisms → Bacteria2241Open in IMG/M
3300021178|Ga0210408_10831104Not Available722Open in IMG/M
3300021479|Ga0210410_11584050Not Available548Open in IMG/M
3300021559|Ga0210409_10015598All Organisms → cellular organisms → Bacteria7489Open in IMG/M
3300021559|Ga0210409_10165161All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → Solibacteraceae → Candidatus Solibacter → Candidatus Solibacter usitatus2025Open in IMG/M
3300021559|Ga0210409_10652425All Organisms → cellular organisms → Bacteria922Open in IMG/M
3300025939|Ga0207665_10000108All Organisms → cellular organisms → Bacteria54035Open in IMG/M
3300026359|Ga0257163_1077158Not Available539Open in IMG/M
3300026499|Ga0257181_1083937Not Available554Open in IMG/M
3300026514|Ga0257168_1153162Not Available514Open in IMG/M
3300026557|Ga0179587_10014212All Organisms → cellular organisms → Bacteria → Acidobacteria4190Open in IMG/M
3300027381|Ga0208983_1045786Not Available849Open in IMG/M
3300027591|Ga0209733_1019513All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1828Open in IMG/M
3300027643|Ga0209076_1034086All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1424Open in IMG/M
3300027645|Ga0209117_1001500All Organisms → cellular organisms → Bacteria → Acidobacteria8296Open in IMG/M
3300027663|Ga0208990_1015640All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2510Open in IMG/M
3300027667|Ga0209009_1105345Not Available716Open in IMG/M
3300027671|Ga0209588_1205189All Organisms → cellular organisms → Bacteria613Open in IMG/M
3300027678|Ga0209011_1001391All Organisms → cellular organisms → Bacteria → Acidobacteria8779Open in IMG/M
3300027835|Ga0209515_10108151All Organisms → cellular organisms → Bacteria → Acidobacteria1842Open in IMG/M
3300027842|Ga0209580_10012907All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria3624Open in IMG/M
3300027857|Ga0209166_10000011All Organisms → cellular organisms → Bacteria303297Open in IMG/M
3300027857|Ga0209166_10000155All Organisms → cellular organisms → Bacteria90909Open in IMG/M
3300030991|Ga0073994_10030026All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium852Open in IMG/M
3300030991|Ga0073994_10032978All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1706Open in IMG/M
3300031057|Ga0170834_105120728Not Available573Open in IMG/M
3300031231|Ga0170824_114826151Not Available842Open in IMG/M
3300031446|Ga0170820_10165892Not Available600Open in IMG/M
3300031718|Ga0307474_11185146All Organisms → cellular organisms → Bacteria604Open in IMG/M
3300031720|Ga0307469_11458423Not Available654Open in IMG/M
3300031740|Ga0307468_101654505Not Available600Open in IMG/M
3300031753|Ga0307477_10089887All Organisms → cellular organisms → Bacteria2132Open in IMG/M
3300031753|Ga0307477_11048463Not Available533Open in IMG/M
3300031754|Ga0307475_10010422All Organisms → cellular organisms → Bacteria → Acidobacteria6152Open in IMG/M
3300031754|Ga0307475_10039871All Organisms → cellular organisms → Bacteria3471Open in IMG/M
3300031754|Ga0307475_10156704All Organisms → cellular organisms → Bacteria1810Open in IMG/M
3300031754|Ga0307475_10157072All Organisms → cellular organisms → Bacteria1808Open in IMG/M
3300031754|Ga0307475_10226693All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1496Open in IMG/M
3300031823|Ga0307478_10376398All Organisms → cellular organisms → Bacteria1173Open in IMG/M
3300031962|Ga0307479_10000574All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae33017Open in IMG/M
3300031962|Ga0307479_10009313All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae9103Open in IMG/M
3300031962|Ga0307479_10034156All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4857Open in IMG/M
3300031962|Ga0307479_10047809All Organisms → cellular organisms → Bacteria4112Open in IMG/M
3300031962|Ga0307479_10073329All Organisms → cellular organisms → Bacteria3310Open in IMG/M
3300031962|Ga0307479_10160469All Organisms → cellular organisms → Bacteria2207Open in IMG/M
3300031962|Ga0307479_10409172All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1341Open in IMG/M
3300031962|Ga0307479_10456217All Organisms → cellular organisms → Bacteria1262Open in IMG/M
3300031962|Ga0307479_10776816Not Available934Open in IMG/M
3300031962|Ga0307479_11449702All Organisms → cellular organisms → Bacteria644Open in IMG/M
3300031962|Ga0307479_12127137Not Available509Open in IMG/M
3300032160|Ga0311301_10725428All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1389Open in IMG/M
3300032180|Ga0307471_100033544All Organisms → cellular organisms → Bacteria3999Open in IMG/M
3300032180|Ga0307471_102492145All Organisms → cellular organisms → Bacteria654Open in IMG/M
3300032180|Ga0307471_104256072Not Available505Open in IMG/M
3300032205|Ga0307472_100030366All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria3124Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil29.77%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil19.85%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil16.03%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil16.03%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil5.34%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.29%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil2.29%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.29%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil1.53%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.53%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.76%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.76%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.76%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.76%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002680Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF132 (Metagenome Metatranscriptome, Counting Only)EnvironmentalOpen in IMG/M
3300002681Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF120 (Metagenome Metatranscriptome, Counting Only)EnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300004099Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF236 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004100Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF244 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004102Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF212 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004104Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF218 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004116Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF206 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004132Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF240 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004133Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF220 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004135Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF204 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004139Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF230 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004631Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF234 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026359Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-AEnvironmentalOpen in IMG/M
3300026499Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-BEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027381Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027591Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027663Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027667Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027678Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027835Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW60B uncontaminated upgradient, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300030991Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032160Sb_50d combined assembly (MetaSPAdes)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12635J15846_1001705633300001593Forest SoilMNLLFRSLNSLFPPRLPRFGFEDASGTAAYYNRLARHDEVTMNPPGLWTGRISTQLVARIQPLKTAEKVPQMILRRAA*
JGI12053J15887_1055662413300001661Forest SoilMNLLFRSLCRLFPPRLPRFGFEDVSGTAEYYNRLARLDEVTVNPPGLWTGRISMQQVARIQPFKTAEKVPQMILRRAA*
Ga0005483J37271_12429013300002680Forest SoilIHPSPSVSEAQGQMNLIFRSLCRLFPPRLPRFGFEDVSGTAEYYNRLARHDEVTMNPPGLWTGRISMQQVVRIQPFKTAEQAPQMILRRAA*
Ga0005471J37259_11073213300002681Forest SoilVRSGISIRAHMLEAQGQMNSFLRFLWSLFLPRLPRLGPEEIACTAEYYNRLARLDEAAMNPPGVWKRRISTRLTVRIQVFKATEKARQRFLRRAA*
JGI25382J43887_1037923213300002908Grasslands SoilMNLLFRSLYILFPPRLPRLGPEDVADTAEYYNRLARHDEAAMDPPSMWIDRISARLTVRIRPFKIAAKVSQTILRCVAPRSKSLAERLE
JGI25617J43924_1025366713300002914Grasslands SoilMNLLFRSLSSLFPPRLPRFGFEDVSGTAEYYNRLARLDEVTMNPPGLWTGRISMQQVARIQLFKTAEKVPQMILRRAA*
Ga0058900_143181913300004099Forest SoilLLFRSLCRLFPPRLPRFGFEDVSGTAEYYNRLARHDEVTMNPPGLWTGRISMQQVARIQPFKTAEQAPQMILRRAA*
Ga0058904_102366393300004100Forest SoilMNLFFRFLWSLFLPRLPRLGPEDVASTAEYCNRLARLDEAAMIPPGVWTRRISRQLTVRIQVFKAAEKARQKILRRAA*
Ga0058904_139850323300004100Forest SoilMNSLYRFLWSLFLPRLPRLGPEDVAGTAEYYNRLARNDEATMNPPGMWIGRISTQLVLHIQRYKTAEKVPQQILRRAA*
Ga0058904_142575323300004100Forest SoilVSEAQGQMNLLFRSLCRLFPPRLPRFGFEDVSGTAEYYNRLARLDEVTMNPPGLWTGRISMQQVARIQPFKTAEQAPQMILRRAA*
Ga0058888_144223013300004102Forest SoilIHPSPDVPEAQGQMNSLCRFLWSLFLPRLPRFGPEDVAGTAEYYNRLARNDEPMNPPGMWIGCISTQLVLRIQRYNTAEKVPQQILRRAA*
Ga0058891_153859113300004104Forest SoilEAQGQMNSFLRFLWSLFLPRLPRLGPEEIACTAEYYNRLARLDEAAMNPPGVWKRRISTRLTVRIQVFKATEKARQRFLRRAA*
Ga0058885_100046613300004116Forest SoilHPSPDVPEAQGQMNSLCRFLWSLFLPRLPRFGPEDVAGTAEYYNRLARHDEVTMNPPGMSTRRISTHLVLRIQPPKPAEKVRQQILRRAA*
Ga0058902_123216513300004132Forest SoilEAQGQMNFLFRFLCILFPPRLPRIGPEDVAGTAEYYNRLARHREATINPPGVWVGRISTELAVRIQLFRTVKKVRQQILGRAA*
Ga0058892_100881113300004133Forest SoilMNLLFRSLCRLFPPRLPRFGFEDVSGTAEYYNRLARHDEVTMNPPGLWTGRISMQQVVRIQPFKTAEQAPQMILRRAA*
Ga0058892_131248413300004133Forest SoilMLEAQGQMNSFLRFLWSLFLPRLPRLGPEEIACTAEYYNRLARLDEAAMNPPGVWKRRISTRLTVRIQVFKATEKARQRFLRRAA*
Ga0058884_100648113300004135Forest SoilLPRLGPEDMAGSAEYYNRLARHHEATLNPPGAWVGRISTELAMRIQLFKAVKKVRQQILRRAA*
Ga0058884_101127433300004135Forest SoilPPRLPRLGPEDMAGSAEYYNRLARHHGATINPPGAWVGRISTELAMRIQLFKTVKKVRQQILRRAA*
Ga0058884_101343913300004135Forest SoilHIHPSPNMSEAQEQMNSLCQFLWSLFLPRLPRFGPEDVASIAEYSDRLARLDETAMNPTGAWKRRISMQLVLHIRRFKAVEKVPQQILHRAA*
Ga0058884_138093513300004135Forest SoilDVPEAQGQMNSLCRFLWSLFLPRLPRFGPEDVAGTAEYYNRLARNDEPMNPPGMWIGCISTQLVLCIQRYNTAEKVPQQILRRAA*
Ga0058884_140724633300004135Forest SoilEAQGQMNFLFRSLCILFPPRLPRLGPEDVGGTAEYYNRLARHHEAIINPPGVWVGRISTQLAVRIQLFRTVSKVRQQFLRRAA*
Ga0058884_140949713300004135Forest SoilWSLFLPRLPRLGPEDVASTAEYCNRLARLDDAAMNPQGRWTRRISTQLVLRIQRHKTAEKVPPQVLRRAA*
Ga0058897_1085952613300004139Forest SoilLPRLPRFGPDDVAGTAEYYNRLARHDEASMNPPREWRGRISMQLTLHMQRFKTAEKASQKILRHAAQQQSCGATAGH*
Ga0058897_1114159223300004139Forest SoilVPEAQGQMNYLYRSLYILILPRLPRFGPEDVAGTAEYYNRLARRDEAASNPAGAWIGRISTRLALRIRPFKTAKKIPQEILRRAA*
Ga0058897_1118243613300004139Forest SoilHPSPDVSEAQGQMNSFFRFLWSLFLPRLPRLGPEDIAGTAEYHNRLARLDEAALNPPGVWKRRISRPLTLRIQVFKVAEKAHQKFLRRVA*
Ga0058897_1118487033300004139Forest SoilDVSEAWGHMNSVCRFLWSLFLPRLPKLQREDVASTAEYCNRLARLDEAATNSPGVWKRRISTRLTVRIHVFKAAKKVPQKFLRRAA*
Ga0058899_1219087513300004631Forest SoilMKYLFRFLYQLSLPRLPRLGPEDVAGTADYYNRLARHEEATINPPDVWIGRISTQPAMGAQPLKIVKEIPQEILRPAA*
Ga0070730_10000671303300005537Surface SoilMNLLARFFENLFLPRLPKLEREDVASTAEYCNRLARLDEAALNSPDVWERHISRQLTVRIQVFKAAENTRQKFLRRAA*
Ga0070730_1004172533300005537Surface SoilMNFLFRFLRKLFLPRLPKLEREDVASTAEFCNRLARLDEAALNTQDVWEWRISRQLTVHIHMFNSAKKVPRKFLRRAA*
Ga0070732_1000389723300005542Surface SoilMNSVCRFLWSLFLPRLPKLEREDVASTAEYCNRLARLDEAAMNLPGVWKRRISTRLTVRIHGFKAAKKIPQRFLLRAAQQQRRA*
Ga0070732_1091590213300005542Surface SoilMNLLARFFENLFLPRLPKLEREDVASTAEYCNRLARLDEAALNSQDVWEWQISRQLTVRIQVFKAAENTRQKFLRRAA*
Ga0066691_1005429323300005586SoilMNRLFRFLYILVLPRLPRLGPEDVAGTAEYYNRLARRDEAASNPPDAWMRRIPKRLALRIRPFKTAKKIPQKILRRAS*
Ga0070716_100000118253300006173Corn, Switchgrass And Miscanthus RhizosphereMNSLSRFLWSLFLPRLPRLGPEDIAGTAEYCNRLARLDEASMNPPREWRGRISMQLALHIQRFETAEKVSQEILRHAAQQQSCGATAGH*
Ga0070716_10000351963300006173Corn, Switchgrass And Miscanthus RhizosphereMNRLFRFLYILFLPRLPRLGPEDVAGTAEYCNRLARRDEVASNPSGAWTGRIPTQLALRIRPLKTAKKMPQKILYRAA*
Ga0079220_1030755013300006806Agricultural SoilMNSVCRFLWNLLLPRLPKLEREDVASTAEYCNRLARLDEAAMNPPGVWKRRISTRLTVRIHVFKTAKKIPQKFLRRSAQQQP
Ga0075426_10000245283300006903Populus RhizosphereMNHLFRSLYSLFLPRLPRLGPEDVAGTAEYCNRLARRDEAASNPTGAWTGRIPTRLAMRIRPFKTTKKMPQNILHRAA*
Ga0099791_1008484513300007255Vadose Zone SoilMNLLFRSLYILFPPRLPRLGPEDVADTAEYYNCLARHDEAAMDPPSMWIDRISARLTVRIRPFKIAAKVSQTILRCVSRVARV
Ga0099793_1009182713300007258Vadose Zone SoilQGQMNLLFRSLCRLFPPRLPRFGFEDVSGTAEYYNRLARLDEVTVNPPGLWTGRISMQQVARIQPFKTAEKVPQMILRRAA*
Ga0099829_1095521523300009038Vadose Zone SoilMNFVCRFLWNLFLPRLPRLGPEDVSGTAEYYNRLARHDEASMNSPREWRGHISMQLGLRMQRYKIAEKAPQQILRRAA*
Ga0099830_1006162833300009088Vadose Zone SoilMNRLFRSLYILFLPRLPRLGPEDVAGTAEYYNRLARCDEAASNPPGAWMGRISTRLALRIRPFKTAKKIPQEILRHAA*
Ga0099830_1161551313300009088Vadose Zone SoilMNLLFRSLSSLFPPRLPRFGFEDVSGTAEYYNRLARLDEVTTNPPGLWTGRISMQQVARMQLFKTAEKVPQMILRRAA*
Ga0099792_1031745723300009143Vadose Zone SoilVSEALGQMNVLSQSLGSLFPPRLPRFGREDVAGTAEYYNRLARRDEAASNPPDAWMRRIPKRLALRIRPFKTAKKIPQKILRRAS*
Ga0136449_10096352713300010379Peatlands SoilMNYLFRSLYFLFLPRLPRLGPEDVAGTAEYYNRMARHQEATIHPPGVWIGRISTQLMLGAQPFKTAKKMPQKFLRRAA*
Ga0150983_1088634013300011120Forest SoilMSEAQEQMNSLCQFLWSLFLPRLPRFGPEDVASIAEYSDRLARLDETAMNPTGAWKRRISMQLVLHIRRFKAVEKVPQQILHRAA*
Ga0150983_1287798813300011120Forest SoilRHTHPSPNVSGAQGQMNSFSRFLWSLFLPRLPRFGPDDVAGTAEYYNRLARHDEASMNPPREWRGRISMQLTLHMQRFKTAEKVSQKILRHAAQQQSCGATAGH*
Ga0150983_1290831933300011120Forest SoilRLDPPYPSEPTSEAQGQMNFPFRFLCILFPPRLPRIGPEDVAGTAEYYNRLARHREATINPPGVWVGRISTELAVRIQLFRTVKKVRQQILGRAA*
Ga0150983_1397167913300011120Forest SoilAQGQMNSFFRFLWSLFLPRLPRLGPEDVASTAEYCNRLARLDEAAMNPAGVWKRRISRRLTVRIHVFKAAEKAHQNFLRRAA*
Ga0150983_1426045813300011120Forest SoilRHIHPSPDVPEAQGQMNSLFRVLCILFPPRLPRIGPEDVAGTAEYYNRLARHHEATINSPGVWVGRISRELAVRILLFRTAKKVRQQILGRAA*
Ga0150983_1502796413300011120Forest SoilAVRSGISIRAHMLEAQGQMNSFLRFLWSLFLPRLPRLGPEEIACTAEYYNRLARLDEAAMNPPGVWKRRISTRLTVRIQVFKATEKARQRFLRRAA*
Ga0150983_1538415713300011120Forest SoilAQGQMNSLYRFLWSLFLPRLPRLGPEDVAGTAEYYNRLARNDEATMNPPGMWIGRISTQLVLHIQRYKTAEKVPQQILRRAA*
Ga0150983_1588350133300011120Forest SoilDLSEAQGQMNSFFRFLWSLFLPRLPRLGPEDIAGTAEYHNRLARLDEAALNPPGVWKRRISRPLTLRIQVFKVAEKAHQKFLRRVA*
Ga0137363_1057634913300012202Vadose Zone SoilMNLLFRSLYILFPPRLPRLGPEDVADTAEYYNRLARHDEAAMDPPSMWIDRISARLTVRIRPFKIAAKVSQTILRCVAPRSK
Ga0137363_1167053313300012202Vadose Zone SoilMSEAQEQMNSLCQFLWSLFLPRLPRFGPEDVASIAEYSNRLARLDEAAMNPTGAWKSRISTQLVLHIRRLKTVEKVPQQILHRAA*
Ga0137399_1144035113300012203Vadose Zone SoilLFPPRLPRFGFEDVSGTAEYYNRLARLDEVTMNPPGLWTGRISMQQVARIQLFKTAEKVPQMILRRAA*
Ga0137360_1091577513300012361Vadose Zone SoilMNLLFRSLCRLFPPRLPRFGFEDVSGTAEYYNRLARLDDVTVNPPGLWTGRISMQQVARIQPFKTAEKVPQMILRRAA*
Ga0137398_1048612223300012683Vadose Zone SoilMNLLFRSLSSLFPPRLPRFGFEDVSGTAEYYNRLARQDQVAMNPPGLWTGRISMQQVAR
Ga0137396_1068757813300012918Vadose Zone SoilMNLLFRSLSSLFPPRLPRFGFEDVSGTAEYYNRLARQDQVAMNPPGLWTGRISMQQVARVQPFKTAEKVPQMILRRAA*
Ga0137359_1001160033300012923Vadose Zone SoilLEAQGQMNRLFRFLYILVLPRLPRLGPEDVAGTAEYYNRLARRDEAASNPPDAWMRRIPKRLALRIRPFKTAKKIPQKILRRAA*
Ga0137359_1001267833300012923Vadose Zone SoilMNSLFRFLWSLFLPRLPKLGPEDVASTAEYCNRLARLDEAAMNPPGVWKRRISRRLTVRIQVFKPAEKAGQKFLRRAA*
Ga0137409_1004305543300015245Vadose Zone SoilMNLLFRSLCRLFPPRLPRFGFEDVSGTAEYYNRLARLDEVTMNPPGLWTGRISMQQVARIQLFKTAEKVPQMILRRAA*
Ga0179594_1003928233300020170Vadose Zone SoilLRRLDPPYPSEPNVSEAQGQMNSLFRFLWSLFLPRLPKLGPEDVASTAEYCNRLARLDEAAMNPPGVWKRRISRRLTVRIQVFKPAEKAGQKFLRRAA
Ga0179592_1000470693300020199Vadose Zone SoilMNSLYRFLWSLFLPRLPRLGPEDVAGTAEYYNRLARNDEGTMNPPGMWIGRISTQLVLRIQRYKTAEKVPQQILRRAA
Ga0179592_1003991913300020199Vadose Zone SoilMNLLFRSLSSLFPPRLPRFGFEDVSGTAEYYNRLARQDQVAMNPPGLWTGRISMQQVARVQPFKTA
Ga0210407_10000295503300020579SoilMLEAQGQMNSFLRFLWSLFLPRLPRLGPEEIACTAEYYNRLARLDEAAMNPPGVWKRRISTRLTVRIQVFKATEKARQRFLRRAA
Ga0210407_1002830013300020579SoilMNYLYRSLYILILPRLPRFGPEDVAGTAEYYNRLARRDEAASNPAGAWIGRISTRLALRIRPFKTAKKIPQEILRRAA
Ga0210407_1061299913300020579SoilMNLLFRSLCRLFPPRLPRFGFEDVSGTAEYYNRLARHDEVTMNPPGLWTGRISMQQVARIQPFKTAEQAPQMILRRAA
Ga0210403_1063196813300020580SoilFLWSLFLPRLPRLGPEEIACTAEYYNRLARLDEAAMNPPGVWKRRISTRLTVRIQVFKATEKARQRFLRRAA
Ga0210399_1004429323300020581SoilMNSFFRFLWSLFLPRLPRLGPEDVASTAEYCNRLARLDDAAMNPQGRWTRRISTQLVLRIQRHKTAEKVPPQVLRRAA
Ga0210399_1065850123300020581SoilVSEAWGHMNSVCRFLWSLFLPRLPKLQREDVASTAEYCNRLARLDEAATNSPGVWKRRISTRLTVRIHVFKAAKKV
Ga0215015_1004059233300021046SoilFLWSLFLPRLPKLEREDVASTAEYCNRLARLDEAAMNPPGVWKRRISRRLTVRIQVFNTAEKAPQKFLRRSA
Ga0215015_1046427823300021046SoilMNSLFRVLCVLFPPRLPRIGPEDVAGTAEYYNRLARHHEAIINPPGLWVGRISRELAVRIQLFRTVKKVRQQILGRAA
Ga0215015_1083455713300021046SoilMNLLFRSLCILFPPRLPRIGPEDVAGTAEYYNRLARHHEATINPPGVWVGRISTELALRIQLFRAAKKVRQQILRRAA
Ga0210404_1013982823300021088SoilMNSLYRFLWSLFLPRLPRLGPEDVAGTAEYYNRLARNDEATMNLPGMKIGRISTQLVLHIQRYKTAEKVPQQILRRAA
Ga0210404_1047113913300021088SoilMNLLFRSLCRLFPPRLPRFGFEDVSGTAEYYNRLARHDEVTMNPPGLWTGRISMQQVVRIQPFKTAEQAPQMILRRAA
Ga0210405_10000306433300021171SoilMNYLYRSLYILILPRLPRLGPEDVAGTAEYYNRLARRDEAASNPAGAWISRISTRLALRIRPFKTAKKAPQEILGRAA
Ga0210405_1000248023300021171SoilMNYLYRSLYILILPRLPRLGPEDVAGTAEYYNRLARRDEAASNPAGAWIGRISTRLALRIRPFKTAKKIPQEILRRAA
Ga0210408_1010331743300021178SoilMNLFFRFLWSLFLPRLPRLGPGDVAGTAEYYNRLARNNEATMNPPGMWTGRISTQLILHIQRYKIAEKVPQQILRRAA
Ga0210408_1083110413300021178SoilMNLLSRSLCILFPPRLPRLRPEDMAASAEYYNRLARHHGATINPPGAWVGRISTELAMRIQLFKTVKKVRQQILRRAA
Ga0210410_1158405013300021479SoilMNRLFRSLYILFLPRLPRLRPEDVAVIAEYYNRLARRQEAAMDPPGVWRGRISMRLALRTPPFKTAEKVPQEILRRVA
Ga0210409_1001559893300021559SoilMNSLCRFLWSLFLPRLPRFGPEDVAGTAEYYNRLARNDEPMNPPGMWIGCISTQLVLCIQRYNTAEKVPQQILRRAA
Ga0210409_1016516113300021559SoilMNYLYRSLYILILPRLPRLGPEDVADTAEYNDRLARRDEAGSNPAGAWIGRISTRLALRIRPFKTAKKIPQEILGRAA
Ga0210409_1065242533300021559SoilRFVWSLFLPRLPRFGPEDVASTAEYYNRLARHDEMTMNPPGMSTRRISTHLVLRIQPPKPAEKVRQQILRRAA
Ga0207665_10000108223300025939Corn, Switchgrass And Miscanthus RhizosphereMNSLSRFLWSLFLPRLPRLGPEDIAGTAEYCNRLARLDEASMNPPREWRGRISMQLALHIQRFETAEKVSQEILRHAAQQQSCGATAGH
Ga0257163_107715813300026359SoilMNLLFRSLSSLFPPRLPRFGFEDVSGTAEYYNRLARLDEVTMNPPGLWTGRISMQQVARIQLFKTAEKVPQMILRRAA
Ga0257181_108393713300026499SoilMNLLFRSLCRLFPPQLPRFGFEDVSGTAEYYNRLARLDEVTVNPPGLWTGRISMQQVARIQLFKTAEKVPQMILRRAA
Ga0257168_115316213300026514SoilMNLLFRSLCRLFPPRLPRFGFEDVSGTAEYYNRLARLDEVTMNPPGLWTGRISMQQVACIQLFKTAEKVPQMILRRAA
Ga0179587_1001421233300026557Vadose Zone SoilMNLLFRSLSSLFPPRLPRFGFEDVSGTAEYYNRLARQDQVAMNPPGLWTGRISMQQVARVQPFKTAEKVPQMILRRAA
Ga0208983_104578613300027381Forest SoilMNLLFRSLSSLFPPRLPRFGFEDVSGTAEYYNRLARLDEVTVNPPGLWTGRISMQQVARIQPFKTAEKVPQMILRRAA
Ga0209733_101951323300027591Forest SoilMNLLFRSLNSLFPPRLPRFGFEDASGTAEYYNRLARHDEVTMNPPGLWTGRISTQLVARIQPLKTAEKVPQMILRRAA
Ga0209076_103408613300027643Vadose Zone SoilMNLLFRSLCRLFPPRLPRFGFEDVSGTAEYYNRLARLDEVTVNPPGLWTGRISMQQVARIQPFKTAEKVPQ
Ga0209117_100150063300027645Forest SoilMNLLFRSLNSLFPPRLPRFGFEDASGTAAYYNRLARHDEVTMNPPGLWTGRISTQLVARIQPLKTAEKVPQMILHRAA
Ga0208990_101564033300027663Forest SoilMNLLFRSLCRLFPPRLPRFGFEDVSGTAEYYNRLARLDEVTMNPPGLWTGRISMQQVARIQLFKTAEKVPQMILRRAA
Ga0209009_110534513300027667Forest SoilMNLLFQSLCTLFPPRLPRFGPEDVAGTAEYYNRLARHYEPAKDPPGAWIGRISMQLAVGKQPLKTAEKLPQKILRRV
Ga0209588_120518923300027671Vadose Zone SoilLEAQGQMNRLFRFLYILVLPRLPRLGPEDVAGTAEYYNRLARRDEAASNPPDAWMRRIPKRLALRIRPFKTAKKIPQKILRRAS
Ga0209011_100139163300027678Forest SoilMNLLFRSLNSLFPPRLPRFGFEDASGTAEYYNRLARHDEVTMNPPGLWTGRISTQLVARIQPLKTAGKVPQMILRHAA
Ga0209515_1010815133300027835GroundwaterMNLLSRSLWILSPPRLPRCGPEDVAGTAEYYNRLARHSEATMNPPGVWIGRISTQLAVRIQPFKTAEKVPQKILRRAA
Ga0209580_1001290733300027842Surface SoilMNSVCRFLWSLFLPRLPKLEREDVASTAEYCNRLARLDEAAMNLPGVWKRRISTRLTVRIHGFKAAKKIPQRFLLRAAQQQRRA
Ga0209166_100000111213300027857Surface SoilMNLLARFFENLFLPRLPKLEREDVASTAEYCNRLARLDEAALNSPDVWERHISRQLTVRIQVFKAAENTRQKFLRRAA
Ga0209166_10000155753300027857Surface SoilMNFLFRFLRKLFLPRLPKLEREDVASTAEFCNRLARLDEAALNTQDVWEWRISRQLTVHIHMFNSAKKVPRKFLRRAA
Ga0073994_1003002613300030991SoilMNLLFRSLSSLFPRRLRRFGFEDVSGTTEYYNRLARQDQVAMNPPGLWTGRISMQQVARVQP
Ga0073994_1003297813300030991SoilMNLLFRSLSSLFPPRLPRFGFEDVSGTAEYYNRLARQDQVAMNPPGLWTGRISMQQVARVQPFKTAEKVPQMIPRRAA
Ga0170834_10512072813300031057Forest SoilPFEPNGLEAQGQMNSFCWFLWSLFRPRLPKLEREDVASTAEYCNRLARLDEAAMNSPGVWKRRTSRPLTVRIRVFGAAGKARQKFLCRAA
Ga0170824_11482615113300031231Forest SoilMNSLCRFLWSLLRPRLPKLEREDVASTAEYCNRLARLDEAAMNSPGVWKRRISRQLTVRIQVFGATGKARQKFLRRAA
Ga0170820_1016589213300031446Forest SoilGSPRFSSQAHAVRSAIPSGPNVSQAQGQMNSLCRFLWSLLRPRLPKLEREDVASTAEYCNRLARLDEAAMNSPGVWKRRISRQLTVRIQVFGATGKARQKFLRRAA
Ga0307474_1118514623300031718Hardwood Forest SoilMNRLYRFLYILFLPRLPRLGPEDVGDIAEYCNRLARRDEAGSSPAGAWMWRISTRLALRIRPFKTAKKIPQKILRHAA
Ga0307469_1145842323300031720Hardwood Forest SoilMRVKEVQNSFRRLTRLDSPYPSEPNVSEAQGQMNSFCRFLWGLFLPRLPKLEPEDVASTAEYCNRLARLDEAAMNYTGVWKRRISRQLTVRIHVFKAAKNAPQKSLRRVASRRAA
Ga0307468_10165450513300031740Hardwood Forest SoilMIQFVGLHGWIRQINRSPGVLGAQGLMNSFCRFLWSLFLPRLPKLEREDVASTAEYCNRLARLDEAAMNSPGVWKRRISRQLTVRIQVFQAAQKTPQKFLRRAA
Ga0307477_1008988733300031753Hardwood Forest SoilLQAYAASFATFIPARQYRKQAQMNRLYRFLYILFLPRLPRLGPEDVGDIAEYCNRLARRDEAGSNPSGAWMWRIPTRLALRIRPFKTAKKIPQEILHRAA
Ga0307477_1104846313300031753Hardwood Forest SoilMSSLFRFIWSLFLPRLPRFGPEDVASIAEYSNRLARLEEAAMNPPGMWIGRISTQLVLRVQRYKIAEKVP
Ga0307475_1001042253300031754Hardwood Forest SoilMKYLFRSLYILFLPRLPTFRPEDVAGTAEYYNRMARQHEMAIHLPGEWKGYISTKLALHIQPFRTEKKIPQQILRHTA
Ga0307475_1003987123300031754Hardwood Forest SoilMSSLFRFLWSLFLPRLPRLGPEDVASTAEYCNRLARLDEAAMNPPGVWKRRISRQLTVRIQVFRAVGKARQKFLRRVA
Ga0307475_1015670423300031754Hardwood Forest SoilMNSLYRFLWSLFLPRLPRLGPEDVAGTAEYYNRLARNDEATMNPPGMWIGRISTQLVLRIQRYKTAEKVPQQILRRAA
Ga0307475_1015707233300031754Hardwood Forest SoilMSSLFRFIWSLFLPRLPRFGPEDVASIAEYSNRLARLEEAAMNPPGMWIGRISTQLVLRVQRYKIAEKVPQQILRRAA
Ga0307475_1022669313300031754Hardwood Forest SoilMNLLFQSLCSLFPPRLPRFGPEDVAGTAEYYNRLARHREANITPPGMWIGRIATERAARIRPFKAAEKVPQRIVRRAASQES
Ga0307478_1037639823300031823Hardwood Forest SoilMNSLYQFLWSLFLPRLPRLGPEDVAGTAEYYNRLARNDEATMNPPGMWIGRISTQLVLRIQRYKTAEKVPQQILRRAA
Ga0307479_1000057443300031962Hardwood Forest SoilMNSLYRFLYILFLPRLPRFGPEDVAGTAEYYNRLARHQEATIKSPALWIGRTSTQLALRIQPFKTARNIPQKILHRAA
Ga0307479_1000931343300031962Hardwood Forest SoilMNRLYRSLYILFLPRLPRLGPEDVAGTAEYYNRLARREEPGSNPPGAWIGRISTWLAMRIRPFKTEKNIPQEIVRRAA
Ga0307479_1003415633300031962Hardwood Forest SoilMNYLYRSLYVLFLPRLPRLGPEDVADTAEYCDRLARNEEAARNPPGAWIGRISTRLSLRIRSFKSAKKIPQRILGRAA
Ga0307479_1004780923300031962Hardwood Forest SoilMSEAQEQMNSLCEFLWSLFLPRLPRFGPEDVASIAEYSNRLARLDEAAMNPTGAWKRRISTQLVLHIRRLKTDEKVPQQILHRAA
Ga0307479_1007332913300031962Hardwood Forest SoilNSLCEFLWSLFLPRLPRFGPEDVASIAEYSNRLARLDEAAMNPTGAWKRHISTQLVLHIRRFKTVEKVPQQLLHRAA
Ga0307479_1016046933300031962Hardwood Forest SoilMNLFFRFLYSLFLPRLPRLGPGDVAGTAEYYNRLARNHEATMNPSGMWTRRISTQLVLRIQRYKTAEKAPQQILRRAA
Ga0307479_1040917223300031962Hardwood Forest SoilMNRLFRALYILFLPRLPRLGPEDVAGTAEYYNRVARCNEAASNPPGAWMGRISTRLALRIRQFKTAKKIPQKILGHAA
Ga0307479_1045621723300031962Hardwood Forest SoilLQAYAASFATFIPARQYRKQAQMNRLYRFLYILFLPRLPRLGPEDVGDTAEYCNRLARRDEAGSNPSGAWMWRIPTRLALRIRPFKTAKKIPQEILHRAA
Ga0307479_1077681623300031962Hardwood Forest SoilMNSVGRFLWSLFLPRLPRLGPEDVANTAEYCNRLARLDEGAMNPEGVWKRHISTHLTMRIQVFKTAGKVPQQILRRAA
Ga0307479_1144970223300031962Hardwood Forest SoilMNYLYRSLYILILPRLPRLGPEDVAGTAEYYNRLARRDEAASDSPGAWMGRISTRLALRIRPFKTAKKIPQEILRRAA
Ga0307479_1212713713300031962Hardwood Forest SoilMNFVSQFLWSLFLPRLPRLGPGDVAGTAAYYNRLARLDEPSMNPPREWRGRISMQLALRMQRFKTAENAAQKILHRAPQQESCGETSW
Ga0311301_1072542823300032160Peatlands SoilMNYLFRSLYFLFLPRLPRLGPEDVAGTAEYYNRMARHQEATIHPPGVWIGRISTQLMLGAQPFKTAKKMPQKFLRRAA
Ga0307471_10003354463300032180Hardwood Forest SoilMNSLFRFLWSLFLPRLPRLGPEDVAGTAEYYNRLARNDEATMNPPGMWIGRISTQLVLHIRRCKSGEKAPQQIQRRAA
Ga0307471_10249214513300032180Hardwood Forest SoilDVREAQGQMNFLSRSLCILFPPRLPRLRPEDMAGSAEYYNRLARHHAATINPPGAWVGRISTELAMRIQLFKAVKKVRQQILRRAA
Ga0307471_10425607223300032180Hardwood Forest SoilMNVLFQSLCSLFPPRLPRFGREDVAGTAEYYNRLARHYEATINPPGMWMGRISTERAVRMRIRPRKTAVRTSKSNRKL
Ga0307472_10003036633300032205Hardwood Forest SoilMNSFFRFLWSLFLPRLPRLGPEDIAGTAEYHNRLARLDEAALNPPGVWKRRISRPLTLRIQVFKVAEKAHQKFLRRVA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.