NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F061705

Metagenome / Metatranscriptome Family F061705

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F061705
Family Type Metagenome / Metatranscriptome
Number of Sequences 131
Average Sequence Length 139 residues
Representative Sequence MKYKSLVLGAAGAAILFGAATAAARSYNPMKWIKKPTASQELAANSTEEKALTLQLQALLPAHTTLKDACTLFKTLDDCVASLRASSNLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAILALKPDVNARGEARNAEKSAHEDIKNARS
Number of Associated Samples 105
Number of Associated Scaffolds 131

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 84.73 %
% of genes near scaffold ends (potentially truncated) 61.83 %
% of genes from short scaffolds (< 2000 bps) 77.86 %
Associated GOLD sequencing projects 100
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (98.473 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(25.954 % of family members)
Environment Ontology (ENVO) Unclassified
(37.405 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(66.412 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 65.58%    β-sheet: 0.00%    Coil/Unstructured: 34.42%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 131 Family Scaffolds
PF04519Bactofilin 6.11
PF02129Peptidase_S15 3.82
PF05973Gp49 3.82
PF00903Glyoxalase 3.05
PF13649Methyltransf_25 3.05
PF01063Aminotran_4 1.53
PF01402RHH_1 1.53
PF00266Aminotran_5 1.53
PF08327AHSA1 1.53
PF03928HbpS-like 1.53
PF01925TauE 0.76
PF00583Acetyltransf_1 0.76
PF04055Radical_SAM 0.76
PF00375SDF 0.76
PF05170AsmA 0.76
PF13570PQQ_3 0.76
PF07045DUF1330 0.76
PF13744HTH_37 0.76
PF09837DUF2064 0.76
PF00144Beta-lactamase 0.76
PF13641Glyco_tranf_2_3 0.76
PF00990GGDEF 0.76
PF00487FA_desaturase 0.76
PF08530PepX_C 0.76
PF13411MerR_1 0.76
PF13473Cupredoxin_1 0.76

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 131 Family Scaffolds
COG1664Cytoskeletal protein CcmA, bactofilin familyCytoskeleton [Z] 6.11
COG3657Putative component of the toxin-antitoxin plasmid stabilization moduleDefense mechanisms [V] 3.82
COG4679Phage-related protein gp49, toxin component of the Tad-Ata toxin-antitoxin systemDefense mechanisms [V] 3.82
COG0115Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyaseAmino acid transport and metabolism [E] 3.05
COG0730Sulfite exporter TauE/SafE/YfcA and related permeases, UPF0721 familyInorganic ion transport and metabolism [P] 0.76
COG1398Fatty-acid desaturaseLipid transport and metabolism [I] 0.76
COG1680CubicO group peptidase, beta-lactamase class C familyDefense mechanisms [V] 0.76
COG1686D-alanyl-D-alanine carboxypeptidaseCell wall/membrane/envelope biogenesis [M] 0.76
COG2367Beta-lactamase class ADefense mechanisms [V] 0.76
COG2936Predicted acyl esteraseGeneral function prediction only [R] 0.76
COG2982Uncharacterized conserved protein AsmA involved in outer membrane biogenesisCell wall/membrane/envelope biogenesis [M] 0.76
COG3239Fatty acid desaturaseLipid transport and metabolism [I] 0.76
COG5470Uncharacterized conserved protein, DUF1330 familyFunction unknown [S] 0.76


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms98.47 %
UnclassifiedrootN/A1.53 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001174|JGI12679J13547_1008174All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300002245|JGIcombinedJ26739_100046302All Organisms → cellular organisms → Bacteria3941Open in IMG/M
3300002245|JGIcombinedJ26739_101569528All Organisms → cellular organisms → Bacteria555Open in IMG/M
3300002910|JGI25615J43890_1072653All Organisms → cellular organisms → Bacteria594Open in IMG/M
3300002916|JGI25389J43894_1028888All Organisms → cellular organisms → Bacteria → Acidobacteria960Open in IMG/M
3300005186|Ga0066676_10040712All Organisms → cellular organisms → Bacteria2589Open in IMG/M
3300005332|Ga0066388_101359216All Organisms → cellular organisms → Bacteria → Acidobacteria1230Open in IMG/M
3300005451|Ga0066681_10709214All Organisms → cellular organisms → Bacteria612Open in IMG/M
3300005541|Ga0070733_10478977All Organisms → cellular organisms → Bacteria832Open in IMG/M
3300005553|Ga0066695_10282828All Organisms → cellular organisms → Bacteria1047Open in IMG/M
3300005555|Ga0066692_10107236All Organisms → cellular organisms → Bacteria1665Open in IMG/M
3300005556|Ga0066707_10295838All Organisms → cellular organisms → Bacteria1060Open in IMG/M
3300005557|Ga0066704_10169594All Organisms → cellular organisms → Bacteria1469Open in IMG/M
3300005559|Ga0066700_10642080All Organisms → cellular organisms → Bacteria736Open in IMG/M
3300005598|Ga0066706_10105626All Organisms → cellular organisms → Bacteria2053Open in IMG/M
3300006052|Ga0075029_100527869Not Available782Open in IMG/M
3300006796|Ga0066665_10272110All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → Verrucomicrobium → unclassified Verrucomicrobium → Verrucomicrobium sp.1349Open in IMG/M
3300006797|Ga0066659_10853187All Organisms → cellular organisms → Bacteria758Open in IMG/M
3300007788|Ga0099795_10449275All Organisms → cellular organisms → Bacteria593Open in IMG/M
3300009012|Ga0066710_101205049All Organisms → cellular organisms → Bacteria1173Open in IMG/M
3300009012|Ga0066710_102426376All Organisms → cellular organisms → Bacteria760Open in IMG/M
3300009137|Ga0066709_100651753All Organisms → cellular organisms → Bacteria → Acidobacteria1508Open in IMG/M
3300009137|Ga0066709_101626914All Organisms → cellular organisms → Bacteria922Open in IMG/M
3300010043|Ga0126380_10048797All Organisms → cellular organisms → Bacteria2273Open in IMG/M
3300010043|Ga0126380_12030291All Organisms → cellular organisms → Bacteria527Open in IMG/M
3300010046|Ga0126384_11826936All Organisms → cellular organisms → Bacteria577Open in IMG/M
3300010048|Ga0126373_10417599All Organisms → cellular organisms → Bacteria1370Open in IMG/M
3300010048|Ga0126373_10851227All Organisms → cellular organisms → Bacteria975Open in IMG/M
3300010320|Ga0134109_10255550All Organisms → cellular organisms → Bacteria662Open in IMG/M
3300010329|Ga0134111_10134965All Organisms → cellular organisms → Bacteria968Open in IMG/M
3300010359|Ga0126376_10000293All Organisms → cellular organisms → Bacteria23518Open in IMG/M
3300010359|Ga0126376_10006897All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6874Open in IMG/M
3300010360|Ga0126372_10009845All Organisms → cellular organisms → Bacteria5027Open in IMG/M
3300010360|Ga0126372_10308945All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1394Open in IMG/M
3300010362|Ga0126377_10154867All Organisms → cellular organisms → Bacteria2157Open in IMG/M
3300010362|Ga0126377_10704933All Organisms → cellular organisms → Bacteria → Acidobacteria1062Open in IMG/M
3300010366|Ga0126379_11128286All Organisms → cellular organisms → Bacteria890Open in IMG/M
3300010376|Ga0126381_100068551All Organisms → cellular organisms → Bacteria → Acidobacteria4415Open in IMG/M
3300010398|Ga0126383_10145923All Organisms → cellular organisms → Bacteria2199Open in IMG/M
3300011269|Ga0137392_10064017All Organisms → cellular organisms → Bacteria → Acidobacteria2808Open in IMG/M
3300011271|Ga0137393_10401694All Organisms → cellular organisms → Bacteria1173Open in IMG/M
3300012198|Ga0137364_10409411All Organisms → cellular organisms → Bacteria1015Open in IMG/M
3300012203|Ga0137399_10944448All Organisms → cellular organisms → Bacteria726Open in IMG/M
3300012205|Ga0137362_10465664All Organisms → cellular organisms → Bacteria1094Open in IMG/M
3300012211|Ga0137377_11523115All Organisms → cellular organisms → Bacteria594Open in IMG/M
3300012351|Ga0137386_10482621All Organisms → cellular organisms → Bacteria893Open in IMG/M
3300012361|Ga0137360_10027480All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3893Open in IMG/M
3300012362|Ga0137361_11474297All Organisms → cellular organisms → Bacteria603Open in IMG/M
3300012582|Ga0137358_10423975All Organisms → cellular organisms → Bacteria899Open in IMG/M
3300012683|Ga0137398_10032095All Organisms → cellular organisms → Bacteria2985Open in IMG/M
3300012685|Ga0137397_10174983All Organisms → cellular organisms → Bacteria1594Open in IMG/M
3300012685|Ga0137397_10229421All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1382Open in IMG/M
3300012923|Ga0137359_10815602All Organisms → cellular organisms → Bacteria808Open in IMG/M
3300012924|Ga0137413_10087079All Organisms → cellular organisms → Bacteria1914Open in IMG/M
3300012925|Ga0137419_10063722All Organisms → cellular organisms → Bacteria2429Open in IMG/M
3300012929|Ga0137404_10846075All Organisms → cellular organisms → Bacteria832Open in IMG/M
3300012930|Ga0137407_10917555All Organisms → cellular organisms → Bacteria829Open in IMG/M
3300012944|Ga0137410_10022132All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4389Open in IMG/M
3300012944|Ga0137410_10159976All Organisms → cellular organisms → Bacteria1722Open in IMG/M
3300012944|Ga0137410_11621675All Organisms → cellular organisms → Bacteria567Open in IMG/M
3300012975|Ga0134110_10237969All Organisms → cellular organisms → Bacteria772Open in IMG/M
3300014166|Ga0134079_10040527All Organisms → cellular organisms → Bacteria1603Open in IMG/M
3300015241|Ga0137418_10090159All Organisms → cellular organisms → Bacteria2771Open in IMG/M
3300015245|Ga0137409_10142413All Organisms → cellular organisms → Bacteria2200Open in IMG/M
3300018482|Ga0066669_10101593All Organisms → cellular organisms → Bacteria1998Open in IMG/M
3300020140|Ga0179590_1091412All Organisms → cellular organisms → Bacteria815Open in IMG/M
3300020199|Ga0179592_10056851All Organisms → cellular organisms → Bacteria1783Open in IMG/M
3300020579|Ga0210407_10193328All Organisms → cellular organisms → Bacteria1579Open in IMG/M
3300020579|Ga0210407_10350981All Organisms → cellular organisms → Bacteria1154Open in IMG/M
3300020579|Ga0210407_10872009All Organisms → cellular organisms → Bacteria691Open in IMG/M
3300020580|Ga0210403_10001585All Organisms → cellular organisms → Bacteria21415Open in IMG/M
3300020581|Ga0210399_10052523All Organisms → cellular organisms → Bacteria3275Open in IMG/M
3300021168|Ga0210406_10152630All Organisms → cellular organisms → Bacteria1938Open in IMG/M
3300021168|Ga0210406_10469808All Organisms → cellular organisms → Bacteria998Open in IMG/M
3300021168|Ga0210406_10929039All Organisms → cellular organisms → Bacteria652Open in IMG/M
3300021171|Ga0210405_10004982All Organisms → cellular organisms → Bacteria12275Open in IMG/M
3300021178|Ga0210408_10965046All Organisms → cellular organisms → Bacteria661Open in IMG/M
3300021180|Ga0210396_10013967All Organisms → cellular organisms → Bacteria7484Open in IMG/M
3300021180|Ga0210396_10941698All Organisms → cellular organisms → Bacteria734Open in IMG/M
3300021181|Ga0210388_10390077All Organisms → cellular organisms → Bacteria1224Open in IMG/M
3300021404|Ga0210389_10692387All Organisms → cellular organisms → Bacteria798Open in IMG/M
3300021406|Ga0210386_11417789All Organisms → cellular organisms → Bacteria581Open in IMG/M
3300021407|Ga0210383_10678164All Organisms → cellular organisms → Bacteria886Open in IMG/M
3300021407|Ga0210383_10916223All Organisms → cellular organisms → Bacteria746Open in IMG/M
3300021420|Ga0210394_10000192All Organisms → cellular organisms → Bacteria146962Open in IMG/M
3300021420|Ga0210394_10236618All Organisms → cellular organisms → Bacteria1597Open in IMG/M
3300021420|Ga0210394_10796585All Organisms → cellular organisms → Bacteria825Open in IMG/M
3300021420|Ga0210394_11778323All Organisms → cellular organisms → Bacteria514Open in IMG/M
3300021432|Ga0210384_10140719All Organisms → cellular organisms → Bacteria2165Open in IMG/M
3300021475|Ga0210392_10013409All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4471Open in IMG/M
3300021475|Ga0210392_10517004All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium880Open in IMG/M
3300021476|Ga0187846_10173768All Organisms → cellular organisms → Bacteria907Open in IMG/M
3300021478|Ga0210402_10543546All Organisms → cellular organisms → Bacteria1078Open in IMG/M
3300021478|Ga0210402_11149768All Organisms → cellular organisms → Bacteria703Open in IMG/M
3300021479|Ga0210410_10035167All Organisms → cellular organisms → Bacteria4348Open in IMG/M
3300021479|Ga0210410_11760990Not Available513Open in IMG/M
3300021559|Ga0210409_10271862All Organisms → cellular organisms → Bacteria1531Open in IMG/M
3300021559|Ga0210409_10845016All Organisms → cellular organisms → Bacteria788Open in IMG/M
3300021560|Ga0126371_10238461All Organisms → cellular organisms → Bacteria1929Open in IMG/M
3300022507|Ga0222729_1072662All Organisms → cellular organisms → Bacteria510Open in IMG/M
3300022510|Ga0242652_1033941All Organisms → cellular organisms → Bacteria597Open in IMG/M
3300024288|Ga0179589_10056089All Organisms → cellular organisms → Bacteria → Acidobacteria1499Open in IMG/M
3300025906|Ga0207699_11434880All Organisms → cellular organisms → Bacteria511Open in IMG/M
3300025939|Ga0207665_11309821All Organisms → cellular organisms → Bacteria577Open in IMG/M
3300026295|Ga0209234_1141918All Organisms → cellular organisms → Bacteria861Open in IMG/M
3300026301|Ga0209238_1028571All Organisms → cellular organisms → Bacteria2082Open in IMG/M
3300026301|Ga0209238_1047982All Organisms → cellular organisms → Bacteria1541Open in IMG/M
3300026319|Ga0209647_1063615All Organisms → cellular organisms → Bacteria → Acidobacteria1906Open in IMG/M
3300026320|Ga0209131_1112830All Organisms → cellular organisms → Bacteria → Acidobacteria1418Open in IMG/M
3300026332|Ga0209803_1057219All Organisms → cellular organisms → Bacteria1700Open in IMG/M
3300026551|Ga0209648_10573348All Organisms → cellular organisms → Bacteria626Open in IMG/M
3300027073|Ga0208366_1029771All Organisms → cellular organisms → Bacteria608Open in IMG/M
3300027548|Ga0209523_1115421All Organisms → cellular organisms → Bacteria555Open in IMG/M
3300027857|Ga0209166_10617644All Organisms → cellular organisms → Bacteria550Open in IMG/M
3300027867|Ga0209167_10357304All Organisms → cellular organisms → Bacteria794Open in IMG/M
3300027903|Ga0209488_10630004All Organisms → cellular organisms → Bacteria775Open in IMG/M
3300027908|Ga0209006_10115947All Organisms → cellular organisms → Bacteria → Acidobacteria2369Open in IMG/M
3300028536|Ga0137415_10691275All Organisms → cellular organisms → Bacteria833Open in IMG/M
3300030776|Ga0075396_1813729All Organisms → cellular organisms → Bacteria536Open in IMG/M
3300031057|Ga0170834_105792051All Organisms → cellular organisms → Bacteria654Open in IMG/M
3300031122|Ga0170822_15400934All Organisms → cellular organisms → Bacteria625Open in IMG/M
3300031231|Ga0170824_100336002All Organisms → cellular organisms → Bacteria1281Open in IMG/M
3300031573|Ga0310915_10843068All Organisms → cellular organisms → Bacteria644Open in IMG/M
3300031718|Ga0307474_10104733All Organisms → cellular organisms → Bacteria2116Open in IMG/M
3300031720|Ga0307469_11867554All Organisms → cellular organisms → Bacteria581Open in IMG/M
3300031740|Ga0307468_101293389All Organisms → cellular organisms → Bacteria663Open in IMG/M
3300031753|Ga0307477_10166536All Organisms → cellular organisms → Bacteria → Acidobacteria1537Open in IMG/M
3300031954|Ga0306926_12455309All Organisms → cellular organisms → Bacteria573Open in IMG/M
3300032180|Ga0307471_100140687All Organisms → cellular organisms → Bacteria2298Open in IMG/M
3300032180|Ga0307471_103881886All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300032261|Ga0306920_101728527All Organisms → cellular organisms → Bacteria885Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil25.95%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil22.14%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil11.45%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil9.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil8.40%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.58%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.58%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.05%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.29%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil2.29%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.53%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.53%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.76%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.76%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.76%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001174Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M1EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002910Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006052Freshwater sediment microbial communities from North America - Little Laurel Run_MetaG_LLR_2013EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300020140Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022507Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-27-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022510Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-14-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026319Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cm (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027073Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF010 (SPAdes)EnvironmentalOpen in IMG/M
3300027548Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM3H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027867Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300030776Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA10 Emin (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031122Oak Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12679J13547_100817413300001174Forest SoilMKYKTLVLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCAASLHASGNLQIKFNCLKWDMTAVRPNGDIKSCQAPPKALSLEKAILALKPD
JGIcombinedJ26739_10004630263300002245Forest SoilGDNMKYKSLVLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCXASLXASSNLQIKFNCLKWDVTAVRPNGDLKSCQAPPKALSLEKAILALKPDVNARGEARNAEKSAREDIKNARS*
JGIcombinedJ26739_10156952813300002245Forest SoilMKYKSLVLGAAGAAILFGAATVAARSYNPMKWIKKPTASQQLAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLHASSNLQIKFNCLKWDITAVRPNGDIKSCQAPPKALSLEKAILALKPVVNARGEARNAEKSAREDIKNAHS*
JGI25615J43890_107265313300002910Grasslands SoilHDTAWPTTINPAPTPRKGHNMKYKTLFLGAAGAAILFGAATAAARSYNPMKWIKKPTASQELAANSVEEKALTLQLQALLPAXTTLKDACTLFKTLDDCVASLHASGNLQIKFNCXKWDMTAVRPNGDIKSCQAPPKALSLEKAILALKPDVNARGEARNAEKNAHEDIKNARS*
JGI25389J43894_102888813300002916Grasslands SoilGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANSAEERKLTLELQALLPPGTTLKDACTVFRSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPKALSLNKAIHALKPDANAKVESRNAERRAHEDIKDASS*
Ga0066676_1004071233300005186SoilSYNPLKWIKKPTASQQLASNSAQERKLTLELQALLPPGTTLKDACTAFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPRALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASS*
Ga0066388_10135921623300005332Tropical Forest SoilMKYKWMVIGAAAAGMLLCAAEAHARSYNPMKWIKKPTASQHLAANSKADKDLAMQLQALLPAHTTLKDACTAFKTLQDCVSSLHASSTLQLKFNCLKWDMTAVRPDGDVKSCEAPPKAQSLAQAIRGLKAGVNAKGEAKNAEMSAREEIKNAGF*
Ga0066681_1070921413300005451SoilMKNKWSVIGAAGFCILLGAAGASARSYNPLKWIKKPTASQQLASNSAQERKLTLELQALLPPGTTLKDACTAFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPKALSLNKAIHALKPDAD
Ga0070733_1047897713300005541Surface SoilMKCKSLVLGAAGAAILFGAATTGARSYNPMKWIKKPSASQELAANSAEEKALTLQLQAMLPAHTTLKDACTLFKTLDDCVASLRASSNLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAILALKPDVNARGEARNA
Ga0066695_1028282833300005553SoilMKNKWSVIGAAGFCILLGAAGASARSYNPLKWIKKPTASQQLAANGAQERKLTLELQALLPPGTTLRDACTAFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPRALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASS*
Ga0066692_1010723633300005555SoilMKYKSLVLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCAASLHASGNLQIKFNCLKWDMTAVRPNGDIKSCQAPPKALSLEKAILALKPGVNARGEARNAEKSAREDIKNARS*
Ga0066707_1029583823300005556SoilMKNKWSVIGAAGFCILLGAAGASAHSYNPLKWIKKPTASQQLAANGAQERKLTLELQALLPPGTTLKDACTVFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPKALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASS*
Ga0066704_1016959423300005557SoilMKNKWSVIGAAGFCILLGASGASAHSYNPLKWIKKPTASQQLAANGAQERKLTLELQALLPPGTTLRDACTAFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPKALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASA*
Ga0066700_1064208023300005559SoilMKNKWSVIGAAGFCILLGASGASAHSYNPLKWIKKPTASQQLAANGAQERKLTLELQALLPPGTTLRDACTAFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPKALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASS*
Ga0066706_1010562623300005598SoilMKNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANSAEERKLTLELQALLPPGTTLKDACTVFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPRALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASS*
Ga0075029_10052786923300006052WatershedsMKYKSLVLGAAGAAILLGAATAGARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPAHTTLKDACTLFKTLDDCVASLRASSNLQIKFN
Ga0066665_1027211033300006796SoilMKNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANGAEEKKLTLELQALLPPHTTLKDACTAFKSLDDCVASLHAGRNLNIKFNCLKWNMTAVRPSNDAKSCAAPPKALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASS*
Ga0066659_1085318713300006797SoilMKHKWSVIGVAGFCILLGAAGASAHSYNPLKWIKKPTASQQLAANSAEERKLTLELQALLPPGTTLRDACTAFKSLDDCVASLHASHNLKIKFNCLKWDMTAVRPSNDAKSCAAPPRALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASA*
Ga0099795_1044927513300007788Vadose Zone SoilKNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANGAQERKLTLELQALLPPGTTLKDACTAFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPKALSLNKAIHALKPDANAKVESRNAERRAHEDIKDASS*
Ga0066710_10120504923300009012Grasslands SoilMKNKWSVIGAAGFCILLGAAGASARSYNPLKWIKKPTASQQLASNSAQERKLTLELQALLPPGTTLKDACTAFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPRALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASS
Ga0066710_10242637613300009012Grasslands SoilMKNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANSAEERKLTLELQALLPPGTTLKDACTVFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSN
Ga0066709_10065175313300009137Grasslands SoilMKNKWSVIGAAGFCILLGASGASAHSYNPLKWIKKPTASQQLAANGAQERKLTLELQALLPPGTTLRDACTAFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPRALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASA*
Ga0066709_10162691423300009137Grasslands SoilMKNKWSVIGAAGFCILLGAAGASARSYNPLKWIKKPTASQQLASNSAQERKLTLELQALLPPGTTLKDACTAFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPRALSLNK
Ga0126380_1004879733300010043Tropical Forest SoilMKNRLSVMGAAGIAILLGASGASAISYNPLNWIKKPTASQQLAAKPEQEKRLSMGLQAILPPRTTLKDACTEFKSLDDCVASLHASNNLKIKFNCLKWNMTAIRPNGDVKSCEAPAKVMSLDKAIRVLKPEADAKTEAKNAERRAREAIKDASS*
Ga0126380_1203029113300010043Tropical Forest SoilMKYKLITIAAIGAGILPGAAGTAARSYNPVKWIKRPTASEQLAANSKQEKDLSLQLQALLPAHATLKDACTAFKSLEDCVASLHASSNLKIKFNCLKWDMTGVRPDGDVKSCEAPTR
Ga0126384_1182693613300010046Tropical Forest SoilMKSKWIVIGVAGLGILFGAAGAAARSLNPVNWLKKPTASQQLAAKPEEEKKLSLQLQAILPPRTSLKDACTAFKNLDDCVASLYVSHNLKIKFSCLKWDMTAVRPGGDVKSCEAP
Ga0126373_1041759923300010048Tropical Forest SoilMKYKVITIAAIGAGILLGAAGTAARSYNPMKWIKRPTASEQLAANSKQEKDLSLQLQALLPAHATLKDACTAFKSLEDCVASLHVSSNLKIKFNCLKWDMTGVRPDGDVKSCEAPTRAMSLYRTIRVMKPYAEARTEARNAERSAREDIKNAR*
Ga0126373_1085122723300010048Tropical Forest SoilMKYKLMLIGAAGILLGATATAARSYNPMKWIKKPTASQQLAANSKADKDLAIQLQALLPAHTTLKDACTAFKTLQDCVSSLHASSTLQLKFNCLKWDMTAVRPDGDVKSCEAPPKALSLAQTIRGLKADVNAKGEAKNAEISAREEIKNAGF*
Ga0134109_1025555023300010320Grasslands SoilMKNKWSVIGAAGFCILLGAAGASARSYNPLKWIKKPTASQQLASNSAQERKLTLELQALLPPGTTLKDACKAFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPRALSLY
Ga0134111_1013496523300010329Grasslands SoilMKNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANGAQERKLTLELQALLPPGTTLRDACTAFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPRALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASS*
Ga0126376_1000029363300010359Tropical Forest SoilMKNRLAAMGAAGIAILLGASGASAISYNPLNWIKRPTASQQLAAKPEQEKRLSMGLQAILPPRTTLKDACTEFKSLDDCVASLHASSNLKIKFNCLKWNITAVRPTGDVKSCEAPARAMSLDRAIRLLKPEADAKTEAKNAERRAKEAIKDASS*
Ga0126376_1000689753300010359Tropical Forest SoilMRYKVLFIGVAAAGFLLGAAGTAARSYNPMKWMKKPTANQQLEANSKAEKDLTMQLQALLPAHSNLKDACTAFKTLQDCVASLHASTTLKIKFNCLKWDMTAVRPNGDVKSCEAPPKALSLAQAIRGLEPNVTAKVEAKNAEKSAGEDIRNAGF*
Ga0126372_1000984563300010360Tropical Forest SoilMRYKVLFIGVAAAGFLLGAAGTAARSYNPMKWMKKPTANQQLEANSKAEKDLTMQLQALLPAHSNLKDACTAFKTLQDCVASLHASTTLKIKFNCLKWDMTAVRPNGDVKSCEAPPKALSLAQAIRGLEPNVTAKLEAKNAEKSAREDIRNAGF*
Ga0126372_1030894533300010360Tropical Forest SoilARAGVPVTVSAGILLGAGGTAARSYNPMKWIKRPTASEQLATNSKQEKDLSLQLQALLPAHATLKDACTAFKSLEDCVASLHVSSNLKIKFNCLKWDMTGVRPDGDVKSCEAPTRAMSLYRTIRVMKPYADARAEARNAERSAREDIKNAR*
Ga0126377_1015486733300010362Tropical Forest SoilMKNRLAVIGAAGIAMLLGVAEASAISHNPLKWIKKPTASQQLAANPDQEKKLSVGLQAILPARTTLKDACTAFKSLDDCVASLHASNNLKIKFNCLKWDVTAVRPNGDVKSCEVPTKAMSLSRAIHDLKPETDAKSEARNAERRAREAIKDASS*
Ga0126377_1070493323300010362Tropical Forest SoilMKNRLSVMGAAGIAILLGASGASAISYNPLNWIKKPTASQQLAAKPEQEKRLSMGLQAILPPRTTLKDACTEFKSLDDCVASLHASNNLKIKFNCLKWNMTAIRPNGDVKSCEAPAKVMSLDKAIRVLKPEADAKTEAKNAERRAREAIKDASS
Ga0126379_1112828623300010366Tropical Forest SoilMKYKVITIAAIGAGILLGAGGTAARSYNPMKWIKRPTASEQLATNSKQEKDLSLQLQALLPAHATLKDACTAFKSLEDCVASLHVSSNLKIKFNCLKWDMTGVRPDGDVKSCEAPTRAMSLYRTIRVMKPYADARAEARNAERSAREDIKNAR*
Ga0126381_10006855113300010376Tropical Forest SoilLGATATAARSYNPMKWIKKPTASQQLAANSKADKDLAMQLQALLPAHTTLKDACTAFKTLQDCVSSLHASSTLQLKFNCLKWDMTAVRPDGDVKSCEAPPKALSLAQAIRGLKAGVNAKGEAKNAEMSAREEIKNAGF*
Ga0126383_1014592333300010398Tropical Forest SoilMKSKLVAIAAAGIAISLGATGAAAISYNPLKWIKKPTASQQLAANSEQEKKLSVGLQAILPARTTLKDACTAFKTLDDCVASLHASSNLKIKFNCLKWDMTAVRPSGDVKSCEAPAKALNLNKAIHALKPDADAKTETRNAERR
Ga0137392_1006401723300011269Vadose Zone SoilMKYKSLVLGAAGAAILFGAATAAARSYNPMKWIKKPTASQELAANSAEEKTLALQLQALLPAHTTLKDACTLFKTLDDCAASLHASSNLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAILALKPGVNARGEARNAEKGAHEDIKNARS*
Ga0137393_1040169423300011271Vadose Zone SoilMKYKSLVLGAAGAAILFGAATAAARSYNPMKWIKKPTASQELAANSVEEKALTLQLQALLPAHTTLKDACTLFKTLDDCVASLRASSNLQIKFNCLKWDMTAVRPNGDIKSCQAPPKALSLEKAILALKPDVNARGEARNAEKGAHEDIKNARS*
Ga0137364_1040941113300012198Vadose Zone SoilMKNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANGAQERKLTLELQALLPAGTTLKDACTVFKSLDDCVASLHASHNLKVKFNCLKWNMTAVRPSNDAKSCAAPPKALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASS*
Ga0137399_1094444813300012203Vadose Zone SoilMKNKWSVIGAGGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANGAQERKLTLELQALLPPGTTLKDACTVFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSGDVKSCEAPPRVLSLNKAIRALKPDADAKVESRNAERRAHEDIKDASS*
Ga0137362_1046566423300012205Vadose Zone SoilMKNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANSAQERKLTLELQALLPPGTTLKDACTVFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPKALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASS*
Ga0137377_1152311513300012211Vadose Zone SoilGAAGFGILLGAAGASAHSYNPLKWIKKPTASQQLASNSAQERKLTLELQALLPPGTTLRDACTVFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPKALSLNKAIHALKPDADAKMESRNAERRAHEDIKDASS*
Ga0137386_1048262113300012351Vadose Zone SoilMKNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANGAQERKLTLELQALLPPGTTLKDACTVFKSLDDCVASLHAGRNLNIKFNCLKWNMTAVRPSNDAKSCAAPPKALSLNKAIHALKPDADAKIESRNA
Ga0137360_1002748053300012361Vadose Zone SoilMKNKWSVIGAAGLGILLGAAGAAAHSYNPLKWIKKPTASQQLAANSAEEKKLTLELQALLPPHTTLKDTCTAFKSLDDCVASLHAGRNLNIKFNCLKWDMTAVRPSGDVKSCEAPPKALPLSKAIHALKPDADAKTESRNAERRAHEDIKDASS*
Ga0137361_1147429713300012362Vadose Zone SoilMKYKILVLGAAGAAILLGAVTVAARSHNPMKWIKKPTASQELAANSVEEKALTLQLQALLPAHTTLKDACTLFKTLDDCVASLHASGNLQIKFNCLKWDMTAVRPNGDIKSCQAPPKALSLEKAILALKP
Ga0137358_1042397523300012582Vadose Zone SoilMKNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANGAQERKLTLELQALLPPGTTLKDACTVFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPKALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASS*
Ga0137398_1003209523300012683Vadose Zone SoilMKNKWSVIGAAGFGILLGAAGASAHSYNPLKWIKKPTASQQLAANSAEERKLTLELQGLLPPGATLKDACTAFKSLDDCVAALHASRNLKVKFNCLKWNMTAVRPSGDVKSCEAPPRVLSLNKAIRALKPDADAKVESRNAERRAHEDIKDASS*
Ga0137397_1017498313300012685Vadose Zone SoilMKYKTLLLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCAASLHASGNLQIKFNCLKWDMTAVRPNGDIKSCQAPPKALSLEKAILALKPGVNARGEARNAEKSAREDIKNARS*
Ga0137397_1022942113300012685Vadose Zone SoilMKYKTLFLGAAGAAILLGAGTAAARSYNPMKWIKKPTASQELAASSAKEKALTLQLQALLPAHTTLKDACTLFKTLDDCVASLHASGNLQIKFNCLKWDMTAVRPNGDIKSCQ
Ga0137359_1081560213300012923Vadose Zone SoilMKYKTLFLGAAGAAILFGAATAAARSYNPMKWIKKPTASQELAANSVEEKALTLQLQALLPAHTTLKDACTLFKTLDDCVASLHASGNLQIKFNCLKWDMTAVRPNGDIKSCQAPPNARSLEKASLGLK
Ga0137413_1008707913300012924Vadose Zone SoilMKNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANGAQERKLTLELQALLPPGTTLKDACTVFKSLDDCVASLHASHNLKIKFNCLKWNMTAV
Ga0137419_1006372213300012925Vadose Zone SoilMKNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANSAEERKLTLELQALLPPGTTLKDACTVFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPKALSLNKAIHALKPDAD
Ga0137404_1084607513300012929Vadose Zone SoilMKYKTLFLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLHASGNLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAI
Ga0137407_1091755523300012930Vadose Zone SoilMRNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKRPTASQQLAANSAEERKLTLELQALLPPGTTLKDACTVFKSLDDCVASLHASHNLKIKFNCLKWNMTAV
Ga0137410_1002213263300012944Vadose Zone SoilMKNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANSAEEKKLTLELQALLPPHTALKDACTAFKSLDDCVASLHAGRNLNIKFNCLKWNMTAVRPSGDVKSCEAPPKALPLNKAIHALKPDADAKAESRNAERRAHEDIKDASS*
Ga0137410_1015997633300012944Vadose Zone SoilLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSVEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLHASGNLQIKFNCLKWDMTAVRPNGDIKSCQAPPKALSLEKAILALKPGVNARGEARNAEKSAREDIKNARS*
Ga0137410_1162167513300012944Vadose Zone SoilMKYKTLFLGAAGAAILFGAATAAARSYNPMKWIKKPTASQELAASSAKEKALTLQLQALLPAHTTLKDACTLFKTLDDCVASLHASGNLQIKFNCLKWDMTAVRPNGDIKSCQAPP
Ga0134110_1023796913300012975Grasslands SoilMKNKWSVIGAAGFCILLGAAGASARSYNPLKWIKKPTASQQLASNSAQERKLTLELQALLPPGTTLKDACTAFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPRALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASS*
Ga0134079_1004052723300014166Grasslands SoilMKNKWSVIGAAGFCILLGAAGASARSYNPLKWIKKPTASQQLASNSAQERKLTLELQALLPPGTTLKDACTAFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPRALSLNKAIHALKPDADAKTESRNAERRAHEDIKDASS*
Ga0137418_1009015923300015241Vadose Zone SoilMKNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANGAQERKLTLELQALLPPGTTLKDACTVFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPKALSLNKAIHALKPDADAKIESRNGERRAHEDIKDASS*
Ga0137409_1014241313300015245Vadose Zone SoilMKYKTLFLGAAGAAILFGAATAAARSYNPMKWIKKPTASQELAANSVEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLHASGNLQIKFNCLKWDMTAVRPNGDIKSCQAPPKALSLEKAILALKPGVNARGEARNAEKSAREDIKNARS*
Ga0066669_1010159313300018482Grasslands SoilMKNKWSVIGAAGFCILLGAAGASAHSYNPLKWIKRPTASQQLAANSAQERKLTLELQALLPAGTTLKDACTAFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPRALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASS
Ga0179590_109141213300020140Vadose Zone SoilMKYKSLVLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCAASLHASGNLQIKFNCLKWDMTAVRPNGD
Ga0179592_1005685133300020199Vadose Zone SoilMKNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANGAQERKLTLELQALLPPGTTLKDACTVFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPKALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASS
Ga0210407_1019332813300020579SoilMKYKSLVLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLRASSNLQIKFNCLKWNMTA
Ga0210407_1035098123300020579SoilMKYKCIIILAVGTGILVGTAGAAARSYNPITWIKKPTASQQLAANSGQEKKLTVELQALLPPKTTLKEACTFYKRLEDCVASLHVSQNLKIKFNCLKWDMTAIQPMGDVKSCEAPGKAMTLHKAIRTLKPDADARAEANHAERRAREDIKDAGS
Ga0210407_1087200913300020579SoilMKYKSLVLGAAGAAILFGAATAGARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTSLKDACTLFKTLDDCAASLHASGNLQIKFNCLKWNMTAVRPNGDIKSCQAPAKALTLEKAILALKPDVNARGEARNA
Ga0210403_1000158513300020580SoilMKYKSLVLGAAGAAILFGAATTAARSYNPMKWIKKPTASQELAANSAEEKALALQVQALLPAHTTLKDACTLFKTLDDCVASLRASSNLQIKFNCLKWNMTAVRPNGDIKSCQALPK
Ga0210399_1005252313300020581SoilMKYKSLVLGAAGAAILFGAATTAARSYNPMKWIKKPTASQELAANSAEEKALALQVQALLPAHTTLKDACTLFKTLDDCVASLRASSNLQIKFNCLKWNM
Ga0210406_1015263013300021168SoilMKYKSLVLGAAGAAILFGAATAAARSYNPMKWIKKPTASQELAANSAEEKALTLQVQALLPARTTLKDACTLFKTLDDCIASLHASSNLQIKFNCLKWDMT
Ga0210406_1046980813300021168SoilPAPTPKKGDSMKYKSLVLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLRASSNLQIKFNCLKWDMTAVRPNGDIKSCQAPPKALSLEKSILALKPDVNARGEARNAEKSAREDIKNARS
Ga0210406_1092903913300021168SoilILFGAATVAARSYNPMKWIKKPTASQELAANGAEEKALTLQLQALLPARTTLKDACTLFKTLDGCVASLRASSNLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAILALKPDVNARGEARNAEKSAREDIKNARS
Ga0210405_1000498253300021171SoilMKYKSLVLGAAGAAILFGAATAGARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPAHTTLKDACTLFKTLDDCVASLRASGDLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAILALKPDVNARGEARNAEKSAREDIKNARS
Ga0210408_1096504613300021178SoilMKYKSLVLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLRASSNLQIKFNCLKWNMTAVR
Ga0210396_10013967103300021180SoilMKYKTLVLGAAGAAILFGAATAAARSYNPMKWIKKPTASQELAASSAKEKALTLQLQALLPAHTTLKDACTLFKTLDDCFASLRASSNLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAILALKPDVNARGEARNAEKSAREDIKNARS
Ga0210396_1094169813300021180SoilMKYKSLVLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLRASSNLQIKFNCLKWDMTAVRPNGDIKSCQAPPKALSLQKAILALK
Ga0210388_1039007723300021181SoilMKYKSLVLGAAGAAILFGAATASARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTSLKDACTLFKTLDDCAASLHASGNLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAILAL
Ga0210389_1069238723300021404SoilMKYKSLVLGAAGAAILFGAATAGARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTSLKDACTLFKTLDDCAASLHASGNLQIKFNCLKWN
Ga0210386_1141778913300021406SoilMKYKSLVLGAAGAAILFGATAVAARSYNPMKWIKKPTASQELAASSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLHASGNLQLKFNCLKWNITAVRPSGDIKSCQAPPKALS
Ga0210383_1067816413300021407SoilMKYKTLVLGAAGAAILFGAATAAARSYNPMKWIKKPTASQELAASSAKEKALTLQLQALLPAHTTLKDACTLFKTLDDCFASLRASSNLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAILALKPDVN
Ga0210383_1091622313300021407SoilMKYKSLVLGAAGAAILLGAATAAARSYNPMKWIKKPTASQELAASSAEEKALTLQLQALLPAHTALKDACTLFKTLDDCVASLHASSGLQIKFNCLKWNMTAVRPNGDIKSCQAPPK
Ga0210394_10000192183300021420SoilMKYKSLVLGAAGAAILFGAATAAARSYNPMKWIKKPTASQELAANSTEEKALTLQLQALLPAHTTLKDACTLFKTLDDCVASLRASSNLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAILALKPDVNARGEARNAEKSAHEDIKNARS
Ga0210394_1023661813300021420SoilMKYKSLVLGAAGAAILFGAATAGARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTSLKDACTLFKTLDDCAASLHASGNLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAILAL
Ga0210394_1079658513300021420SoilMKYKTLVLGAAGAAILFGAATAAARSYNPMKWIKKPTASQELAASSAKEKALMLQLQALLPAHTTLKDACTLFKTLDDCFASLRASSNLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAILALKPDVNA
Ga0210394_1177832313300021420SoilMKYKSLVLGAAGAAILLGAATAAARSYNPMKWIKKPTASQELAASSAEEKALTLQLQALLPAHTALKDACTLFKTLDDCVASLHASSGLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALS
Ga0210384_1014071933300021432SoilMKYKSLVLGAAGAAILFGAATAAARSYNPMKWIKKPTASQELVANSAEEKALTLQVQALLPARTTLKDACTLFKTLDDCIASLHASSNLQIKFNCLKWDMTAVRPNGDIKSCQAPPKALSLEKAILALKPDV
Ga0210392_1001340963300021475SoilMKYKSLVLGAAGAAILFGAATAGARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTSLKDACTLFKTLDDCAASLHASGNLQIKFNCLKWNMTAVRPNGDIKSCQAPAK
Ga0210392_1051700423300021475SoilMKYKSLVLGAAGAAILFGAATTAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLRASSNLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAI
Ga0187846_1017376813300021476BiofilmMRNKWIVIGAAGIGILFGAAGAAARSYNPINWIKKPTASQQLAAKPEEERKLASQLQAVLPPRTTLKDACTAFKSLNDCVASIQASHNLKIKFNCLKWDVTAVRPGGDVKSCEAPPRAYPLVKAISVLKPDADAKTEAKSAERRAREIIKDASS
Ga0210402_1054354613300021478SoilMKYKSLVLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLRASSNLQIKFNCLKWNMTAVRPNGD
Ga0210402_1114976813300021478SoilMKYKSLLLGAAGAAILFGAATTAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPAHTTLKDACTLFKTLDDCVASLRASSNLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAILALKPDVNAR
Ga0210410_1003516713300021479SoilMKYKSLVLGAAGAAILFGAATAGARSYNPMKWIKKPTASQELAANSAEEKTLTLQLQALLPARTSLKDACTLFKTLDDCAASLHASGNLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALTLEKAILALKPDVNARGEARNAE
Ga0210410_1176099013300021479SoilMKYKSLVLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLRASSNLQIKFNCLKWDMT
Ga0210409_1027186233300021559SoilMKYKSLVLGAAGAAILFGAATAGARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTSLKDACTLFKTLDDCAASLHASGNLQIKFNCLKWNMTAVRPN
Ga0210409_1084501613300021559SoilMKYKSLVLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLRASSNLQIKFNCLKWDMTAVRPNGDIKSCQAPPKALSLEKAILALKPDVNPRGEARNAEKSARE
Ga0126371_1023846113300021560Tropical Forest SoilMKNRLAVVAAAGIGMFFCASGAFAISYNPLNWIKRPTASQQLAANADQEKKLSVGLQAILPARTTLKDACTVFKSLDDCVASLHASNNLQIKFNCLKWSVTGVRPTGDVKSCEAPAKALSLDRAIRALKSDVNAKAEARNAERRAKEDIKDASS
Ga0222729_107266213300022507SoilRKASMKYKCIIILAVGTGILVGTAGAAARSYNPITWIKKPTASQQLAANSGQEKKLTVELQALLPPKTTLKEACTVYKRLEDCVASLHVSQNLKIKFNCLKWDMTAIQPMGDVKSCEAPGKAMTLHKAIRILKPDADARAEANHAERRAREDIKDAGS
Ga0242652_103394113300022510SoilGHSMKYKSLVLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALMPARTTLKDACTLFKTLEDCVASLRASSDLQIKFNCLKWDMTAVRPNGDIKSCQAPPKALSLEKAILALKPDVNARGEARNAEKSAREDIKNARS
Ga0179589_1005608913300024288Vadose Zone SoilMKYKSLVLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCAASLHASGNLQIKFNCLKWDMTAVRTNGEITSC
Ga0207699_1143488013300025906Corn, Switchgrass And Miscanthus RhizosphereMKYKSLVLGAAGAAILFGATTVAARSYNPMKWIKKPTASQELAANSAVEKALALQLQALLPARTTLKDACTLFRTLDDCVASLHASGDLQIKFNCLKWDMTAVRPNGDIKSCQAPPK
Ga0207665_1130982113300025939Corn, Switchgrass And Miscanthus RhizosphereMKYKSLVLGAAGAAILFGATTVAARSYNPMKWIKKPTASQELAANSAVEKALALQLQALLPARTTLKDACTLFRTLDDCVASLHASGDLQIKFNCLKWDMTAVRPNGDIKSCQAPPKALSLEKTILALK
Ga0209234_114191813300026295Grasslands SoilSPPPSRKASMKNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANGAQERKLTLELQALLPPGTTLKDACTAFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPRALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASS
Ga0209238_102857123300026301Grasslands SoilMKNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANSAEERKLTLELQALLPPGTTLKDACTVFRSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPKALSLNKAIHALKPDANAKVESRNAERRAHEDIKDASS
Ga0209238_104798243300026301Grasslands SoilMKNKWSVIGAAGFCILFGAVGTSAHSYNPLKWIKKPTASQQLAANSAEERKLTLELQALLPAGTTLKDACTVFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSNDAKSCAAPPRALSLNKAIHALKPDADAKIESRNAERRAHEDIKDASS
Ga0209647_106361533300026319Grasslands SoilMKNKWSVIGAAGFGILLGAAGASAHSYNPLKWIKKPTASQQLAANSAEERKLTLELQGLLPPGATLKDACTAFKSLDDCVAALHASRNLKVKFNCLKWNMTAVRPSGDVKSCEAPPRVLSLNKAIRALKPDADAKVESRNAERRAHEDIKDASS
Ga0209131_111283013300026320Grasslands SoilILLGAAGASAHSYNPLKWIKKPTASQQLAANSAEERKLTLELQGLLPPGATLKDACTAFKSLDDCVAALHASRNLKVKFNCLKWNMTAVRPSGDVKSCEAPPRVLSLNKAIRALKPDADAKVESRNAERRAHEDIKDASS
Ga0209803_105721933300026332SoilMKGPMKNKWIVIGAVGISVLLGAAGAAGRSYNPMKWIKKSPGPTASEQLAVNKEEEKKLTLQLQALLPPRTTLKDACATFKSLDDCVAALHVSRNLKIKFNCLKWDLTAARPNGDVKSCEAPPRDRALTLN
Ga0209648_1057334813300026551Grasslands SoilMKNKWIVIGAVGISVLLGAAGAAGRSYNPMKWIKKPTASQQLAANTEEEKKLTLQLQALLPPRTTLKDACTAFKSLNDCVASLHASHNLKIKFNCLKWDMTAVRPS
Ga0208366_102977113300027073Forest SoilGAAGAAILFGAATAAARSYNPMKWIKKPTASQELAASSAKEKALTLQLQALLPAHTTLKDACTLFKTLDDCFASLRASSNLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAILALKPDVNARGEARNAEKSAREDIKNARS
Ga0209523_111542113300027548Forest SoilMKYKTLFLGAAGAAILFSAATAAARSYNPMKWIKKPTASQELAANSTEEKALTLQLQALLPAHTTLKDACTLFKTLDDCVASLHASSNLQIKFNCLKWDMTAVRPSGDIKSCQA
Ga0209166_1061764423300027857Surface SoilLLGASEASAISYNPLNWIKRPTASEQLAAKPDQEKKLSVGLQAILPARTSLKDACSAFKSLDDCVASLHASNNLKIKFNCLKWNMTAVRPNGDVKSCEAPAKAMSLDKAIRVLKPDADA
Ga0209167_1035730413300027867Surface SoilMKCKSLVLGAAGAAILFGAATTGARSYNPMKWIKKPSASQELAANSAEEKALTLQLQAMLPAHTTLKDACTLFKTLDDCVASLRASSNLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAILALKPDVNARGEARNAEKSAREDI
Ga0209488_1063000413300027903Vadose Zone SoilSRKARMKNKWSVIGAAGLGILLGAAGASAHSYNPLKWIKKPTASQQLAANSAEERKLTLELQALLPPGTTLKDACTVFKSLDDCVASLHASHNLKIKFNCLKWNMTAVRPSSDAKSCAAPPKALSLNKAIHALKPDADAKVESRNAERRAHEDIKDASS
Ga0209006_1011594723300027908Forest SoilMKYKSLVLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLRASSNLQIKFNCLKWDVTAVRPNGDLKSCQAPPKALSLEKAILALKPDVNARGEARNAEKSAREDIKNARS
Ga0137415_1069127513300028536Vadose Zone SoilMKNKWSVIGAAGFCILLGAAGASAHSYNPLKWIKKPTASQQLAANGAQERKLTLELQALLPPGTTLKDACAVFRSLDDCVASLHASHNLKIRFNCLKWNMTAVRPSSDAKS
Ga0075396_181372913300030776SoilRKGYSMKYKMLVFGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANGTEEKALTLQLQALMPAHTTLKDACTLFKTLDDCVASLRASSDLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAILALKPDVNARGEARNAEKSAREDIKNARS
Ga0170834_10579205113300031057Forest SoilMKYKSLVLGAAGAAILFGAATAAARSYNPMKWIKKPTASQELAANSAEEKALTLQMQALLPARTTLKDACTLFKTLDDCVASLHASTNLQIKFNCLKWDVTAVRPNGDIKSCQAPPKALSLEKAILALKPDVNARGEARNAEKSAR
Ga0170822_1540093413300031122Forest SoilMKYKSLVLGAAGAAILFGAATAAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLRASSTLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLDKAILALKPDVNARGEARNAEKSAREDIKNARS
Ga0170824_10033600213300031231Forest SoilMKYKSLVLGAAGAAILFGAATAAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLHASTNLQIKFNCLKWDVTAVRPNGDIKSCQAPPKALSLEKAILALKPDV
Ga0310915_1084306813300031573SoilSYNPMKWIKRPTAGEQLAANSKQEKDLSLQLQALLPAHATLKDACTAFKSLEDCVASLHASSNLKIKFNCLKWDMTGVRPDGDVKSCEAPTRAMSLYRTIRVMKPNADARAEARNAERSAREDIKNAR
Ga0307474_1010473333300031718Hardwood Forest SoilMKYKSLVLGAAGAAILLGAATAGARSYNPMKWIKKPTASQELAANSAEEKALTLQLQAMLPAHTTLKDACTLFKTLDDCVASLRASSDLQIKFNCLKWNMTAVRPNGDIKSCQAPPKALSLEKAILALKPDVNARGEARNAEKSAREDIKNARS
Ga0307469_1186755413300031720Hardwood Forest SoilMKYKSLVLGAAGAAILFGAATVAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLHASSDLQIKFNCLKWDMTAVRPNGDIK
Ga0307468_10129338913300031740Hardwood Forest SoilMKYKSLVLGAAGAAILLGAATAAARSYNPMKWIKKPTASQELAANSAEEKALALQLQALLPARTTLKDACTLFKTLDDCVASLHASSDLQIKFNCLKWDMTAVRPNGDIKSCQAPPKALSLEKAILALKLDVNARGEA
Ga0307477_1016653623300031753Hardwood Forest SoilMKHRWIVILAAGTGMIASAAGAAAHSYNPITWIKKPTASQQLAANSEQEKKLTVELQAILPPKTTLKEACTVYKRLEDCVASLHVSQNLKVKFNCLKWDMTAIQPMGDIKSCEAPGKAMSMHKAIRALKPDADARSEANNAERRAREDIKDAGY
Ga0306926_1245530913300031954SoilMKYKVITIAAIGAGILLGAAGTAARSYNPMKWIKRPTAGEQLAANSKQEKDLSLQLQALLPAHATLKDACTAFKSLEDCVASLHASSNLKIKFNCLKWDMTGVRPDGDVKSCEAPTRAMSLYRTIRVMKPNADARAEARNAERSAR
Ga0307471_10014068733300032180Hardwood Forest SoilMKYKSLVLGAAGAAILFGAATTAARSYNPMKWIKKPTASQELAANSAEEKALTLQLQALLPARTTLKDACTLFKTLDDCVASLHASGDLQIKFNCLKWDMTAVRPNGDIKSCQAPPKALSLEKAILAL
Ga0307471_10388188613300032180Hardwood Forest SoilMKYKSFLLGAAGAAILLGAATAPARSYNPMKWIKKPTASQELAANSAEEKALALQLQALLPARTTLKDACTLFKTLDDCVASLHASSDLQIKFNCLKWDMTAVRPNGDIKSCQAPPKALSLEKAILALKPD
Ga0306920_10172852723300032261SoilMKYKVITIAAIGAGILLGAAGTAARSYNPMKWIKRPTAGEQLAANSKQEKDLSLQLQALLPAHATLKDACTAFKSLEDCVASLHASSNLKIKFNCLKWDMTGVRPDGDVKSCEAPTRAMSLYRTIRVMKPYGDARAEARNAERSAREDIKNAR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.