NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F097108

Metagenome / Metatranscriptome Family F097108

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097108
Family Type Metagenome / Metatranscriptome
Number of Sequences 104
Average Sequence Length 238 residues
Representative Sequence MAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGEVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP
Number of Associated Samples 81
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 43.69 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 54.81 %
Associated GOLD sequencing projects 68
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.038 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(41.346 % of family members)
Environment Ontology (ENVO) Unclassified
(63.462 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(80.769 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 13.88%    β-sheet: 40.00%    Coil/Unstructured: 46.12%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF03544TonB_C 34.62
PF02397Bac_transf 15.38
PF07238PilZ 7.69
PF04368DUF507 1.92
PF04773FecR 1.92
PF07719TPR_2 1.92
PF01075Glyco_transf_9 1.92
PF05050Methyltransf_21 0.96
PF00392GntR 0.96
PF01757Acyl_transf_3 0.96
PF10531SLBB 0.96
PF01592NifU_N 0.96
PF030614HBT 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG0810Periplasmic protein TonB, links inner and outer membranesCell wall/membrane/envelope biogenesis [M] 34.62
COG2148Sugar transferase involved in LPS biosynthesis (colanic, teichoic acid)Cell wall/membrane/envelope biogenesis [M] 15.38
COG0859ADP-heptose:LPS heptosyltransferaseCell wall/membrane/envelope biogenesis [M] 1.92
COG0822Fe-S cluster assembly scaffold protein IscU, NifU familyPosttranslational modification, protein turnover, chaperones [O] 0.96


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.04 %
UnclassifiedrootN/A0.96 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002909|JGI25388J43891_1006696All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2268Open in IMG/M
3300005166|Ga0066674_10255921All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium829Open in IMG/M
3300005167|Ga0066672_10142991All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1495Open in IMG/M
3300005167|Ga0066672_10325335All Organisms → cellular organisms → Bacteria1002Open in IMG/M
3300005171|Ga0066677_10053817All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2021Open in IMG/M
3300005172|Ga0066683_10237342All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1129Open in IMG/M
3300005174|Ga0066680_10023144All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3426Open in IMG/M
3300005176|Ga0066679_10021172All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3449Open in IMG/M
3300005177|Ga0066690_10192432All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1355Open in IMG/M
3300005179|Ga0066684_10022940All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3280Open in IMG/M
3300005179|Ga0066684_10116341All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1659Open in IMG/M
3300005180|Ga0066685_10079904All Organisms → cellular organisms → Bacteria → Acidobacteria2156Open in IMG/M
3300005186|Ga0066676_10132917All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1548Open in IMG/M
3300005187|Ga0066675_10471230All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium936Open in IMG/M
3300005450|Ga0066682_10813018All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium564Open in IMG/M
3300005451|Ga0066681_10006468All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium5481Open in IMG/M
3300005451|Ga0066681_10067250All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1994Open in IMG/M
3300005454|Ga0066687_10837744All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phyllobacteriaceae → Hoeflea → unclassified Hoeflea → Hoeflea sp. 108548Open in IMG/M
3300005537|Ga0070730_10009573All Organisms → cellular organisms → Bacteria8025Open in IMG/M
3300005540|Ga0066697_10041499All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia2584Open in IMG/M
3300005540|Ga0066697_10066401All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2065Open in IMG/M
3300005553|Ga0066695_10000319All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium13973Open in IMG/M
3300005554|Ga0066661_10729222All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium581Open in IMG/M
3300005559|Ga0066700_10107584All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1837Open in IMG/M
3300005561|Ga0066699_10296450All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1150Open in IMG/M
3300005568|Ga0066703_10034887All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2735Open in IMG/M
3300005575|Ga0066702_10556650All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium693Open in IMG/M
3300005576|Ga0066708_10202269All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1247Open in IMG/M
3300006032|Ga0066696_10059743All Organisms → cellular organisms → Bacteria → Acidobacteria2172Open in IMG/M
3300006755|Ga0079222_10043564All Organisms → cellular organisms → Bacteria2030Open in IMG/M
3300006755|Ga0079222_10123264All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1416Open in IMG/M
3300006797|Ga0066659_10065969All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2349Open in IMG/M
3300006797|Ga0066659_11491057All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium566Open in IMG/M
3300006804|Ga0079221_10267783All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium982Open in IMG/M
3300006804|Ga0079221_10379614All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium866Open in IMG/M
3300006806|Ga0079220_11183852All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium628Open in IMG/M
3300006914|Ga0075436_100050843All Organisms → cellular organisms → Bacteria2860Open in IMG/M
3300009137|Ga0066709_100152009All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2962Open in IMG/M
3300009137|Ga0066709_100279289All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2255Open in IMG/M
3300010100|Ga0127440_1021835All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium595Open in IMG/M
3300010320|Ga0134109_10033140All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1651Open in IMG/M
3300010320|Ga0134109_10127583All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium903Open in IMG/M
3300010322|Ga0134084_10082467All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1001Open in IMG/M
3300010323|Ga0134086_10215319All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium722Open in IMG/M
3300010325|Ga0134064_10125942All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium866Open in IMG/M
3300010358|Ga0126370_10001782All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia9941Open in IMG/M
3300010361|Ga0126378_10973868All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium953Open in IMG/M
3300010361|Ga0126378_11112228All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium891Open in IMG/M
3300010361|Ga0126378_11979312All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium664Open in IMG/M
3300010364|Ga0134066_10101924All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium835Open in IMG/M
3300010366|Ga0126379_12010270All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium680Open in IMG/M
3300010376|Ga0126381_100002037All Organisms → cellular organisms → Bacteria21796Open in IMG/M
3300010376|Ga0126381_101056476All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1172Open in IMG/M
3300012199|Ga0137383_11070991All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium585Open in IMG/M
3300012207|Ga0137381_10191364All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1771Open in IMG/M
3300012207|Ga0137381_10322807All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1345Open in IMG/M
3300012208|Ga0137376_10427741All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1150Open in IMG/M
3300012209|Ga0137379_10048020All Organisms → cellular organisms → Bacteria → Acidobacteria4116Open in IMG/M
3300012211|Ga0137377_10157270All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2178Open in IMG/M
3300012211|Ga0137377_10709190All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium940Open in IMG/M
3300012349|Ga0137387_10001531All Organisms → cellular organisms → Bacteria11480Open in IMG/M
3300012349|Ga0137387_10081827All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2222Open in IMG/M
3300012357|Ga0137384_11461944All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium532Open in IMG/M
3300012359|Ga0137385_10321405All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1329Open in IMG/M
3300012976|Ga0134076_10211157All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium817Open in IMG/M
3300014150|Ga0134081_10014774All Organisms → cellular organisms → Bacteria2144Open in IMG/M
3300014157|Ga0134078_10263504All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium728Open in IMG/M
3300014166|Ga0134079_10019360All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2180Open in IMG/M
3300014166|Ga0134079_10055350All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1408Open in IMG/M
3300015356|Ga0134073_10000712All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium6136Open in IMG/M
3300015357|Ga0134072_10071713All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1010Open in IMG/M
3300015357|Ga0134072_10321628All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium584Open in IMG/M
3300018431|Ga0066655_10004035All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium5815Open in IMG/M
3300018433|Ga0066667_10002018All Organisms → cellular organisms → Bacteria7939Open in IMG/M
3300018468|Ga0066662_10076486All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2268Open in IMG/M
3300018482|Ga0066669_10026624All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3342Open in IMG/M
3300018482|Ga0066669_10413144All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1147Open in IMG/M
3300021560|Ga0126371_10014435All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae7197Open in IMG/M
3300026308|Ga0209265_1032766All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1606Open in IMG/M
3300026309|Ga0209055_1002074All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium12850Open in IMG/M
3300026317|Ga0209154_1001294All Organisms → cellular organisms → Bacteria14825Open in IMG/M
3300026318|Ga0209471_1000075All Organisms → cellular organisms → Bacteria76128Open in IMG/M
3300026318|Ga0209471_1139127All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1016Open in IMG/M
3300026329|Ga0209375_1021584All Organisms → cellular organisms → Bacteria → Acidobacteria3678Open in IMG/M
3300026330|Ga0209473_1000326All Organisms → cellular organisms → Bacteria29883Open in IMG/M
3300026330|Ga0209473_1038576All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2131Open in IMG/M
3300026332|Ga0209803_1002116All Organisms → cellular organisms → Bacteria12610Open in IMG/M
3300026523|Ga0209808_1102397All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1206Open in IMG/M
3300026540|Ga0209376_1064623All Organisms → cellular organisms → Bacteria → Acidobacteria2022Open in IMG/M
3300026542|Ga0209805_1060754All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1860Open in IMG/M
3300026547|Ga0209156_10015610All Organisms → cellular organisms → Bacteria4720Open in IMG/M
3300026547|Ga0209156_10145466All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1150Open in IMG/M
3300027748|Ga0209689_1012277All Organisms → cellular organisms → Bacteria5601Open in IMG/M
3300027765|Ga0209073_10122386All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium938Open in IMG/M
3300027787|Ga0209074_10363109All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium597Open in IMG/M
3300027857|Ga0209166_10038543All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2858Open in IMG/M
3300031753|Ga0307477_10676276All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium692Open in IMG/M
3300031754|Ga0307475_10160222All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1790Open in IMG/M
3300031962|Ga0307479_10001654All Organisms → cellular organisms → Bacteria20365Open in IMG/M
3300031962|Ga0307479_10942216All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium834Open in IMG/M
3300032180|Ga0307471_100003719All Organisms → cellular organisms → Bacteria9201Open in IMG/M
3300032180|Ga0307471_100778299All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1123Open in IMG/M
3300032205|Ga0307472_100034667All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2974Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil41.35%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil14.42%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.54%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil7.69%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil7.69%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil6.73%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil6.73%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.92%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.96%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.96%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002909Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010100Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_20_5_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300026308Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103 (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25388J43891_100669623300002909Grasslands SoilMAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGXVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP*
Ga0066674_1025592113300005166SoilSPFLSFVASCQRPKQKGIAPIAFVLAAPVLILQVSLPTLTVPANAATPSQEQKQSPKREPIGSLSATGEVYVNDKPAPVESTVFVGDTIRTGENSAAIFTMTGNGTLKIGSQTQVVITGDPQFAAELQAGTAVIDSISGPSGIKVRVGGVVVVPAVRSRVTAAKIERQADGLFLVTCLDGDISTIPLQGTSGQLLEAGQSVSASPRGELFVQKQSGVRSSDGGLRGPLKSRTGWTLLGIAGAGAGAAALVLGHGGGTPVSPSGP*
Ga0066672_1014299123300005167SoilMAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGEVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP*
Ga0066672_1032533523300005167SoilEVYVNDKPVPVESTVFPGDTVRTGENSTAVFTMTGNGTLKIGSQTQVSLAGDPQFVAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGSGVPGTLKGKKGWTLLGLAGGGAGVAAVVLGHGGKAPVSPSGP*
Ga0066677_1005381723300005171SoilVFVWREIFIRLLSLKLRSVCLAARSQRQKANCVARTILAVVLTMGLQGSFLSAPASGAAPTQDQAQNSRKYPLGSLSATGEVYVNDKPVPVESTVFPGDTVRTGENSTAVFTMTGNGTLKIGSQTQVSLAGDPQFVAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGSGVPGTLKGKKGWTLLGLAGGGAGVAAVVLGHGGKAPVSPSGP*
Ga0066683_1023734213300005172SoilVLTVHETFMKLKSKSPFLSFVASCQRPKQKGIAPIAFVLAAPVLILQVSLPTLTVPANAATPSQEQKQSPKREPIGSLSATGEVYVNDKPAPVESTVFVGDTIRTGENSAAIFTMTGNGTLKIGSQSQVVITGDPQFAAELQAGTAVIDSISGPSGIKVRAGGVVVVPAVRSRVTAAKIERQADGSFLVTCLDGDISTIPLQGTSGRLLEAGQSVSASPQGELFVQKQSGVRSSDGGLRGPLKSRTGWTLLGIAGAGAGAAALVLGHGGGTPVSPSGP*
Ga0066680_1002314423300005174SoilMAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGQVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP*
Ga0066679_1002117213300005176SoilPAIALGLALVPALQVFVLAGPAHAATTVQEPTQNGKRVPIGSLSATGEVYVNEKPVPVESTVFAGDSVRTGQNSTAVFTLPGNGTLKIGPQTQVVISNNPQFTAELQAGTAVIDSISGPSGIKLRAGSVVVVPAVRSRVTSAKIEGQPGGAFLVTCVDGDISTIPLSGTSGQLLEAGQSVNISPRGELSVQKQSGGTSTGPGLPGPLKGKSGWSLLGLAGAGAGAAALAVGHGGTKPVSPSGP*
Ga0066690_1019243223300005177SoilAVFTLPGNGTLKIGPQTQVVISNNPQFTAELQAGTAVIDSISGPSGIKLRAGSVVVVPAVRSRVTSAKIEGQPGGAFLVTCVDGDISTIPLSGTSGQLLEAGQSVNISPRGELSVQKQSGGTSTGPGLPGPLKGKSGWSLLGLAGAGAGAAALAVGHGGTKPVSPSGP*
Ga0066684_1002294033300005179SoilMGLQGSFLSAPASGAAPTQDQAQNSRKYPLGSLSATGEVYVNDKPVPVESTVFPGDTVRTGENSTAVFTMTGNGTLKIGSQTQVSLAGDPQFVAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGSGVPGTLKGKKGWTLLGLAGGGAGVAAVVLGHGGKAPVSPSGP*
Ga0066684_1011634113300005179SoilLTAQNTFAKPSRHKLRPLPPGLAVMFVLQIFLFTTPARAAAVSQEQAQNPKRVPVGSLSATGEVYVNEKPVPVESTVFAGDTIRTGQNSTAIFTMTGNGTLKIGAQTQMVISGDPQFPAELQTGTAVINSISGPSGVKLRVGEYVVVPAVRSRVTSAKIERQPDGSFLVTCMDGDISTIPLEGTSGQLLEAGQSVDISPQGRLSVQKQSGIRSTGGGGHGAIKSRTGWTLLGLAGAGAGVAALVVGHGGKQPVSPSGP*
Ga0066685_1007990423300005180SoilMQENTLELCSGSVLQNRSSLSVSPASRGESVVLTVHETFMKLKSKSPLLSFVASCQRPKQKGIAAIAFVLAAPVLILQVSLLTLTVPANAATPSQEQKQSPKREPIGSLSATGEVYVNDKPAPVESTVFVGDTIRTGENSAAIFTMTGNGTLKIGSQTQVVITGDPQFAAELQAGTAVIDSISGPSGIKVRVGGVVVVPAVRSRVTAAKIERQADGLFLVTCLDGDISTIPLQGTSGQLLEAGQSVSASPRGELFVQKQSGVRPSGGGLPGPLKSRTGWTLLGIAGAGAGAAALLLGHGGGTPVSPSGP*
Ga0066676_1013291713300005186SoilPAFEMQENTLELCSGSVLQNRSSLSVSPASRGESVVLTVHETFMKLKSKSPLLSFVASCQRPKQKGIAPIAFVLAAPVVILQVSLPTLTVPANAATPSQEQKQSPKREPMGSLSATGEVYVNDKPAPVESTVFVGDTIRTGENSAAIFTMTGNGTLKIGSQTQVVITGDPQFAAELQAGTAVIDSISGPSGIKVRVGGVVVVPAVRSRVTAAKIERQADGLFLVTCLDGDISTIPLQGTSGQLLEAGQSVSASPRGELFVQKQSGVRPSGGGLPGPLKSRTGWTLLGIAGAGAGAAALLLGHGGGTPVSPSGP*
Ga0066675_1047123013300005187SoilGSLSATGEVYVNDKPAPVESTVFVGDTIRTGENSAAIFTMTGNGTLKIGSQSQVVITGDPQFAAELQAGTAVIDSISGPSGIKVRVGGVVVVPAVRSRVTAAKIERQADGLFLVTCLDGDISTIPLQGTSGQLLEAGQSVSASPRGELFVQKQSGVRPSGGGLPGPLKSRTGWTLLGIAGAGAGAAALLLGHGGGTPVSPSGP*
Ga0066682_1081301813300005450SoilNSRKYPLGSLSATGEVYVNDKPVPVESTVFPGDTVRTGENSTAVFTMTGNGTLKIGSQTQVLLAGDPQFAAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGSGVPGTLKGKKGWTLLGLAGG
Ga0066681_1000646823300005451SoilMGLQGSFLSAPASGAAPTQDQAQNSRKYPLGSLSATGEVYVNDKPVPVESTVFPGDTVRTGENSTAVFTMTGNGTLKIGSQTQVLLAGDPQFAAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGSGVPGTLKGKKGWTLLGLAGGGAGVAAVVLGHGGKAPVSPSGP*
Ga0066681_1006725023300005451SoilMQENTLELCSGSVLQNRSSLSVSPASRGESVVLTVHETFMKLKSKSPLLSFVASCQRPKQKGIAAIAFVLAAPVLILQVSLLTLTVPANAATPSQEQKQSPKREPMGSLSATGEVYVNDKPAPVESTVFVGDTIRTGENSAAIFTMTGNGTLKIGSQTQVVITGDPQFAAELQAGTAVIDSISGPSGIKVRVGGVVVVPAVRSRVTAAKIERQADGLFLVTCLDGDISTIPLQGTSGQLLEAGQSVSASPRGELFVQKQSGVRPSGGGLPGPLKSRTGWTLLGIAGAGAGAAALLLGHGGGTPVSPSGP*
Ga0066687_1083774413300005454SoilEVYVNDKPVPVESTVFPGDTVRTGENSTAVFTMTGNGTLKIGSQTQVSLAGDPQFVAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGYGVPGTLKGKKGWTLLGLAGGGAGVAAVV
Ga0070730_1000957323300005537Surface SoilVSVSRETFRKGPGLKLCSVSLVAVGQSKQLKRMVGSALILFLTIGLQGFFLTTPVNAAGAKQDQAQNAKKQPLGSLSSTGEVYVNDKPVPIESTVFVGDTVRTGANSTAVFTMTGNGTLKIGAQTQVALTGDPQFAAELQSGTAVIDSLSGPSGIKLRAGEYVVVPAVRSRVTSAKIERQADGSFLVTCLDGDISTLALQGTAGRLLEATQSVTLTPSGQMLVQKQSGGNSAGSGIAGPLSSRKGWTMLGLAGGGAGLAALVLGHGGKPPVSPSGP*
Ga0066697_1004149933300005540SoilVENANWGERFSRLRTSCQCREHRMAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGQVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP*
Ga0066697_1006640123300005540SoilVFVWREIFIKLLSLKLRSLCLAARSQRQKFNWVVRTILAVVLTTMGLQGSFLSAPASGAAPTQDQAQNSRKYPLGSLSATGEVYVNDKPVPVESTVFPGDTVRTGENSTAVFTMTGNGTLKIGSQTQVLLAGDPQFAAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGSGVPGTLKGKKGWTLLGLAGGGAGVAAVVLGHGGKAPVSPSGP*
Ga0066695_1000031943300005553SoilVENANWGERFSRLRTSCQCREHRMAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGEVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP*
Ga0066661_1072922213300005554SoilGNGTLKIGPQAQVVISNNSQFTAELQAGTAVIDSLSGPSGIKLRAGSVVVVPAVRSRVTSAKIEGQPGGAFLVTCVDGDISTIPLSGTSGQLLEAGQSVNISPRGELSVQKQSGGTSTGPGLPGPLKGKSGWSLLGLAGAGAGAAALAVGHGGTKPVSPSGP*
Ga0066700_1010758413300005559SoilKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP*
Ga0066699_1029645023300005561SoilMTPAIALGLALVPALQIFVLAGPAHAATTVQEPTQNGKRVPIGSLSATGEVYVNEKPVPVESTVFADDSVRTGQNSTAVFTLTGNGTLKIGPQAQVVISNNSQFTAELQAGTAVIDSLSGPSGIKLRAGSVVVVPAVRSRVTSAKIEGQPGGAFLVTCVDGDISTIPLSGTSGQLLEAGQSVNISPRGELVVQKQSGGTSTGPGLPGPFKGKSGWTLLGLAGAGAGTAALAVAHGSSKPVSPSGP*
Ga0066703_1003488723300005568SoilMQHSNWKPPFLCLAHVYEPLRRRMPPAIALGLALVPALQVFVLAGPAHAATTVQEPTQNGKRVPIGSLSATGEVYVNEKPVPVESTVFAGDSVRTGQNSTAVFTLPGNGTLKIGPQTQVVISNNPQFTAELQAGTAVIDSISGPSGIKLRAGSVVVVPAVRSRVTSAKIEGQPGGAFLVTCVDGDISTIPLSGTSGQLLEAGQSVNISPRGELSVQKQSGGTSTGPGLPGPLKGKSGWSLLGLAGAGAGAAALAVGHGGTKPVSPSGP*
Ga0066702_1055665013300005575SoilSLSATGEVYVNEKPVPVESTVFADDSVRTGQNSTAVFTLTGNGTLKIGPQAQVVISNNSQFTAELQAGTAVIDSLSGPSGIKLRAGSVVVVPAVRSRVTSAKIEGQPGGAFLVTCVDGDISTIPLSGTSGQLLEAGQSVNISPRGELVVQKQSGGTSTGPGLPGPFKGKSGWTLLGLAGAGAGTAALAVAHGSSKPVSPSGP*
Ga0066708_1020226923300005576SoilVLTVHETFMKLRSKTPLLCLVASYRRPKPKGIAPIAFALAAQMLILQMFLLTVPANAAVHSQDQKQSPKREPIGSLSATGEVYVNDKPAPVESTVFVGDTVRTGENSAAIFSMTGNGTLKIGSQTQVVITGDPQFAAELQAGTAVIDSISGPSGIKVRVGGVVVVPAVRSRVTAAKIERQADGLFLVTCLDGDISTIPLQGTSGQLLEAGQSVSASPRGELFVQKQSGVRPSGGGLRGPLKSRTGWTLLGIAGAGAGAAALLLGHGGGTPVSPSG
Ga0066696_1005974323300006032SoilLLTAQNTFAKPSRHKLRPLPPGLAVMFVLQIFLFTTPARAAAVSQEQAQNPKRVPVGSLSATGEVYVNEKPVPVESTVFAGDTIRTGQNSTAIFTMTGNGTLKIGAQTQMVISGDPQFPAELQTGTAVINSISGPSGVKLRVGEYVVVPAVRSRVTSAKIERQPDGSFLDTCMDGDISTIPLEGTSGQLLEAGQSVDISPQGRLSVQKQSGIRSTGGGGHGAIKSRTGWTLLGLAGAGAGVAALVVGHGGKQPVSPSGP*
Ga0079222_1004356423300006755Agricultural SoilVSVLRELFKKGLGLKLCPVSLVAVGHPKQRKRIGGSVLAFFLALGLQSFLMPAPANAVGAKQDQTQSAKKLPLGSLSSTGEVYVNDKPVPVESTVFVGDTVRTGANSTAVFTMTGNGTLKIGAQTQVVLTGDPQFAAELQSGTAVIDSLSGPSGIKLRAGEYVVVPAVRSRVTSAKIERQADGNFLVTCLDGDISTLALRGTAGRLLEASQSVSLTPNGQMIVQKQSGGKSLDSGIPGVLQGRKGWTLLGLAGAGAGAAALVLGHGGKPPVSPSGP*
Ga0079222_1012326423300006755Agricultural SoilVESTVFPGETIRTGANSTAVFTMTGNGTLKIGSQTQVVLTGDPQFAAELQSGTAVIDSLSGPSGIKLRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEGGQAVSVTPSGQMLVQKQSGGKSIGSGIPGPLQGKRGWTLLTLAGGGAGVAALVLGHSSKPPVSPSGP*
Ga0066659_1006596913300006797SoilLLTAQNTFAKPSRHKLRPLPPGLAVMFVLQIFLFTTPARAAAVSQEQAQNPKRVPVGSLSATGEVYVNEKPVPVESTVFAGDTIRTGQNSTAIFTMTGNGTLKIGAQTQMVISGDPQFPAELQTGTAVINSISGPSGVKLRVGEYVVVPAVRSRVTSAKIERQPDGSFLVTCMDGDISTIPLEGTSGQLLEAGQSVDISPQGRLSVQKQSGIRSTGGGGHGAIKSRTGWTLLGLAGAGAGVAALVVGHGGKQPVSPSGP*
Ga0066659_1149105713300006797SoilYPLGSLSATGEVYVNDKPVPVESTVFPGDTVRTGENSTAVFTMTGNGTLKIGSQTQVLLAGDPQFAAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGSGVPGTLKGKKGWTLLGLAGGGAGV
Ga0079221_1026778323300006804Agricultural SoilTGEVYVNDKPVPVESTVFPGDTVRTGENSTAVFTMTGNGTLKMGSQTQVLLAGDPQFAAELQSGTAVIDSLSGPSGIKLRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEGGQAVSVTPSGQMLVQKQSGGKSIGSGIPGPLQGKRGWTLLTLAGGGAGVAALVLGHSSKPPVSPSGP*
Ga0079221_1037961413300006804Agricultural SoilSAASVSQEQAQSTKRQPLGSLTATGEVYVNDKLAPAESTIFVGDTIRTGDAGTAIFTMAGNGALKIGAQTEVVISGDPEFAAELQSGTAVIDSISGPSGIKLRVGNVAVVPAVRSRVTSAKIQGQPGGTFQVTCLNGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP*
Ga0079220_1118385213300006806Agricultural SoilVRTGANSTAVFTMTGNGTLKIGAQTQVVLTGDPQFAAELQSGTAVIDSLSGPSGIKLRAGEYVVVPAVRSRVTSAKIERQADGNFLVTCLDGDISTLALRGTAGRLLEASQSVSLTPNGQMIVQKQSGGKSLDSGIPGVLQDRKGWTLLGLAGAGAGAAALVLGHGGKPPVSPSGP*
Ga0075436_10005084323300006914Populus RhizosphereVFVWREVFMKLVSVKLRSLSLAATSQRQKSNWAVRTILAVVLTMGLQGFFVSTPASAAAPAQDQAQNSKKYPLGSLSATGEVYVNEKPVPVESTVFPGETIRTGANSTAVFTMTGNGTLKIGSQTQVVLTGDPQFAAELQSGTAVIDSLSGPSGIKLRVGGVVVVPAVRSRVTSAKIERQADGTFLVACLDGDISTLSLEGTAGRLLEAGQAVSVTPGGQMLVQKQSGGKSIGSGMPGPLQGKRGWTLLTLAGGGAGVAALVLGHGGKTPVSPSGP*
Ga0066709_10015200933300009137Grasslands SoilVFVWREIFIKLLSLKLRSLCLAARSQRQKFNWVVRTILAVVLTTMGLQGSFLSAPASGAAPTQDQAQNSRKYPLGSLSATGEVYVNDKPVPVESTVFPGDTVRTGENSTAVFTMTGNGTLKIGSQTQVSLAGDPQFVAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGSGVPGTLKGKKGWTLLGLAGGGAGVAAVVLGHGGKAPVSPSGP*
Ga0066709_10027928923300009137Grasslands SoilVLTVHETFMKLRSKTPLLCLVASYRRPKPKGIAPIAFALAAKMLILQMFLLTVPANAAVHSQDQKQSPKREPIGSLSATGEVYVNDKPAPVESTVFVGDTVRTGENSAAIFSMTGNGTLKIGSQTQVVITGDPQFAAELQAGTAVIDSISGPSGIKVRVGGVVVVPAVRSRVTAAKIERQVDGLFLVTCLDGDISTIPLQGTSGQLLEAGQSVSASPRGELFVQKQSGVRPSGGGLRGPLKSRTGWTLLGIAGAGAGAAALLLGHGGGTPVSPSGP*
Ga0127440_102183513300010100Grasslands SoilVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP*
Ga0134109_1003314023300010320Grasslands SoilVLTVHETFMKLKSKSPLLSFVASCQRPKQKGIAAIAFVLAAPVLILQVSLLTLTVPANAATPSQEQKQSPKREPMGSLSATGEVYVNDKPAPVESTVFVGDTIRTGENSAAIFTMTGNGTLKIGSQTQVVITGDPQFAAELQAGTAVIDSISGPSGIKVRVGGVVVVPAVRSRVTAAKIERQADGLFLVTCLDGDISTIPLQGTSGQLLEAGQSVSASPRGELFVQKQSGVRPSGGGLPGPLKSRTGWTLLGIAGAGAGAAALLLGHGGGTPVSPSGP*
Ga0134109_1012758313300010320Grasslands SoilVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP*
Ga0134084_1008246713300010322Grasslands SoilMGLQGSFLSAPASGAAPTQDQAQNSRKYPLGSLSATGEVYVNDKPVPVESTVFPGDTVRTGENSTAVFTMTGNGTLKIGSQTQVSLAGDPQFVAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGSGVPGTLKGKKGWTLLGLAGGGAGVAAVV
Ga0134086_1021531913300010323Grasslands SoilPMGSLSATGEVYVNDKPAPVESTVFVGDTIRTGENSAAIFTMTGNGTLKIGSQSQVVITGDPQFAAELQAGTAVIDSISGPSGIKVRVGGVVVVPAVRSRVTAAKIERQADGSFLVTCLDGDISTIPLQGTSGQLLEAGQSVSASPRGELFVQKQSGVRHSGGGLRGPLKSRTGWTLLGIAGAGAGAAALLLGHGGGTPVSPSGP*
Ga0134064_1012594213300010325Grasslands SoilMAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGEVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPV
Ga0126370_1000178273300010358Tropical Forest SoilLLAPTCAAAASAQEQPQSSRKYPLGSLSATGEVYVNDKPVPVESTVFEGDTVRTGENSTAIFTMTGNGTLKIGAQTQVVLTGDPQFAAELQSGTAVIDSLSGPSGIKLRVGGAVVVPAVRSRVTSAKVERQADGTYLVTCLDGDISTLSLQGTAGRLLEGGQSVSVTPTGQMIVQKQSGGKSLGSGITGPLKGKTGWTLMGLAGGGAGVAALVLGHGGKPPVSPSGP*
Ga0126378_1097386823300010361Tropical Forest SoilVGTAAARRQQGQHSRKYPLGSLSATGEVYVNDKPVPVESTVFAGDTIRTGENSTAVFTMTGNGALKIAALTQVALPGEPQFAAELQSGAAVIDSISGPSGIKLRVGGVVVVPAVRSRVTSAKIERQADGTYVVTCLNGDISTLALQGTAGRLLEAGQAVSVTPAGQMLVQKESGGKPLSSVTGALEGKKGWTMLALAGAGAGVAALVLGHGGKPPVSPSGP*
Ga0126378_1111222813300010361Tropical Forest SoilLAAFLFSGPASAAAAKQEQAQHSRKYPLGSLSATGEVYVNETAVPVESTVFPGDTIRTGENSTAVFTMTGNGTLKIGAQTQVALPGEPQFAAELQSGTALIDSISGPSGIKLRVGGVVVVPAVRSRVTSAKIERLVDGTYVVTCLDGDVSTLALQGTAGRLLEAGQAVSVTPSGQMFAQKESGGKSLGNVTSALQGKKGWTMLALAGGGAGVAALVLGHGGKPPVSP
Ga0126378_1197931213300010361Tropical Forest SoilLLLTLQVSLLITSADAGAVSQEQPQSTRRQPLGSLTATGEVYVNDKVAPAESTVFVGDTIRTGGASTAIFTMTGNGTLKIGAQSQVVISGDPQFAAELQSGTAVIDSISGPSGIKLRVGSFAVVPAVRSRVTSAKIEAQAQPGGTFQVTCLDGDISTLPLQGGGSGRLLEAGQWVSISPSGGLVAPKQSGVRSSGSSNNGPFKGRTGWTLLGLAGAGAGAA
Ga0134066_1010192413300010364Grasslands SoilAFKMQENTLELCSGSVLQNRSSLSVSPASRGESVVLTVHETFMKLKSKSPLLSFVASCQRPKQKGIAAIAFVLAAPVLILQVSLLTLTVPANAATPSQEQKQSPKREPMGSLSATGEVYVNDKPAPVESTVFVGDTIRTGENSAAIFTMTGNGTLKIGSQTQVVITGDPQFAAELQAGTAVIDSISGPSGIKVRVGGVVVVPAVRSRVTAAKIERQADGLFLVTCLDGDISTIPLQGTSGQLLEAGQSVSASPRGELFVQKQSGVRPSGGGLPGPLK
Ga0126379_1201027013300010366Tropical Forest SoilARQQQAQHSRKYPLGSLSATGEVYVNDKPVPVESTVFAGDTIRTGENSTAVFTMTGNGALKIAALTQVALPGEPQFAAELQSGAAVIDSISGPSGIKLRVGGVVVVPAVRSRVTSAKIERQADGTYVVTCLDGDISTLALQGTAGRLLEAGQAVSVTPGGQMLVQKESGGRPLGGVTSTLQGKKGWTMLALAGGGAGLAAVVLGHGGKPPVSPSGP*
Ga0126381_100002037183300010376Tropical Forest SoilLLTAGLAAFLFSGPASAAAAKQEQAQHSRKYPLGSLSATGEVYVNETAVPVESTVFPGDTIRTGENSTAVFTMTGNGTLKIGAQTQVALPGEPQFAAELQSGTALIDSISGPSGIKLRVGGVVVVPAVRSRVTSAKIERLVDGTYVVTCLDGDVSTLALQGTAGRLLEAGQAVSVTPSGQMFAQKESGGKSLGNVTSALQGKKGWTMLALAGGGAGVAALVLGHGGKPPVSPSGP*
Ga0126381_10105647613300010376Tropical Forest SoilMPFLCLVNSGQRLKSKSSVSTAFAFLLTAGLTFFFLVPAGAAAARQQQAQQSRKYPLGSLSATGEVYVNDKPVPVESTVFAGDTIRTGENSTAVFTMTGNGALKIGAQTQVALPGEPQFAAELQSGTAVIDSISGPSGIKLRVGGVVVVPAVRSRVTSAKIERQADGTYVVTCLNGDISTLALQGTAGRLLEAGQAVTVTPAGQMLVQKESGGKPLSSVTGALEGKKGWTMLALAGGGAGVAALVLGHGGKPPVSPSGP*
Ga0137383_1107099113300012199Vadose Zone SoilSTAVFTMTGNGTLKIGSQTQLVISGDLQFSADLQAGTAVIDSISGPSGIKVRAGGVVVIPAVRSRVTSAKIGRQADGSFLVTCLDGDISTIQLGGTSGLLLEAGQSVNISPRGELSAQKHSGVTSAGGGQLGPLKGKTGWTLLGLAGAGAGAAALVLGHGGSTPVSPSGP*
Ga0137381_1019136413300012207Vadose Zone SoilSASAASVSQDQAQGTKSQPLGSLTATGEVYVTDKLAPAESTIFVGDTIRTGDAGTAIFTMAGNGTLKIGAHTEVVISGDPEFAAELHSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIQGLPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP*
Ga0137381_1032280713300012207Vadose Zone SoilSPASRGGSIVLTVQETFMKLKSESPLLCLVASSPRQQRKGIASIVLCFAGPILILPMFLLTVPANAAAPSQDQRQNPKREPIGSLSATGEVYVNEKPVPIESTVFVGDTVRTGESSTAVFTMTGNGTVKIGSQTKLVISGDLQFSADLQAGTAVIDSISGPSGIKVRAGGVVVIPAVRSRVTSAKIGRQADGSFLVTCLDGDISTIQLGGTSGLLLEAGQSVNISPRGELSAQKHSGVTSTGGGQLGPLKGKTGWTLLGLAGAGAGAAALVLGHGGSTPVSPSGP*
Ga0137376_1042774113300012208Vadose Zone SoilVLIVHETFMKLRSKTPLLCLVASYRRPKPKGIAPIAFALAAQMLILQMFLLTVPANAAVHSQDQKQSPKREPIGSLSATGEVYVNDKPAPVESTVFVGDTVRTGENSAAIFSMTGNGTLKIGSQTQVVITGDPQFAAELQAGTAVIDSISGPSGIKVRVGGVVVVPAVRSRVTAAKIERQADGLFLVTCLDGDISTIPLQGTSGQLLEAGQSVSASPRGELFVQKQSGVRPSGGGLRGPLKSRTGWTLLGI
Ga0137379_1004802023300012209Vadose Zone SoilVLTVQETFMKLKSESPLLCLVASSPRQQRKGIASIVLCFAGPILILPMFLLTVPANAAAPSQDQRQNPKREPIGSLSATGEVYVNEKPVPIESTVFVGDTVRTGESSTAVFTMTGNGTVKIGSQTKLVISGDLQFSADLQAGTAVIDSISGPSGIKVRAGGVVVIPAVRSRVTSAKIGRQADGSFLVTCLDGDISTIQLGGTSGLLLEAGQSVNISPRGELSAQKHSGVTSTGGGQLGPLKGKTGWTLLGLAGAGAGAAALVLGHGGSTPVSPSGP*
Ga0137378_1021458223300012210Vadose Zone SoilVLTVQETFMKLKSESPLLRLVASSPRQQRKGIASIVLCFAGPILILPMFLLTVPANAAAPSQDQRQSPKREPIGSLSATGEVYVNEKPVPLESTVFVGDTVRTGENSTAVFTMTGNGTVKIGSQTKLVISGDLQFSADLQAGTAVIDSISGPSGIKVRAGGVVVIPAVRSRVTSAKIGRQADGSFLVTCLDGDISTIQLGGTSGLLLEAGQSVNISPRGELSAQKHSGVTSAGGGQLGPLKGKTGWTL
Ga0137377_1015727033300012211Vadose Zone SoilVYVNDKLAPAESTIFVGDTIRTGDAGTAIFTMAGNGTLKIGAHTEVVISGDPEFAAELHSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIQGLPGGTFQVTCLDGDISTLPLQGGSGQLLEAGQSVSISPGGGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTP
Ga0137377_1070919013300012211Vadose Zone SoilMAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGQVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTP
Ga0137387_1000153113300012349Vadose Zone SoilVYVNDKLAPAESTIFVGDTIRTGDAGTAIFTMAGNGTLKIGAHTEVVISGDPEFAAELHSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIQGLHGGTFQVTCLDGDISTLPLQGGSGQLLEAGQSVSISPGGGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTP
Ga0137387_1008182713300012349Vadose Zone SoilMAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGEVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTP
Ga0137384_1146194413300012357Vadose Zone SoilGEVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGA
Ga0137385_1032140513300012359Vadose Zone SoilMAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGEVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPS
Ga0134076_1021115723300012976Grasslands SoilISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP*
Ga0134081_1001477423300014150Grasslands SoilMAPAISFGLILMLTLQVSVLTTSANSASVSQEQTQSTKRQPLGSLTATGQVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP*
Ga0134078_1026350413300014157Grasslands SoilMGLQGSFLSAPASGAAPTQDQAQNSRKYPLGSLSATGEVYVNDKPVPVESTVFPGDTVRTGENSTAVFTMTGNGTLKIGSQTQVLLAGDPQFAAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGSGVPGTLKGKKGWTLLGLAGGGAGVAAVV
Ga0134079_1001936023300014166Grasslands SoilVLTVHETFMKLKSKSPLLSFVASCQRPKQKGIAAIAFVLAAPVLILQVSLLTLTVPANAATPSQEQKQSPKREPMGSLSATGEVYVNDKPAPVESTVFVGDTIRTGENSAAIFTMTGNGTLKIGSQTQVVITGDPQFAAELQAGTAVIDSISGPSGIKVRAGGVVVVPAVRSRVTAAKIERQADGSFLVTCLDGDISTIPLQGTSGRLLEAGQSVSASPQGELFVQKQSGVRSSDGGLRGPLKSRTGWTLLGIAGAGAGAAALVLGHGGGTPVSPSGP*
Ga0134079_1005535023300014166Grasslands SoilDTVRTGENSTAVFTMTGNGTLKIGSQTQVLLAGDPQFAAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGSGVPGTLKGKKGWTLLGLAGGGAGVAAVVLGHGGKAPVSPSGP*
Ga0134073_1000071223300015356Grasslands SoilMFVLQIFLFTTPARAAAVSQEQAQNPKRVPVGSLSATGEVYVNEKPVPVESTVFAGDTIRTGQNSTAIFTMTGNGTLKIGAQTQMVISGDPQFPAELQTGTAVINSISGPSGVKLRVGEYVVVPAVRSRVTSAKIERQPDGSFLVTCMDGDISTIPLEGTSGQLLEAGQSVDISPQGRLSVQKQSGIRSTGGGGHGAIKSRTGWTLLGLAGAGAGVAALVVGHGGKQPVSPSGP*
Ga0134072_1007171313300015357Grasslands SoilRMAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGEVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP*
Ga0134072_1032162813300015357Grasslands SoilTVRTGENSTAVFTMTGNGTLKIGSQTQVSLAGDPQFVAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGSGVPGTLKGKKGWTLLGLAGGGAGVAAVVLGHGGKAPVSPSGP*
Ga0066655_1000403523300018431Grasslands SoilMAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGQVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP
Ga0066667_1000201863300018433Grasslands SoilVFVWREIFIKLLSLKLRSLCLAARSQRQKVNWVVRTILAVVLTTMGLQGSFLSAPASGAAPTQDQAQNSRKYPLGSLSATGEVYVNDKPVPVESTVFPGDTVRTGENSTAVFTMTGNGTLKIGSQTQVLLAGDPQFAAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGSGVPGTLKGKKGWTLLGLAGGGAGVAAVVLGHGGKAPVSPSGP
Ga0066662_1007648623300018468Grasslands SoilMAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGEVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP
Ga0066669_1002662413300018482Grasslands SoilRSQRQKVNWVVRTILAVVLTTMGLQGSFLSAPASGAAPTQDQAQNSRKYPLGSLSATGEVYVNDKPVPVESTVFPGDTVRTGENSTAVFTMTGNGTLKIGSQTQVLLAGDPQFAAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGSGVPGTLKGKKGWTLLGLAGGGAGVAAVVLGHGGKAPVSPSGP
Ga0066669_1041314423300018482Grasslands SoilTLTVPANAATPSQEQKQSPKREPMGSLSATGEVYVNDKPAPVESTVFVGDTIRTGENSAAIFTMTGNGTLKIGSQSQVVITGDPQFAAELQAGTAVIDSISGPSGIKVRAGGVVVVPAVRSRVTAAKIERQADGSFLVTCLDGDISTIPLQGTSGRLLEAGQSVSASPQGELFVQKQSGVRSSDGGLRGPLKSRTGWTLLGIAGAGAGAAALVLGHGGGTPVSPSGP
Ga0126371_1001443593300021560Tropical Forest SoilLHFLLTAGLAAFLFSGPASAAAAKQEQAQHSRKYPLGSLSATGEVYVNETAVPVESTVFPGDTIRTGENSTAVFTMTGNGTLKIGAQTQVALPGEPQFAAELQSGTALIDSISGPSGIKLRVGGVVVVPAVRSRVTSAKIERLVDGTYVVTCLDGDVSTLALQGTAGRLLEAGQAVSVTPSGQMFAQKESGGKSLGNVTSALQGKKGWTMLALAGGGAGVAALVLGHGGKPPVSPSGP
Ga0209265_103276623300026308SoilMAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGQVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPST
Ga0209055_100207423300026309SoilVENANWGERFSRLRTSCQCREHRMAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGQVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP
Ga0209154_100129483300026317SoilMQHSNWKPPFLCLAHVYEPLRRRMPPAIALGLALVPALQVFVLAGPAHAATTVQEPTQNGKRVPIGSLSATGEVYVNEKPVPVESTVFAADSVRTGQNSTAVFTLPGNGTLKIGPQTQVVISNNPQFTAELQAGTAVIDSISGPSGIKLRAGSVVVVPAVRSRVTSAKIEGQPGGAFLVTCVDGDISTIPLSGTSGQLLEAGQSVNISPRGELSVQKQSGGTSTGPGLPGPLKGKSGWSLLGLAGAGAGAAALAVGHGGTKPVSPSGP
Ga0209471_1000075253300026318SoilMQHSNWKPPFLCLAHVYEPLRRRMPPAIALGLALVPALQVFVLAGPAHAATTVQEPTQNGKRVPIGSLSATGEVYVNEKPVPVESTVFAGDSVRTGQNSTAVFTLPGNGTLKIGPQTQVVISNNPQFTAELQAGTAVIDSISGPSGIKLRAGSVVVVPAVRSRVTSAKIEGQPGGAFLVTCVDGDISTIPLSGTSGQLLEAGQSVNISPRGELSVQKQSGGTSTGPGLPGPLKGKSGWSLLGLAGAGAGAAALAVGHGGTKPVSPSGP
Ga0209471_113912723300026318SoilFLCLAHVYKPLRGGMTPAIALGLALVPALQIFVLAGPAHAATTVQEPTQNGKRVPIGSLSATGEVYVNEKPVPVESTVFADDSVRTGQNSTAVFTLTGNGTLKIGPQAQVVISNNSQFTAELQAGTAVIDSLSGPSGIKLRAGSVVVVPAVRSRVTSAKIEGQPGGAFLVTCVDGDISTIPLSGTSGQLLEAGQSVNISPRGELVVQKQSGGTSTGPGLPGPFKGKSGWTLLGLAGAGAGTAALAVAHGSSKPVSPSGP
Ga0209375_102158423300026329SoilVGALATFDTGRSKPSLKAPCRSVDKVHPAFKMQENTLELCSGSVLQNRSSLSVSPASRGESVVLTVHETFMKLKSKSPLLSFVASCQRPKQKGIAAIAFVLAAPVLILQVSLLTLTVPANAATPSQEQKQSPKREPMGSLSATGEVYVNDKPAPVESTVFVGDTIRTGENSAAIFTMTGNGTLKIGSQTQVVITGDPQFAAELQAGTAVIDSISGPSGIKVRVGGVVVVPAVRSRVTAAKIERQADGLFLVTCLDGDISTIPLQGTSGQLLEAGQSVSASPRGELFVQKQSGVRPSGGGLPGPLKSRTGWTLLGIAGAGAGAAALLLGHGGGTPVSPSGP
Ga0209473_1000326293300026330SoilMFVLQIFLFTTPARAAAVSQEQAQNPKRVPVGSLSATGEVYVNEKPVPVESTVFAGDTIRTGQNSTAIFTMTGNGTLKIGAQTQMVISGDPQFPAELQTGTAVINSISGPSGVKLRVGEYVVVPAVRSRVTSAKIERQPDGSFLVTCMDGDISTIPLEGTSGQLLEAGQSVDISPQGRLSVQKQSGIRSTGGGGHGAIKSRTGWTLLGLAGAGAGVAALVVGHGGKQPVSPSGP
Ga0209473_103857633300026330SoilVFVWREIFIRLLSLKLRSVCLAARSQRQKANCVARTILAVVLTMGLQGSFLSAPASGAAPTQDQAQNSRKYPLGSLSATGEVYVNDKPVPVESTVFPGDTVRTGENSTAVFTMTGNGTLKIGSQTQVSLAGDPQFVAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGSGVPGTLKGKKGWTLLGLAGGGAGVAAVVLGHGGKAPVSPSGP
Ga0209803_100211623300026332SoilVENANWGERFSRLRTSCQCREHRMAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGEVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP
Ga0209808_110239723300026523SoilLKLRSVCLAARSQRQKANCVARTILAVVLTMGLQGSFLSAPASGAAPTQDQAQNSRKYPLGSLSATGEVYVNDKPVPVESTVFPGDTVRTGENSTAVFTMTGNGTLKIGSQTQVSLAGDPQFVAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGSGVPGTLKGKKGWTLLGLAGGGAGVAAVVLGHGGKAPVSPSGP
Ga0209376_106462323300026540SoilVLTVHETFMKLKSKSPLLSFVASCQRPKQKGIAAIAFVLAAPVLILQVSLLTLTVPANAATPSQEQKQSPKREPMGSLSATGEVYVNDKPAPVESTVFVGDTIRTGENSAAIFTMTGNGTLKIGSQTQVVITGDPQFAAELQAGTAVIDSISGPSGIKVRVGGVVVVPAVRSRVTAAKIERQADGLFLVTCLDGDISTIPLQGTSGQLLEAGQSVSASPRGELFVQKQSGVRPSGGGLPGPLKSRTGWTLLGIAGAGAGAAALLLGHGGGTPVSPSGP
Ga0209805_106075423300026542SoilVFVWREIFIRLLSLKLRSVCLAARSQRQKANCVARTILAVVLTMGLQGSFLSAPASGAAPTQDQAQNSRKYPLGSLSATGEVYVNDKPVPVESTVFPGDTVRTGENSTAVFTMTGNGTSLAGDPQFVAELQTGTAVIDSLSGPSGIKVRVGGVVVVPAVRSRVTSAKIERQADGTFLVTCLDGDISTLSLEGTAGRLLEAGQAVGVTPSGQMLVQKQSGGKSLGSGVPGTLKGKKGWTLLGLAGGGAGVAAVVLGHGGKAPVSPSGP
Ga0209156_1001561033300026547SoilVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP
Ga0209156_1014546613300026547SoilLTAQNTFAKPSRHKLRPLPPGLAVMFVLQIFLFTTPARAAAVSQEQAQNPKRVPVGSLSATGEVYVNEKPVPVESTVFAGDTIRTGQNSTAIFTMTGNGTLKIGAQTQMVISGDPQFPAELQTGTAVINSISGPSGVKLRVGEYVVVPAVRSRVTSAKIERQPDGSFLVTCMDGDISTIPLEGTSGQLLEAGQSVDISPQGRLSVQKQSGIRSTGGGGHGAIKSRTGWTLLGLAGAGAGVAALVVGHGGKQPVSPSGP
Ga0209689_101227713300027748SoilKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVSSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP
Ga0209073_1012238613300027765Agricultural SoilVFVWREVFMKLVSVKLRSLSLAATSQRQKSNWAVRTILAVVLTMGLQGFFVSTPASAAAPAQDQAQNSKKYPLGSLSATGEVYVNEKPVPVESTVFPGETIRTGANSTAVFTMTGNGTLKIGSQTQVVLTGDPQFAAELQSGTAVIDSLSGPSGIKLRVGGVVVVPAVRSRVTSAKIERQADGTFLVACLDGDISTLSLEGTAGRLLEAGQAVSVTPGGQMLVQKQSGGKSIGSGMPGPLQGKRG
Ga0209074_1036310913300027787Agricultural SoilTVFPGETIRTGANSTAVFTMTGNGTLKIGSQTQVVLTGDPQFAAELQSGTAVIDSLSGPSGIKLRVGGVVVVPAVRSRVTSAKIERQADGTFLVACLDGDISTLSLEGTAGRLLEAGQAVSVTPGGQMLVQKQSGGKSIGSGMPGPLQGKRGWTLLTLAGGGAGVAALVLGHGGKTPVSPSGP
Ga0209166_1003854333300027857Surface SoilVSVSRETFRKGPGLKLCSVSLVAVGQSKQLKRMVGSALILFLTIGLQGFFLTTPVNAAGAKQDQAQNAKKQPLGSLSSTGEVYVNDKPVPIESTVFVGDTVRTGANSTAVFTMTGNGTLKIGAQTQVALTGDPQFAAELQSGTAVIDSLSGPSGIKLRAGEYVVVPAVRSRVTSAKIERQADGSFLVTCLDGDISTLALQGTAGRLLEATQSVTLTPSGQMLVQKQSGGNSAGSGIAGPLSSRKGWTMLGLAGGGAGLAALVLGHGGKPPVSPSGP
Ga0307477_1067627613300031753Hardwood Forest SoilDKLAPAESTIFVGDTIRTGDAGTAIFTMAGNGALKIGAQTEVVISGDPEFAAELQSGTAVIDSISGPSGIKLRVGNVAVVPAVRSRVTSAKIQGQPGGTFQVTCLNGDISTLPLQGGSGQLLEAGQSVSISPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP
Ga0307475_1016022213300031754Hardwood Forest SoilVVSTADERFLEHPNSERSFRCLRTLCQYPERRTAPAISFGLVLMLTFQVSLLTTSADSASVSQEQTQSTKRQPLGSLTATGEVYVNDKVTPAESTIFVGDTIRTGDAGTAIFTMAGNGALKIGAQTEVVISGDPEFAAELQSGTAVIDSISGPSGIKLRVGNVAVVPAVRSRVTSAKIQGQPGGTFQVTCLNGDISTLPLQGGSGQLLEAGQSVSISPGGGLVAQKGSGLKLSGSSHTEVLKGRTGWTLLGLAGAGAGAAALALTHGGGTPVSPSTP
Ga0307479_1000165423300031962Hardwood Forest SoilVSTADERFLEHPNSERSFRCLRSLWQYPERRTAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGEVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQVATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGRTFQVTCLTGDISALPLQGGSGQLLEAGQSVSISPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP
Ga0307479_1094221613300031962Hardwood Forest SoilAPAISFGLVLMLTFQVSLLTTSADSASVSQEQTQSTKRQPLGSLTATGEVYVNDKVTPAESTIFVGDTIRTGDAGTAIFTMAGNGALKIGAQTEVVISGDPEFAAELQSGTAVIDSISGPSGIKLRVGNVAVVPAVRSRVTSAKIQGQPGGTFQVTCLNGDISTLPLQGGSGQLLEAGQSVSISPGGGLVAQKGSGLKLSGSSHTEVLKGRTGWTLLGLAGAGAGAAALALTHGGGTPVSPSTP
Ga0307471_10000371983300032180Hardwood Forest SoilVSTADERFLEHPNSERSFRCLRTLCQYPERRTAPAISFGLVLMLAFQVSLLTISASAASVPQEQAQSTKRQPLGSLTATGEVYVNDKLAPAESTIFVGDTIRTGDAGTAIFTMAGNGALKIGAQTEVVISGDPEFAAELQSGTAVIDSISGPSGIKLRVGNVAVVPAVRSRVTSAKIQGQPGGTFQVTCLNGDISTLPLQGGSGQLLEAGQSVSISPGGGLVAQKGSGLKLSGSSHTEVLKGRTGWTLLGLAGAGAGAAALALTHGGGTPVSPSTP
Ga0307471_10077829913300032180Hardwood Forest SoilMAPAISFGLILMLTLQVSVLTTSADSASVSQEQTQSTKRQPLGSLTATGEVYVNDKVAPAESTVFVGDTIRTGEASTAIFTMTGNGTLKIGAQSQVVISGDPQFATELQSGTAVIDSISGPSGIKLRVGNFAVVPAVRSRVTSAKIEAQPGGTFQVTCLTGDISTLPLQGGSGQLLEAGQSVRSSPGRGLVAQKQSGLKLSGSSHTEALKGRTGWTLFGLAGAGAGAAALALTHGGGTPVSPSTP
Ga0307472_10003466723300032205Hardwood Forest SoilNDKVTPAESTIFVGDTIRTGDAGTAIFTMAGNGALKIGAQTEVVISGDPEFAAELQSGTAVIDSISGPSGIKLRVGNVAVVPAVRSRVTSAKIQGQPGGTFQVTCLNGDISTLPLQGGSGQLLEAGQSVSISPGGGLVAQKGSGLKLSGSSHTEVLKGRTGWTLLGLAGAGAGAAALALTHGGGTPVSPSTP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.