NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F093610

Metagenome Family F093610

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F093610
Family Type Metagenome
Number of Sequences 106
Average Sequence Length 63 residues
Representative Sequence GKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN
Number of Associated Samples 81
Number of Associated Scaffolds 106

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 28.30 %
% of genes from short scaffolds (< 2000 bps) 26.42 %
Associated GOLD sequencing projects 72
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (72.642 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(33.962 % of family members)
Environment Ontology (ENVO) Unclassified
(53.774 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(60.377 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 74.60%    β-sheet: 0.00%    Coil/Unstructured: 25.40%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 106 Family Scaffolds
PF02830V4R 58.49
PF06505XylR_N 1.89
PF01882DUF58 0.94
PF07726AAA_3 0.94
PF03952Enolase_N 0.94

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 106 Family Scaffolds
COG1719Predicted hydrocarbon binding protein, contains 4VR domainGeneral function prediction only [R] 58.49
COG0148EnolaseCarbohydrate transport and metabolism [G] 0.94
COG1721Uncharacterized conserved protein, DUF58 family, contains vWF domainFunction unknown [S] 0.94


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A72.64 %
All OrganismsrootAll Organisms27.36 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005167|Ga0066672_10748840All Organisms → cellular organisms → Archaea620Open in IMG/M
3300005167|Ga0066672_10748841All Organisms → cellular organisms → Archaea620Open in IMG/M
3300005174|Ga0066680_10220516All Organisms → cellular organisms → Archaea1201Open in IMG/M
3300005174|Ga0066680_10930483All Organisms → cellular organisms → Archaea513Open in IMG/M
3300005176|Ga0066679_10849841All Organisms → cellular organisms → Archaea579Open in IMG/M
3300005178|Ga0066688_10813200All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon583Open in IMG/M
3300005554|Ga0066661_10700876All Organisms → cellular organisms → Archaea594Open in IMG/M
3300005586|Ga0066691_10485195All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon738Open in IMG/M
3300006794|Ga0066658_10562098All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon620Open in IMG/M
3300006806|Ga0079220_11801910All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon539Open in IMG/M
3300009012|Ga0066710_104739691All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon508Open in IMG/M
3300009088|Ga0099830_11425027All Organisms → cellular organisms → Archaea576Open in IMG/M
3300009089|Ga0099828_11144704All Organisms → cellular organisms → Archaea691Open in IMG/M
3300010321|Ga0134067_10061554All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1228Open in IMG/M
3300012201|Ga0137365_10889259All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon650Open in IMG/M
3300012350|Ga0137372_10136038All Organisms → cellular organisms → Archaea2022Open in IMG/M
3300012362|Ga0137361_11285479All Organisms → cellular organisms → Archaea656Open in IMG/M
3300012972|Ga0134077_10389142All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon600Open in IMG/M
3300017659|Ga0134083_10379972All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon613Open in IMG/M
3300018468|Ga0066662_10338105All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1288Open in IMG/M
3300018468|Ga0066662_12977587All Organisms → cellular organisms → Archaea503Open in IMG/M
3300026306|Ga0209468_1032555All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1816Open in IMG/M
3300026313|Ga0209761_1304099All Organisms → cellular organisms → Archaea553Open in IMG/M
3300026325|Ga0209152_10039817All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1627Open in IMG/M
3300026342|Ga0209057_1221776All Organisms → cellular organisms → Archaea537Open in IMG/M
3300026361|Ga0257176_1062963All Organisms → cellular organisms → Archaea594Open in IMG/M
3300026536|Ga0209058_1256650All Organisms → cellular organisms → Archaea611Open in IMG/M
3300031962|Ga0307479_10054000Not Available3868Open in IMG/M
3300032180|Ga0307471_103715555All Organisms → cellular organisms → Archaea540Open in IMG/M
3300032180|Ga0307471_104039661All Organisms → cellular organisms → Archaea518Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil33.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil33.02%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil12.26%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil8.49%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.72%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.89%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.94%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.94%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.94%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026306Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026361Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-BEnvironmentalOpen in IMG/M
3300026499Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-BEnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027947Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1011792813300002558Grasslands SoilVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLVNELAEATQPAN*
JGI25383J37093_1006546013300002560Grasslands SoilKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATHTAN*
JGI25384J37096_1004229533300002561Grasslands SoilLFKSHFLAAFEFQIDVINNLSRSVFVLDNPNSNFAKKVSEAVENLEKELAGMSAQQSSL*
JGI25382J37095_1008898523300002562Grasslands SoilLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLVNELAEATQPAN*
JGI25382J43887_1008715023300002908Grasslands SoilLATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSGFAHKINEAAEMLTRELAEATQPAN
Ga0066672_1074884023300005167SoilPGVRIKDAVKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVGEAVETLERELSGATKIAN*
Ga0066672_1074884123300005167SoilPGVRIKDAVKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVGEAVETLERELSGATKIAN*
Ga0066680_1022051633300005174SoilPGVRIKDAVKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKIGEAVETLERELSGATKIAN*
Ga0066680_1093048313300005174SoilAVKVGKDLAVLFKSHFLAAFEFQIDVINNLSRSVFVLDNPNSNFAKKVSEAVENLEKELAGMSVQQSSL*
Ga0066679_1018357413300005176SoilGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN*
Ga0066679_1084984113300005176SoilRFLQAFEFQLDVINNLSRSVFVIDNPNSAFAKKIVEAVETLERELAGATKIAN*
Ga0066679_1094950823300005176SoilELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN*
Ga0066690_1027481413300005177SoilVKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSPFAKKIGEAVEILERELSGATKIAN*
Ga0066688_1081320013300005178SoilGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSSFAHKINEAAEMLTNELAEATQPAN*
Ga0066685_1026981513300005180SoilKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSPFAKKIGEAVETLERELSGATKIAN*
Ga0066685_1053162113300005180SoilKVGKELADLFKSKFLEAFEFQIDVINNLGRSVFILDNPNSTFARKVGDAADYLVRELGEAPKIAN*
Ga0066686_1104950623300005446SoilKSHFLAAFEFQIDVINNLSRSVFVLDNPNSNFAKKVSEAVENLEKELAGMSAQQSSL*
Ga0066689_1001316553300005447SoilIKDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLVNELAEATQPAN*
Ga0066697_1006382113300005540SoilSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAVDTLERELSGATKIAN*
Ga0066701_1024218513300005552SoilSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN*
Ga0066661_1039364513300005554SoilLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAVDTLERELSGATKIAN*
Ga0066661_1070087613300005554SoilLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN*
Ga0066704_1054439323300005557SoilLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSSFAHKINEAAEMLTNELAEATQPAN*
Ga0066698_1022599233300005558SoilKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN*
Ga0066698_1029946313300005558SoilRFLQAFEFQMDVINNLSRSVFVIDNPDSGFAHKVTEAADMLTKELEEIPKAAN*
Ga0066698_1058488213300005558SoilKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN*
Ga0066700_1031423013300005559SoilFEFQLDVINNLSRSVFVVDNPDSSFAKKIGEAVETLERELSGATKIAN*
Ga0066708_1032426613300005576SoilKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN*
Ga0066691_1048519513300005586SoilGVRIKDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDYPDSGFAHKINEAAEMLINELAEATQPAN*
Ga0066656_1086889213300006034SoilGKELAVLFRSRFLQAFDFQMDVINNLSRSVFVMDNPDSAFAHKVNEAAELLTKELAEAPATAN*
Ga0066658_1056209823300006794SoilLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN*
Ga0079220_1180191013300006806Agricultural SoilKELATLFRSHFLQAFEFQIDVINNLSRSVFVVDNPDSPFAHKVSEAVELLTQELAEVPKSAN*
Ga0099791_1038870813300007255Vadose Zone SoilLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAADTLERELSGATKIAN*
Ga0099794_1033835413300007265Vadose Zone SoilPGVRIKDAIKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKLAN*
Ga0066710_10473969113300009012Grasslands SoilVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN
Ga0099830_1133719613300009088Vadose Zone SoilDAMKVGKELATLFHSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAEMLTRELEEIPKAAN*
Ga0099830_1142502723300009088Vadose Zone SoilELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAADTLERELSGATKIAN*
Ga0099828_1048035213300009089Vadose Zone SoilQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPYSSFAKKIGEAVETLERELSGATKIAN*
Ga0099828_1049809713300009089Vadose Zone SoilQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAADTLERELSGATKIAN*
Ga0099828_1097972623300009089Vadose Zone SoilFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN*
Ga0099828_1114470413300009089Vadose Zone SoilPPGVRIKDAVKVGQELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAADTLERELSGATKIAN*
Ga0099827_1057081323300009090Vadose Zone SoilFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKIGEAVETLERELSGATKIAN*
Ga0066709_10451969613300009137Grasslands SoilVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN*
Ga0126373_1080967013300010048Tropical Forest SoilATLFKSHFLQAFEFQIDVINNLSRSVFVIDNPDSPFAHKVSEAVELLTKELEEAPKTAN*
Ga0134067_1006155423300010321Grasslands SoilPPGVRIKDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN*
Ga0134067_1014980913300010321Grasslands SoilLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN*
Ga0134080_1000545013300010333Grasslands SoilKELAVLFRSRFLQAFDFQMDVINNLSRSVFVMDNPDSAFAHKVNEAAELLTKELAEAPATAN*
Ga0126379_1032750913300010366Tropical Forest SoilGKELATLFKSHFLQAFEFQIDVINNLSRSVFVIDNPDSPFAHKVSEAVELLTKELEEAPKTAN*
Ga0126383_1158815613300010398Tropical Forest SoilKFLAAFEFQIDVINNLSRSVFVMDNPNSSFAKKVDEAVGNLEKELNGSS*
Ga0137391_1033501913300011270Vadose Zone SoilLFRSKFLEAFEFQIDVINNLSRSVFVLDHPTSTFARKVGEAADFLVKELGEAPKIAN*
Ga0137391_1129354113300011270Vadose Zone SoilTLFHSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAEMLTRELEEIPKAAN*
Ga0137393_1088365123300011271Vadose Zone SoilHSRFLQAFEFQMDVINNLSRSVFVVDNPDSGFAHKINEAAEMLTRELAEEPQPAN*
Ga0137388_1025870743300012189Vadose Zone SoilLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKIGEAVETLERELSGATKIAN*
Ga0137388_1079038423300012189Vadose Zone SoilVRIKDAIKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN*
Ga0137365_1074735313300012201Vadose Zone SoilFRSRFLQAFEFQLDVINNLSRSVFVIDNPDSSFAKKIGEAVETLERELSGATKIAN*
Ga0137365_1088925913300012201Vadose Zone SoilTLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSGFAHKINEAAEMLINELAEVTQPAN*
Ga0137362_1075387313300012205Vadose Zone SoilDAVKVGRELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLVNELAEVTQPAN*
Ga0137380_1090432613300012206Vadose Zone SoilVKVGQELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSPFAKKVTEAADILERELSGATKIAN*
Ga0137381_1091834323300012207Vadose Zone SoilELADLFKSKFLEAFEFQIDVINNLSRSVFVLDHPNSTFARKVGDAADFLVKELGEAPRIAN*
Ga0137379_1046858333300012209Vadose Zone SoilVKVGQELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAADILERELSGATKIAN*
Ga0137378_1050596213300012210Vadose Zone SoilGKELATLFRSRFLQAFDFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAELLTKELAEVPATAN*
Ga0137387_1009113213300012349Vadose Zone SoilGQELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAVDTLERELSGATKIAN*
Ga0137387_1015111913300012349Vadose Zone SoilVKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKIGEAVETLERELSGATKIAN*
Ga0137387_1132665113300012349Vadose Zone SoilKELAILFKSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAEMLTKELEEIPKAAN*
Ga0137372_1013603813300012350Vadose Zone SoilDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN*
Ga0137386_1106409913300012351Vadose Zone SoilKVGQELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSPFAKKVTEAADILERELSGATKIAN*
Ga0137369_1089252823300012355Vadose Zone SoilAVKVGKELAMLFRSRFLQAFDFQMDVINNLSRSVFVMDNPDSAFAHKVNEAVELLMQELKEVPDSAN*
Ga0137384_1138210313300012357Vadose Zone SoilADLFRSKFLEAFEFQIDVINNLSRSVFVLEHPNSTFARKVGEAADFLVKELGEAPKIAN*
Ga0137361_1128547923300012362Vadose Zone SoilGVRIKDAVKVGQELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAVDTLERELSGATKIAN*
Ga0137390_1019802843300012363Vadose Zone SoilLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAVDTLERELSGATKIAN*
Ga0137390_1044021623300012363Vadose Zone SoilLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAEMLTKELEEIPKAAN*
Ga0137395_1082732013300012917Vadose Zone SoilDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVIDNPDSGFAHKVTEAAEMLTKELEEIPKAAN*
Ga0137419_1194515313300012925Vadose Zone SoilRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAVDTLERELSGATKIAN*
Ga0134077_10000126153300012972Grasslands SoilGKDLAVLFKSHFLAAFEFQIDVINNLSRSVFVLDNPNSNFAKKVSEAVENLEKELAGMSVQQSSL*
Ga0134077_1038914223300012972Grasslands SoilVPPGVRIKDAMKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAEMLTKELEEIPKAAN*
Ga0134087_1045367513300012977Grasslands SoilRLKDAVKVGKELAVLFRSRFLQAFDFQIDVINNLSRSVFVMDNPDSAFAHKVNEAAELLTKELAEAPATAN*
Ga0134078_1010460513300014157Grasslands SoilVRLKDAVKVGKELAVLFRSRFLQAFDFQIDVINNLSRSVFVIDNPDSAFAHKVNEAVELLTKELAEVPATAN*
Ga0134085_1060495713300015359Grasslands SoilAIKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVIDNPDSSFAKKIGEAVETLERELSGATKIAN*
Ga0134083_1037997223300017659Grasslands SoilVPPGVRIKDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATHTAN
Ga0066662_1033810523300018468Grasslands SoilPGVRIKDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN
Ga0066662_1297758713300018468Grasslands SoilDAVKVGKDLAVLFKSHFLAAFEFQIDVINNLSRSVFVLDNPNSNFAKKVSEAVENLEKELAGMSAQQSSL
Ga0137417_116724913300024330Vadose Zone SoilRIKDAMKVGKELATLFHSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAEMLTRELEEIPKAAN
Ga0209350_103443713300026277Grasslands SoilFLQAFEFQMDVINNLSRSVFVVDNPDSSFAHKINEAAEMLTNELAEATQPAN
Ga0209237_116246523300026297Grasslands SoilGVRIKDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATHPAN
Ga0209236_119429413300026298Grasslands SoilAIKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN
Ga0209468_103255533300026306SoilVPPGVRLKDAVKVGKELAVLFRSRFLQAFDFQMDVINNLSRSVFVMDNPDSAFAHKVNEAAELLTKELAEAPATAN
Ga0209761_130409913300026313Grasslands SoilGRELAQLFRSKFLAAFEFQMDVINNLSRSVFVVDNPNSTFAHKVSGAAENLIRELGEAAKVAN
Ga0209686_117384913300026315SoilGVRIKDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN
Ga0209152_1003981723300026325SoilVPPGVRIKDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN
Ga0209267_119849623300026331SoilGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKIGEAVETLERELSGATKIAN
Ga0209804_121718723300026335SoilFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN
Ga0209057_118460523300026342SoilLFHSRFLQAFEFQMDVINNLSRSVFVIDNPDSGFAHKVTEAADMLTKELEEIPKAAN
Ga0209057_122177623300026342SoilSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAVDTLERELSGATKIAN
Ga0257176_106296323300026361SoilKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN
Ga0257181_101487423300026499SoilMKVGKELATLFHSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAEMLTRELEEIPKAAN
Ga0209806_100584193300026529SoilGKDLAVLFKSHFLAAFEFQIDVINNLSRSVFVLDNPNSNFAKKVSEAVENLERELAGMSAQQSSL
Ga0209806_108669313300026529SoilTLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSSFAHKINEAAEMLTNELAEATQPAN
Ga0209058_125665023300026536SoilLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSPFAKKIGEAVETLERELSGATKIAN
Ga0209588_115708823300027671Vadose Zone SoilIKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN
Ga0209180_1061757313300027846Vadose Zone SoilKVGKELATLFHSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAEMLTKELEEIPKAAN
Ga0209868_101231413300027947Groundwater SandAVKVGRELAQLFRSRFLEAFEFQLDVINNLSRSVFVLDQPNSSFAKKISEAAENLTRQIGEIGETPKVAN
Ga0307479_1005400063300031962Hardwood Forest SoilATLFKSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVSEAAEMLTNELREAPRAAN
Ga0307471_10069223413300032180Hardwood Forest SoilLFKSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVREAAEMLTNELTEAPKPAN
Ga0307471_10371555523300032180Hardwood Forest SoilVKVGQELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAVDTLERELSGATKIAN
Ga0307471_10403966113300032180Hardwood Forest SoilPGVRIKDAMKVGQELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSPFAKKVTEAVDTLERELSGATKIAS
Ga0307472_10107543823300032205Hardwood Forest SoilVGKELATLFKSHFLQAFEFQMDVINNLSRSVFVIDNPDSSFAHKVTEAAEMLTNELREAPASAN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.