MGnifams

A metagenomics-derived protein families resource


Protein Family: MGYF0000000007

Overview

This is the top-scoring MGnify protein (along with its specific region if not whole) that was recruited in the family through hmmsearch. Links to the MGnify Proteins site. Family representative sequence MGYP006891090331/1-516
# Amino Acids (AA) Representative length 516
The total number of MGnify sequences that have been iteratively recruited in the family through a series of processes such as: creating a seed alignment from the family's initial cluster, building an HMM model, and finally recruiting and aligning sequences from MGnify Proteins with the family HMM model. Total number of sequences in the family 3618
Denotes if FunFam functional annotation hits were identified via the hmmer/hmmsearch tool. Sequence-HMM FunFam matches
Denotes if Pfam functional annotation hits were identified via the hmmer/hmmsearch tool. Sequence-HMM Pfam matches
Denotes if Pfam domain annotation hits were identified through model searching with the hhsuite/hhblits tool. Profile-profile Pfam matches
Denotes if structure homologs of the family's representative sequence have been identified in the AlphaFoldDB or PDB databases through the foldseek tool. Structure-structure hits

ESMFold structure

Predicted 3D protein structure through the Meta AI ESMFold model. ESMFold uses the representations from a large language model (ESM2) to generate an accurate structure prediction from the sequence of a protein.

For more information visit:

Download CIF file

  Very high (pLDDT ≥ 90)   High (90 > pLDDT ≥ 70)   Low (70 > pLDDT ≥ 50)   Very low (pLDDT < 50)

pLDDT corresponds to the model's prediction of its score on the per-residue Local Distance Difference Test. It is a measure of local accuracy. Confidence bands are used to colour-code the residues in the 3D viewer. The exact per-residue pLDDT value is shown when you mouseover the structure. Average structure plddt score: 69.9
The pTM score (predicted Template Modeling score) is a confidence metric that estimates how accurate the global topology of a predicted protein structure is likely to be. pTM score: 0.499

Predicted secondary structure The secondary structure prediction was carried out with the s4pred software.

α-helices:  0.78%
β-strands:  49.22%
coils:      50.0%

The protein is likely rich in β-strands, which may indicate a β-sheet-dominated architecture.

Download features JSON file

Predicted transmembrane regions The transmembrane region prediction was carried out with the DeepTMHMMM software.

inside:     0.0%
membrane-α: 0.0%
outside:    96.9%
signal:     3.1%
membrane-β: 0.0%
periplasm:  0.0%

This does not seem to be a transmembrane protein.

Download transmembrane JSON file

Multiple Sequence Alignment (Seed) This is the seed alignment that was used to create the HMM model of the family. It is different to the full alignment, which incorporates all MGnify sequences that have been recruited in the family after searching with the HMM model against the sequence pool. The full alignment is usually quite larger than the seed one and can be downloaded via the FTP.

Download seed MSA file

HMM viewer The family HMM is visualized via the Skylign API.

The height of each stack represents the information content (also known as relative entropy) at that position, while the size of each letter within the stack reflects its estimated probability. Click on a stack to highlight the corresponding column in the seed MSA viewer above.

Download HMM file

Biomes distribution An interactive sunburst plot showing the biomes where the family's underlying MGnify proteins were detected.

Download biomes CSV file

Domain architecture

The top 15 most prevalent domain architectures (including MGnifams and Pfams) found in the full alignment sequences of the family. The numbers on the left indicate how many MGnify sequences share each domain architecture.

Download domains JSON file

Functional annotation through Funfam matches

The family representative sequence was searched against the FunFam database (ver. 4.3.0) with hmmer/hmmsearch.

No FunFam hits found

Functional annotation through Pfam matches

The family representative sequence was searched against the Pfam database (ver. 38.0) with hmmer/hmmsearch.

Pfam Name E-value Score HMM from HMM to Alignment from Alignment to Envelope from Envelope to Accuracy
PF19049 SidE_DUB 0.00011 12.5 29 134 183 288 160 306 0.71

Profile-profile Pfam matches

This MGnifam HMM profile was searched against the HH-suite profile Pfam database (ver. 35.0) with HHsearch.

No MGnifam model Pfam hits found

Structure-structure hits

This MGnifam 3D structure was searched against the Alphafold/UniProt and PDB databases with foldseek.

Rank Target Structure Target DB Aligned Length Query Start Query End Target Start Target End E-value
1 R5RW47 AlphaFold 513 3 515 507 1.904e-41
2 R6ESW7 AlphaFold 515 1 515 501 9.18e-38
3 A0A843CFZ3 AlphaFold 484 32 515 832 3.441e-08
4 A0A662RLU0 AlphaFold 493 31 515 834 3.802e-08
5 A0A7V0ST75 AlphaFold 484 31 514 1059 1.198e-07
6 A0A2N5ZHC4 AlphaFold 465 50 514 1048 1.198e-07
7 A0A2N2WDR4 AlphaFold 484 31 514 1035 1.259e-07
8 A0A1J5AW95 AlphaFold 485 32 516 1039 1.698e-07
9 A0A7C3DPZ1 AlphaFold 462 53 514 1054 2.291e-07
10 D8MR81 AlphaFold 621 48 516 998 2.797e-07
11 A0A850NKJ1 AlphaFold 516 1 516 502 3.965e-07
12 F1VW93 AlphaFold 516 1 516 674 4.606e-07
13 A0A7X8KCC2 AlphaFold 485 30 514 442 4.841e-07
14 A0A354R943 AlphaFold 514 2 515 493 4.841e-07
15 A0A2J6HER8 AlphaFold 462 53 514 1036 5.349e-07
16 A0A662F6H3 AlphaFold 516 1 516 477 5.349e-07
17 A0A0S2HZ20 AlphaFold 462 53 514 1050 5.623e-07
18 A0A1F3IYK6 AlphaFold 462 53 514 1035 6.53e-07
19 A0A845TS82 AlphaFold 516 1 516 674 7.216e-07
20 A0A3D0VIV0 AlphaFold 462 53 514 1035 7.216e-07
21 A0A847Q4A2 AlphaFold 462 53 514 1034 7.585e-07
22 A0A1H6P341 AlphaFold 519 30 515 592 9.26e-07
23 A0A2N2XWF4 AlphaFold 462 53 514 1079 1.075e-06
24 A0A1Z9IQ63 AlphaFold 379 136 514 803 1.13e-06
25 A0A2E5UQG1 AlphaFold 366 149 514 932 1.249e-06
26 A0A2G6BPP1 AlphaFold 463 52 514 1053 1.249e-06
27 A0A7Y7TRF8 AlphaFold 457 58 514 1035 1.249e-06
28 A0A2F0AFG9 AlphaFold 379 136 514 602 1.313e-06
29 A0A0Q1A9T4 AlphaFold 485 30 514 696 1.38e-06
30 A0A7Y7TR01 AlphaFold 459 56 514 1067 1.771e-06
31 A0A2E5SHP9 AlphaFold 375 140 514 924 2.057e-06
32 A0A1I3EKM2 AlphaFold 570 1 516 771 2.057e-06
33 A0A4R3TWV8 AlphaFold 549 8 516 770 2.775e-06
34 A0A2E2VV05 AlphaFold 366 149 514 932 3.066e-06
35 A0A1V6I5Y2 AlphaFold 462 53 514 1034 3.066e-06
36 A0A4V2AZH4 AlphaFold 191 355 515 332 3.223e-06
37 A0A1D8P7M8 AlphaFold 507 54 515 648 3.743e-06
38 A0A4Q1KGX9 AlphaFold 513 58 515 1114 3.935e-06
39 A0A7X8Q9P1 AlphaFold 487 28 514 1051 4.57e-06
40 A0A1R2CNE6 AlphaFold 501 46 515 507 6.48e-06
41 8ic1 PDB 449 67 515 475 6.995e-06
42 8hhv PDB 424 92 515 475 7.729e-06
43 W2DIY6 AlphaFold 503 31 516 1245 8.741e-06
44 A0A2K8THB2 AlphaFold 516 1 516 465 8.741e-06
45 A0A662A4B1 AlphaFold 453 64 516 1112 9.188e-06
46 A0A225EB15 AlphaFold 129 387 515 341 9.658e-06
47 A0A6L9KM00 AlphaFold 577 29 516 833 9.658e-06
48 A0A2E1QA83 AlphaFold 209 323 514 596 1.067e-05
49 A0A7J4JMI2 AlphaFold 367 31 397 389 1.067e-05
50 A0A538ETN0 AlphaFold 222 301 515 649 1.239e-05
51 A0A7Y7Z0Q1 AlphaFold 581 51 516 1243 1.303e-05
52 A0A2N5YWY9 AlphaFold 459 56 514 738 1.303e-05
53 A0A4P9VRM9 AlphaFold 513 2 514 387 1.369e-05
54 A0A4V0GQK0 AlphaFold 514 1 514 465 1.513e-05
55 A0A0G1NH71 AlphaFold 381 31 411 391 1.757e-05
56 A0CFE6 AlphaFold 512 153 515 564 1.847e-05
57 A0A2N2WDT8 AlphaFold 462 53 514 1055 1.942e-05
58 7kwc PDB 223 300 515 241 2.096e-05
59 A0A0X8PM41 AlphaFold 514 1 514 465 2.255e-05
60 6v1v PDB 463 54 516 750 2.435e-05
61 A0A2J6HER9 AlphaFold 464 53 516 1058 2.492e-05
62 A0A4U1HQY1 AlphaFold 467 48 514 785 2.492e-05
63 A0A7X8FBD4 AlphaFold 214 301 514 751 2.619e-05
64 A0A2D7NEV3 AlphaFold 209 322 514 595 2.753e-05
65 A0A1N6Q7J3 AlphaFold 453 64 516 630 3.904e-05
66 6vls PDB 487 30 516 950 4.01e-05
67 K2H303 AlphaFold 503 13 515 496 4.104e-05
68 6tfk PDB 466 51 516 681 4.43e-05
69 A0A1Z9PSK7 AlphaFold 208 322 514 594 4.535e-05
70 A0A3E0TY78 AlphaFold 485 74 516 645 5.267e-05
71 4qaw PDB 222 293 514 523 5.409e-05
72 A0A2F0AFW7 AlphaFold 209 323 514 603 5.536e-05
73 4qaw PDB 222 293 514 527 5.686e-05
74 A0A2D8S7Z1 AlphaFold 209 323 514 603 5.819e-05
75 A0A3E0TQY7 AlphaFold 477 84 516 644 6.43e-05
76 A0A1R2BV65 AlphaFold 487 29 515 518 6.43e-05
77 A0A0M0F7N3 AlphaFold 451 66 516 626 6.43e-05
78 4zmh PDB 298 242 514 912 6.604e-05
79 A0A3E0UF02 AlphaFold 485 74 516 645 7.468e-05
80 A0A2U2AKU2 AlphaFold 350 167 516 269 7.85e-05
81 6tfj PDB 467 50 516 764 8.062e-05
82 A0A6A6QD54 AlphaFold 476 58 515 739 8.251e-05
83 5awq PDB 215 354 515 593 9.842e-05
84 A0A2N5YTC3 AlphaFold 350 165 514 325 0.0001007
85 A0A0H2L953 AlphaFold 453 64 516 626 0.0001007
86 6tfj PDB 467 50 516 765 0.0001035
87 A0A3E0TQ89 AlphaFold 475 83 516 645 0.0001059
88 4qaw PDB 213 302 514 526 0.0001202
89 5x7q PDB 463 53 515 1081 0.0001263
90 A0A2E6VD14 AlphaFold 355 162 516 340 0.0001293
91 A0A6N7D6H5 AlphaFold 498 19 516 816 0.0001293
92 A0A1Q7VYD9 AlphaFold 183 352 515 187 0.0001359
93 6bqm PDB 467 50 516 455 0.0001396
94 A0A7V0SSX1 AlphaFold 459 56 514 1046 0.0001501
95 2vzr PDB 125 390 514 124 0.0001542
96 4qaw PDB 223 292 514 529 0.0001542
97 A0A1Y1RIY7 AlphaFold 481 34 514 682 0.0001578
98 6vls PDB 487 30 516 954 0.0001621
99 A0A8B6H6I3 AlphaFold 468 110 515 689 0.0001744
100 A0A6A6AJW6 AlphaFold 147 370 515 355 0.0001833
101 A0A2G8KQ99 AlphaFold 561 56 515 642 0.0001927
102 A0A2D6RL76 AlphaFold 149 387 516 745 0.0002025
103 A0A3D6B276 AlphaFold 295 222 516 242 0.0002025
104 A0A2A2QFP4 AlphaFold 518 195 515 536 0.0002025
105 A0A3E0TXI2 AlphaFold 477 84 516 644 0.0002025
106 A0A3E0UEJ3 AlphaFold 477 84 516 644 0.0002025
107 A0A099KVZ6 AlphaFold 489 84 516 656 0.0002025
108 J2ZT18 AlphaFold 231 286 516 887 0.0002352
109 5cag PDB 250 13 240 251 0.0002416
110 6v1v PDB 463 54 516 748 0.0002416
111 A0A841W8N0 AlphaFold 129 390 515 187 0.0002872
112 A0A4P5QUE5 AlphaFold 130 387 516 223 0.0002872
113 2w3j PDB 128 389 516 127 0.000295
114 3wnk PDB 135 388 514 518 0.000295
115 A0A0F8LLC1 AlphaFold 278 258 515 503 0.0003173
116 A0A0F8MXM7 AlphaFold 278 258 515 506 0.0003173
117 4moa PDB 303 293 516 592 0.0003259
118 A0A0J8GT16 AlphaFold 481 84 516 651 0.0003506
119 F9D772 AlphaFold 566 1 515 900 0.0003506
120 2c4x PDB 229 301 516 249 0.0003785
121 A0A1W6DMZ1 AlphaFold 453 64 516 630 0.0003874
122 A0A8B6GPV7 AlphaFold 597 1 515 736 0.0003874
123 A0A0F8DTV0 AlphaFold 278 258 515 506 0.0004072
124 A0A4D6MLR7 AlphaFold 366 150 515 350 0.0004072
125 E1IB39 AlphaFold 139 377 515 710 0.000428
126 A0A0E3RP74 AlphaFold 254 300 515 494 0.000473
127 A0A0F8SUG7 AlphaFold 254 300 515 503 0.000473
128 A0A1Z5TVH3 AlphaFold 256 266 515 306 0.000473
129 A0A7M7T2W0 AlphaFold 568 56 515 612 0.000473
130 7bys PDB 429 88 516 427 0.0004857
131 A0A2C6W4Z5 AlphaFold 129 390 515 187 0.0004972
132 A0A2D6RDT6 AlphaFold 477 84 516 681 0.0004972
133 7byx PDB 452 65 516 427 0.0005106
134 A0A2V5ZZU5 AlphaFold 106 413 516 174 0.0005226
135 A0A1Z4JTN0 AlphaFold 132 385 515 190 0.0005226
136 A0A0F8M8Y5 AlphaFold 278 258 515 506 0.0005226
137 7byv PDB 487 30 516 428 0.0005367
138 A0A235IIQ3 AlphaFold 129 390 515 196 0.0005493
139 A0A4V0HXS3 AlphaFold 131 400 515 397 0.0005493
140 2w1w PDB 115 405 516 128 0.0005642
141 K9WRI3 AlphaFold 133 390 515 228 0.0006069
142 4qb6 PDB 119 396 514 127 0.0006234
143 A0A1Z4H361 AlphaFold 132 385 515 190 0.000638
144 A0A4V4KSR2 AlphaFold 167 376 515 268 0.000638
145 A0A6L8L281 AlphaFold 161 382 516 177 0.000638
146 A0A0D2NF21 AlphaFold 199 318 516 277 0.000638
147 4p99 PDB 455 30 484 379 0.0006552
148 A0A4R7BMV1 AlphaFold 484 50 515 675 0.0006706
149 3c7o PDB 451 100 515 484 0.000724
150 6e57 PDB 462 55 516 399 0.000724
151 A0A4S9KBF7 AlphaFold 167 376 515 268 0.000741
152 A0A7C5FF89 AlphaFold 391 122 512 366 0.000741
153 A0A2C6VXZ5 AlphaFold 129 390 515 195 0.0007789
154 A0A3C1V7M9 AlphaFold 370 146 515 384 0.0007789
155 A0A4S9E2W9 AlphaFold 328 188 515 290 0.0008187
156 A0A2L2N240 AlphaFold 129 390 515 187 0.0008606
157 A0A3M7IHW7 AlphaFold 240 315 515 306 0.0008606
158 A0A553L4H9 AlphaFold 474 84 516 674 0.0008606
159 A0A818M9G0 AlphaFold 747 1 516 894 0.0008606
160 4qaw PDB 229 301 516 527 0.0008839
161 A0A1H0JPZ8 AlphaFold 271 269 515 498 0.0009046
162 A0A235J816 AlphaFold 129 390 515 187 0.0009509
163 A0A4S9W740 AlphaFold 167 376 515 268 0.0009996
164 A0A2V6JPU8 AlphaFold 178 380 515 179 0.0009996
165 A0A1Z1FA44 AlphaFold 213 300 512 515 0.0009996

Family Representative Sequence Viewer

Amino acid position: -

MKKQYIAAFACILAAACSKAELNEVDPGPSGESMTLTLPQTKVAVSGDSYQFELDDEITAIASNGSRAILKPNAASSGAHTATNYFTGTFDKPVTDGSTISFYYNAKSIADNGTATFEQNGDPWLVSTGNKFTRTEDRQISVTATLAAPENVRAIAVIFTDNEGIESFEFHAKDQSIKLGTFDGTSFSGNSTVSQNVLSHQSEGIEFMRSNIVYVPKDMEGGFWIKAIKGQQAMYKSYATKNPIENTKVAISSFVPAKVDIDVNISGFATSYSYYVANEGIEGISAKDVNKANNVSNDWMGEGKATYTISREGIPAALLTFDSFKLTVDGKEYTGGEATKAISVAAGNGHTTWGQKDIVATVVYKDLDDNEYTGTQTIVRHITGLPYAANPPKDSGDNAWSKSSWNVKLESSYVQLGAVTGTGEPSIKSPTFNMPAAVDITIKSDVKAEKYTFITEIKTIYKVYVNGIEVSSKQGSFSRTTLSANTSFSSGSNSLKCESSYKLAGPSVTLYSVSIL

HMM Consensus MKkiyialaalaalvgCskeelnevateekgesvsltidetrVaiegeeykfeagDkItlvseaGssatLtaeassgaatvrFtGefskpaeddtyyaiyynaksidaeGtvtfeQnGdpwlvavakavtresdeqisisaefkppnallavaVssagveslekaefkakdgsilastfdgtsfsgestvsgivlsaasgdeagffvslPadmegGfwikltdgnnvmyksYatktfientkvnveefvpakVdldvqisgfaTSYsYYaanegiegiakdvatansvandvigegkasctitksgIqsalisvasvgltvdGeeytadeatktisvnattghttWgqksikayVvyktkdGkeytstntltrhiTGLPYtanppknsgeagwkksgeniswknsylqlkestsstggsiispefnipsninvkvsvkasaysagvelkttltisvgsttvssesstkskettsenatltssknsikiestyatagpsvkvsslsik