Prokaryotic Protein Subcellular Localization Prediction (ProtLocPro)

ProtLocPro is a subcellular localization prediction tool for prokaryotic (Archaea / Gram-positive / Gram-negative) protein sequences. The model is built on protein language model features and combines multi-scale local pattern extraction with global long-range dependency modeling to perform 6-class localization prediction for input protein sequences, helping users quickly screen candidate localizations and support downstream functional annotation and experiment design.

1. Protein Sequences (up to 10 FASTA entries):

Parsed sequences: 0, total residues: 0



Supported Localization Classes (6)

  • Cytoplasmic
  • CytoplasmicMembrane
  • Extracellular
  • Periplasmic
  • OuterMembrane
  • Cellwall

Use Cases

  • Functional annotation of prokaryotic proteins
  • Candidate screening for secreted/membrane proteins
  • Early-stage prioritization for protein engineering and target discovery
  • Automated localization prediction module in omics projects

Model Performance Metrics

==================================================
   ---  Overall Test Set Performance Report ---
==================================================
  - Accuracy: 0.9111
  - Weighted F1: 0.9102
  - Macro F1: 0.8279
  - Matthews Correlation Coefficient (MCC): 0.8631

Per-Class MCC:
  - MCC for class 'Cytoplasmic': 0.9196
  - MCC for class 'Periplasmic': 0.7596
  - MCC for class 'CYtoplasmicMembrane': 0.8864
  - MCC for class 'Extracellular': 0.8029
  - MCC for class 'Cellwall': 0.6789
  - MCC for class 'OuterMembrane': 0.8069

Classification Report:
                     precision    recall  f1-score   support

        Cytoplasmic       0.95      0.98      0.96       507
        Periplasmic       0.74      0.80      0.77        46
CYtoplasmicMembrane       0.92      0.90      0.91       217
      Extracellular       0.84      0.81      0.82       104
           Cellwall       0.65      0.72      0.68        18
      OuterMembrane       0.92      0.73      0.81        75

           accuracy                           0.91       967
          macro avg       0.84      0.82      0.83       967
       weighted avg       0.91      0.91      0.91       967


Confusion Matrix:
                     Cytoplasmic  Periplasmic  CYtoplasmicMembrane  Extracellular  Cellwall  OuterMembrane
Cytoplasmic                  496            4                    3              1         0              3
Periplasmic                    3           37                    3              3         0              0
CYtoplasmicMembrane           11            3                  196              3         3              1
Extracellular                  8            4                    4             84         3              1
Cellwall                       1            0                    1              3        13              0
OuterMembrane                  5            2                    6              6         1             55
        

Last updated: 2026-04-01