Skip to content

ADATSK

ADATSK extends TSK with an adaptive softmin antecedent that stabilizes high-dimensional fuzzy inference while preserving first-order TSK consequents.

Reference

G. Xue, Q. Chang, J. Wang, K. Zhang and N. R. Pal, "An Adaptive Neuro-Fuzzy System With Integrated Feature Selection and Rule Extraction for High-Dimensional Classification Problems," in IEEE Transactions on Fuzzy Systems, vol. 31, no. 7, pp. 2167-2181, July 2023, doi: 10.1109/TFUZZ.2022.3220950

Mathematical Formulation

ADATSK extends TSK fuzzy inference by using an adaptive softmin antecedent (Ada-softmin) together with first-order linear consequents.

Antecedent

Each rule-term membership is typically computed with a Gaussian function:

\[ \mu_{r,d}(x_d)=\exp\left(-\frac{(x_d-c_{r,d})^2}{2\sigma_{r,d}^2}\right) \]

where \(c_{r,d}\) is the center and \(\sigma_{r,d}>0\) is the spread.

In highFIS, the paper-strict ADATSKClassifier path uses a dedicated ADATSKGaussianMF antecedent, with:

\[ \mu(x)=\exp\left(-\frac{(x-c)^2}{\sigma^2}\right) \]

and applies sigma=1 before training, yielding the paper's simplified form:

\[ \mu(x)=\exp\left(-(x-c)^2\right) \]

Adaptive Ada-softmin aggregation

ADATSK computes rule firing strengths with an adaptive softmin based on the minimum antecedent membership for each rule:

\[ \hat{q}_r = \left\lceil \frac{690}{\ln (\min_{d}\mu_{r,d}(x_d))} \right\rceil, \quad \hat{q}_r \in [-1000, -1] \]
\[ \phi_r = \left( \frac{1}{D} \sum_{d=1}^{D}\mu_{r,d}(x_d)^{\hat{q}_r} \right)^{1/\hat{q}_r} \]

The exponent \(\hat{q}_r\) is recomputed on every forward pass and is clamped for numerical stability, which avoids the fixed-parameter softmin problems of underflow and fake minimum.

Normalization

Rule firing strengths are normalized by simple sum normalization:

\[ \bar{\phi}_r = \frac{\phi_r}{\sum_{k=1}^{R} \phi_k} \]

Consequent

In the ADATSK paper, the base formulation is presented for classification, with first-order TSK consequents:

\[ \mathbf{y}_r = W_r \mathbf{x} + \mathbf{b}_r \]

Output aggregation

The final prediction is the normalized weighted sum of rule consequents:

For classification:

\[ \mathbf{y} = \sum_{r=1}^{R} \bar{\phi}_r \mathbf{y}_r \]

Code ↔ Paper Correspondence

Equation Class / Method Description
Adaptive softmin highfis.layers.AdaSoftminRuleLayer Computes per-rule softmin exponents from the minimum membership value
Normalization highfis.defuzzifiers.SumBasedDefuzzifier Standard sum-based rule strength normalization
Consequent ClassificationConsequentLayer First-order linear consequents for classification
Membership functions highfis.memberships.ADATSKGaussianMF Paper-style Gaussian antecedent used by ADATSKClassifier

Implementation notes

In highFIS, ADATSKClassifierModel implements the paper-aligned ADATSK core by replacing the standard product antecedent with the adaptive softmin operator.

Model classes

  • ADATSKClassifierModel uses highfis.layers.AdaSoftminRuleLayer to compute rule strengths.
  • The classifier consequent remains first-order linear and is normalized with highfis.defuzzifiers.SumBasedDefuzzifier.
  • ADATSKClassifierModel does not expose the feature-selection/ rule-extraction gates of FSRE-ADATSK.

Estimator wrappers

  • ADATSKClassifier is the paper-strict default wrapper.
  • Default settings follow the paper protocol: n_mfs=3, mf_init="grid", rule_base="coco", full-batch updates (batch_size=None), and no shuffling.
  • For grid initialization, MF centers are placed directly on [V_{\min}, V_{\max}] with no margin padding.
  • In the pre-train hook, Gaussian spreads are set to sigma=1 and frozen.
  • Consequent parameters are initialized to zero in the ADATSK classifier paper-strict path.
  • For high-dimensional inputs (default threshold 1000 features), antecedent parameters are frozen by default to match the paper's experimental protocol.

Membership functions

  • The primary antecedent MFs are highfis.memberships.ADATSKGaussianMF objects in the paper-strict classifier path.
  • Gaussian spreads are fixed at sigma=1 in the paper-strict ADATSK path, reproducing Eq. (3).

Training in the paper vs. highFIS

  • The paper trains ADATSK end-to-end using full-batch gradient descent and MSE-style classification error.
  • The paper-strict ADATSK default in highFIS uses nn.MSELoss() for the classifier and SGD-based full-batch optimization.
  • eps is used to clamp membership values and stabilize log-space computations in AdaSoftminRuleLayer.

  • FSRE-ADATSK is documented separately in docs/models/fsre-adatsk.md.

Framework extensions (outside paper-strict scope)

  • ADATSKRegressorModel and ADATSKRegressor are provided as framework extensions for regression workflows.
  • highfis.memberships.CompositeGaussianMF remains available as an engineering alternative for custom experiments.

Alignment with the paper

  • The paper's key ADATSK contribution is the adaptive softmin antecedent operator to avoid numeric underflow and fake minimum effects.
  • highFIS implements this via AdaSoftminRuleLayer with a per-rule exponent derived from the rule's minimum antecedent membership.
  • The default ADATSKClassifier now follows the paper protocol (CoCo rule base, Eq. (3)-style Gaussian antecedent with sigma=1 fixed, full-batch GD, MSE-style classification loss, and zero-initialized consequents).
  • Regression and alternative MF variants are treated as explicit framework extensions, not part of the strict ADATSK paper baseline.