pulp-bio · Thoriri · Apr 14, 2026 · Mar 31, 2026 · Apr 1, 2026 · Apr 1, 2026
diff --git a/README.md b/README.md
@@ -10,23 +10,29 @@
   <a href="https://arxiv.org/abs/2512.15729">
     <img src="https://img.shields.io/badge/arXiv-2512.15729-b31b1b.svg" alt="TinyMyo Paper">
   </a>
-  <a href="https://huggingface.co/thorir/FEMBA">
+  <a href="https://arxiv.org/abs/2603.19100">
+    <img src="https://img.shields.io/badge/arXiv-2603.19100-b31b1b.svg" alt="LuMamba Paper">
+  </a>
+  <a href="https://huggingface.co/PulpBio/FEMBA">
     <img src="https://img.shields.io/badge/HuggingFace-FEMBA-%23ffcc4d?logo=huggingface&logoColor=black" alt="Hugging Face: FEMBA">
   </a>
-  <a href="https://huggingface.co/thorir/LUNA">
+  <a href="https://huggingface.co/PulpBio/LUNA">
     <img src="https://img.shields.io/badge/HuggingFace-LUNA-%23ffcc4d?logo=huggingface&logoColor=black" alt="Hugging Face: LUNA">
   </a>
   <a href="https://huggingface.co/PulpBio/TinyMyo">
     <img src="https://img.shields.io/badge/HuggingFace-TinyMyo-%23ffcc4d?logo=huggingface&logoColor=black" alt="Hugging Face: TinyMyo">
   </a>
+  <a href="https://huggingface.co/PulpBio/LuMamba">
+    <img src="https://img.shields.io/badge/HuggingFace-LuMamba-%23ffcc4d?logo=huggingface&logoColor=black" alt="Hugging Face: LuMamba">
+  </a>
   <a href="https://github.com/pulp-bio/BioFoundation">
     <img src="https://img.shields.io/github/stars/pulp-bio/BioFoundation?style=social" alt="GitHub Stars">
   </a>
 </p>
 
 Copyright (C) 2025 ETH Zurich, Switzerland. SPDX-License-Identifier: Apache-2.0. See LICENSE file for details.
 
-Authors: Thorir Mar Ingolfsson, Anna Tegon, Berkay Döner, Xiaying Wang, Matteo Fasulo, Yawei Li & Luca Benini.
+Authors: Thorir Mar Ingolfsson, Anna Tegon, Berkay Döner, Xiaying Wang, Matteo Fasulo, Danaé Broustail, Yawei Li & Luca Benini.
 
 ## About
 
@@ -40,20 +46,22 @@ Looking for ready-to-use weights of models? We host them on Hugging Face:
 
 ### Currently available
 
-- **FEMBA** ([paper](https://arxiv.org/abs/2502.06438)) [![HF Model Card](https://img.shields.io/badge/Model%20Card-FEMBA-ffcc4d?logo=huggingface&logoColor=black)](https://huggingface.co/thorir/FEMBA)
-- **LUNA** ([paper](https://arxiv.org/abs/2510.22257)) [![HF Model Card](https://img.shields.io/badge/Model%20Card-LUNA-ffcc4d?logo=huggingface&logoColor=black)](https://huggingface.co/thorir/LUNA)
+- **FEMBA** ([paper](https://arxiv.org/abs/2502.06438)) [![HF Model Card](https://img.shields.io/badge/Model%20Card-FEMBA-ffcc4d?logo=huggingface&logoColor=black)](https://huggingface.co/PulpBio/FEMBA)
+- **LUNA** ([paper](https://arxiv.org/abs/2510.22257)) [![HF Model Card](https://img.shields.io/badge/Model%20Card-LUNA-ffcc4d?logo=huggingface&logoColor=black)](https://huggingface.co/PulpBio/LUNA)
 - **TinyMyo** ([paper](https://arxiv.org/abs/2512.15729)) [![HF Model Card](https://img.shields.io/badge/Model%20Card-TinyMyo-ffcc4d?logo=huggingface&logoColor=black)](https://huggingface.co/PulpBio/TinyMyo)
+- **LuMamba** ([paper](https://arxiv.org/abs/2603.19100)) [![HF Model Card](https://img.shields.io/badge/Model%20Card-LuMamba-ffcc4d?logo=huggingface&logoColor=black)](https://huggingface.co/PulpBio/LuMamba)
 
 #### Why FEMBA?
 
 - **Scales to long EEG** with linear-time Mamba (no quadratic attention).
 - **Strong results** on TUAB/TUAR/TUSL with ready task-specific checkpoints.
 - **Simple fine-tune path:** set `CHECKPOINT_DIR`, run `+experiment=FEMBA_finetune`.
 
-**➡️ Model hub:** <https://huggingface.co/thorir/FEMBA>
-**📄 Model card:** [FEMBA on Hugging Face](https://huggingface.co/thorir/FEMBA) — benchmarks, protocols, and efficiency notes.
+**➡️ Model hub:** <https://huggingface.co/PulpBio/FEMBA>
+**📄 Model card:** [FEMBA on Hugging Face](https://huggingface.co/PulpBio/FEMBA) — benchmarks, protocols, and efficiency notes.
 **📜 Weights license:** CC BY-ND 4.0 (use + redistribute **unmodified** weights with attribution; no redistribution of **modified** weights)
 **🧑‍🍳 PR-gated improvements:** If you fine-tune internally and want your variant to become an **official** FEMBA release, open a PR with configs, logs, and evals. We’ll review together; if it looks good, we’ll retrain/validate and publish an **official** FEMBA checkpoint.
+
 **What you’ll find on the hub**
 
 - `TUAB/` → abnormal EEG (base/large)
@@ -70,7 +78,7 @@ pip install huggingface_hub
 from huggingface_hub import snapshot_download
 
 # downloads all task folders (TUAB/TUAR/TUSL) and safetensors into ./checkpoints/FEMBA
-snapshot_download(repo_id="thorir/FEMBA", repo_type="model", local_dir="checkpoints/FEMBA")
+snapshot_download(repo_id="PulpBio/FEMBA", repo_type="model", local_dir="checkpoints/FEMBA")
 ```
 
 Use the paths directly in your runs, e.g.:
@@ -88,8 +96,8 @@ python -u run_train.py +experiment=FEMBA_finetune
 - **Pretrained on >21k hours** (TUEG + Siena) with masked-patch reconstruction; strong transfer across datasets/montages.
 - **Simple fine-tune path:** pick model size with `LUNA_{base,large,huge}.yaml`, set `pretrained_safetensors_path`, run `+experiment=LUNA_finetune`.
 
-**➡️ Model hub:** <https://huggingface.co/thorir/LUNA>
-**📄 Model card:** [LUNA on Hugging Face](https://huggingface.co/thorir/LUNA) — variants, configs, and fine-tuning walkthrough.
+**➡️ Model hub:** <https://huggingface.co/PulpBio/LUNA>
+**📄 Model card:** [LUNA on Hugging Face](https://huggingface.co/PulpBio/LUNA) — variants, configs, and fine-tuning walkthrough.
 **📜 Weights license:** CC BY-ND 4.0 (use + redistribute **unmodified** weights with attribution; no redistribution of **modified** weights)
 **🧑‍🍳 PR-gated improvements:** If you fine-tune internally and want your variant to become an **official** LUNA release, open a PR with configs, logs, and evals. We’ll review; if it looks good, we’ll retrain/validate and publish an **official** LUNA checkpoint.
 
@@ -108,7 +116,7 @@ pip install huggingface_hub
 from huggingface_hub import snapshot_download
 
 # downloads LUNA folders and .safetensors into ./checkpoints/LUNA
-snapshot_download(repo_id="thorir/LUNA", repo_type="model", local_dir="checkpoints/LUNA")
+snapshot_download(repo_id="PulpBio/LUNA", repo_type="model", local_dir="checkpoints/LUNA")
 ```
 
 Use the paths directly in your runs like here below:
@@ -180,6 +188,45 @@ python -u run_train.py +experiment=TinyMyo_finetune \
 - **Generic Neuromotor Interface**
   - Codebase: [MatteoFasulo/generic-neuromotor-interface](https://github.com/MatteoFasulo/generic-neuromotor-interface)
 
+#### Why LuMamba?
+
+- **Merging topology-invariant** EEG representations with **Mamba-based** temporal modeling to jointly achieve **linear-time efficiency** and **channel invariance**.
+- **Pretrained on >21k hours** (TUEG) with **LeJEPA** (Balestriero and LeCun, 2025), a recent joint-embedding predictive approach, adapted to biosignals in this repository to enhance cross-montage robustness. 
+
+**➡️ Model hub:** <https://huggingface.co/PulpBio/LuMamba>
+**📄 Model card:** [LuMamba on Hugging Face](https://huggingface.co/PulpBio/LuMamba) — variants, configs, and fine-tuning walkthrough.
+**📜 Weights license:** CC BY-ND 4.0 (use + redistribute **unmodified** weights with attribution; no redistribution of **modified** weights)
+**🧑‍🍳 PR-gated improvements:** If you fine-tune internally and want your variant to become an **official** LuMamba release, open a PR with configs, logs, and evals. We’ll review; if it looks good, we’ll retrain/validate and publish an **official** LuMamba checkpoint.
+
+**What you’ll find on the hub**
+
+- `LeJEPA-only`, `Reconstruction-only`, `Mixed LeJEPA-Reconstruction` pre-trained checkpoints → Pre-training design variants.
+- Instructions to get started on fine-tuning experiments.
+
+Quick download with `huggingface_hub`:
+
+```bash
+pip install huggingface_hub
+```
+
+```python
+from huggingface_hub import snapshot_download
+
+# downloads all pre-trained variants and safetensors into ./checkpoints/LuMamba
+snapshot_download(repo_id="PulpBio/LuMamba", repo_type="model", local_dir="checkpoints/LuMamba")
+```
+Include the safetensors checkpoint path as input and run fine-tuning in the commandline:
+```bash
+python -u run_train.py +experiment=LuMamba_finetune \
+  pretrained_safetensors_path=/absolute/path/to/checkpoints/LuMamba/LuMamba.safetensors
+```
+
+Tips:
+- Data preprocessing scripts are provided in `/make_datasets` for various downstream datasets. See `make_datasets/README.md` for instructions on getting started. 
+- Adapt configuration file `config/experiment/LuMamba_finetune.yaml` to your specific task with correct dataset paths, classification type (regression and multi-class classification `mcc`, binary `bc` and change `model.num_classes` accordingly), I/O settings, trainer parameters, etc.
+  - Ensure `data_module:train/test/val` are initialized with the compatible dataset class.
+  - Configuration file includes sufficient `#CHANGEME` tags and further instructions for a working example.
+
 ## Features
 
 - **Modular Design**: The repository is organized into modules for data loading, models, training tasks, and more, making it easy to extend and adapt for new research projects.
@@ -264,7 +311,7 @@ python -u run_train.py +experiment=FEMBA_finetune
 ```
 
 > **Tip:** Pretrained FEMBA weights (TUAB/TUAR/TUSL folders) are available on 🤗 Hugging Face:
-> <https://huggingface.co/thorir/FEMBA>
+> <https://huggingface.co/PulpBio/FEMBA>
 > Set `CHECKPOINT_DIR` to the desired `.safetensors` (e.g., `.../TUAR/base.safetensors`) before launching.
 
 Note in both cases one needs to make sure that the dataset that specific experiment is using is downloaded and available in the correct path.
@@ -399,10 +446,19 @@ If you find this work useful, please cite the respective papers:
       primaryClass={eess.SP},
       url={https://arxiv.org/abs/2512.15729},
 }
+@misc{broustail2026lumambalatentunifiedmamba,
+      title={LuMamba: Latent Unified Mamba for Electrode Topology-Invariant and Efficient EEG Modeling}, 
+      author={Danaé Broustail and Anna Tegon and Thorir Mar Ingolfsson and Yawei Li and Luca Benini},
+      year={2026},
+      eprint={2603.19100},
+      archivePrefix={arXiv},
+      primaryClass={cs.AI},
+      url={https://arxiv.org/abs/2603.19100}, 
+}
 ```
 
 ## License
 
 This project is licensed under the Apache License 2.0. See the [LICENSE](./LICENSE) file for details.
 
-**Note on model weights:** Pretrained weights are hosted at <https://huggingface.co/thorir/FEMBA>, <https://huggingface.co/thorir/LUNA>, and <https://huggingface.co/PulpBio/TinyMyo> and licensed under **CC BY-ND 4.0**. You may use and redistribute the **unmodified** weights with attribution. Redistribution of **modified** weights is not permitted. To upstream improvements, please open a PR; accepted changes will be released as **official** checkpoints.
+**Note on model weights:** Pretrained weights are hosted at <https://huggingface.co/PulpBio/FEMBA>, <https://huggingface.co/PulpBio/LUNA>, <https://huggingface.co/PulpBio/TinyMyo>, and <https://huggingface.co/PulpBio/LuMamba> and licensed under **CC BY-ND 4.0**. You may use and redistribute the **unmodified** weights with attribution. Redistribution of **modified** weights is not permitted. To upstream improvements, please open a PR; accepted changes will be released as **official** checkpoints.
diff --git a/config/defaults.yaml b/config/defaults.yaml
@@ -52,6 +52,10 @@ io:
   version: 0
   base_output_path: "#CHANGEME"
 
+hydra:
+  run:
+    dir: ${env:CHECKPOINT_DIR}/outputs/${now:%Y-%m-%d}/${now:%H-%M-%S}
+
 trainer:
   num_nodes: ${num_nodes}
   devices: ${gpus}

diff --git a/config/experiment/LuMamba_finetune.yaml b/config/experiment/LuMamba_finetune.yaml
@@ -0,0 +1,116 @@
+# @package _global_
+#*----------------------------------------------------------------------------*
+#* Copyright (C) 2025 ETH Zurich, Switzerland                                 *
+#* SPDX-License-Identifier: Apache-2.0                                        *
+#*                                                                            *
+#* Licensed under the Apache License, Version 2.0 (the "License");            *
+#* you may not use this file except in compliance with the License.           *
+#* You may obtain a copy of the License at                                    *
+#*                                                                            *
+#* http://www.apache.org/licenses/LICENSE-2.0                                 *
+#*                                                                            *
+#* Unless required by applicable law or agreed to in writing, software        *
+#* distributed under the License is distributed on an "AS IS" BASIS,          *
+#* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.   *
+#* See the License for the specific language governing permissions and        *
+#* limitations under the License.                                             *
+#*                                                                            *
+#* Author:  Danaé Broustail                                                   *
+#* Author:  Thorir Mar Ingolfsson                                             *
+#*----------------------------------------------------------------------------*
+
+tag: LuMamba_finetune
+model_size: tiny
+gpus: 1
+num_nodes: 1
+num_workers: 32
+batch_size: 512
+
+training: True
+final_validate: True
+final_test: True
+finetune_pretrained: True
+resume: False
+
+layerwise_lr_decay: 0.75
+scheduler_type: cosine
+classification_type: "bc"
+# Change for different tasks: "mcc" for multi-class classification and regression tasks, "bc" for binary classification.
+
+pretrained_checkpoint_path: null
+pretrained_safetensors_path: null
+
+callbacks:
+  progress_bar:
+    _target_: 'pytorch_lightning.callbacks.TQDMProgressBar'
+    refresh_rate: 50
+  early_stopping:
+    _target_: 'pytorch_lightning.callbacks.EarlyStopping'
+    monitor: 'val_loss'
+    patience: 15
+    mode: 'min'
+    verbose: True
+
+
+input_normalization:
+  normalize: True
+
+finetuning:
+  freeze_layers: False
+
+io:
+  checkpoint_dirpath: ${env:CHECKPOINT_DIR}/checkpoints
+  version: ${tag}_${model_size}_finetune
+  base_output_path: "${env:CHECKPOINT_DIR}/logs"
+
+defaults:
+  - override /data_module: finetune_data_module 
+  - override /model: LuMamba_tiny
+  - override /scheduler: cosine
+  - override /task: "#CHANGEME" # override with "finetune_task_LUNA" for classification and "finetune_regression_task_LuMamba" for regression tasks
+  - override /criterion: finetune_criterion
+
+# Change _target_ to corresponding dataset wrapper for finetuning dataset
+data_module:
+  train:
+    _target_: 'datasets.#CHANGEME.#CHANGEME'
+    hdf5_file: '#CHANGEME' # path to finetuning dataset training split
+    finetune: True
+    # channel_subset: False # applicable to MoBI and MODMA datasets; set to True to use a subset of the channels
+  val:
+    _target_: 'datasets.#CHANGEME.#CHANGEME'
+    hdf5_file: '#CHANGEME' # path to finetuning dataset validation split
+    finetune: True
+    # channel_subset: False # applicable to MoBI and MODMA datasets; set to True to use a subset of the channels
+  test: 
+    _target_: 'datasets.#CHANGEME.#CHANGEME'
+    hdf5_file: '#CHANGEME' # path to finetuning dataset test split
+    finetune: True
+    # channel_subset: False # applicable to MoBI and MODMA datasets; set to True to use a subset of the channels
+
+model:
+  num_classes: 2 # Change for corresponding finetuning dataset
+  classification_type: ${classification_type} # Options: "mcc" for multi-class classification and regression, "bc" for binary classification.
+  classifier_option: null  # Options: "mamba", "linear". "None" defaults to LUNA classifier.
+
+trainer:
+  enable_model_summary: True
+  accelerator: gpu
+  num_nodes: ${num_nodes}
+  devices: ${gpus}
+  strategy: ddp
+  max_epochs: 50
+  precision: bf16-mixed
+
+model_checkpoint:
+  save_last: True
+  monitor: "val_loss"
+  mode: "min"
+  save_top_k: 1
+  every_n_epochs: 1
+
+optimizer:
+  optim: 'AdamW'
+  lr: 5.0e-4
+  betas: [0.9, 0.999]
+  weight_decay: 0.05