[Feature]: Generalize Prediction pipeline for Lightning CLI models by aditya0by0 · Pull Request #148 · ChEB-AI/python-chebai

aditya0by0 · 2026-01-30T15:54:53Z

Generalize prediction logic

Please merge below PRs after this PR:

Related Discussion

Reliability of `_class_path` in Lightning checkpoints when using `instantiate_module` Lightning-AI/pytorch-lightning#21474

Related bugs rectified in Lightning for the pipeline

Additional changes

Save class labels in checkpoint under the key "classification_labels"

>>> dt = torch.load("path/to/file.ckpt", weights_only=False, map_location="cpu")
>>> dt.keys()
dict_keys(['epoch', 'global_step', 'pytorch-lightning_version', 'state_dict', 'loops', 'callbacks', 'optimizer_states', 'lr_schedulers', 'hparams_name', 'hyper_parameters', 'datamodule_hparams_name', 'datamodule_hyper_parameters', 'classification_labels'])

Wrap inference with torch.inference_mode() to avoid gradient tracking (see Avoid gradient tracking python-chebifier#21)
- Behavior of model.eval() in PyTorch
- Difference between torch.no_grad() and torch.inference_mode()
Use torch.compile for faster inference

aditya0by0 · 2026-02-03T20:56:28Z

@sfluegel05,

Could you confirm our agreed approach for handling old_checkpoint files which don't classification labels stored in them?

Update code to handle legacy checkpoints: This requires adding logic to prediction.py and the chebifier repo to ingest external class files.
- Concerns: Adds boilerplate and permanent complexity to handle a temporary issue.
Patch old checkpoints (Preferred): Use the below one-time script to inject labels into the existing files.
- Benefits: Keeps the codebase clean and ensures all checkpoints follow a standardized schema.

I’m in favor of Option 2 to avoid carrying technical debt in the prediction logic. Does this match your understanding?

I'm willing to add this script to the repo, and small readme note for old checkpoints for option 2.


import sys

import torch


def add_class_labels_to_checkpoint(input_path, classes_file_path):
    with open(classes_file_path, "r") as f:
        class_labels = [line.strip() for line in f.readlines()]

    assert len(class_labels) > 0, "The classes file is empty."

    # 1. Load the checkpoint
    checkpoint = torch.load(
        input_path, map_location=torch.device("cpu"), weights_only=False
    )

    if "classification_labels" in checkpoint:
        print(
            "Warning: 'classification_labels' key already exists in the checkpoint and will be overwritten."
        )


    # 2. Add your custom key/value pair
    checkpoint["classification_labels"] = class_labels

    # 3. Save the modified checkpoint
    output_path = input_path.replace(".ckpt", "_modified.ckpt")
    torch.save(checkpoint, output_path)
    print(f"Successfully added classification_labels and saved to {output_path}")


if __name__ == "__main__":
    if len(sys.argv) < 2:
        print("Usage: python modify_checkpoints.py <input_checkpoint> <classes_file>")
        sys.exit(1)

    input_ckpt = sys.argv[1]
    classes_file = sys.argv[2]

    add_class_labels_to_checkpoint(
        input_path=input_ckpt, classes_file_path=classes_file
    )

Copilot

Pull request overview

This PR introduces a new generalized prediction pipeline intended to work with LightningCLI-saved models/checkpoints, including persisting classification label names into checkpoints for consistent prediction output formatting.

Changes:

Add checkpoint persistence of classification_labels (derived from a dataset classes.txt) and wire the dataset path into model init via LightningCLI argument linking.
Introduce a new SMILES prediction entrypoint (chebai/result/prediction.py) that reconstructs model/datamodule from checkpoint hyperparameters.
Refactor XYBaseDataModule.predict_dataloader to build a prediction dataloader from an in-memory SMILES list, plus update docs/tests and add VS Code workspace files.

Reviewed changes

Copilot reviewed 11 out of 12 changed files in this pull request and generated 11 comments.

Show a summary per file

File	Description
tests/unit/cli/testCLI.py	Adjusts CLI unit test model args (smaller hidden layer).
tests/unit/cli/mock_dm.py	Adds `classes_txt_file_path` for CLI linking in tests.
tests/unit/cli/classification_labels.txt	Adds sample classification labels used by CLI unit tests.
chebai/trainer/CustomTrainer.py	Removes prior bespoke prediction logic and overrides `predict()`.
chebai/result/prediction.py	Adds new prediction script/class for SMILES/file inference from checkpoint.
chebai/preprocessing/datasets/base.py	Refactors prediction dataloader flow and adds `classes_txt_file_path`.
chebai/models/base.py	Adds label-file loading + saving `classification_labels` into checkpoints.
chebai/cli.py	Links `data.classes_txt_file_path` into `model.init_args.classes_txt_file_path`.
README.md	Updates prediction instructions to use the new prediction script.
.vscode/settings.json	Adds VS Code project settings (currently invalid JSON).
.vscode/extensions.json	Adds recommended VS Code extensions.
.gitignore	Stops ignoring the entire `.vscode` directory (only ignores launch.json).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-09T23:26:17Z

chebai/preprocessing/datasets/base.py

+    def _process_input_for_prediction(
+        self, smiles_list: list[str], model_hparams: Optional[dict] = None
+    ) -> tuple[list, list]:
+        """
+        Process input data for prediction.


The new prediction preprocessing path (predict_dataloader returning (DataLoader, valid_indices) plus _process_input_for_prediction / _preprocess_smiles_for_pred) isn't covered by unit tests. Since XYBaseDataModule already has unit tests, adding coverage for valid-index scattering and the empty/all-filtered case would help prevent regressions.

Copilot · 2026-02-09T23:26:17Z

chebai/preprocessing/datasets/base.py

    def predict_dataloader(
-        self, *args, **kwargs
-    ) -> Union[DataLoader, List[DataLoader]]:
+        self,
+        smiles_list: List[str],
+        model_hparams: Optional[dict] = None,
+        **kwargs,


XYBaseDataModule.predict_dataloader now requires smiles_list (and returns a tuple), which is incompatible with PyTorch Lightning's expected predict_dataloader(self) signature. Any use of Trainer.predict(...) / LightningCLI predict will fail with a missing-argument error. Keep the Lightning-compatible predict_dataloader() signature and add a separate helper (e.g., predict_dataloader_from_smiles(...)) or make smiles_list optional and fall back to the previous behavior when it is None.

chebai/preprocessing/datasets/base.py

Copilot · 2026-02-09T23:26:17Z

chebai/result/prediction.py

+        self._model_hparams = ckpt_file["hyper_parameters"]
+        self._model_hparams.pop("_instantiator", None)
+        self._model_hparams.pop("classes_txt_file_path", None)
+        self._model: ChebaiBaseNet = instantiate_module(
+            ChebaiBaseNet, self._model_hparams
+        )
+        self._model.to(self.device)
+        print("*" * 10, f"Loaded model class: {self._model.__class__.__name__}")


The predictor instantiates a fresh model via instantiate_module(...) but never loads the checkpoint weights (ckpt_file["state_dict"]) into it. As written, inference will run with randomly initialized weights. Load the state_dict into self._model (and consider strict=False with a clear warning if keys mismatch), or use Lightning's load_from_checkpoint(...) so weights are restored correctly.

Copilot · 2026-02-09T23:26:17Z

chebai/result/prediction.py

+        print("Below are the modules loaded from the checkpoint:")
+
+        self._dm_hparams = ckpt_file["datamodule_hyper_parameters"]
+        self._dm_hparams.pop("splits_file_path")


self._dm_hparams.pop("splits_file_path") will raise KeyError for checkpoints/datamodules that don't have this key. Use a default (pop(..., None)) or gate this removal on key presence so inference works across datamodules.

Suggested change

self._dm_hparams.pop("splits_file_path")

self._dm_hparams.pop("splits_file_path", None)

Copilot · 2026-02-09T23:26:18Z

chebai/trainer/CustomTrainer.py

+        raise NotImplementedError(
+            "CustomTrainer.predict is not implemented."
+            "Use `Prediction.predict_from_file` or `Prediction.predict_smiles` from `chebai/result/prediction.py` instead."


CustomTrainer.predict() now unconditionally raises NotImplementedError, which breaks Lightning's predict workflow and the existing chebai CLI predict subcommand (it will always crash). If prediction is intentionally unsupported through the Trainer, remove/disable the CLI predict subcommand; otherwise implement predict() by delegating to super().predict(...) (or a supported prediction path). Also, the error message points to Prediction.* but the new class is named Predictor.

Suggested change

raise NotImplementedError(

"CustomTrainer.predict is not implemented."

"Use `Prediction.predict_from_file` or `Prediction.predict_smiles` from `chebai/result/prediction.py` instead."

"""

Runs prediction using the underlying Lightning Trainer implementation.

This restores the default `Trainer.predict` behavior so that Lightning's

prediction workflow and the `chebai` CLI `predict` subcommand work as expected.

"""

return super().predict(

model=model,

dataloaders=dataloaders,

datamodule=datamodule,

return_predictions=return_predictions,

ckpt_path=ckpt_path,

Copilot · 2026-02-09T23:26:18Z

chebai/preprocessing/datasets/base.py

+    def _process_input_for_prediction(
+        self, smiles_list: list[str], model_hparams: Optional[dict] = None
+    ) -> tuple[list, list]:
+        """
+        Process input data for prediction.
+
+        Args:
+            smiles_list (List[str]): List of SMILES strings.
+            model_hparams (Optional[dict]): Model hyperparameters.
+                Some prediction pre-processing pipelines may require these.
+
+        Returns:
+            tuple[list, list]: Processed input data and valid indices.
+        """
+        data, valid_indices = [], []
+        num_of_labels = int(model_hparams["out_dim"])
+        self._dummy_labels: list = list(range(1, num_of_labels + 1))
+


_process_input_for_prediction treats model_hparams as optional, but then immediately indexes model_hparams["out_dim"]. If model_hparams is omitted/None this will crash. Either make model_hparams required in the public API (and type it accordingly) or add a clear validation/error early in predict_dataloader/_process_input_for_prediction.

Copilot · 2026-02-09T23:26:18Z

chebai/result/prediction.py

+        self._classification_labels: list = ckpt_file.get("classification_labels")
+        print(f"Loaded {len(self._classification_labels)} classification labels.")
+        assert len(self._classification_labels) > 0, (
+            "Classification labels list is empty."
+        )
+        assert len(self._classification_labels) == self._model.out_dim, (


ckpt_file.get("classification_labels") can return None; the next line calls len(self._classification_labels) which will raise TypeError. Handle the missing key explicitly (e.g., default to [] and raise a clear error if absent) so users get an actionable message when using older checkpoints.

README.md

.vscode/settings.json

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

changes moved from #135

6571cfb

aditya0by0 mentioned this pull request Jan 30, 2026

Generalize prediction logic #135

Closed

aditya0by0 self-assigned this Jan 30, 2026

This was referenced Jan 30, 2026

Prediction functional for Graphs ChEB-AI/python-chebai-graph#31

Open

Utilize new prediction pipeline ChEB-AI/python-chebifier#25

Open

aditya0by0 added 2 commits January 30, 2026 19:06

fix pipeline

1997d9f

make classification labels to be in ckpt file mandatory

7e1143e

aditya0by0 added the enhancement New feature or request label Jan 30, 2026

aditya0by0 added 3 commits January 30, 2026 23:27

avoid generated dummy labels for each smiles

ac863cd

add vs extensions setting for the projects

9daa712

fix model device error + config fp

c584284

aditya0by0 mentioned this pull request Feb 2, 2026

New prediction pipeline ChEB-AI/chebifier-web#28

Open

aditya0by0 requested a review from sfluegel05 February 3, 2026 10:12

aditya0by0 marked this pull request as ready for review February 3, 2026 10:12

Change Python type checking mode to 'basic'

2ef9cc8

aditya0by0 added priority: high labels Feb 3, 2026

use execute for predict step

5e9d5e9

aditya0by0 requested a review from Copilot February 9, 2026 23:19

Copilot started reviewing on behalf of aditya0by0 February 9, 2026 23:20 View session

Copilot AI reviewed Feb 9, 2026

View reviewed changes

aditya0by0 and others added 3 commits February 10, 2026 00:32

Update .vscode/settings.json

c7b3a86

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

fix type

7f094c3

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

fix file path in comment

e30513a

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

aditya0by0 marked this pull request as draft February 17, 2026 12:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Generalize Prediction pipeline for Lightning CLI models#148

[Feature]: Generalize Prediction pipeline for Lightning CLI models#148
aditya0by0 wants to merge 11 commits intodevfrom
feature/general_pred_pipeline

aditya0by0 commented Jan 30, 2026 •

edited by sfluegel05

Loading

Uh oh!

aditya0by0 commented Feb 3, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 9, 2026

Uh oh!

Copilot AI Feb 9, 2026

Uh oh!

Uh oh!

Copilot AI Feb 9, 2026

Uh oh!

Copilot AI Feb 9, 2026

Uh oh!

Copilot AI Feb 9, 2026

Uh oh!

Copilot AI Feb 9, 2026

Uh oh!

Copilot AI Feb 9, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	self._dm_hparams.pop("splits_file_path")
	self._dm_hparams.pop("splits_file_path", None)

-        raise NotImplementedError(
-            "CustomTrainer.predict is not implemented."
-            "Use `Prediction.predict_from_file` or `Prediction.predict_smiles` from `chebai/result/prediction.py` instead."
+        """
+        Runs prediction using the underlying Lightning Trainer implementation.
+        This restores the default `Trainer.predict` behavior so that Lightning's
+        prediction workflow and the `chebai` CLI `predict` subcommand work as expected.
+        """
+        return super().predict(
+            model=model,
+            dataloaders=dataloaders,
+            datamodule=datamodule,
+            return_predictions=return_predictions,
+            ckpt_path=ckpt_path,

Conversation

aditya0by0 commented Jan 30, 2026 • edited by sfluegel05 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related Discussion

Related bugs rectified in Lightning for the pipeline

Additional changes

Uh oh!

aditya0by0 commented Feb 3, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

aditya0by0 commented Jan 30, 2026 •

edited by sfluegel05

Loading