Skip to content

Conversation

@ziadhany
Copy link
Collaborator

@ziadhany ziadhany commented Aug 30, 2025

I created an initial script to parse Git commit messages that can be easily integrated with our model. The script takes a Git repository as input, parses all commits, and returns the CVEs along with their corresponding fixed commits.

Issues:

results:
vulnerabilities_packagecommitpatch2.zip


def clone(self):
"""Clone the repository."""
self.repo_url = "https://github.com/torvalds/linux"
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This part should not be static

Copy link
Member

@keshav-space keshav-space left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @ziadhany, see some suggestions.

@ziadhany
Copy link
Collaborator Author

ziadhany commented Jan 31, 2026

This is the list of repositories I think we should collect and parse Git messages from:

Ignored:

Copy link
Member

@keshav-space keshav-space left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@ziadhany thanks, see some feedback below.

Signed-off-by: ziad hany <ziadhany2016@gmail.com>
Add a test for CollectRepoFixCommitPipeline

Signed-off-by: ziad hany <ziadhany2016@gmail.com>
Signed-off-by: ziad hany <ziadhany2016@gmail.com>
Signed-off-by: ziad hany <ziadhany2016@gmail.com>
…ct fixed_by_commit_patches correctly.

Signed-off-by: ziad hany <ziadhany2016@gmail.com>
… multiple repositories

Signed-off-by: ziad hany <ziadhany2016@gmail.com>
Signed-off-by: ziad hany <ziadhany2016@gmail.com>
Remove CollectOpensslFixCommits Pipeline

Signed-off-by: ziad hany <ziadhany2016@gmail.com>
Signed-off-by: ziad hany <ziadhany2016@gmail.com>
Signed-off-by: ziad hany <ziadhany2016@gmail.com>
Signed-off-by: ziad hany <ziadhany2016@gmail.com>
Copy link
Member

@keshav-space keshav-space left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @ziadhany, LGTM! Do we have an estimate of how long it takes to run CollectLinuxFixCommitsPipeline?

@ziadhany
Copy link
Collaborator Author

ziadhany commented Feb 12, 2026

Thanks @ziadhany, LGTM! Do we have an estimate of how long it takes to run CollectLinuxFixCommitsPipeline?

The Linux kernel pipeline takes around 5 minutes if the network call is ignored.

@ziadhany ziadhany merged commit 7975a38 into aboutcode-org:main Feb 12, 2026
4 of 5 checks passed
@ziadhany ziadhany deleted the parsing-commit branch February 12, 2026 12:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants