Mining Security Indicators

This repository contains the replication package for the extended abstract Where do developers admit their security-related concerns?. The work was conducted by Moritz Mock, Thomas Forrer, and Barbara Russo.

Link to preprint: doi.org/10.48550/arXiv.2405.10902

Link to publication (open access): https://doi.org/10.1007/978-3-031-72781-8_21

Abstract

Developers use different means to document the security concerns of their code. Because of all of these opportunities, they may forget where the information is stored, or others may not be aware of it, and leave it unmaintained for so long that it becomes obsolete, if not useless. In this work, we analyzed different sources of code documentation from four large-scale, real-world, open-source projects in an industrial setting to understand where developers report their security concerns. In particular, we manually inspected 2.559 instances taken from code comments, commit messages, and issue trackers. Overall, we found that developers prefer to document security concerns in source code comments and issue trackers. We also found that the longer the comments stay unfixed, the more likely they remain unfixed. Thus, to create awareness among developers, we implemented a pipeline to remind them about the introduction or removal of comments pointing to a security problem.

Installation

pip install -r req.txt

Commands to execute the code

Traversing commit messages

python traverse_commits.py --path <project> --path_logs <folder_for_the_logs>

Traverse code for comments

Traverses the most recent code for security patterns.

python traverse_code_for_comments.py --path <project> --path_logs <folder_for_the_logs>

Traverse code history for comments

Traverses the complete history of a project for security or MAT patterns.

To prepare the data for the visualization, run the second script.

python traverse_history_for_comments.py --path <project> --path_logs <folder_for_the_logs> --MAT <TRUE/FALSE>
python eval_historical_data.py --path <project> --path_logs <folder_for_the_logs>

Extract tags of a project

python extract_tags_of_project.py --path <project> --path_logs <folder_for_the_logs>

Traverse code tags for comments

Traverse the complete tags of a project for security or MAT patterns.

To prepare the data for the visualization, run the second script.

python traverse_tags_for_comments.py --path <project> --path_logs <folder_for_the_logs> --MAT <TRUE/FALSE>
python eval_historical_data.py --path <project> --path_logs <folder_for_the_logs>
python include_information_about_the_number_of_tags.py --path <project> --path_logs <folder_for_the_logs>

How to cite the work

Preprint:

@misc{mock2024developers,
      title={Where do developers admit their security-related concerns?}, 
      author={Moritz Mock and Thomas Forrer and Barbara Russo},
      year={2024},
      eprint={2405.10902},
      archivePrefix={arXiv},
      primaryClass={cs.SE},
      doi={10.48550/arXiv.2405.10902}
}

Preprint:

@InProceedings{10.1007/978-3-031-72781-8_21,
      author="Mock, Moritz
      and Forrer, Thomas
      and Russo, Barbara",
      editor="Marchesi, Lodovica
      and Goldman, Alfredo
      and Lunesu, Maria Ilaria
      and Przyby{\l}ek, Adam
      and Aguiar, Ademar
      and Morgan, Lorraine
      and Wang, Xiaofeng
      and Pinna, Andrea",
      title="Where Do Developers Admit their Security-Related Concerns?",
      booktitle="Agile Processes in Software Engineering and Extreme Programming -- Workshops",
      year="2025",
      publisher="Springer Nature Switzerland",
      address="Cham",
      pages="189--195",
      abstract="Developers use different means to document the security concerns of their code. Because of all of these opportunities, they may forget where the information is stored, or others may not be aware of it, and leave it unmaintained for so long that it becomes obsolete, if not useless. In this work, we analyzed different sources of code documentation from four large-scale, real-world, open-source projects in an industrial setting to understand where developers report their security concerns. In particular, we manually inspected 2.559 instances taken from source code comments, commit messages, and issue trackers. Overall, we found that developers prefer to document security concerns in source code comments and issue trackers. We also found that the longer the comments stay unfixed, the more likely they remain unfixed. Thus, to create awareness among developers, we implemented a pipeline to remind them about the introduction or removal of comments pointing to a security problem.",
      isbn="978-3-031-72781-8"
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
plots		plots
.gitignore		.gitignore
GLPI_eval_duration_comments_stayed.png		GLPI_eval_duration_comments_stayed.png
eval_commit_messages_data.py		eval_commit_messages_data.py
eval_historical_data.py		eval_historical_data.py
eval_issue_tracker_data.py		eval_issue_tracker_data.py
extract_tags_of_project.py		extract_tags_of_project.py
include_information_about_the_number_of_tags.py		include_information_about_the_number_of_tags.py
readme.md		readme.md
req.txt		req.txt
traverse_code_for_comments.py		traverse_code_for_comments.py
traverse_commits.py		traverse_commits.py
traverse_github_issues.py		traverse_github_issues.py
traverse_history_for_comments.py		traverse_history_for_comments.py
traverse_jira_issues.py		traverse_jira_issues.py
traverse_tags_for_comments.py		traverse_tags_for_comments.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mining Security Indicators

Abstract

Installation

Commands to execute the code

Traversing commit messages

Traverse code for comments

Traverse code history for comments

Extract tags of a project

Traverse code tags for comments

How to cite the work

Preprint:

Preprint:

About

Languages

moritzmock/MiningSecurityIndicators

Folders and files

Latest commit

History

Repository files navigation

Mining Security Indicators

Abstract

Installation

Commands to execute the code

Traversing commit messages

Traverse code for comments

Traverse code history for comments

Extract tags of a project

Traverse code tags for comments

How to cite the work

Preprint:

Preprint:

About

Resources

Stars

Watchers

Forks

Languages