This program is a program that analyzes the similarity of text, document, and code files. The currently supported file formats are as follows.
- Documents (Only image check is supported, under development)
docx,pdf
- Code (Under development)
c, cpp, h, hpp, py, java, mat, m, cs, asm, js, v, vhd, vhdl, r
- Text (Under development)
txt, csv, json, xml, html, css, yml, yaml
src/
├── common.py
├── config.py
├── main.py
tools/
├── compare/
│ ├── code.py
│ ├── image.py
│ ├── text.py
├── extract/
│ ├── parse_files.py
├── transform.py
README.md
requirements.txt
- (Optional) Create a virtual environment.
python -m venv VIRTUAL_ENV_NAME
source VIRTUAL_ENV_NAME/bin/activate
- Install the required packages.
pip install -r requirements.txt
- Place the files you want to compare (ex. source code, document, text) in the
submission
directory.
submission/
├── studentname/
│ ├── code1.cpp
│ ├── code2.cpp
│ ├── report.docx
│ ├── report.pdf
├── couldbeanyname/
│ ├── code1.cpp
│ ├── code2.java
│ ├── code3.py
│ ├── report.pdf
- (Optional) If you have assignments that previously submitted by students, place them in the
reference
directory.
reference/
├── prev_studentname/
│ ├── code1.cpp
│ ├── code2.cpp
│ ├── report.docx
│ ├── report.pdf
├── prev_studentname2/
│ ├── code1.cpp
│ ├── code2.java
│ ├── code3.py
│ ├── report.pdf
- Run the program.
python src/main.py
- The results will be saved in the
result
directory.
result/
├── result.csv
- The program is currently under development and may not work properly.
- This program is intended to be used as a tool to assist in the detection of plagiarism. It is not a substitute for human judgment, and it is not a guarantee of plagiarism. It is the responsibility of the user to verify the results and determine whether plagiarism has occurred. The authors of this program are not responsible for any consequences that may arise from the use of this program.
- Improve image comparison efficiency
- Speed up image comparison by image compression
- Compute similarity using block matrix inner product
- Implement document comparison
- Implement code comparison
- Implement text comparison
This project is licensed under the MIT License - see the LICENSE file for details.
Contributions are welcome! Please feel free to submit issues, feature requests, or pull requests. If you have any suggestions for improvement or new features, please let us know.
If you have any questions or need further assistance, please contact us at the following:
- Email: jwlee@linux.com
- Issue Tracker: GitHub Issues
- Discussion Forum: GitHub Discussions
- Jangsoo Park - Initial work
- Jiwoon Lee - Initial work, Maintainer
- Jihyun Ha - Beta tester