Skip to content

Commit fc5e681

Browse files
Create README.md
1 parent 7fa97e9 commit fc5e681

File tree

1 file changed

+28
-0
lines changed

1 file changed

+28
-0
lines changed

README.md

Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
# Fraudulent Transaction Detection Using Decision Tree
2+
3+
This project focuses on implementing various tasks related to decision tree classification using the `scikit-learn` library in Python. The goal is to process and analyze a dataset, train decision tree models, and optimize their performance. We will build a model that can detect fraudulent transactions. The steps involve:
4+
5+
1. **Reading and filtering the data**
6+
2. **Splitting the data into training and testing groups**
7+
3. **Training a decision tree**
8+
9+
After the decision tree is trained, we will prune it by increasing the cost complexity parameter, which helps optimize the model’s complexity. Different depths of the tree will be tested and compared, and while deeper trees may capture more complex patterns, they don’t always produce more accurate models.
10+
11+
### Dataset Overview
12+
13+
The dataset contains a series of transactions and associated features. Each row corresponds to a transaction, and the columns include the following:
14+
15+
- **Transaction Amount**: Amount involved in the transaction.
16+
- **Time Of Day**: The time of day when the transaction occurred.
17+
- **Transaction Frequency**: How many times a transaction happened in a row.
18+
- **Day Of Week**: The day of the week when the transaction took place.
19+
- **Labels**: Binary labels where 1 indicates a fraudulent transaction, and 0 indicates a legitimate one.
20+
21+
### Features Used in the Decision Tree Model
22+
23+
- **Transaction Amount**: The amount involved in the transaction.
24+
- **Time Of Day**: The time at which the transaction occurred.
25+
- **Day Of Week**: The day on which the transaction occurred.
26+
- **Transaction Frequency**: Frequency of transactions within a period.
27+
28+
These features are input into the decision tree model, which outputs whether the transaction is fraudulent or not.

0 commit comments

Comments
 (0)