Multi-label-classification

step 1: run the file "tf_idf_and_output.py" it takes input file as input_genre2_formated.csv and movie_data_formated.csv the seperator in our case is "^" for input_genre2_formated and movie_data_formated and for rest file it is "," input_genre2_formated.csv is a file of 10892 movie data that has only 2 label as output movie_data_formated.csv is the actual file with 44000 movie data that has genre of variable count we have used movie_data_formated.csv to find all genre and we have picked only those genre that has frequency more then 1000

step2: on runing the "tf_idf_and_output.py" file u will get normalized tf_idf of movie based on bag of words which are tf_idf_csv_normalized.csv and output_genre2_normalized.csv better try it in a small size i.e 1000

step3: now run the "our_model.py" it will take "tf_idf_csv_normalized.csv" file for making the training data and the testing data and "output_genre2_normaized.csv" for the label data and create 3 file one actual output, predicted output and actual output and predicted output together seperated by "_" for creation of confusion matrix and accuracy stuff

step 4: now run the "cal_accuracy.py" give input the file "label_and_output_pca.csv" that has actual output and predicted output together and it gives the result file that has precision, recall, f1-score etc "" as some file had size more than 100mb i have kept it in zipped form""

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Genre_classification_using_synopsis_in_deep_learning (4).pdf		Genre_classification_using_synopsis_in_deep_learning (4).pdf
Presentation3 (5).pptx		Presentation3 (5).pptx
README.md		README.md
ReadME.txt		ReadME.txt
accuracy_measure_normalized.csv		accuracy_measure_normalized.csv
accuracy_measure_unnormalized.csv		accuracy_measure_unnormalized.csv
cal_accurracy.py		cal_accurracy.py
fetch_from_imdb_server.py		fetch_from_imdb_server.py
input_genre2_formated.csv		input_genre2_formated.csv
input_genre3_formated.csv		input_genre3_formated.csv
label_and_output_pca.csv		label_and_output_pca.csv
movie_data_formated.csv		movie_data_formated.csv
movies_metadata.csv		movies_metadata.csv
normalize_vectors.py		normalize_vectors.py
our_model.py		our_model.py
output_genre2_full.csv		output_genre2_full.csv
output_genre2_xyz.csv		output_genre2_xyz.csv
output_label_pca.csv		output_label_pca.csv
output_vector_pca.csv		output_vector_pca.csv
tf_idf_and_output.py		tf_idf_and_output.py
tf_idf_csv.zip		tf_idf_csv.zip
tf_idf_csv_normalized.zip		tf_idf_csv_normalized.zip
tf_idf_csv_xyz.csv		tf_idf_csv_xyz.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multi-label-classification

About

Uh oh!

Releases

Packages

Languages

SwagarikaGiri/Multi-label-classification

Folders and files

Latest commit

History

Repository files navigation

Multi-label-classification

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages