It contains my course work and project for Data Science, Data Mining, Bioinformatics, and Digital Image Processing classes.
Description: I build a model to classify and analysis post-storm images from NOAA to help researchers to study the effects of storms. I created python code to compress large images and to find the center of an image. I did statistical analysis with the data such as basic statistics, distribution modeling, hypothesis testing, correlation and coveriance.
Work type: group project.
Programming language and tools: Python 3, Jupyter Notebook, PyCharm.
Frames and libraries: pandas, numpy, csv, matplotlib, sklearn, seaborn, scipy, math.
Description: Using Joining, Groupby, Sorting, find the best date night movie and titanic passengers survival ratio by age, class, and sex.
Work type: Individual course work.
Programming language and tools: Python 3, Jupyter Notebook.
Frames and libraries: OS, pandas, HTML.
Description: I created a algorithm of random team generator based on the given team number.
Work type: Individual course work.
Programming language and tools: Python 3, Jupyter Notebook.
Description: Mining interesting association rule between movies based on it's genre and rating using Apriori algorithm.
Work type: Individual course work.
Programming language and tools: Python 3, Jupyter Notebook.
Frames and libraries: pandas, mlxtend, apriori, association_rules, transactionEncoder.
Description: Find the features of the data, smoothing, mormalization and correlation analysis among the attributes.
Work type: Individual course work.
Programming language and tools: Python 3, Jupyter Notebook.
Frames and libraries: pandas, numpy, scipy, sklearn.
Description: Find top 20 topics out of the whole dataset and find top 10 words of each topic. Then using the top 10 words, try to figure out the topic. Also figure out how to topic has changed over time using topic modeling.
Work type: Individual course work.
Programming language and tools: Python 3, Jupyter Notebook.
Frames and libraries: pandas, csv, gensim, nltk, pprint, LdaModel, WordNetLemmatizer, corpora.
Description: Performed sentiment analysis on the tweets data using machine learning Logistic Regression classification.
Work type: Individual course work.
Programming language and tools: Python 3, Jupyter Notebook.
Frames and libraries: pandas, csv, re, string, nltk, itertools, sklearn, LogisticRegression.
Description: Created null hypothesis, conducted t-statistic, p-value and then calculated mean.
Work type: Individual course work.
Programming language and tools: Python 3, Jupyter Notebook.
Frames and libraries: pandas, scipy.
Description: Applied KNN classification model into the iris dataset.
Work type: Individual course work.
Programming language and tools: Python 3, Jupyter Notebook.
Frames and libraries: pandas, numpy, sklearn, KNeibhborsClassifier.
Description: Using multiple linear regression model predict semantic similarity based on the given input data. Calculated R-square value and drawn residual plots.
Work type: Individual course work.
Programming language and tools: Python 3, Jupyter Notebook.
Frames and libraries: pandas, statsmodels, seaborn, matplotlib.
Description: Calculate semantic similarity between pair of data by using Jaccard, Resnik, AllPair, BestPair similarity algorithms.
Work type: Individual course work.
Programming language and tools: Python 3, Jupyter Notebook.
Frames and libraries: pandas, numpy, colections.
Description: I created python script for Z and KMP algorithm to check if a pattern is present in the given text or not.
Work type: Individual course work.
Programming language and tools: Python 3, Jupyter Notebook.
Description: Using the given input image file, I created python script and performed subtraction, negation, and translation on the images.
Work type: Individual course work.
Programming language and tools: Python 3, Jupyter Notebook.
Frames and libraries: numpy.
Description: I have written python script for co-occurrence matrix, correlation coefficient, downsampling and upsampling algorithms. Then using the given image file, I performed those algorithms on the images.
Work type: Individual course work.
Programming language and tools: Python 3, Jupyter Notebook.
Frames and libraries: numpy, math.
Description: I have written python script for histrogram processing, gaussian filtering, unsharp masking and highpass filtering. Then using the given image file, I performed those algorithms on the images.
Work type: Individual course work.
Programming language and tools: Python 3, Jupyter Notebook.
Frames and libraries: numpy, math, subtract.
Description: I have written python script for dilation, erosion, boundary extraction, edge detection by using gradient operators. Then using the given image file, I performed those algorithms on the images.
Work type: Individual course work.
Programming language and tools: Python 3, Jupyter Notebook.
Frames and libraries: numpy, math.