MeDUSA is modular, customizable, and user friendly toolset in R to ease the data processing of Direct Infusion Untarged Single-cell Analysis.
Included is a dockerfile that compiles all R requirements, some common compound databases, and an Rstudio.
docker pull thefollyllama/medusa
docker run -e PASSWORD=medusa -p 8787:8787 -v .:/home/rstudio/local thefollyllama/medusa
in a browser navigate to "localhost:8787" usr:rstudio pwd:medusa
It is suggested to run via dockerhub, as building the image can take over an hour. However, local build instructions can be found in the Dockerfile
MeDUSA's modularity is achived via common data objects that are interchangeable.
DataFrame where:
- Rownames = Float: MZ
- Colnames = String: sample_names
- Data = Float: Intensities
DataFrame where:
- Rownames = Float: MZ
- Colnames = Float: ScanTime
- Data = Float: Intensities
DataFrame where:
- Rownames = Float: MZ
- Colnames = String: sample_names
- Data = Float: log2(Intensities)
Programatically: the methods for [mz-obj, mzT-obj, mzLog-obj] are all interchangeable....proceed with caution.
DataFrame where columns are:
- Float : MZ
- String: ScanTime
- Float : intensity
Dataframe required columns are:
- Integer: measurement
- String : sample_name (Must match above)
- String : type (i.e. cell, media_cell, solvent)
- String : phenotype Additional columns could be helpful. Such as:
- Integer: time
- Integer: plate
- Integer: cell_count
- String : sampler
- String : polarity
- String : sampling_day
- String : filtered_out
MeDUSA's files and methods are named to quickly identify input and output expectations.
file_name : [suggested order][input object type][description].R i.e. "2_mz_quality_check.R"
public function : [input_object]_[category]_magic suggested method to handle everything in the file category (i.e. "mz_post_magic")
public function : [input object][subcategory][description] (i.e. "mzlog_analysis_fold")
private funciton : [type].[description] (i.e. "extract.binning")
file: z_[description].R
- common methods written by ehetzel file: zz_[description].R
- common methods written by others
Also in this repo are Rproj adjacent tools
docs: JOSS paper materials examples : a small set of data and an example of ideal usage scripts : mostly data massaging scripts to provide compound data in the docker container validation: TBD Dockerfile: An optimized docker image ~2.5gb Dockerfile_unoptimized: More debug friendly dockerfile ~12gb