GilbertLabUCSF
diff --git a/‎README.rst‎
Lines changed: 94 additions & 0 deletions b/‎README.rst‎
Lines changed: 94 additions & 0 deletions
diff --git a/‎docs/environment.yaml‎
Lines changed: 3 additions & 1 deletion b/‎docs/environment.yaml‎
Lines changed: 3 additions & 1 deletion
diff --git a/‎docs/requirements.txt‎
Lines changed: 1 addition & 0 deletions b/‎docs/requirements.txt‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/source/CanDI.rst‎
Lines changed: 13 additions & 38 deletions b/‎docs/source/CanDI.rst‎
Lines changed: 13 additions & 38 deletions
diff --git a/‎docs/source/README.rst‎
Lines changed: 94 additions & 0 deletions b/‎docs/source/README.rst‎
Lines changed: 94 additions & 0 deletions
diff --git a/‎brca_heatmap.ipynb‎ renamed to ‎docs/source/brca_heatmap.ipynb‎
Lines changed: 8 additions & 14 deletions b/‎brca_heatmap.ipynb‎ renamed to ‎docs/source/brca_heatmap.ipynb‎
Lines changed: 8 additions & 14 deletions
@@ -0,0 +1,94 @@
+CanDI - A global cancer data integrator
+=======================================
+
+|Documentation Status|
+
+Package Installation
+--------------------
+
+First, you need to clone this repository to use CanDI.
+
+.. code:: bash
+
+   git clone https://github.com/GilbertLabUCSF/CanDI.git
+
+We suggest to use `Conda <https://docs.conda.io/en/latest/>`__ as a
+package manager and environment management system. You can create a
+fresh conda environment with all ``CanDI``\ ’s requirements using bellow
+command:
+
+.. code:: bash
+
+   conda env create -f CaDI/candi.yml -n candi
+
+Prepare Datasets
+~~~~~~~~~~~~~~~~
+
+The python command from CanDI will automatically download and modify
+datasets.
+
+.. code:: bash
+
+   python CanDI/CanDI/install.py
+
+Downloaded and formatted datasets would organize this way:
+
+.. code::
+
+   .
+   ├── config.ini # modified after Installation 
+   ├── depmap
+   │   ├── CCLE_expression.csv
+   │   ├── CCLE_fusions.csv
+   │   ├── CCLE_gene_cn.csv
+   │   ├── CCLE_mutations.csv
+   │   ├── CCLE_RNAseq_reads.csv
+   │   ├── CRISPR_gene_dependency.csv
+   │   ├── CRISPR_gene_effect.csv
+   │   └── sample_info.csv
+   ├── genes
+   │   └── gene_info.csv
+   └── locations
+       └── merged_locations.csv
+
+Package Usage
+-------------
+
+Import CanDI into python
+~~~~~~~~~~~~~~~~~~~~~~~~
+
+To import ``CanDI``, your active directory in python must be same as the
+cloned folder.
+
+.. code:: python
+
+   import CanDI as can
+
+**OR**, you can add path to the `CanDI` directory if you want to use it from other directories.
+
+.. code:: python
+
+   import sys
+   sys.path.append("path-to-candi-directory")
+
+   import CanDI as can
+
+CanDI Objects
+~~~~~~~~~~~~~
+
+-  ``data`` : Container for all candi datasets. All access to datasets
+   go through data object.
+-  ``Gene`` : Provides cross dataset indexing from the gene perspective.
+-  ``CellLine`` : Provides cross dataset indexing from the cell line
+   perspective.
+-  ``Cancer`` : Provides cross dataset indexing by a group of cell lines
+   that are all the same tissue.
+-  ``Organelle``: Provides cross dataset indexing for a group of genes
+   whose proteins localize to the same organelle.
+-  ``CellLineCluster`` : Provides cross dataset indexing for a group of
+   user defined cell lines.
+-  ``GeneCluster`` : Provides cross dataset indexing for a group of user
+   defined genes.
+
+.. |Documentation Status| image:: https://readthedocs.org/projects/candi/badge/?version=latest
+   :target: https://candi.readthedocs.io/en/latest/?badge=latest
@@ -4,8 +4,10 @@ channels:
   - conda-forge
   - defaults
 dependencies:
+  - python
   - sphinx==3.2.1
   - pandas
-  - nbsphinx==0.8.1
+  - pip
   - pip:
+    - nbsphinx
     - sphinx_rtd_theme==0.4.3
@@ -1,5 +1,6 @@
 sphinx==3.2.1
 sphinx_rtd_theme==0.4.3
+nbsphinx
 readthedocs-sphinx-search==0.1.0rc3
 pandas
 configparser
 
@@ -1,51 +1,26 @@
-CanDI package
-=============
-
 CanDI.candi module
-------------------
+==================
 
-.. automodule:: CanDI.candi
+.. automodule:: CanDI.candi.candi
    :members:
    :undoc-members:
    :show-inheritance:
 
 CanDI.data module
------------------
-
-.. automodule:: CanDI.data
-   :members:
-   :undoc-members:
-   :show-inheritance:
-
-CanDI.handlers module
----------------------
-
-.. automodule:: CanDI.handlers
-   :members:
-   :undoc-members:
-   :show-inheritance:
+=================
+The data class is instantiated at import. This class contains paths to all data downloaded with CanDI.
+It has internal methods for loading datasets into memory as pandas dataframes.
+There are 3 index tables that candi relies on for fetch all data:
 
-CanDI.install module
---------------------
+- cell_lines
+- genes
+- locations
 
-.. automodule:: CanDI.install
-   :members:
-   :undoc-members:
-   :show-inheritance:
-
-CanDI.manager module
---------------------
-
-.. automodule:: CanDI.manager
-   :members:
-   :undoc-members:
-   :show-inheritance:
+These tables are automatically loaded as pandas dataframes upon import of CanDI
+It is highly recommended the user familiarize themself with the columns and indexes of these tables.
+All candi classes operate through these index tables.
 
-CanDI.reset\_config module
---------------------------
-
-.. automodule:: CanDI.reset_config
+.. automodule:: CanDI.candi.data
    :members:
    :undoc-members:
    :show-inheritance:
-
@@ -0,0 +1,94 @@
+CanDI - A global cancer data integrator
+=======================================
+
+|Documentation Status|
+
+Package Installation
+--------------------
+
+First, you need to clone this repository to use CanDI.
+
+.. code:: bash
+
+   git clone https://github.com/GilbertLabUCSF/CanDI.git
+
+We suggest to use `Conda <https://docs.conda.io/en/latest/>`__ as a
+package manager and environment management system. You can create a
+fresh conda environment with all ``CanDI``\ ’s requirements using bellow
+command:
+
+.. code:: bash
+
+   conda env create -f CaDI/candi.yml -n candi
+
+Prepare Datasets
+~~~~~~~~~~~~~~~~
+
+The python command from CanDI will automatically download and modify
+datasets.
+
+.. code:: bash
+
+   python CanDI/CanDI/install.py
+
+Downloaded and formatted datasets would organize this way:
+
+.. code::
+
+   .
+   ├── config.ini # modified after Installation 
+   ├── depmap
+   │   ├── CCLE_expression.csv
+   │   ├── CCLE_fusions.csv
+   │   ├── CCLE_gene_cn.csv
+   │   ├── CCLE_mutations.csv
+   │   ├── CCLE_RNAseq_reads.csv
+   │   ├── CRISPR_gene_dependency.csv
+   │   ├── CRISPR_gene_effect.csv
+   │   └── sample_info.csv
+   ├── genes
+   │   └── gene_info.csv
+   └── locations
+       └── merged_locations.csv
+
+Package Usage
+-------------
+
+Import CanDI into python
+~~~~~~~~~~~~~~~~~~~~~~~~
+
+To import ``CanDI``, your active directory in python must be same as the
+cloned folder.
+
+.. code:: python
+
+   import CanDI as can
+
+**OR**, you can add path to the `CanDI` directory if you want to use it from other directories.
+
+.. code:: python
+
+   import sys
+   sys.path.append("path-to-candi-directory")
+
+   import CanDI as can
+
+CanDI Objects
+~~~~~~~~~~~~~
+
+-  ``data`` : Container for all candi datasets. All access to datasets
+   go through data object.
+-  ``Gene`` : Provides cross dataset indexing from the gene perspective.
+-  ``CellLine`` : Provides cross dataset indexing from the cell line
+   perspective.
+-  ``Cancer`` : Provides cross dataset indexing by a group of cell lines
+   that are all the same tissue.
+-  ``Organelle``: Provides cross dataset indexing for a group of genes
+   whose proteins localize to the same organelle.
+-  ``CellLineCluster`` : Provides cross dataset indexing for a group of
+   user defined cell lines.
+-  ``GeneCluster`` : Provides cross dataset indexing for a group of user
+   defined genes.
+
+.. |Documentation Status| image:: https://readthedocs.org/projects/candi/badge/?version=latest
+   :target: https://candi.readthedocs.io/en/latest/?badge=latest
@@ -1,9 +1,15 @@
 {
  "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# _BRCA_ Heatmap"
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": 1,
-   "id": "a63ef233",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -16,7 +22,6 @@
   },
   {
    "cell_type": "markdown",
-   "id": "4f21e048",
    "metadata": {},
    "source": [
     "### Cancer Object Instantiation\n",
@@ -27,7 +32,6 @@
   {
    "cell_type": "code",
    "execution_count": 2,
-   "id": "21f19aab",
    "metadata": {},
    "outputs": [
     {
@@ -51,7 +55,6 @@
   },
   {
    "cell_type": "markdown",
-   "id": "02119bf2",
    "metadata": {},
    "source": [
     "### Subsetting by mutation status\n",
@@ -64,7 +67,6 @@
   {
    "cell_type": "code",
    "execution_count": 3,
-   "id": "96b1f2c1",
    "metadata": {},
    "outputs": [
     {
@@ -461,7 +463,6 @@
   },
   {
    "cell_type": "markdown",
-   "id": "13705e7a",
    "metadata": {},
    "source": [
     "I want to look at BRCA1 mutations in these types of cancers. I start by using the mutated function to identify ovarian and breast cancer cell lines with BRCA1 mutations. A cancer object's mutated method's default behavior is to output a list of depmap ids corresponding to celllines containing any mutation within the given genes. I then instantiate CellLineCluster objects of exclusively mutated or wild type cell lines for both breast and ovarian cancer. This makes comparing these cell lines easier.\n",
@@ -471,7 +472,6 @@
   {
    "cell_type": "code",
    "execution_count": 4,
-   "id": "4726efb9",
    "metadata": {},
    "outputs": [
     {
@@ -508,7 +508,6 @@
   },
   {
    "cell_type": "markdown",
-   "id": "103f3f3e",
    "metadata": {},
    "source": [
     "### Cross Referencing Mutation and Gene Knockout Data\n",
@@ -519,7 +518,6 @@
   {
    "cell_type": "code",
    "execution_count": 5,
-   "id": "ddcdd407",
    "metadata": {},
    "outputs": [],
    "source": [
@@ -569,7 +567,6 @@
   },
   {
    "cell_type": "markdown",
-   "id": "7d3d1277",
    "metadata": {},
    "source": [
     "### Fanconi Anemia Genes Knockout Effect in Ovarian Cancer\n",
@@ -579,7 +576,6 @@
   {
    "cell_type": "code",
    "execution_count": 6,
-   "id": "0194cc45",
    "metadata": {},
    "outputs": [
     {
@@ -618,7 +614,6 @@
   },
   {
    "cell_type": "markdown",
-   "id": "f8d4d556",
    "metadata": {},
    "source": [
     "### Fanconi Anemia Genes Knockout Effect in Breast Cancer\n",
@@ -628,7 +623,6 @@
   {
    "cell_type": "code",
    "execution_count": 7,
-   "id": "e1d3cbab",
    "metadata": {},
    "outputs": [
     {
@@ -665,7 +659,7 @@
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
-   "version": "3.9.2"
+   "version": "3.8.5"
   }
  },
  "nbformat": 4,