method to to dump reader easily (#255)

andrelmfarias · web-flow · commit bf75db5ee577 · 2019-09-13T12:16:42.000+02:00
diff --git a/README.md b/README.md
@@ -14,9 +14,9 @@ An End-To-End Closed Domain Question Answering System.
 
 ## cdQA in details
 
-If you are interested in understanding how the system works and it is implemented, we wrote an [article on Medium](https://towardsdatascience.com/how-to-create-your-own-question-answering-system-easily-with-python-2ef8abc8eb5) with a high-level explanation.
+If you are interested in understanding how the system works and its implementation, we wrote an [article on Medium](https://towardsdatascience.com/how-to-create-your-own-question-answering-system-easily-with-python-2ef8abc8eb5) with a high-level explanation.
 
-We also made a presentation during the \#9 NLP Breakfast organised by [Feedly](feedly.com). You can check out the video of the presentation [here](https://blog.feedly.com/nlp-breakfast-9-closed-domain-question-answering/).
+We also made a presentation during the \#9 NLP Breakfast organised by [Feedly](feedly.com). You can check it out [here](https://blog.feedly.com/nlp-breakfast-9-closed-domain-question-answering/).
 
 ## Table of Contents <!-- omit in toc -->
 
@@ -130,6 +130,10 @@ cdqa_pipeline = QAPipeline(model='bert_qa_vGPU-sklearn.joblib')
 cdqa_pipeline.fit_reader('path-to-custom-squad-like-dataset.json')
 ```
 
+Save the reader model after the fine-tune:
+```python
+cdqa_pipeline.dump_reader('path-to-save-bert-reader.joblib')
+``
 ### Making predictions
 
 To get the best prediction given an input query:
diff --git a/cdqa/pipeline/cdqa_sklearn.py b/cdqa/pipeline/cdqa_sklearn.py
@@ -179,7 +179,7 @@ def predict(self, X=None, return_logit=False, n_predictions=None):
             )
 
     def to(self, device):
-        """ Send reade to CPU if device=='cpu' or to GPU if device=='cuda'
+        """ Send reader to CPU if device=='cpu' or to GPU if device=='cuda'
         """
         if device not in ("cpu", "cuda"):
             raise ValueError("Attribute device should be 'cpu' or 'cuda'.")
@@ -202,6 +202,11 @@ def cuda(self):
         self.reader.device = torch.device("cuda")
         return self
 
+    def dump_reader(self, filename):
+        """ Dump reader model to a .joblib object
+        """
+        joblib.dump(self.reader, filename)
+
     @staticmethod
     def _expand_paragraphs(df):
         # Snippet taken from: https://stackoverflow.com/a/48532692/11514226
diff --git a/setup.py b/setup.py
@@ -8,7 +8,7 @@ def read(file):
 
 setup(
     name="cdqa",
-    version="1.1.3c",
+    version="1.1.4a",
     author="Félix MIKAELIAN, André FARIAS, Matyas AMROUCHE, Olivier SANS, Théo NAZON",
     description="An End-To-End Closed Domain Question Answering System",
     long_description=read("README.md"),