documents column reordering

methal-project · Oct 9, 2023 · 4135ff7 · 4135ff7
1 parent 2d7a9fd
commit 4135ff7
Show file tree

Hide file tree

Showing 3 changed files with 109 additions and 110 deletions.
diff --git a/README.md b/README.md
@@ -34,12 +34,12 @@ For more details about the categorization, see [our paper](https://univoak.eu/is
 | ---- | ---- | --- | 
 | 0 | speaker | Character name | 
 | 1 | gender | Character gender | 
-| 2 | author | author name | 
-| 3 | date | A date for the play | 
-| 4 | date_type | When it was written, first printed, or print date for the edition we used | 
-| 5 | social_class | Character social class, we estimated this based on information in the *dramatis personæ* | 
-| 6 | job | Character's profession as in the *dramatis personæ* | 
-| 7 | job_category | Professional category using our own taxonomy | 
+| 2 | social_class | Character social class, we estimated this based on information in the *dramatis personæ* | 
+| 3 | job | Character's profession as in the *dramatis personæ* | 
+| 4 | job_category | Professional category using our own taxonomy | 
+| 5 | author | author name | 
+| 6 | date | A date for the play | 
+| 7 | date_type | When it was written, first printed, or print date for the edition we used | 
 | 8 | segment_number | For emotion analysis, the plays get divided into homogeneous segments. This field can be ignored for other purposes. | 
 | 9 | play_short_name | Corresponds to the play's filename in the TEI directories (without *.xml*) | 
 | 10 | genre | We have comedy, drama, volksstueck, tale (*Märel*) | 

diff --git a/metadata_analysis.ipynb b/metadata_analysis.ipynb
diff --git a/pre_treatment/script/postprocess_character_speech_df.py b/pre_treatment/script/postprocess_character_speech_df.py
@@ -130,7 +130,6 @@
 
 # some speaker names had trailing whitespace
 outdf['speaker'] = outdf.speaker.apply(lambda x:x.strip())
-#outdf['date'] = outdf.date.astype(int)
 
 # write out
 outdf.to_csv(outdf_path, index=False)