NHSDigital
diff --git a/‎.gitignore‎
Lines changed: 178 additions & 0 deletions b/‎.gitignore‎
Lines changed: 178 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 105 additions & 0 deletions b/‎README.md‎
Lines changed: 105 additions & 0 deletions
diff --git a/‎requirements.txt‎
Lines changed: 37 additions & 0 deletions b/‎requirements.txt‎
Lines changed: 37 additions & 0 deletions
diff --git a/‎srh_code/__init__.py‎ b/‎srh_code/__init__.py‎
@@ -0,0 +1,178 @@
+
+# Avoid committing any data. If you need to make an exception for one file you can do that separately
+*.csv
+*.xlsx
+*.xls
+
+# Linting files:
+.pylint.d/
+
+# Data:
+*.csv
+*.sas7bdat
+
+# Notebooks:
+*.ipynb
+*.ipynb_checkpoints
+
+# Working file
+working.py
+temp.py
+
+# Casched files
+cached_dataframes/
+*.ft
+
+# System generated
+desktop.ini
+
+# Cached files
+*.ft
+cached_dataframes/
+
+####################################################################
+# GENERIC GITIGNORE FILE FOR PYTHON 
+# Copied from https://github.com/github/gitignore/blob/master/Python.gitignore
+# This is a useful thing to just copy and paste since it should work for most use cases. 
+# If you have additional things to exclude from the .gitignore file you should add it above
+# this section.
+#####################################################################
+
+# Byte-compiled / optimized / DLL files
+__pycache__/
+*.py[cod]
+*$py.class
+
+# C extensions
+*.so
+
+# Distribution / packaging
+.Python
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+share/python-wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+MANIFEST
+
+# PyInstaller
+#  Usually these files are written by a python script from a template
+#  before PyInstaller builds the exe, so as to inject date/other infos into it.
+*.manifest
+*.spec
+
+# Installer logs
+pip-log.txt
+pip-delete-this-directory.txt
+
+# Unit test / coverage reports
+htmlcov/
+.tox/
+.nox/
+.coverage
+.coverage.*
+.cache
+nosetests.xml
+coverage.xml
+*.cover
+*.py,cover
+.hypothesis/
+.pytest_cache/
+cover/
+
+# Translations
+*.mo
+*.pot
+
+# Django stuff:
+*.log
+local_settings.py
+db.sqlite3
+db.sqlite3-journal
+
+# Flask stuff:
+instance/
+.webassets-cache
+
+# Scrapy stuff:
+.scrapy
+
+# Sphinx documentation
+docs/_build/
+
+# PyBuilder
+.pybuilder/
+target/
+
+# Jupyter Notebook
+.ipynb_checkpoints
+
+# IPython
+profile_default/
+ipython_config.py
+
+# pyenv
+#   For a library or package, you might want to ignore these files since the code is
+#   intended to run in multiple environments; otherwise, check them in:
+# .python-version
+
+# pipenv
+#   According to pypa/pipenv#598, it is recommended to include Pipfile.lock in version control.
+#   However, in case of collaboration, if having platform-specific dependencies or dependencies
+#   having no cross-platform support, pipenv may install dependencies that don't work, or not
+#   install all needed dependencies.
+#Pipfile.lock
+
+# PEP 582; used by e.g. github.com/David-OConnor/pyflow
+__pypackages__/
+
+# Celery stuff
+celerybeat-schedule
+celerybeat.pid
+
+# SageMath parsed files
+*.sage.py
+
+# Environments
+.env
+.venv
+env/
+venv/
+ENV/
+env.bak/
+venv.bak/
+
+# Spyder project settings
+.spyderproject
+.spyproject
+
+# Rope project settings
+.ropeproject
+
+# mkdocs documentation
+/site
+
+# mypy
+.mypy_cache/
+.dmypy.json
+dmypy.json
+
+# Pyre type checker
+.pyre/
+
+# pytype static type analyzer
+.pytype/
+
+# Cython debug symbols
+cython_debug/
@@ -0,0 +1,105 @@
+Warning - this repository is a snapshot of a repository internal to NHS England. This means that links to videos and some URLs may not work.***
+
+Repository owner: Analytical Services: Population Health, Clinical Audit and Specialist Care
+
+Email: lifestyles@nhs.net
+
+To contact us raise an issue on Github or via email and we will respond promptly.
+
+# Background
+
+This project produces the required publication outputs for the Sexual and
+Reproductive Health Services (Contraception) publication: Data tables, charts,
+and map data.
+
+Data is sourced from the Sexual and Reproductive Health Activity Dataset (SRHAD),
+NHS hospital admissions data, NHS Business Services Authority (NHSBA)
+prescription cost analysis data, and the NHS corporate reference datasets.
+
+# Initial package set up
+
+Set up is done using the requirements.txt file
+
+Run the following command in Terminal to set up the package
+Note that before running the below, it is advised to delete the folder with currently
+stored user packages: C:\Users\YOUR_SHORTCODE\AppData\Roaming\Python\Python39
+This will reset your packages to the default install versions before updating
+for this project.
+```
+pip install --user --no-warn-script-location -r requirements.txt
+```
+
+
+# Directory structure:
+```
+srh-services-rap
+│   README.md
+│   requirements.txt                      - Used to install the python dependencies
+│
+├───srh_code                              - This is the main code directory for this project
+│   │   create_publication.py             - This script runs the entire publication
+│   │   parameters.py                     - Contains parameters that define the how the publication will run
+│   │
+│   └───sql_code                          - This folder contains all the SQL queries used in the import data stage
+│           │   query_ahas.sql
+│           │   query_asset_reporting.sql
+│           │   query_asset.sql
+│           │   query_imd_decile.sql
+│           │   query_imd_lsoa.sql
+│           │   query_la_ref.sql
+│           │   query_lsoa_ref.sql
+│           │   query_org_daily.sql
+│           │   query_org_sites.sql
+│           │   query_population.sql
+│           │
+│       utilities                          - This folder contains all the main modules used to create the publication
+│           │   charts.py                  - Defines the arguments needed to create and export chart outputs
+│           │   data_connections.py        - Defines the df_from_sql function, used when importing SQL data
+│           │   field_definitions.py       - Defines any derived fields added during processing.
+│           │   filter_definitions.py      - Defines pe-set pipeline filters.
+│           │   helpers.py                 - Contains generalised functions used within the project
+│           │   load.py                    - Contains functions for reading in the required data
+│           │   logger_config.py           - The configuration functions for the publication logger
+│           │   pre-processing.py          - Contains the core pre-processing functions
+│           │   publication_files.py       - Contains functions used to create publication ready outputs and save in relevant folders
+│           │   tables.py                  - Defines the arguments needed to create and export Excel table outputs
+│           │   
+│           └───processing
+│                 │   processing_publication.py    - Contains the core functions used to produce publication outputs
+│                write                     - This folder contains all the main modules used to write the outputs to external files
+│                     write_data.py        - Contains functions for writing in the data to external files
+│                     write_format.py      - Contains functions for formatting the external files
+└───tests
+    └────unittests                         - Unit tests for Python functions
+            │   test_field_definitions.py
+            │   test_filter_definitions.py            
+            │   test_helpers.py
+            │   test_pre_processing.py        
+            │   test_processing_publication.py
+ 
+```
+# Running the pipeline:
+
+There are two main files that users running the process will need to interact with:
+
+    * The `parameters.py` 
+    * The `create_publication.py`
+
+The file parameters.py contains all of the things that we expect to change from one publication
+to the next. Indeed, if the methodology has not changed, then this should be the only file you need
+to modify. A few elements require updating each year (e.g. the reporting year), but most
+are likely to only require occassional updates (e.g. file paths, default codes).
+It also allows the user to control which parts of the publication they want the pipeline to produce.
+
+The publication process is run using the top-level script, create_publication.py.
+This script imports and runs all the required functions from the sub-modules.
+
+# Link to publication
+https://digital.nhs.uk/data-and-information/publications/statistical/sexual-and-reproductive-health-services
+
+# Licence
+The NHS England Sexual and Reproductive Health Services (Contraception) National Statistics publication codebase is released under the MIT License.
+
+Copyright © 2023, NHS England
+
+You may re-use this document/publication (not including logos) free of charge in any format or medium, under the terms of the Open Government Licence v3.0. Information Policy Team, The National Archives, Kew, Richmond, Surrey, TW9 4DU; email: psi@nationalarchives.gsi.gov.uk
@@ -0,0 +1,37 @@
+
+-e .
+
+# Libraries used for running the SRH services pipeline
+# Package versions frozen as of 13DEC2022
+# Based on Python version below
+# Python version = 3.9.12
+
+# Data manipulation
+numpy==1.21.5
+pandas==1.4.2
+sidetable==0.9.0
+
+# Excel output
+xlwings==0.24.9
+openpyxl==3.0.09
+pywin32==303
+XlsxWriter==3.0.3
+
+# Word outputs (if needed)
+# python-docx==0.8.11
+# docx-mailmerge==0.5.0
+
+# SQL
+sqlalchemy==1.4.32
+pyodbc==4.0.32
+
+# Testing
+pytest==7.1.3
+pytest-html==3.1.1
+
+# Additional dependencies of the above packages
+importlib-resources==5.4.0
+pathlib==1.0.1
+simplegeneric==0.8.1
+tzlocal==4.1
+pyarrow==10.0.1