-
Notifications
You must be signed in to change notification settings - Fork 0
/
chapter2.tex
141 lines (98 loc) · 9.97 KB
/
chapter2.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
\chapter{Background}
\label{cha:back}
In this chapter I will cover the context of my project to allow the reader to understand the more complex parts of the report to come. First I will explain the different symptoms of hay fever and how they might be caused by different allergens. I will then talk about the main dataset used throughout and how I interpreted the data to produce meaningful results.
\section{Seasonal Allergies}
A massive 44\% of Britons suffer from at least one allergy, the majority suffering from some form of a seasonal allergy \cite{mintelallergy}. Allergies are on the rise, even with modern allergy medication continuously improving \cite{mintelallergy}. It is clear that something needs to be done to help those who suffer.\\
\subsection{Asthma}
When talking about hay fever, most people assume that it only affects the nose. However, in a study by Eur Respir J, it was found that 20.4\% of allergy sufferers also self-reported suffering from asthma \cite{rhinitis}. By 2025, it is predicated that asthma will represent the most prevalent chronic childhood disease and result in one of the highest health care costs, primarily due to the need for ongoing treatment throughout the patients life \cite{childhood}. Despite many decades of research, we do not have a complete working solution to the problem. The best we can do is limit exposure, Inhale will help with this by highlighting areas with unusually high occurrences of particular symptoms.\\
Listed below are the main causes of allergy induced asthma, the effects of which can impact everyday life of the sufferer.\\
\begin{itemize}
\item Allergens: including pollen, dust mites, animal fur ("dander") or feathers
\item Airborne Irritants: including cigarette smoke, fumes and pollution
\item Emotions: including stress or laughter
\item Weather Conditions: including sudden changes in temperature, cold air, windy days, thunderstorms and hot, humid days
\item Infections: particularly infections of the upper airways, such as colds and flu
\end{itemize}\begin{center}Adapted from \cite{urlasthmacauses}\end{center}
There is a large variety of factors contributing to allergy induced asthma. It would be impossible to cover all aspects of these in this report. I am going to focus on the effects of Airborne Irritants and Allergens.
\section{Datasets}
In this section, I will describe the structure and features of the datasets used by Inhale.\\
\subsection{Britain Breathing}
Britain Breathing is a Citizen Science project run by the British Society for Immunology, Royal Society of Biology and the University of Manchester Immunology Group, who conduct research into the causes and treatment of allergies. To aid their research, they collect data on seasonal allergies using a simple but effective mobile application available on the Android operating system. Their aim is to get allergy sufferers to submit their symptoms regularly so that they can build solutions to the problem.\\
When setting up the application, you are asked standard registration questions such as your age, gender, and whether you suffer from pre-existing allergy related issues such as asthma and hay fever. In everyday use of the Britain Breathing app you are asked to rate the severity of your allergy related symptoms on each day. You are also asked whether you have taken your allergy medication for that day. This data is collected along with your location to be stored by Britain Breathing.
\begin{figure}[H]
\begin{center}
\includegraphics[width=6cm, height=8cm]{bbapp}
\includegraphics[width=6cm, height=8cm]{bbapp2}
\caption{Screenshots of the data collection screen on the Britain Breathing Android app}
\label{fig:bbscrn}
\end{center}
\end{figure}
Dr Vigo works with the Britain Breathing project, and was kind enough to provide me with their dataset from September 2016. This dataset contains 18,386 records from all over the UK. The data contains values corresponding to the questions asked in the application. They also ask whether or not the user has taken their medication today, this field is particularly useful as it can be used as a multiplier. If someone has severe nose, eye and breathing symptoms and they have taken their medication, then this should be expressed in the final visualisation as they're clearly being exposed to a significant amount of allergens.\\% avoid 'obviously', rephrase
The dataset covers nose, eye and breathing symptoms, see Table \ref{bbdatatable}. As these parts of the body are all interlinked in very close proximity, it makes sense to try to combine them. In Section \ref{sec:anal}, I will go into more detail as to how I combine these problematic areas into one value that summarises them.\\
\begin{table}[H]
\begin{center}
\begin{tabular}{|c|c|c|c|c|c|c|c|c|c}\hline\hline
Breathing&Eyes&How Feeling&Nose&Taken Meds&Asthma&Hay Fever&Latitude&Longitude\\\hline
0&0&0&0&1&1&0&54.10&-2.39\\
3&0&2&3&1&1&1&53.89&-2.79\\
3&1&0&3&1&1&1&53.24&-2.34\\\hline\hline
\end{tabular}
\end{center}
\caption{A Simplified sample of the 18,386 record Britain Breathing dataset}
\label{bbdatatable}
\end{table}
The Britain Breathing dataset will be the base for the entire project. Most of my time was spent with this set establishing a good hotspot identification algorithm and presenting it on a map.
\subsection{Other Datasets}
Whilst early testing and development involved many datasets, I decided on two main sets to use for comparison to the Britain Breathing dataset.\\
The data.gov website provides counts for Road Traffic data available for public use. The set contains 750,000 entries from major road traffic counts at locations all over the UK. They give a location of the Centre Point of the count, that is, where the counter was when recording the traffic. The northmost and southmost junctions on either side of the Centre Point. For each record, the counts for different vehicle types are given. Vehicles are categorised into Cars, Buses, Large Goods Vehicles (LGV), Heavy Goods Vehicle Rigid axle (HGVRx) and HGVAx for articulated HGV's where x is the number of axles of the HGV. See Table \ref{RoadTrafficData} for a simplified version of the dataset.\\
\begin{table}[H]
\begin{center}
\begin{tabular}{|c|c|c|c|c|c|c|c|c|c|c|}\hline\hline
S Ref E&S REF N&Road&A REF E&A REF N&Hour&CAR&BUS&LGV&HGVR2&...\\\hline
90200&10585&A3112&90320&10530&7&24&6&13&5&...\\
380927&405391&M60&382819&405920&15&7642&18&300&64&...\\
380927&405391&M60&382819&405920&17&10042&32&654&103&...\\\hline\hline
\end{tabular}
\caption{Simplified example of the 750,000 record Road Traffic dataset. 20 columns have been removed for viewing purposes.}\label{RoadTrafficData}
\end{center}
\end{table}
The second set used is a dataset of the large urban areas in the UK. The data is fairly simple consisting of a location feature in the form of a Latitude and Longitude and a name for the area. I used this dataset as a guide in the hope that it might help highlight interesting areas. Areas that are not in a large urban area but are indicated as a hotspot should be investigated further. However, it is not used for any particular research benefit, nor is it used to draw any conclusions.\\
\section{Using Datasets}
The number of people suffering from allergic rhinitis is rising, and whilst increased carbon emissions and global warming are lengthening the pollen season, there are not significantly more pollen producing plants and trees year on year \cite{co2pollen, allergyrising}.\\
The main pollutants emitted by combustion engine vehicles are:
\begin{itemize}
\item CO
\item $SO_2$
\item $NO_x$
\end{itemize}
\begin{center}
Adapated from \cite{vehcemis}
\end{center}
It has been proven that these pollutants, whilst perhaps not associated with hay fever symptoms by most, are actually all directly associated with allergic rhinitis\cite{airPollution}. With this in mind, we can compare the Road Traffic dataset with the Britain Breathing dataset on the same map to try and find some correlation.\\
\section{Existing Applications}
\label{sec:diagrams}
There are few directly relevant applications. The reason for this is that in order to produce a meaningful application a well-formed dataset is required. In order for a dataset to be relevant, it needs to be of a reasonable size, have a good range of locations and a diverse demographic. Without collecting data from the people directly affected by allergens it's not possible to make educated conclusions on allergy hotspots. Currently there are no existing applications that satisfy this criteria.\\
\subsection{Britain Breathing}
Britain Breathing have a Data Visualiser on their website. It provides a simple display of the allergy data collected in a particular data range, for a particular data value. At the time of writing this report, the Visualiser is not working so I can only display a screenshot of the interface without the map containing the visualisation.
\begin{figure}[H]
\begin{center}
\includegraphics[width=0.75\textwidth]{bbvisinterface}
\caption{Britain Breathing Visualiser Interface}
\end{center}
\end{figure}
\subsection{Pollen forecast}
The most comparable applications to Inhale are pollen forecast services such as Pollen.com's allergy forecast map. As the name suggests it only displays pollen forecasts, not allergy symptoms. Whilst pollen is a major contributor to allergy symptoms, it needs to be combined with the other problematic allergens to be make credible conclusions.\\
Therefore, without the data from an organisation like Britain Breathing, it is not possible to make a comparable application.
\begin{figure}[H]
\begin{center}
\includegraphics[width=0.75\textwidth]{pollenforecast}
\caption{Pollen.com Allergy Forecast Map}
\end{center}
\end{figure}
\section{Remarks}
After looking at many other mapping applications, it has become clear that using the standard Marker to represent points on a map can be used if and only if there are a very low density of points. Any other scenario results in a map that is totally incomprehensible.
\begin{figure}[H]
\centering
\includegraphics[width=0.05\textwidth]{marker}
\caption{Marker}\label{fig:marker}
\end{figure}