Applied Modeling and Quantitative Methods
Assessing the Cost of Reproduction between Male and Female Sex Functions in Hermaphroditic Plants
The cost of reproduction refers to the use of resources for the production of offspring that decreases the availability of resources for future reproductive events and other biological processes. Models of sex-allocation provide insights into optimal patterns of resource investment in male and female sex functions and have been extended to include other components of the life history, enabling assessment of the costs of reproduction. These models have shown that, in general, costs of reproduction through female function should usually exceed costs through male function. However, those previous models only considered allocations from a single pool of shared resources. Recent studies have indicated that the type of resource currency can differ for female and male sex functions, and that this might affect costs of reproduction via effects on other components of the life history. Using multiple invasibility analysis, this study examined resource allocation to male and female sex functions, while simultaneously considering allocations to survival and growth. Allocation patterns were modelled using both shared and separate resource pools. Under shared resources, allocation patterns to male and female sex function followed the results of earlier models. When resource pools were separate, however, allocations to male function often exceeded allocations to female function, even if fitness gains increased less strongly with investment in male function than with investment in female function. These results demonstrate that the costs of reproduction are affected by (1) the types of resources needed for reproduction via female or male function and (2) via trade-offs with other components of the life history. Future studies of the costs of reproduction should examine whether allocations to reproduction via female versus male function usually entail the use of different types of resources.
Author Keywords: Cost of Reproduction, Gain Curve, Life History, Resource Allocation Patterns, Resource Currencies
Time Series Algorithms in Machine Learning - A Graph Approach to Multivariate Forecasting
Forecasting future values of time series has long been a field with many and varied applications, from climate and weather forecasting to stock prediction and economic planning to the control of industrial processes. Many of these problems involve not only a single time series but many simultaneous series which may influence each other. This thesis provides methods based on machine learning of handling such problems.
We first consider single time series with both single and multiple features. We review the algorithms and unique challenges involved in applying machine learning to time series. Many machine learning algorithms when used for regression are designed to produce a single output value for each timestamp of interest with no measure of confidence; however, evaluating the uncertainty of the predictions is an important component for practical forecasting. We therefore discuss methods of constructing uncertainty estimates in the form of prediction intervals for each prediction. Stability over long time horizons is also a concern for these algorithms as recursion is a common method used to generate predictions over long time intervals. To address this, we present methods of maintaining stability in the forecast even over large time horizons. These methods are applied to an electricity forecasting problem where we demonstrate the effectiveness for support vector machines, neural networks and gradient boosted trees.
We next consider spatiotemporal problems, which consist of multiple interlinked time series, each of which may contain multiple features. We represent these problems using graphs, allowing us to learn relationships using graph neural networks. Existing methods of doing this generally make use of separate time and spatial (graph) layers, or simply replace operations in temporal layers with graph operations. We show that these approaches have difficulty learning relationships that contain time lags of several time steps. To address this, we propose a new layer inspired by the long-short term memory (LSTM) recurrent neural network which adds a distinct memory state dedicated to learning graph relationships while keeping the original memory state. This allows the model to consider temporally distant events at other nodes without affecting its ability to model long-term relationships at a single node. We show that this model is capable of learning the long-term patterns that existing models struggle with. We then apply this model to a number of real-world bike-share and traffic datasets where we observe improved performance when compared to other models with similar numbers of parameters.
Author Keywords: forecasting, graph neural network, LSTM, machine learning, neural network, time series
Capital Ratios and Liquidity Creation: Evidence from Canadian Big Six Banks
Using quarterly data from the six largest Canadian banks, we investigate the relationship between regulatory capital ratio and on-balance sheet liquidity created in the Canadian economy by "Big Six". We find a significant positive relationship between Tier 1 capital ratio and on-balance sheet liquidity creation for Canadian big six banks, implying that large banks in Canada favor risks and rely on capital to fund illiquid assets. In contrast, for smaller banks, the relationship is significantly negative. Our results are robust to dynamic panel regression using 2-Step GMM, two exogenous shocks - COVID-19 crisis and the Global Financial Crisis (2007-2009), mergers & acquisitions activities in the banking industry, and core deposits financing. The COVID-19 pandemic and core deposits adversely impact the Tier 1 capital ratio's relationship with on-balance-sheet liquidity creation, while the global financial crisis (2007-2009) effect on the association is insignificant.
Author Keywords: Big Six, COVID -19, Deposits, Liquidity Creation, Tier 1 Capital Ratio,
Characteristics of Models for Representation of Mathematical Structure in Typesetting Applications and the Cognition of Digitally Transcribing Mathematics
The digital typesetting of mathematics can present many challenges to users, especially those of novice to intermediate experience levels. Through a series of experiments, we show that two models used to represent mathematical structure in these typesetting applications, the 1-dimensional structure based model and the 2-dimensional freeform model, cause interference with users' working memory during the process of transcribing mathematical content. This is a notable finding as a connection between working memory and mathematical performance has been established in the literature. Furthermore, we find that elements of these models allow them to handle various types of mathematical notation with different degrees of success. Notably, the 2-dimensional freeform model allows users to insert and manipulate exponents with increased efficiency and reduced cognitive load and working memory interference while the 1-dimensional structure based model allows for handling of the fraction structure with greater efficiency and decreased cognitive load.
Author Keywords: mathematical cognition, mathematical software, user experience, working memory
Stability Properties of Disease Models under Economic Expectations
Comprehending the dynamics of infectious diseases is very important in formulating public health policies to tackling their prevalence. Mathematical epidemiology (ME) has played a very vital role in achieving the above. Nevertheless, classical mathematical epidemiological models do not explicitly model the behavioural responses of individuals in the presence of prevalence of these diseases. Economic epidemiology (EE) as a field has stepped in to fill this gap by integrating economic and mathematical concepts within one framework. This thesis investigated two issues in this area. The methods employed are the standard linear analysis of stability of dynamical systems and numerical simulation. Below are the investigations and the findings of this thesis:
Firstly, an investigation into the stability properties of the equilibria of EE
models is carried out. We investigated the stability properties of modified EE systems studied by Aadland et al. [6] by introducing a parametric quadratic utility function into the model, thus making it possible to model the maximum number of contacts made by rational individuals to be determined by a parameter. This parameter in particular influences the level of utility of rational individuals. We have shown that if rational individuals have a range of possible contacts to choose from, with the maximum of the number of contacts allowable for these individuals being dependent on a parameter, the variation in this parameter tends to affect the stability properties of the system. We also showed that under the assumption of permanent recovery for
disease coupled with individuals observing or not observing their immunity, death
and birth rates can affect the stability of the system. These parameters also have
effect on the dynamics of the EE SIS system.
Secondly, an EE model of syphilis infectivity among &ldquo men who have sex with men &rdquo (MSM) in detention centres is developed in an attempt at looking at the effect of behavioural responses on the disease dynamics among MSM. This was done by explicitly incorporating the interplay of the biology of the disease and the behaviour of the inmates. We investigated the stability properties of the system under rational expectations where we showed that: (1) Behavioural responses to the prevalence of
the disease affect the stability of the system. Therefore, public health policies have the tendency of putting the system on indeterminate paths if rational MSM have complete knowledge of the laws governing the motion of the disease states as well as a complete understanding on how others behave in the system when faced with risk-benefit trade-offs. (2) The prevalence of the disease in the long run is influenced by incentives that drive the utility of the MSM inmates. (3) The interplay between the dynamics of the biology of the disease and the behavioural responses of rational MSM tends to put the system at equilibrium quickly as compared to its counterpart (that is when the system is solely dependent on the biology of the disease) when subjected to small perturbation.
Author Keywords: economic and mathematical epidemiology models, explosive path, indeterminate-path stability, numerical solution, health gap, saddle-path stability, syphilis,
Prescription Drugs: From Paper to Database with Application to Air Pollution-Related Public Health Risk
Medication used to treat human illness is one of the greatest developments in human history. In Canada, prescription drugs have been developed and made available to treat a wide variety of illnesses, from infections to heart disease and so on. Records of prescription drug fulfillment at coarse Canadian geographic scales were obtained from Health Canada in order to track the use of these drugs by the Canadian population.
The obtained prescription drug fulfillment records were in a variety of inconsistent formats, including a large selection of years for which only paper tabular records were available (hard copies). In this work, we organize, digitize, proof and synthesize the full available data set of prescription drug records, from paper to final database. Extensive quality control was performed on the data before use. This data was then analyzed for temporal and spatial changes in prescription drug use across Canada from 1990-2013.
In addition, one of major research areas in environmental epidemiological studies is the study of population health risk associated with exposure to ambient air pollution. Prescription drugs can moderate public health risk, by reducing the drug user's physiological symptoms and preventing acute health effects (e.g., strokes, heart attacks, etc.). The cleaned prescription drug data was considered in the context of a common model to examine its influence on the association between air pollution exposure and various health outcomes. Since, prescription drug data were available only at the provincial level, a Bayesian hierarchical model was employed to include the prescription drugs as a covariate at regional level, which were then combined to estimate the association at national level. Although further investigations are required, the study results suggest that the prescription drugs influenced the air pollution related public health risk.
Author Keywords: Data, Error checking, Population health, Prescriptions
An Investigation of the Impact of Big Data on Bioinformatics Software
As the generation of genetic data accelerates, Big Data has an increasing impact on the way bioinformatics software is used. The experiments become larger and more complex than originally envisioned by software designers. One way to deal with this problem is to use parallel computing.
Using the program Structure as a case study, we investigate ways in which to counteract the challenges created by the growing datasets. We propose an OpenMP and an OpenMP-MPI hybrid parallelization of the MCMC steps, and analyse the performance in various scenarios.
The results indicate that the parallelizations produce significant speedups over the serial version in all scenarios tested. This allows for using the available hardware more efficiently, by adapting the program to the parallel architecture. This is important because not only does it reduce the time required to perform existing analyses, but it also opens the door to new analyses, which were previously impractical.
Author Keywords: Big Data, HPC, MCMC, parallelization, speedup, Structure
THE PROPENSITY TOWARD EXTREMIST MIND-SET AS PREDICTED BY PERSONALITY, MOTIVATION, AND SELF-CONSTRUAL
ABSTRACT
The Propensity Toward Extremist Mind-Set as Predicted
by Personality, Motivation, and Self-Construal
Nick Fauset
Multivariate regression analyses were used to determine the effects of Personality (Neuroticism, Extraversion, Openness, Agreeableness, and Conscientiousness), Motivation (External, Amotivation, Intrinsic, and Identified), and Self-Construal (Independent and Interdependent) on three domains of Extremist Mind-Set (Proviolence, Vile World, and Divine Power). Participants consisted of first year undergraduate students (209 females, 76 males) enrolled in Introductory Psychology (N=279) and/or Introductory Economics (N=7), whom participated for course credit. The Motivation measure was problematic for students to complete and this variable was dropped from the model due to missing data. Decreases in Neuroticism, Openness, Agreeableeness, and Interdependent were significantly correlated with increases in Proviolence. Decreases in Agreeableness were correlated with increases in Vile World. Decreases in Openness, and increases in Agreeableness and Interdependent were significantly correlated with increases in Divine Power. These observations provide an interesting perspective on the types of Canadian undergraduate students who are more likely to score highly on measures of Extremism.
Keywords: Militant Extremist Mental Mind-Set, Extremism, Personality, Five Factor Model, Motivation, Intrinsic, Extrinsic, Self-Construal, Independent, Interdependent
Author Keywords: Extremism, Militant Extremist Mental Mind-Set, Motivation, Personality, Self-Construal
An Investigation of Load Balancing in a Distributed Web Caching System
With the exponential growth of the Internet, performance is an issue as bandwidth is often limited. A scalable solution to reduce the amount of bandwidth required is Web caching. Web caching (especially at the proxy-level) has been shown to be quite successful at addressing this issue. However as the number and needs of the clients grow, it becomes infeasible and inefficient to have just a single Web cache. To address this concern, the Web caching system can be set up in a distributed manner, allowing multiple machines to work together to meet the needs of the clients. Furthermore, it is also possible that further efficiency could be achieved by balancing the workload across all the Web caches in the system. This thesis investigates the benefits of load balancing in a distributed Web caching environment in order to improve the response times and help reduce bandwidth.
Author Keywords: adaptive load sharing, Distributed systems, Load Balancing, Simulation, Web Caching
An Emprirical Investigation into the Relationship Between Education and Health
Health literature has long noted a positive correlation between health and levels of education. Two competing theories have been advanced to explain this phenomenon: (1) education "causes" health by allowing individuals to process complex information and act on it; and, (2) education and health are merely correlated through some third underlying characteristic.
Determining which of these two theories is correct is of importance to public policy. But that task is empirically difficult because, from the standard, static perspective, the theories are observationally equivalent.
We exploit a way in which the two theories have different implications regarding the sort of behaviour we should observe over time. We use smoking as a measure of health behaviour and find that smoking rates between "high" and "low" educated individuals expand when information is hard to process, and then contract as it becomes more easily processable. This approach is then repeated using physical activity as a measure of health-related behaviour to address limitations of the smoking model.
Our novel approach to estimating the differences in the behavioural responses to changes in the processability of health-related information, across education groups, provides strong evidence in support of the view that education and health are causally linked.
Author Keywords: applied statistics, education, health economics, public health, public policy, smoking