Hurley, Richard

Cloud Versus Bare Metal: A comparison of a high performance computing cluster running in a commercial cloud and on a traditional hardware cluster using OpenMP and OpenMPI

Type:
Names:
Creator (cre): Bilaniuk, Vicky, Thesis advisor (ths): McConnell, Sabine, Degree committee member (dgc): Hurley, Richard, Degree granting institution (dgg): Trent University
Abstract:

A comparison of two high performance computing clusters running on AWS and Sharcnet was done to determine which scenarios yield the best performance. Algorithm complexity ranged from O (n) to O (n3). Data sizes ranged from 195 KB to 2 GB. The Sharcnet hardware consisted of Intel E5-2683 and Intel E7-4850 processors with memory sizes ranging from 256 GB to 3072 GB. On AWS, C4.8xlarge instances were used, which run on Intel Xeon E5-2666 processors with 60 GB per instance. AWS was able to launch jobs immediately regardless of job size. The only limiting factors on AWS were algorithm complexity and memory usage, suggesting a memory bottleneck. Sharcnet had the best performance but could be hampered by the job scheduler. In conclusion, Sharcnet is best used when the algorithm is complex and has high memory usage. AWS is best used when immediate processing is required.

Author Keywords: AWS, cloud, HPC, parallelism, Sharcnet

2019

Development of a Cross-Platform Solution for Calculating Certified Emission Reduction Credits in Forestry Projects under the Kyoto Protocol of the UNFCCC

Type:
Names:
Creator (cre): McIntyre, Gregory, Thesis advisor (ths): Ponce-Hernandez, Raul, Thesis advisor (ths): Hurley, Richard, Degree committee member (dgc): Hircock, Brian, Degree granting institution (dgg): Trent University
Abstract:

This thesis presents an exploration of the requirements for and development of a software tool to calculate Certified Emission Reduction (CERs) credits for afforestation and reforestation projects conducted under the Clean Development Mechanism (CDM). We examine the relevant methodologies and tools to determine what is required to create a software package that can support a wide variety of projects involving a large variety of data and computations. During the requirements gathering, it was determined that the software package developed would need to support the ability to enter and edit equations at runtime. To create the software we used Java for the programming language, an H2 database to store our data, and an XML file to store our configuration settings. Through these choices, we can build a cross-platform software solution for the purpose outlined above. The end result is a versatile software tool through which users can create and customize projects to meet their unique needs as well as utilize the features provided to streamline the management of their CDM projects.

Author Keywords: Carbon Emissions, Climate Change, Forests, Java, UNFCCC, XML

2020

An Ethical Analysis of Bell's Targeted Ad Prorgram

Type:
Names:
Creator (cre): Rowe, Brendan, Thesis advisor (ths): Hickson, Michael, Thesis advisor (ths): Hurley, Richard, Degree granting institution (dgg): Trent University
Abstract:

Online behavioural advertising (OBA) is an advertising technique which relies on collected customer information and online activity to serve people with more relevant ads. On November 16th, 2013, Bell Canada launched their first OBA program via Bell Mobility: the Bell Targeted Ads Program, or BTAP. My thesis provides an ethical analysis of BTAP and shows that Bell undermined and violated customer privacy, stifled customer autonomy, and harmed customer identity. Relevant moral problems include typification, a disrespecting of customer autonomy, and identity commodification.

I show that BTAP was unethical by grounding my arguments within the moral framework of Information Ethics (IE). IE is an ethical system which focuses on the role of information in the ethical dilemmas. IE also justifies the self-constitutive theory of privacy (SCP) which argues that our information and privacy are entangled with our identities. This gives us strong reason to defend our privacy/identity within BTAP.

After making several arguments which demonstrate that BTAP was unethical, I will then turn my attention to showing how it is possible to rectify and mitigate many of BTAP's ethical problems by installing a two-stage opt-in (TSOI) which provides customers with a greater deal of autonomy, and the ability to remove themselves from BTAP.

Author Keywords: Bell Canada, Ethics, Identity, Online Behavioural Advertising, Privacy, Targeted Advertising

2017

Exploring the Scalability of Deep Learning on GPU Clusters

Type:
Names:
Creator (cre): Williams, Taylor Alan, Thesis advisor (ths): McConnell, Sabine, Degree committee member (dgc): Hurley, Richard, Degree granting institution (dgg): Trent University
Abstract:

In recent years, we have observed an unprecedented rise in popularity of AI-powered systems. They have become ubiquitous in modern life, being used by countless people every day. Many of these AI systems are powered, entirely or partially, by deep learning models. From language translation to image recognition, deep learning models are being used to build systems with unprecedented accuracy. The primary downside, is the significant time required to train the models. Fortunately, the time needed for training the models is reduced through the use of GPUs rather than CPUs. However, with model complexity ever increasing, training times even with GPUs are on the rise. One possible solution to ever-increasing training times is to use parallelization to enable the distributed training of models on GPU clusters. This thesis investigates how to utilise clusters of GPU-accelerated nodes to achieve the best scalability possible, thus minimising model training times.

Author Keywords: Compute Canada, Deep Learning, Distributed Computing, Horovod, Parallel Computing, TensorFlow

2019

Utilizing Class-Specific Thresholds Discovered by Outlier Detection

Type:
Names:
Creator (cre): Branch, Richard Arthur Conan, Thesis advisor (ths): McConnell, Sabine, Thesis advisor (ths): Hurley, Richard, Degree granting institution (dgg): Trent University
Abstract:

We investigated if the performance of selected supervised machine-learning techniques could be improved by combining univariate outlier-detection techniques and machine-learning methods. We developed a framework to discover class-specific thresholds in class probability estimates using univariate outlier detection and proposed two novel techniques to utilize these class-specific thresholds. These proposed techniques were applied to various data sets and the results were evaluated. Our experimental results suggest that some of our techniques may improve recall in the base learner. Additional results suggest that one technique may produce higher accuracy and precision than AdaBoost.M1, while another may produce higher recall. Finally, our results suggest that we can achieve higher accuracy, precision, or recall when AdaBoost.M1 fails to produce higher metric values than the base learner.

Author Keywords: AdaBoost, Boosting, Classification, Class-Specific Thresholds, Machine Learning, Outliers

2016

Machine Learning Using Topology Signatures For Associative Memory

Type:
Names:
Creator (cre): Florez, Elkin Dario, Thesis advisor (ths): McConnell, Sabine, Thesis advisor (ths): Hurley, Richard, Degree granting institution (dgg): Trent University
Abstract:

This thesis presents a technique to produce signatures from topologies generated by the Growing Neural Gas algorithm. The generated signatures have the following characteristics: The signature's memory footprint is smaller than the "real object" and it represents a point in the n x m multidimensional space. Signatures can be compared based on Euclidean distance and distances between signatures provide measurements of differences between models. Signatures can be associated with a concept and then be used as a learning step for a classification algorithm. The signatures are normalized and vectorized to be used in a multidimensional space clustering. Although the technique is generic in essence, it was tested by classifying alphabet and numerical handwritten characters and 2D figures obtaining a good accuracy and precision. It can be used for many other purposes related to shapes and abstract typologies classification and associative memory. Future work could incorporate other classifiers.

Author Keywords: Associative memory, Character recognition, Machine learning, Neural gas, Topological signatures, Unsupervised learning

2015

Self-Organizing Maps and Galaxy Evolution

Type:
Names:
Creator (cre): Beland, Jacques Alain Gerard, Thesis advisor (ths): McConnell, Sabine, Thesis advisor (ths): Irwin, Judith, Degree committee member (dgc): Abdella, Kenzu, Degree committee member (dgc): Hurley, Richard, Degree committee member (dgc): Bauer, Michael, Degree granting institution (dgg): Trent University
Abstract:

Artificial Neural Networks (ANN) have been applied to many areas of research. These techniques use a series of object attributes and can be trained to recognize different classes of objects. The Self-Organizing Map (SOM) is an unsupervised machine learning technique which has been shown to be successful in the mapping of high-dimensional data into a 2D representation referred to as a map. These maps are easier to interpret and aid in the classification of data. In this work, the existing algorithms for the SOM have been extended to generate 3D maps. The higher dimensionality of the map provides for more information to be made available to the interpretation of classifications. The effectiveness of the implementation was verified using three separate standard datasets. Results from these investigations supported the expectation that a 3D SOM would result in a more effective classifier.

The 3D SOM algorithm was then applied to an analysis of galaxy morphology classifications. It is postulated that the morphology of a galaxy relates directly to how it will evolve over time. In this work, the Spectral Energy Distribution (SED) will be used as a source for galaxy attributes. The SED data was extracted from the NASA Extragalactic Database (NED). The data was grouped into sample sets of matching frequencies and the 3D SOM application was applied as a morphological classifier. It was shown that the SOMs created were effective as an unsupervised machine learning technique to classify galaxies based solely on their SED. Morphological predictions for a number of galaxies were shown to be in agreement with classifications obtained from new observations in NED.

Author Keywords: Galaxy Morphology, Multi-wavelength, parallel, Self-Organizing Maps

2015