-
A Technical Note on the Architectural Effects on Maximum Dependency Lengths of Recurrent Neural Networks
Authors:
Jonathan S. Kent,
Michael M. Murray
Abstract:
This work proposes a methodology for determining the maximum dependency length of a recurrent neural network (RNN), and then studies the effects of architectural changes, including the number and neuron count of layers, on the maximum dependency lengths of traditional RNN, gated recurrent unit (GRU), and long-short term memory (LSTM) models.
This work proposes a methodology for determining the maximum dependency length of a recurrent neural network (RNN), and then studies the effects of architectural changes, including the number and neuron count of layers, on the maximum dependency lengths of traditional RNN, gated recurrent unit (GRU), and long-short term memory (LSTM) models.
△ Less
Submitted 19 July, 2024;
originally announced August 2024.
-
Asymmetry Analysis of Bilateral Shapes
Authors:
Kanti V. Mardia,
Xiangyu Wu,
John T. Kent,
Colin R. Goodall,
Balvinder S. Khambay
Abstract:
Many biological objects possess bilateral symmetry about a midline or midplane, up to a ``noise'' term. This paper uses landmark-based methods to measure departures from bilateral symmetry, especially for the two-group problem where one group is more asymmetric than the other. In this paper, we formulate our work in the framework of size-and-shape analysis including registration via rigid body mot…
▽ More
Many biological objects possess bilateral symmetry about a midline or midplane, up to a ``noise'' term. This paper uses landmark-based methods to measure departures from bilateral symmetry, especially for the two-group problem where one group is more asymmetric than the other. In this paper, we formulate our work in the framework of size-and-shape analysis including registration via rigid body motion. Our starting point is a vector of elementary asymmetry features defined at the individual landmark coordinates for each object. We introduce two approaches for testing. In the first, the elementary features are combined into a scalar composite asymmetry measure for each object. Then standard univariate tests can be used to compare the two groups. In the second approach, a univariate test statistic is constructed for each elementary feature. The maximum of these statistics lead to an overall test statistic to compare the two groups and we then provide a technique to extract the important features from the landmark data. Our methodology is illustrated on a pre-registered smile dataset collected to assess the success of cleft lip surgery on human subjects. The asymmetry in a group of cleft lip subjects is compared to a group of normal subjects, and statistically significant differences have been found by univariate tests in the first approach. Further, our feature extraction method leads to an anatomically plausible set of landmarks for medical applications.
△ Less
Submitted 24 July, 2024;
originally announced July 2024.
-
SWIFT: A Monotonic, Flux-Form Semi-Lagrangian Tracer Transport Scheme for Flow with Large Courant Numbers
Authors:
Thomas M. Bendall,
James Kent
Abstract:
Local conservation of mass and entropy are becoming increasingly desirable properties for modern numerical weather and climate models. This work presents a Flux-Form Semi-Lagrangian (FFSL) transport scheme, called SWIFT, that facilitates this conservation for tracer variables, whilst maintaining other vital properties such as preservation of a constant, monotonicity and positivity. Importantly, th…
▽ More
Local conservation of mass and entropy are becoming increasingly desirable properties for modern numerical weather and climate models. This work presents a Flux-Form Semi-Lagrangian (FFSL) transport scheme, called SWIFT, that facilitates this conservation for tracer variables, whilst maintaining other vital properties such as preservation of a constant, monotonicity and positivity. Importantly, these properties all hold for large Courant numbers and multi-dimensional flow, making the scheme appropriate for use within a dynamical core which takes large time steps.
The SWIFT scheme presented here can be seen as an evolution of the FFSL methods of Leonard et al and Lin and Rood. Two-dimensional and three-dimensional schemes consist of a splitting into a sequence of one-dimensional calculations. The new SWIFT splitting presented here allows monotonic and positivity properties from the one-dimensional calculations to be inherited by the multi-dimensional scheme. These one-dimensional calculations involve separating the mass flux into terms that correspond to integer and fractional parts of the Courant number. Key to achieving conservation is coupling the transport of tracers to the transport of the fluid density, through re-use of the discrete mass flux that was calculated from the fluid density in the transport of the tracers. This work also describes how these properties can still be attained when the tracer is vertically-staggered from the density in a Charney-Phillips grid.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
A mixed finite-element, finite-volume, semi-implicit discretisation for atmospheric dynamics: Spherical geometry
Authors:
Thomas Melvin,
Ben Shipway,
Nigel Wood,
Tommaso Benacchio,
Thomas Bendall,
Ian Boutle,
Alex Brown,
Christine Johnson,
James Kent,
Stephen Pring,
Chris Smith,
Mohamed Zerroukat,
Colin Cotter,
John Thuburn
Abstract:
The reformulation of the Met Office's dynamical core for weather and climate prediction previously described by the authors is extended to spherical domains using a cubed-sphere mesh. This paper updates the semi-implicit mixed finite-element formulation to be suitable for spherical domains. In particular the finite-volume transport scheme is extended to take account of non-uniform, non-orthogonal…
▽ More
The reformulation of the Met Office's dynamical core for weather and climate prediction previously described by the authors is extended to spherical domains using a cubed-sphere mesh. This paper updates the semi-implicit mixed finite-element formulation to be suitable for spherical domains. In particular the finite-volume transport scheme is extended to take account of non-uniform, non-orthogonal meshes and uses an advective-then-flux formulation so that increment from the transport scheme is linear in the divergence. The resulting model is then applied to a standard set of dry dynamical core tests and compared to the existing semi-implicit semi-Lagrangian dynamical core currently used in the Met Office's operational model.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
Multiplexed digital holography for fluid surface profilometry
Authors:
August Geelmuyden,
Vitor S. Barroso,
Sreelekshmi C. Ajithkumar,
Anthony J. Kent,
Silke Weinfurtner
Abstract:
Digital holography (DH) has been widely used for imaging and characterization of micro and nanostructures in materials science and biology and has the potential to provide high-resolution, non-destructive measurements of fluid surfaces as well. Digital holographic setups capture the complex wavefronts of light scattered by an object or reflected from a surface, allowing for quantitative measuremen…
▽ More
Digital holography (DH) has been widely used for imaging and characterization of micro and nanostructures in materials science and biology and has the potential to provide high-resolution, non-destructive measurements of fluid surfaces as well. Digital holographic setups capture the complex wavefronts of light scattered by an object or reflected from a surface, allowing for quantitative measurements of their shape and deformation. However, their use in fluid profilometry is scarce and has not been explored in much depth. We present an alternative usage for a DH setup that can measure and monitor the surface of fluid samples. Based on DH reflectometry, our modelling shows that multiple reflections from the sample and the reference interfere and generate multiple holograms of the sample, resulting in a multiplexed image of the wavefront. The individual interferograms can be isolated in the spatial-frequency domain, and the fluid surface can be digitally reconstructed from them. We further show that this setup can be used to track changes in the surface of a fluid over time, such as during the formation and propagation of waves or evaporation of surface layers.
△ Less
Submitted 24 May, 2023;
originally announced June 2023.
-
Simulations of idealised 3D atmospheric flows on terrestrial planets using LFRic-Atmosphere
Authors:
Denis E. Sergeev,
Nathan J. Mayne,
Thomas Bendall,
Ian A. Boutle,
Alex Brown,
Iva Kavcic,
James Kent,
Krisztian Kohary,
James Manners,
Thomas Melvin,
Enrico Olivier,
Lokesh K. Ragta,
Ben J. Shipway,
Jon Wakelin,
Nigel Wood,
Mohamed Zerroukat
Abstract:
We demonstrate that LFRic-Atmosphere, a model built using the Met Office's GungHo dynamical core, is able to reproduce idealised large-scale atmospheric circulation patterns specified by several widely-used benchmark recipes. This is motivated by the rapid rate of exoplanet discovery and the ever-growing need for numerical modelling and characterisation of their atmospheres. Here we present LFRic-…
▽ More
We demonstrate that LFRic-Atmosphere, a model built using the Met Office's GungHo dynamical core, is able to reproduce idealised large-scale atmospheric circulation patterns specified by several widely-used benchmark recipes. This is motivated by the rapid rate of exoplanet discovery and the ever-growing need for numerical modelling and characterisation of their atmospheres. Here we present LFRic-Atmosphere's results for the idealised tests imitating circulation regimes commonly used in the exoplanet modelling community. The benchmarks include three analytic forcing cases: the standard Held-Suarez test, the Menou-Rauscher Earth-like test, and the Merlis-Schneider Tidally Locked Earth test. Qualitatively, LFRic-Atmosphere agrees well with other numerical models and shows excellent conservation properties in terms of total mass, angular momentum and kinetic energy. We then use LFRic-Atmosphere with a more realistic representation of physical processes (radiation, subgrid-scale mixing, convection, clouds) by configuring it for the four TRAPPIST-1 Habitable Atmosphere Intercomparison (THAI) scenarios. This is the first application of LFRic-Atmosphere to a possible climate of a confirmed terrestrial exoplanet. LFRic-Atmosphere reproduces the THAI scenarios within the spread of the existing models across a range of key climatic variables. Our work shows that LFRic-Atmosphere performs well in the seven benchmark tests for terrestrial atmospheres, justifying its use in future exoplanet climate studies.
△ Less
Submitted 6 June, 2023;
originally announced June 2023.
-
Robust score matching for compositional data
Authors:
Janice L. Scealy,
Kassel L. Hingee,
John T. Kent,
Andrew T. A. Wood
Abstract:
The restricted polynomially-tilted pairwise interaction (RPPI) distribution gives a flexible model for compositional data. It is particularly well-suited to situations where some of the marginal distributions of the components of a composition are concentrated near zero, possibly with right skewness. This article develops a method of tractable robust estimation for the model by combining two ideas…
▽ More
The restricted polynomially-tilted pairwise interaction (RPPI) distribution gives a flexible model for compositional data. It is particularly well-suited to situations where some of the marginal distributions of the components of a composition are concentrated near zero, possibly with right skewness. This article develops a method of tractable robust estimation for the model by combining two ideas. The first idea is to use score matching estimation after an additive log-ratio transformation. The resulting estimator is automatically insensitive to zeros in the data compositions. The second idea is to incorporate suitable weights in the estimating equations. The resulting estimator is additionally resistant to outliers. These properties are confirmed in simulation studies where we further also demonstrate that our new outlier-robust estimator is efficient in high concentration settings, even in the case when there is no model contamination. An example is given using microbiome data. A user-friendly R package accompanies the article.
△ Less
Submitted 12 May, 2023;
originally announced May 2023.
-
Designing with Non-Finite Output Dimension via Fourier Coefficients of Neural Waveforms
Authors:
Jonathan S. Kent
Abstract:
Ordinary Deep Learning models require having the dimension of their outputs determined by a human practitioner prior to training and operation. For design tasks, this places a hard limit on the maximum complexity of any designs produced by a neural network, which is disadvantageous if a greater allowance for complexity would result in better designs. In this paper, we introduce a methodology for t…
▽ More
Ordinary Deep Learning models require having the dimension of their outputs determined by a human practitioner prior to training and operation. For design tasks, this places a hard limit on the maximum complexity of any designs produced by a neural network, which is disadvantageous if a greater allowance for complexity would result in better designs. In this paper, we introduce a methodology for taking outputs of non-finite dimension from neural networks, by learning a "neural waveform," and then taking as outputs the coefficients of its Fourier series representation. We then present experimental evidence that neural networks can learn in this setting on a toy problem.
△ Less
Submitted 17 August, 2022;
originally announced December 2022.
-
Chaos Theory and Adversarial Robustness
Authors:
Jonathan S. Kent
Abstract:
Neural networks, being susceptible to adversarial attacks, should face a strict level of scrutiny before being deployed in critical or adversarial applications. This paper uses ideas from Chaos Theory to explain, analyze, and quantify the degree to which neural networks are susceptible to or robust against adversarial attacks. To this end, we present a new metric, the "susceptibility ratio," given…
▽ More
Neural networks, being susceptible to adversarial attacks, should face a strict level of scrutiny before being deployed in critical or adversarial applications. This paper uses ideas from Chaos Theory to explain, analyze, and quantify the degree to which neural networks are susceptible to or robust against adversarial attacks. To this end, we present a new metric, the "susceptibility ratio," given by $\hat Ψ(h, θ)$, which captures how greatly a model's output will be changed by perturbations to a given input.
Our results show that susceptibility to attack grows significantly with the depth of the model, which has safety implications for the design of neural networks for production environments. We provide experimental evidence of the relationship between $\hat Ψ$ and the post-attack accuracy of classification models, as well as a discussion of its application to tasks lacking hard decision boundaries. We also demonstrate how to quickly and easily approximate the certified robustness radii for extremely large models, which until now has been computationally infeasible to calculate directly.
△ Less
Submitted 5 July, 2023; v1 submitted 19 October, 2022;
originally announced October 2022.
-
Indecision Trees: Learning Argument-Based Reasoning under Quantified Uncertainty
Authors:
Jonathan S. Kent,
David H. Menager
Abstract:
Using Machine Learning systems in the real world can often be problematic, with inexplicable black-box models, the assumed certainty of imperfect measurements, or providing a single classification instead of a probability distribution.
This paper introduces Indecision Trees, a modification to Decision Trees which learn under uncertainty, can perform inference under uncertainty, provide a robust…
▽ More
Using Machine Learning systems in the real world can often be problematic, with inexplicable black-box models, the assumed certainty of imperfect measurements, or providing a single classification instead of a probability distribution.
This paper introduces Indecision Trees, a modification to Decision Trees which learn under uncertainty, can perform inference under uncertainty, provide a robust distribution over the possible labels, and can be disassembled into a set of logical arguments for use in other reasoning systems.
△ Less
Submitted 8 July, 2023; v1 submitted 23 June, 2022;
originally announced June 2022.
-
An Exploration of Active Learning for Affective Digital Phenotyping
Authors:
Peter Washington,
Cezmi Mutlu,
Aaron Kline,
Cathy Hou,
Kaitlyn Dunlap,
Jack Kent,
Arman Husic,
Nate Stockham,
Brianna Chrisman,
Kelley Paskov,
Jae-Yoon Jung,
Dennis P. Wall
Abstract:
Some of the most severe bottlenecks preventing widespread development of machine learning models for human behavior include a dearth of labeled training data and difficulty of acquiring high quality labels. Active learning is a paradigm for using algorithms to computationally select a useful subset of data points to label using metrics for model uncertainty and data similarity. We explore active l…
▽ More
Some of the most severe bottlenecks preventing widespread development of machine learning models for human behavior include a dearth of labeled training data and difficulty of acquiring high quality labels. Active learning is a paradigm for using algorithms to computationally select a useful subset of data points to label using metrics for model uncertainty and data similarity. We explore active learning for naturalistic computer vision emotion data, a particularly heterogeneous and complex data space due to inherently subjective labels. Using frames collected from gameplay acquired from a therapeutic smartphone game for children with autism, we run a simulation of active learning using gameplay prompts as metadata to aid in the active learning process. We find that active learning using information generated during gameplay slightly outperforms random selection of the same number of labeled frames. We next investigate a method to conduct active learning with subjective data, such as in affective computing, and where multiple crowdsourced labels can be acquired for each image. Using the Child Affective Facial Expression (CAFE) dataset, we simulate an active learning process for crowdsourcing many labels and find that prioritizing frames using the entropy of the crowdsourced label distribution results in lower categorical cross-entropy loss compared to random frame selection. Collectively, these results demonstrate pilot evaluations of two novel active learning approaches for subjective affective data collected in noisy settings.
△ Less
Submitted 6 April, 2022; v1 submitted 4 April, 2022;
originally announced April 2022.
-
Directional distributions and the half-angle principle
Authors:
John T. Kent
Abstract:
Angle halving, or alternatively the reverse operation of angle doubling, is a useful tool when studying directional distributions. It is especially useful on the circle where, in particular, it yields an identification between the wrapped Cauchy distribution and the angular central Gaussian distributions, as well as a matching of their parameterizations. The operation of angle halving can be exten…
▽ More
Angle halving, or alternatively the reverse operation of angle doubling, is a useful tool when studying directional distributions. It is especially useful on the circle where, in particular, it yields an identification between the wrapped Cauchy distribution and the angular central Gaussian distributions, as well as a matching of their parameterizations. The operation of angle halving can be extended to higher dimensions, but its effect on distributions is more complicated than on the circle. In all dimensions angle halving provides a simple way to interpret stereographic projection from the sphere to Euclidean space.
△ Less
Submitted 24 February, 2022; v1 submitted 14 February, 2022;
originally announced February 2022.
-
The Arcanum Mission: Scientific Objectives and Instruments for Neptune, Triton and KBOs
Authors:
James McKevitt,
Christina Bornberg,
Tom Dixon,
Louis Ayin-Walsh,
Jonathan Parkinson-Swift,
James Morgan,
Shayne Beegadhur,
Franco Criscola,
Carina Heinreichsberger,
Bharath Simha Reddy Pappula,
Sophie Bulla,
Kuren Patel,
Aryan Laad,
Ethan Forder,
Jaspreet Singh,
Oisín Moore,
Madalin Foghis,
Paul Wedde,
Thomas Mcdougall,
Jack Kent,
Utkarsh Raj
Abstract:
The Arcanum mission is a proposed L-class spacecraft that highlights the revolutionary approach which can now be taken to future space mission design. Using the case of the SpaceX Starship vehicle and in particular the high mass and volume characteristics of this launcher, the feasible large size of future missions, even with high delta-V transfer requirements, are analysed. A demonstrator vehicle…
▽ More
The Arcanum mission is a proposed L-class spacecraft that highlights the revolutionary approach which can now be taken to future space mission design. Using the case of the SpaceX Starship vehicle and in particular the high mass and volume characteristics of this launcher, the feasible large size of future missions, even with high delta-V transfer requirements, are analysed. A demonstrator vehicle, designed to support a large and capable science platform with multiple components, is detailed, clearly showing the range and depth of science goals that will be answerable thanks to the current revolution in super heavy-lift launch vehicles.
△ Less
Submitted 20 October, 2021;
originally announced October 2021.
-
DOODLER: Determining Out-Of-Distribution Likelihood from Encoder Reconstructions
Authors:
Jonathan S. Kent,
Bo Li
Abstract:
Deep Learning models possess two key traits that, in combination, make their use in the real world a risky prospect. One, they do not typically generalize well outside of the distribution for which they were trained, and two, they tend to exhibit confident behavior regardless of whether or not they are producing meaningful outputs. While Deep Learning possesses immense power to solve realistic, hi…
▽ More
Deep Learning models possess two key traits that, in combination, make their use in the real world a risky prospect. One, they do not typically generalize well outside of the distribution for which they were trained, and two, they tend to exhibit confident behavior regardless of whether or not they are producing meaningful outputs. While Deep Learning possesses immense power to solve realistic, high-dimensional problems, these traits in concert make it difficult to have confidence in their real-world applications. To overcome this difficulty, the task of Out-Of-Distribution (OOD) Detection has been defined, to determine when a model has received an input from outside of the distribution for which it is trained to operate.
This paper introduces and examines a novel methodology, DOODLER, for OOD Detection, which directly leverages the traits which result in its necessity. By training a Variational Auto-Encoder (VAE) on the same data as another Deep Learning model, the VAE learns to accurately reconstruct In-Distribution (ID) inputs, but not to reconstruct OOD inputs, meaning that its failure state can be used to perform OOD Detection. Unlike other work in the area, DOODLER requires only very weak assumptions about the existence of an OOD dataset, allowing for more realistic application. DOODLER also enables pixel-wise segmentations of input images by OOD likelihood, and experimental results show that it matches or outperforms methodologies that operate under the same constraints.
△ Less
Submitted 27 September, 2021;
originally announced September 2021.
-
Unsupervised Learning for Target Tracking and Background Subtraction in Satellite Imagery
Authors:
Jonathan S. Kent,
Charles C. Wamsley,
Davin Flateau,
Amber Ferguson
Abstract:
This paper describes an unsupervised machine learning methodology capable of target tracking and background suppression via a novel dual-model approach. ``Jekyll`` produces a video bit-mask describing an estimate of the locations of moving objects, and ``Hyde`` outputs a pseudo-background frame to subtract from the original input image sequence. These models were trained with a custom-modified ver…
▽ More
This paper describes an unsupervised machine learning methodology capable of target tracking and background suppression via a novel dual-model approach. ``Jekyll`` produces a video bit-mask describing an estimate of the locations of moving objects, and ``Hyde`` outputs a pseudo-background frame to subtract from the original input image sequence. These models were trained with a custom-modified version of Cross Entropy Loss.
Simulated data were used to compare the performance of Jekyll and Hyde against a more traditional supervised Machine Learning approach. The results from these comparisons show that the unsupervised methods developed are competitive in output quality with supervised techniques, without the associated cost of acquiring labeled training data.
△ Less
Submitted 13 August, 2021;
originally announced September 2021.
-
Imaging swiFTly: streaming widefield Fourier Transforms for large-scale interferometry
Authors:
Peter Wortmann,
James Kent,
Bojan Nikolic
Abstract:
We describe a scalable distributed imaging algorithm framework for next-generation radio telescopes, managing the Fourier transform from apertures to sky (or vice versa) with a focus on minimising memory load, data transfers, and computation. Our algorithm uses smooth window functions to isolate the influence between specific regions of spatial-frequency and image space. This allows the distributi…
▽ More
We describe a scalable distributed imaging algorithm framework for next-generation radio telescopes, managing the Fourier transform from apertures to sky (or vice versa) with a focus on minimising memory load, data transfers, and computation. Our algorithm uses smooth window functions to isolate the influence between specific regions of spatial-frequency and image space. This allows the distribution of image data between nodes and the construction of segments of frequency space exactly when and where needed.
The developed prototype distributes terabytes of image data across many nodes, while generating visibilities at throughput and accuracy competitive with existing software. Scaling is demonstrated to be better than cubic in problem complexity (for baseline length and field of view), reducing the risk involved in growing radio astronomy processing to large telescopes like the Square Kilometre Array.
△ Less
Submitted 27 May, 2024; v1 submitted 24 August, 2021;
originally announced August 2021.
-
An L-class Multirole Observatory and Science Platform for Neptune
Authors:
James McKevitt,
Sophie Bulla,
Tom Dixon,
Franco Criscola,
Jonathan Parkinson-Swift,
Christina Bornberg,
Jaspreet Singh,
Kuren Patel,
Aryan Laad,
Ethan Forder,
Louis Ayin-Walsh,
Shayne Beegadhur,
Paul Wedde,
Bharath Simha Reddy Pappula,
Thomas McDougall,
Madalin Foghis,
Jack Kent,
James Morgan,
Utkarsh Raj,
Carina Heinreichsberger
Abstract:
A coming resurgence of super heavy-lift launch vehicles has precipitated an immense interest in the future of crewed spaceflight and even future colonisation efforts. While it is true that a bright future awaits this sector, driven by commercial ventures and the reignited interest of old space-faring nations, and the joining of new ones, little of this attention has been reserved for the science-c…
▽ More
A coming resurgence of super heavy-lift launch vehicles has precipitated an immense interest in the future of crewed spaceflight and even future colonisation efforts. While it is true that a bright future awaits this sector, driven by commercial ventures and the reignited interest of old space-faring nations, and the joining of new ones, little of this attention has been reserved for the science-centric applications of these launchers. The Arcanum mission is a proposal to use these vehicles to deliver an L-class observatory into a highly eccentric orbit around Neptune, with a wide-ranging suite of science goals and instrumentation tackling Solar System science, planetary science, Kuiper Belt Objects and exoplanet systems.
△ Less
Submitted 17 June, 2021;
originally announced June 2021.
-
Vector Symbolic Architectures as a Computing Framework for Emerging Hardware
Authors:
Denis Kleyko,
Mike Davies,
E. Paxon Frady,
Pentti Kanerva,
Spencer J. Kent,
Bruno A. Olshausen,
Evgeny Osipov,
Jan M. Rabaey,
Dmitri A. Rachkovskij,
Abbas Rahimi,
Friedrich T. Sommer
Abstract:
This article reviews recent progress in the development of the computing framework vector symbolic architectures (VSA) (also known as hyperdimensional computing). This framework is well suited for implementation in stochastic, emerging hardware, and it naturally expresses the types of cognitive operations required for artificial intelligence (AI). We demonstrate in this article that the field-like…
▽ More
This article reviews recent progress in the development of the computing framework vector symbolic architectures (VSA) (also known as hyperdimensional computing). This framework is well suited for implementation in stochastic, emerging hardware, and it naturally expresses the types of cognitive operations required for artificial intelligence (AI). We demonstrate in this article that the field-like algebraic structure of VSA offers simple but powerful operations on high-dimensional vectors that can support all data structures and manipulations relevant to modern computing. In addition, we illustrate the distinguishing feature of VSA, "computing in superposition," which sets it apart from conventional computing. It also opens the door to efficient solutions to the difficult combinatorial search problems inherent in AI applications. We sketch ways of demonstrating that VSA are computationally universal. We see them acting as a framework for computing with distributed representations that can play a role of an abstraction layer for emerging computing hardware. This article serves as a reference for computer architects by illustrating the philosophy behind VSA, techniques of distributed computing with them, and their relevance to emerging computing hardware, such as neuromorphic computing.
△ Less
Submitted 20 July, 2023; v1 submitted 9 June, 2021;
originally announced June 2021.
-
Mixture models for spherical data with applications to protein bioinformatics
Authors:
Kanti V. Mardia,
Stuart Barber,
Philippa M. Burdett,
John T. Kent,
Thomas Hamelryck
Abstract:
Finite mixture models are fitted to spherical data. Kent distributions are used for the components of the mixture because they allow considerable flexibility. Previous work on such mixtures has used an approximate maximum likelihood estimator for the parameters of a single component. However, the approximation causes problems when using the EM algorithm to estimate the parameters in a mixture mode…
▽ More
Finite mixture models are fitted to spherical data. Kent distributions are used for the components of the mixture because they allow considerable flexibility. Previous work on such mixtures has used an approximate maximum likelihood estimator for the parameters of a single component. However, the approximation causes problems when using the EM algorithm to estimate the parameters in a mixture model. Hence the exact maximum likelihood estimator is used here for the individual components. This paper is motivated by a challenging prize problem in structural bioinformatics of how proteins fold. It is known that hydrogen bonds play a key role in the folding of a protein. We explore this hydrogen bond geometry using a data set describing bonds between two amino acids in proteins. An appropriate coordinate system to represent the hydrogen bond geometry is proposed, with each bond represented as a point on a sphere. We fit mixtures of Kent distributions to different subsets of the hydrogen bond data to gain insight into how the secondary structure elements bond together, since the distribution of hydrogen bonds depends on which secondary structure elements are involved.
△ Less
Submitted 27 April, 2021;
originally announced April 2021.
-
Improved Digital Therapy for Developmental Pediatrics Using Domain-Specific Artificial Intelligence: Machine Learning Study
Authors:
Peter Washington,
Haik Kalantarian,
John Kent,
Arman Husic,
Aaron Kline,
Emilie Leblanc,
Cathy Hou,
Onur Cezmi Mutlu,
Kaitlyn Dunlap,
Yordan Penev,
Maya Varma,
Nate Tyler Stockham,
Brianna Chrisman,
Kelley Paskov,
Min Woo Sun,
Jae-Yoon Jung,
Catalin Voss,
Nick Haber,
Dennis Paul Wall
Abstract:
Background: Automated emotion classification could aid those who struggle to recognize emotions, including children with developmental behavioral conditions such as autism. However, most computer vision emotion recognition models are trained on adult emotion and therefore underperform when applied to child faces. Objective: We designed a strategy to gamify the collection and labeling of child emot…
▽ More
Background: Automated emotion classification could aid those who struggle to recognize emotions, including children with developmental behavioral conditions such as autism. However, most computer vision emotion recognition models are trained on adult emotion and therefore underperform when applied to child faces. Objective: We designed a strategy to gamify the collection and labeling of child emotion-enriched images to boost the performance of automatic child emotion recognition models to a level closer to what will be needed for digital health care approaches. Methods: We leveraged our prototype therapeutic smartphone game, GuessWhat, which was designed in large part for children with developmental and behavioral conditions, to gamify the secure collection of video data of children expressing a variety of emotions prompted by the game. Independently, we created a secure web interface to gamify the human labeling effort, called HollywoodSquares, tailored for use by any qualified labeler. We gathered and labeled 2155 videos, 39,968 emotion frames, and 106,001 labels on all images. With this drastically expanded pediatric emotion-centric database (>30 times larger than existing public pediatric emotion data sets), we trained a convolutional neural network (CNN) computer vision classifier of happy, sad, surprised, fearful, angry, disgust, and neutral expressions evoked by children. Results: The classifier achieved a 66.9% balanced accuracy and 67.4% F1-score on the entirety of the Child Affective Facial Expression (CAFE) as well as a 79.1% balanced accuracy and 78% F1-score on CAFE Subset A, a subset containing at least 60% human agreement on emotions labels. This performance is at least 10% higher than all previously developed classifiers evaluated against CAFE, the best of which reached a 56% balanced accuracy even when combining "anger" and "disgust" into a single class.
△ Less
Submitted 3 June, 2024; v1 submitted 15 December, 2020;
originally announced December 2020.
-
Irregular Metronomes as Assistive Devices to Promote Healthy Gait Patterns
Authors:
Aaron D. Likens,
Spyridon Mastorakis,
Andreas Skiadopoulos,
Jenny A. Kent,
Md Washik Al Azad,
Nick Stergiou
Abstract:
Older adults and people suffering from neurodegenerative disease often experience difficulty controlling gait during locomotion, ultimately increasing their risk of falling. To combat these effects, researchers and clinicians have used metronomes as assistive devices to improve movement timing in hopes of reducing their risk of falling. Historically, researchers in this area have relied on metrono…
▽ More
Older adults and people suffering from neurodegenerative disease often experience difficulty controlling gait during locomotion, ultimately increasing their risk of falling. To combat these effects, researchers and clinicians have used metronomes as assistive devices to improve movement timing in hopes of reducing their risk of falling. Historically, researchers in this area have relied on metronomes with isochronous interbeat intervals, which may be problematic because normal healthy gait varies considerably from one step to the next. More recently, researchers have advocated the use of irregular metronomes embedded with statistical properties found in healthy populations. In this paper, we explore the effect of both regular and irregular metronomes on many statistical properties of interstride intervals. Furthermore, we investigate how these properties react to mechanical perturbation in the form of a halted treadmill belt while walking. Our results demonstrate that metronomes that are either isochronous or random metronome break down the inherent structure of healthy gait. Metronomes with statistical properties similar to healthy gait seem to preserve those properties, despite a strong mechanical perturbation. We discuss the future development of this work in the context of networked augmented reality metronome devices.
△ Less
Submitted 24 November, 2020;
originally announced December 2020.
-
Impact of LHC vector boson production in heavy ion collisions on strange PDFs
Authors:
A. Kusina,
T. Ježo,
D. B. Clark,
P. Duwentäster,
E. Godat,
T. J. Hobbs,
J. Kent,
M. Klasen,
K. Kovařík,
F. Lyonnet,
K. F. Muzakka,
F. I. Olness,
I. Schienbein,
J. Y. Yu
Abstract:
The extraction of the strange quark parton distribution function (PDF) poses a long-standing puzzle. Measurements from neutrino-nucleus deep inelastic scattering (DIS) experiments suggest the strange quark is suppressed compared to the light sea quarks, while recent studies of W/Z boson production at the LHC imply a larger strange component at small x values. As the parton flavor determination in…
▽ More
The extraction of the strange quark parton distribution function (PDF) poses a long-standing puzzle. Measurements from neutrino-nucleus deep inelastic scattering (DIS) experiments suggest the strange quark is suppressed compared to the light sea quarks, while recent studies of W/Z boson production at the LHC imply a larger strange component at small x values. As the parton flavor determination in the proton depends on nuclear corrections, e.g. from heavy-target DIS, LHC heavy ion measurements can provide a distinct perspective to help clarify this situation. In this investigation we extend the nCTEQ15 nPDFs to study the impact of the LHC proton-lead W/Z production data on both the flavor differentiation and nuclear corrections. This complementary data set provides new insights on both the LHC W/Z proton analyses and the neutrino-nucleus DIS data. We identify these new nPDFs as nCTEQ15WZ. Our calculations are performed using a new implementation of the nCTEQ code (nCTEQ++) based on C++ which enables us to easily interface to external programs such as HOPPET, APPLgrid and MCFM. Our results indicate that, as suggested by the proton data, the small x nuclear strange sea appears larger than previously expected, even when the normalization of the W/Z data is accommodated in the fit. Extending the nCTEQ15 analysis to include LHC W/Z data represents an important step as we advance toward the next generation of nPDFs.
△ Less
Submitted 17 July, 2020;
originally announced July 2020.
-
Detection of Cosmic Structures using the Bispectrum Phase. II. First Results from Application to Cosmic Reionization Using the Hydrogen Epoch of Reionization Array
Authors:
Nithyanandan Thyagarajan,
Chris L. Carilli,
Bojan Nikolic,
James Kent,
Andrei Mesinger,
Nicholas S. Kern,
Gianni Bernardi,
Siyanda Matika,
Zara Abdurashidova,
James E. Aguirre,
Paul Alexander,
Zaki S. Ali,
Yanga Balfour,
Adam P. Beardsley,
Tashalee S. Billings,
Judd D. Bowman,
Richard F. Bradley,
Jacob Burba,
Steve Carey,
Carina Cheng,
David R. DeBoer,
Matt Dexter,
Eloy de Lera Acedo,
Joshua S. Dillon,
John Ely
, et al. (47 additional authors not shown)
Abstract:
Characterizing the epoch of reionization (EoR) at $z\gtrsim 6$ via the redshifted 21 cm line of neutral Hydrogen (HI) is critical to modern astrophysics and cosmology, and thus a key science goal of many current and planned low-frequency radio telescopes. The primary challenge to detecting this signal is the overwhelmingly bright foreground emission at these frequencies, placing stringent requirem…
▽ More
Characterizing the epoch of reionization (EoR) at $z\gtrsim 6$ via the redshifted 21 cm line of neutral Hydrogen (HI) is critical to modern astrophysics and cosmology, and thus a key science goal of many current and planned low-frequency radio telescopes. The primary challenge to detecting this signal is the overwhelmingly bright foreground emission at these frequencies, placing stringent requirements on the knowledge of the instruments and inaccuracies in analyses. Results from these experiments have largely been limited not by thermal sensitivity but by systematics, particularly caused by the inability to calibrate the instrument to high accuracy. The interferometric bispectrum phase is immune to antenna-based calibration and errors therein, and presents an independent alternative to detect the EoR HI fluctuations while largely avoiding calibration systematics. Here, we provide a demonstration of this technique on a subset of data from the Hydrogen Epoch of Reionization Array (HERA) to place approximate constraints on the brightness temperature of the intergalactic medium (IGM). From this limited data, at $z=7.7$ we infer "$1σ$" upper limits on the IGM brightness temperature to be $\le 316$ "pseudo" mK at $κ_\parallel=0.33$ "pseudo" $h$ Mpc$^{-1}$ (data-limited) and $\le 1000$ "pseudo" mK at $κ_\parallel=0.875$ "pseudo" $h$ Mpc$^{-1}$ (noise-limited). The "pseudo" units denote only an approximate and not an exact correspondence to the actual distance scales and brightness temperatures. By propagating models in parallel to the data analysis, we confirm that the dynamic range required to separate the cosmic HI signal from the foregrounds is similar to that in standard approaches, and the power spectrum of the bispectrum phase is still data-limited (at $\gtrsim 10^6$ dynamic range) indicating scope for further improvement in sensitivity as the array build-out continues.
△ Less
Submitted 2 July, 2020; v1 submitted 20 May, 2020;
originally announced May 2020.
-
Ultrafast strain-induced charge transport in semiconductor superlattices
Authors:
F. Wang,
C. L. Poyser,
M. T. Greenaway,
A. V. Akimov,
R. P. Campion,
A. J. Kent,
T. M. Fromhold,
A. G. Balanov
Abstract:
We investigate the effect of hypersonic (> 1 GHz) acoustic phonon wavepackets on electron transport in a semiconductor superlattice. Our quantum mechanical simulations demonstrate that a GHz train of picosecond deformation strain pulses propagating through a superlattice can generate current oscillations whose frequency is several times higher than that of the strain pulse train. The shape and pol…
▽ More
We investigate the effect of hypersonic (> 1 GHz) acoustic phonon wavepackets on electron transport in a semiconductor superlattice. Our quantum mechanical simulations demonstrate that a GHz train of picosecond deformation strain pulses propagating through a superlattice can generate current oscillations whose frequency is several times higher than that of the strain pulse train. The shape and polarity of the calculated current pulses agree well with experimentally measured electric signals. The calculations also explain and accurately reproduce the measured variation of the induced current pulse magnitude with the strain pulse amplitude and applied bias voltage. Our results open a route to developing acoustically-driven semiconductor superlattices as sources of millimetre and sub-millimetre electromagnetic waves.
△ Less
Submitted 26 March, 2020;
originally announced March 2020.
-
Direct Wide-Field Radio Imaging in Real-Time at High Time Resolution using Antenna Electric Fields
Authors:
James Kent,
Adam P. Beardsley,
Landman Bester,
Steve F. Gull,
Bojan Nikolic,
Jayce Dowell,
Nithyanandan Thyagarajan,
Greg B. Taylor,
Judd Bowman
Abstract:
The recent demonstration of a real-time direct imaging radio interferometry correlator represents a new capability in radio astronomy. However wide field imaging with this method is challenging since wide-field effects and array non-coplanarity degrade image quality if not compensated for. Here we present an alternative direct imaging correlation strategy using a Direct Fourier Transform (DFT), mo…
▽ More
The recent demonstration of a real-time direct imaging radio interferometry correlator represents a new capability in radio astronomy. However wide field imaging with this method is challenging since wide-field effects and array non-coplanarity degrade image quality if not compensated for. Here we present an alternative direct imaging correlation strategy using a Direct Fourier Transform (DFT), modelled as a linear operator facilitating a matrix multiplication between the DFT matrix and a vector of the electric fields from each antenna. This offers perfect correction for wide field and non-coplanarity effects. When implemented with data from the Long Wavelength Array (LWA), it offers comparable computational performance to previously demonstrated direct imaging techniques, despite having a theoretically higher floating point cost. It also has additional benefits, such as imaging sparse arrays and control over which sky co-ordinates are imaged, allowing variable pixel placement across an image. It is in practice a highly flexible and efficient method of direct radio imaging when implemented on suitable arrays. A functioning Electric Field Direct imaging architecture using the DFT is presented, alongside an exploration of techniques for wide-field imaging similar to those in visibility based imaging, and an explanation of why they do not fit well to imaging directly with the digitized electric field data. The DFT imaging method is demonstrated on real data from the LWA telescope, alongside a detailed performance analysis, as well as an exploration of its applicability to other arrays.
△ Less
Submitted 29 October, 2019; v1 submitted 9 September, 2019;
originally announced September 2019.
-
Revisiting the orbital tracking problem
Authors:
John T. Kent,
Shambo Bhattacharjee,
Weston R. Faber,
Islam I. Hussein
Abstract:
Consider a space object in an orbit about the earth. An uncertain initial state can be represented as a point cloud which can be propagated to later times by the laws of Newtonian motion. If the state of the object is represented in Cartesian earth centered inertial (Cartesian-ECI) coordinates, then even if initial uncertainty is Gaussian in this coordinate system, the distribution quickly becomes…
▽ More
Consider a space object in an orbit about the earth. An uncertain initial state can be represented as a point cloud which can be propagated to later times by the laws of Newtonian motion. If the state of the object is represented in Cartesian earth centered inertial (Cartesian-ECI) coordinates, then even if initial uncertainty is Gaussian in this coordinate system, the distribution quickly becomes non-Gaussian as the propagation time increases. Similar problems arise in other standard fixed coordinate systems in astrodynamics, e.g. Keplerian and to some extent equinoctial. To address these problems, a local "Adapted STructural (AST)'' coordinate system has been developed in which uncertainty is represented in terms of deviations from a "central state".
Given a sequence of angles-only measurements, the iterated nonlinear extended (IEKF) and unscented (IUKF) Kalman filters are often the most appropriate variants to use. In particular, they can be much more accurate than the more commonly used non-iterated versions, the extended (EKF) and unscented (UKF) Kalman filters, especially under high eccentricity. In addition, iterated Kalman filters can often be well-approximated by two new closed form filters, the observation-centered extended (OCEKF) and unscented (OCUKF) Kalman filters.
△ Less
Submitted 24 September, 2019; v1 submitted 29 August, 2019;
originally announced September 2019.
-
nCTEQ PDFs at the LHC: Vector boson production in heavy ion collisions
Authors:
The nCTEQ Collaboration,
D. B. Clark,
E. Godat,
T. J. Hobbs,
T. Ježo,
J. Kent,
C. Keppel,
M. Klasen,
K. Kovarík,
A. Kusina,
F. Lyonnet,
J. G. Morfin,
F. I. Olness,
J. F. Owens,
I. Schienbein,
J. Y. Yu
Abstract:
Extraction of the strange quark PDF is a long-standing puzzle. We use the nCTEQ nPDFs with uncertainties to study the impact of the LHC W/Z production data on both the flavor differentiation and nuclear corrections; this complements the information from neutrino-DIS data. As the proton flavor determination is dependent on nuclear corrections (from heavy target DIS, for example), LHC heavy ion meas…
▽ More
Extraction of the strange quark PDF is a long-standing puzzle. We use the nCTEQ nPDFs with uncertainties to study the impact of the LHC W/Z production data on both the flavor differentiation and nuclear corrections; this complements the information from neutrino-DIS data. As the proton flavor determination is dependent on nuclear corrections (from heavy target DIS, for example), LHC heavy ion measurements can also help improve proton PDFs. We introduce a new implementation of the nCTEQ code (nCTEQ++) based on C++ which has a modular strucure and enables us to easily integrate programs such as HOPPET, APPLgrid, and MCFM. Using ApplGrids generated from MCFM, we use nCTEQ++ to perform a preliminary fit including the pPb LHC W/Z vector boson data.
△ Less
Submitted 1 September, 2019;
originally announced September 2019.
-
Observation-centered Kalman filters
Authors:
John T. Kent,
Shambo Bhattacharjee,
Weston R. Faber,
Islam I. Hussein
Abstract:
Various methods have been proposed for the nonlinear filtering problem, including the extended Kalman filter (EKF), iterated extended Kalman filter (IEKF), unscented Kalman filter (UKF) and iterated unscented Kalman filter (IUKF). In this paper two new nonlinear Kalman filters are proposed and investigated, namely the observation-centered extended Kalman filter (OCEKF) and observation-centered uns…
▽ More
Various methods have been proposed for the nonlinear filtering problem, including the extended Kalman filter (EKF), iterated extended Kalman filter (IEKF), unscented Kalman filter (UKF) and iterated unscented Kalman filter (IUKF). In this paper two new nonlinear Kalman filters are proposed and investigated, namely the observation-centered extended Kalman filter (OCEKF) and observation-centered unscented Kalman filter (OCUKF). Although the UKF and EKF are common default choices for nonlinear filtering, there are situations where they are bad choices. Examples are given where the EKF and UKF perform very poorly, and the IEKF and OCEKF perform well. In addition the IUKF and OCUKF are generally similar to the IEKF and OCEKF, and also perform well, though care is needed in the choice of tuning parameters when the observation error is small. The reasons for this behaviour are explored in detail.
△ Less
Submitted 24 September, 2019; v1 submitted 31 July, 2019;
originally announced July 2019.
-
Resonator Networks outperform optimization methods at solving high-dimensional vector factorization
Authors:
Spencer J. Kent,
E. Paxon Frady,
Friedrich T. Sommer,
Bruno A. Olshausen
Abstract:
We develop theoretical foundations of Resonator Networks, a new type of recurrent neural network introduced in Frady et al. (2020) to solve a high-dimensional vector factorization problem arising in Vector Symbolic Architectures. Given a composite vector formed by the Hadamard product between a discrete set of high-dimensional vectors, a Resonator Network can efficiently decompose the composite in…
▽ More
We develop theoretical foundations of Resonator Networks, a new type of recurrent neural network introduced in Frady et al. (2020) to solve a high-dimensional vector factorization problem arising in Vector Symbolic Architectures. Given a composite vector formed by the Hadamard product between a discrete set of high-dimensional vectors, a Resonator Network can efficiently decompose the composite into these factors. We compare the performance of Resonator Networks against optimization-based methods, including Alternating Least Squares and several gradient-based algorithms, showing that Resonator Networks are superior in several important ways. This advantage is achieved by leveraging a combination of nonlinear dynamics and "searching in superposition," by which estimates of the correct solution are formed from a weighted superposition of all possible solutions. While the alternative methods also search in superposition, the dynamics of Resonator Networks allow them to strike a more effective balance between exploring the solution space and exploiting local information to drive the network toward probable solutions. Resonator Networks are not guaranteed to converge, but within a particular regime they almost always do. In exchange for relaxing this guarantee of global convergence, Resonator Networks are dramatically more effective at finding factorizations than all alternative approaches considered.
△ Less
Submitted 14 July, 2020; v1 submitted 19 June, 2019;
originally announced June 2019.
-
A Real-Time, All-Sky, High Time Resolution, Direct Imager for the Long Wavelength Array
Authors:
James Kent,
Jayce Dowell,
Adam Beardsley,
Nithyanandan Thyagarjan,
Greg Taylor,
Judd Bowman
Abstract:
The future of radio astronomy will require instruments with large collecting areas for higher sensitivity, wide fields of view for faster survey speeds, and efficient computing and data rates relative to current capabilities. We describe the first successful deployment of the E-field Parallel Imaging Correlator (EPIC) on the LWA station in Sevilleta, New Mexico, USA (LWA-SV). EPIC is a solution to…
▽ More
The future of radio astronomy will require instruments with large collecting areas for higher sensitivity, wide fields of view for faster survey speeds, and efficient computing and data rates relative to current capabilities. We describe the first successful deployment of the E-field Parallel Imaging Correlator (EPIC) on the LWA station in Sevilleta, New Mexico, USA (LWA-SV). EPIC is a solution to the computational problem of large interferometers. By gridding and spatially Fourier transforming channelised electric fields from the antennas in real-time, EPIC removes the explicit cross multiplication of all pairs of antenna voltages to synthesize an aperture, reducing the computational scaling from $\mathcal{O}(n_a^2)$ to $\mathcal{O}(n_g \log_2 n_g)$, where $n_a$ is the number of antennas and $n_g$ is the number of grid points. Not only does this save computational costs for dense arrays but it produces very high time resolution images in real time. The GPU-based implementation uses existing LWA-SV hardware and the high performance streaming framework, Bifrost. We examine the practical details of the EPIC deployment and verify the imaging performance by detecting a meteor impact on the atmosphere using continuous all-sky imaging at 50 ms time resolution.
△ Less
Submitted 25 April, 2019;
originally announced April 2019.
-
Helix modelling through the Mardia-Holmes model framework and an extension of the Mardia-Holmes model
Authors:
Mai F Alfahad,
John T Kent,
Kanti V Mardia
Abstract:
For noisy two-dimensional data, which are approximately uniformly distributed near the circumference of an ellipse, Mardia and Holmes (1980) developed a model to fit the ellipse. In this paper we adapt their methodology to the analysis of helix data in three dimensions. If the helix axis is known, then the Mardia-Holmes model for the circular case can be fitted after projecting the helix data onto…
▽ More
For noisy two-dimensional data, which are approximately uniformly distributed near the circumference of an ellipse, Mardia and Holmes (1980) developed a model to fit the ellipse. In this paper we adapt their methodology to the analysis of helix data in three dimensions. If the helix axis is known, then the Mardia-Holmes model for the circular case can be fitted after projecting the helix data onto the plane normal to the helix axis. If the axis is unknown, an iterative algorithm has been developed to estimate the axis. The methodology is illustrated using simulated protein alpha-helices. We also give a multivariate version of the Mardia-Holmes model which will be applicable for fitting an ellipsoid and in particular a cylinder.
△ Less
Submitted 21 October, 2018;
originally announced October 2018.
-
Ultrafast insulator-to-metal transition in VO$_2$ nanostructures assisted by picosecond strain pulses
Authors:
Ia. A. Mogunov,
F. Fernández,
S. Lysenko,
A. J. Kent,
A. V. Scherbakov,
A. M. Kalashnikova,
A. V. Akimov
Abstract:
Strain engineering is a powerful technology which exploits stationary external or internal stress of specific spatial distribution for controlling the fundamental properties of condensed materials and nanostructures. This advanced technique modulates in space the carrier density and mobility, the optical absorption and, in strongly correlated systems, the phase, e.g. insulator/metal or ferromagnet…
▽ More
Strain engineering is a powerful technology which exploits stationary external or internal stress of specific spatial distribution for controlling the fundamental properties of condensed materials and nanostructures. This advanced technique modulates in space the carrier density and mobility, the optical absorption and, in strongly correlated systems, the phase, e.g. insulator/metal or ferromagnetic/paramagnetic. However, while successfully accessing nanometer length scale, strain engineering is yet to be brought down to ultrafast time scales allowing strain-assisted control of state of matter at THz frequencies. In our work we demonstrate a control of an optically-driven insulator-to-metal phase transition by a picosecond strain pulse, which paves a way to ultrafast strain engineering in nanostructures with phase transitions. This is realized by simultaneous excitation of VO$_2$ nanohillocks by a 170-fs laser and picosecond strain pulses finely timed with each other. By monitoring the transient optical reflectivity of the VO$_2$, we show that strain pulses, depending on the sign of the strain at the moment of optical excitation, increase or decrease the fraction of VO$_2$ which undergoes an ultrafast phase transition. Transient strain of moderate amplitude $\sim0.1$% applied during ultrafast photo-induced non-thermal transition changes the fraction of VO$_2$ in the laser-induced phase by $\sim1$%. By contrast, if applied after the photo-excitation when the phase transformations of the material are governed by thermal processes, transient strain of the same amplitude produces no measurable effect on the phase state.
△ Less
Submitted 13 December, 2018; v1 submitted 15 October, 2018;
originally announced October 2018.
-
PDF Flavor Determination and the nCTEQ PDFs
Authors:
nCTEQ Collaboration,
E. Godat,
D. B. Clark,
T. J. Hobbs,
T. Jezo,
J. Kent,
C. Keppel,
K. Kovarik,
A. Kusina,
F. Lyonnet,
J. G. Morfin,
F. I. Olness,
J. F. Owens,
I. Schienbein,
J. Y. Yu
Abstract:
Recent LHC W/Z vector boson production data in proton-lead collisions are quite sensitive to the heavier flavors (especially the strange PDF), and this complements the information from neutrino-DIS data. As the proton flavor determination is dependent on nuclear corrections (from heavy target DIS, for example), LHC heavy ion measurements can also help improve proton PDFs. We introduce a new implem…
▽ More
Recent LHC W/Z vector boson production data in proton-lead collisions are quite sensitive to the heavier flavors (especially the strange PDF), and this complements the information from neutrino-DIS data. As the proton flavor determination is dependent on nuclear corrections (from heavy target DIS, for example), LHC heavy ion measurements can also help improve proton PDFs. We introduce a new implementation of the nCTEQ code (nCTEQ++) based on C++ which has a modular strucure and enables us to easily integrate programs such as HOPPET, APPLgrid, and MCFM. Using ApplGrids generated from MCFM, we use nCTEQ++ to perform a fit including the $pPb$ LHC W/Z vector boson data.
△ Less
Submitted 22 August, 2018;
originally announced August 2018.
-
Automatic Curation of Golf Highlights using Multimodal Excitement Features
Authors:
Michele Merler,
Dhiraj Joshi,
Quoc-Bao Nguyen,
Stephen Hammer,
John Kent,
John R. Smith,
Rogerio S. Feris
Abstract:
The production of sports highlight packages summarizing a game's most exciting moments is an essential task for broadcast media. Yet, it requires labor-intensive video editing. We propose a novel approach for auto-curating sports highlights, and use it to create a real-world system for the editorial aid of golf highlight reels. Our method fuses information from the players' reactions (action recog…
▽ More
The production of sports highlight packages summarizing a game's most exciting moments is an essential task for broadcast media. Yet, it requires labor-intensive video editing. We propose a novel approach for auto-curating sports highlights, and use it to create a real-world system for the editorial aid of golf highlight reels. Our method fuses information from the players' reactions (action recognition such as high-fives and fist pumps), spectators (crowd cheering), and commentator (tone of the voice and word analysis) to determine the most interesting moments of a game. We accurately identify the start and end frames of key shot highlights with additional metadata, such as the player's name and the hole number, allowing personalized content summarization and retrieval. In addition, we introduce new techniques for learning our classifiers with reduced manual training data annotation by exploiting the correlation of different modalities. Our work has been demonstrated at a major golf tournament, successfully extracting highlights from live video streams over four consecutive days.
△ Less
Submitted 21 July, 2017;
originally announced July 2017.
-
The Influence of Collaboration in Procurement Relationships
Authors:
Wesley S. Boyce,
Haim Mano,
John L. Kent
Abstract:
Supply Chain Management often requires independent organizations to work together to achieve shared objectives. This collaboration is necessary when coordinated actions benefit the group more than the uncoordinated efforts of individual firms. Despite the commonly reported benefits that can be gained in close relationships, recent research has indicated that collaboration attempts between purchasi…
▽ More
Supply Chain Management often requires independent organizations to work together to achieve shared objectives. This collaboration is necessary when coordinated actions benefit the group more than the uncoordinated efforts of individual firms. Despite the commonly reported benefits that can be gained in close relationships, recent research has indicated that collaboration attempts between purchasing firms and their suppliers have not been as widespread as anticipated. Using a survey of procurement professionals, this research investigates how the purchasing function utilizes collaboration in its supply chain relationships. Structural equation modeling is used to identify how information sharing, decision synchronization, incentive alignment, collaborative communication, and trust impact collaboration, as well as how collaboration impacts performance. Results from 86 survey responses indicate that firms are still not fully utilizing collaborative relationships.
△ Less
Submitted 10 October, 2016;
originally announced January 2017.
-
Score matching estimators for directional distributions
Authors:
Kanti V Mardia,
John T Kent,
Arnab K Laha
Abstract:
One of the major problems for maximum likelihood estimation in the well-established directional models is that the normalising constants can be difficult to evaluate. A new general method of "score matching estimation" is presented here on a compact oriented Riemannian manifold. Important applications include von Mises-Fisher, Bingham and joint models on the sphere and related spaces. The estimato…
▽ More
One of the major problems for maximum likelihood estimation in the well-established directional models is that the normalising constants can be difficult to evaluate. A new general method of "score matching estimation" is presented here on a compact oriented Riemannian manifold. Important applications include von Mises-Fisher, Bingham and joint models on the sphere and related spaces. The estimator is consistent and asymptotically normally distributed under mild regularity conditions. Further, it is easy to compute as a solution of a linear set of equations and requires no knowledge of the normalizing constant. Several examples are given, both analytic and numerical, to demonstrate its good performance.
△ Less
Submitted 28 April, 2016;
originally announced April 2016.
-
Manifolds of Projective Shapes
Authors:
Thomas Hotz,
Florian Kelma,
John T. Kent
Abstract:
The projective shape of a configuration of k points or "landmarks" in RP(d) consists of the information that is invariant under projective transformations and hence is reconstructable from uncalibrated camera views. Mathematically, the space of projective shapes for these k landmarks can be described as the quotient space of k copies of RP(d) modulo the action of the projective linear group PGL(d)…
▽ More
The projective shape of a configuration of k points or "landmarks" in RP(d) consists of the information that is invariant under projective transformations and hence is reconstructable from uncalibrated camera views. Mathematically, the space of projective shapes for these k landmarks can be described as the quotient space of k copies of RP(d) modulo the action of the projective linear group PGL(d). Using homogeneous coordinates, such configurations can be described as real k-times-(d+1)-dimensional matrices given up to left-multiplication of non-singular diagonal matrices, while the group PGL(d) acts as GL(d+1) from the right. The main purpose of this paper is to give a detailed examination of the topology of projective shape space, and, using matrix notation, it is shown how to derive subsets that are in a certain sense maximal, differentiable Hausdorff manifolds which can be provided with a Riemannian metric. A special subclass of the projective shapes consists of the Tyler regular shapes, for which geometrically motivated pre-shapes can be defined, thus allowing for the construction of a natural Riemannian metric.
△ Less
Submitted 5 November, 2018; v1 submitted 13 February, 2016;
originally announced February 2016.
-
The use of a common location measure in the invariant coordinate selection and projection pursuit
Authors:
Fatimah Alashwali,
John Kent
Abstract:
Invariant coordinate selection (ICS) and projection pursuit (PP) are two methods that can be used to detect clustering directions in multivariate data by optimizing criteria sensitive to non-normality. In particular, ICS finds clustering directions using a relative eigen-decomposition of two scatter matrices with different levels of robustness; PP is a one-dimensional variant of ICS. Each of the t…
▽ More
Invariant coordinate selection (ICS) and projection pursuit (PP) are two methods that can be used to detect clustering directions in multivariate data by optimizing criteria sensitive to non-normality. In particular, ICS finds clustering directions using a relative eigen-decomposition of two scatter matrices with different levels of robustness; PP is a one-dimensional variant of ICS. Each of the two scatter matrices includes an implicit or explicit choice of location. However, when different measures of location are used, ICS and PP can behave counter-intuitively. In this paper we explore this behavior in a variety of examples and propose a simple and natural solution: use the same measure of location for both scatter matrices.
△ Less
Submitted 26 March, 2015; v1 submitted 28 January, 2015;
originally announced January 2015.
-
Estimation and Testing for Covariance-Spectral Spatial-Temporal Models
Authors:
A. M. Mosammam,
J. T. Kent
Abstract:
In this paper we explore a covariance spectral modelling strategy for spatial-temporal processes which involves a spectral approach for time but a covariance approach for space.It facilitates the analysis of coherence between the temporal frequency components at different spatial sites. Stein(2005) developed a semi-parametric model within this framework.The purpose of this paper is to give a deepe…
▽ More
In this paper we explore a covariance spectral modelling strategy for spatial-temporal processes which involves a spectral approach for time but a covariance approach for space.It facilitates the analysis of coherence between the temporal frequency components at different spatial sites. Stein(2005) developed a semi-parametric model within this framework.The purpose of this paper is to give a deeper insight into the properties of his model and to develop simple and more intuitive methods of estimation and testing. An example is given using the Irish wind speed data.
△ Less
Submitted 16 September, 2014;
originally announced September 2014.
-
Hypersonic properties of monodisperse spherical mesoporous silica particles
Authors:
D. A. Eurov,
D. A. Kurdyukov,
E. Yu. Stovpiaga,
A. S. Salasyuk,
J. Jäger,
A. V. Scherbakov,
A. V. Akimov,
A. J. Kent,
D. R. Yakovlev,
M. Bayer,
V. G. Golubev
Abstract:
We use the picosecond acoustic pump-probe technique to study the elastic properties of monodispersemesoporous silica spheres filled with nickel and deposited in the form of opal-like films on silica substrates. The picosecond pump-probe optical transmission signal shows harmonic oscillations corresponding to the lower energy radial Lamb mode in the vibrational spectrum of the spheres. These oscill…
▽ More
We use the picosecond acoustic pump-probe technique to study the elastic properties of monodispersemesoporous silica spheres filled with nickel and deposited in the form of opal-like films on silica substrates. The picosecond pump-probe optical transmission signal shows harmonic oscillations corresponding to the lower energy radial Lamb mode in the vibrational spectrum of the spheres. These oscillations, with a frequency of several gigahertz last for several nanoseconds in the spheres with diameter 1050 nm, showing high homogeneity of the sphere parameters. By analysis of the oscillation spectrum of films with different sphere diameter and nickel content we obtain the elastic moduli of the mesoporous silica spheres.
△ Less
Submitted 28 April, 2014;
originally announced April 2014.
-
Comparative Assembly Hubs: Web Accessible Browsers for Comparative Genomics
Authors:
Ngan Nguyen,
Glenn Hickey,
Brian J. Raney,
Joel Armstrong,
Hiram Clawson,
Ann Zweig,
Jim Kent,
David Haussler,
Benedict Paten
Abstract:
We introduce a pipeline to easily generate collections of web accessible UCSC genome browsers interrelated by an alignment. Using the alignment, all annotations and the alignment itself can be efficiently viewed with reference to any genome in the collection, symmetrically. A new, intelligently scaled alignment display makes it simple to view all changes between the genomes at all levels of resolu…
▽ More
We introduce a pipeline to easily generate collections of web accessible UCSC genome browsers interrelated by an alignment. Using the alignment, all annotations and the alignment itself can be efficiently viewed with reference to any genome in the collection, symmetrically. A new, intelligently scaled alignment display makes it simple to view all changes between the genomes at all levels of resolution, from substitutions to complex structural rearrangements, including duplications.
△ Less
Submitted 5 November, 2013;
originally announced November 2013.
-
Discussion of "Geodesic Monte Carlo on Embedded Manifolds"
Authors:
Simon Byrne,
Mark Girolami,
Persi Diaconis,
Christof Seiler,
Susan Holmes,
Ian L. Dryden,
John T. Kent,
Marcelo Pereyra,
Babak Shahbaba,
Shiwei Lan,
Jeffrey Streets,
Daniel Simpson
Abstract:
Contributed discussion and rejoinder to "Geodesic Monte Carlo on Embedded Manifolds" (arXiv:1301.6064)
Contributed discussion and rejoinder to "Geodesic Monte Carlo on Embedded Manifolds" (arXiv:1301.6064)
△ Less
Submitted 5 November, 2013;
originally announced November 2013.
-
A new method to simulate the Bingham and related distributions in directional data analysis with applications
Authors:
John T. Kent,
Asaad M. Ganeiber,
Kanti V. Mardia
Abstract:
A new acceptance-rejection method is proposed and investigated for the Bingham distribution on the sphere using the angular central Gaussian distribution as an envelope. It is shown to have high efficiency and to be straightfoward to use. The method can also be extended to Fisher and Fisher-Bingham distributions on spheres and related manifolds.
A new acceptance-rejection method is proposed and investigated for the Bingham distribution on the sphere using the angular central Gaussian distribution as an envelope. It is shown to have high efficiency and to be straightfoward to use. The method can also be extended to Fisher and Fisher-Bingham distributions on spheres and related manifolds.
△ Less
Submitted 30 October, 2013;
originally announced October 2013.
-
Centromere reference models for human chromosomes X and Y satellite arrays
Authors:
Karen H. Miga,
Yulia Newton,
Miten Jain,
Nicolas Altemose,
Huntington F. Willard,
W. James Kent
Abstract:
The human genome remains incomplete, with multi-megabase sized gaps representing the endogenous centromeres and other heterochromatic regions. These regions are commonly enriched with long arrays of near-identical tandem repeats, known as satellite DNAs, that offer a limited number of variant sites to differentiate individual repeat copies across millions of bases. This substantial sequence homoge…
▽ More
The human genome remains incomplete, with multi-megabase sized gaps representing the endogenous centromeres and other heterochromatic regions. These regions are commonly enriched with long arrays of near-identical tandem repeats, known as satellite DNAs, that offer a limited number of variant sites to differentiate individual repeat copies across millions of bases. This substantial sequence homogeneity challenges available assembly strategies, and as a result, centromeric regions are omitted from ongoing genomic studies. To address this problem, we present a locally ordered assembly across two haploid human satellite arrays on chromosomes X and Y, resulting in an initial linear representation of 3.83 Mb of centromeric DNA within an individual genome. To further expand the utility of each centromeric reference sequence, we evaluate sites within the arrays for short-read mappability and chromosome specificity. As satellite DNAs evolve in a concerted manner, we use these centromeric assemblies to assess the extent of sequence variation among 372 individuals from distinct human populations. In doing so, we identify two ancient satellite array variants in both X and Y centromeres as determined by array length and sequence composition. This study provides an initial linear representation and comprehensive sequence characterization of a regional centromere and establishes a foundation to extend genomic characterization to these sites as well as to other repeat-rich regions within complex genomes.
△ Less
Submitted 17 September, 2013; v1 submitted 28 June, 2013;
originally announced July 2013.
-
Modulation of a surface plasmon-polariton resonance by sub-terahertz diffracted coherent phonons
Authors:
Christian Brüggemann,
Andrey V. Akimov,
Boris A. Glavin,
Vladimir I. Belotelov,
Ilya A. Akimov,
Jasmin Jäger,
Sachin Kasture,
Achanta Venu Gopal,
Arvind S. Vengurlekar,
Dmitri R. Yakovlev,
Anthony J. Kent,
Manfred Bayer
Abstract:
Coherent sub-THz phonons incident on a gold grating that is deposited on a dielectric substrate undergo diffraction and thereby induce an alteration of the surface plasmon-polariton resonance. This results in efficient high-frequency modulation (up to 110 GHz) of the structure's reflectivity for visible light in the vicinity of the plasmon-polariton resonance. High modulation efficiency is achieve…
▽ More
Coherent sub-THz phonons incident on a gold grating that is deposited on a dielectric substrate undergo diffraction and thereby induce an alteration of the surface plasmon-polariton resonance. This results in efficient high-frequency modulation (up to 110 GHz) of the structure's reflectivity for visible light in the vicinity of the plasmon-polariton resonance. High modulation efficiency is achieved by designing a periodic nanostructure which provides both plasmon-polariton and phonon resonances. Our theoretical analysis shows that the dynamical alteration of the plasmon-polariton resonance is governed by modulation of the slit widths within the grating at the frequencies of higher-order phonon resonances.
△ Less
Submitted 14 June, 2012; v1 submitted 13 June, 2012;
originally announced June 2012.
-
Shrinkage estimation with a matrix loss function
Authors:
Reman Abu-Shanab,
John T. Kent,
William E. Strawderman
Abstract:
Consider estimating the n by p matrix of means of an n by p matrix of independent normally distributed observations with constant variance, where the performance of an estimator is judged using a p by p matrix quadratic error loss function. A matrix version of the James-Stein estimator is proposed, depending on a tuning constant. It is shown to dominate the usual maximum likelihood estimator for s…
▽ More
Consider estimating the n by p matrix of means of an n by p matrix of independent normally distributed observations with constant variance, where the performance of an estimator is judged using a p by p matrix quadratic error loss function. A matrix version of the James-Stein estimator is proposed, depending on a tuning constant. It is shown to dominate the usual maximum likelihood estimator for some choices of of the tuning constant when n is greater than or equal to 3. This result also extends to other shrinkage estimators and settings.
△ Less
Submitted 18 January, 2011;
originally announced January 2011.
-
Using acoustic waves to induce high-frequency current oscillations in superlattices
Authors:
M. T. Greenaway,
A. G. Balanov,
D. Fowler,
A. J. Kent,
T. M. Fromhold
Abstract:
We show that GHz acoustic waves in semiconductor superlattices can induce THz electron dynamics that depend critically on the wave amplitude. Below a threshold amplitude, the acoustic wave drags electrons through the superlattice with a peak drift velocity overshooting that produced by a static electric field. In this regime, single electrons perform drifting orbits with THz frequency components.…
▽ More
We show that GHz acoustic waves in semiconductor superlattices can induce THz electron dynamics that depend critically on the wave amplitude. Below a threshold amplitude, the acoustic wave drags electrons through the superlattice with a peak drift velocity overshooting that produced by a static electric field. In this regime, single electrons perform drifting orbits with THz frequency components. When the wave amplitude exceeds the critical threshold, an abrupt onset of Bloch-like oscillations causes negative differential velocity. The acoustic wave also affects the collective behavior of the electrons by causing the formation of localised electron accumulation and depletion regions, which propagate through the superlattice, thereby producing self-sustained current oscillations even for very small wave amplitudes. We show that the underlying single-electron dynamics, in particular the transition between the acoustic wave dragging and Bloch oscillation regimes, strongly influence the spatial distribution of the electrons and the form of the current oscillations. In particular, the amplitude of the current oscillations depends non-monotonically on the strength of the acoustic wave, reflecting the variation of the single-electron drift velocity.
△ Less
Submitted 27 May, 2010; v1 submitted 14 March, 2008;
originally announced March 2008.
-
Resonance-like piezoelectric electron-phonon interaction in layered structures
Authors:
B. A. Glavin,
V. A. Kochelap,
T. L. Linnik,
A. J. Kent,
N. M. Stanton,
M. Henini
Abstract:
We show that mismatch of the piezoelectric parameters between layers of multiple-quantum well structures leads to modification of the electron-phonon interaction. In particular, short-wavelength phonons propagating perpendicular to the layers with wavevector close to $2πn/d$, where $d$ is the period of the structure, induce a strong smoothly-varying component of the piezo-potential. As a result,…
▽ More
We show that mismatch of the piezoelectric parameters between layers of multiple-quantum well structures leads to modification of the electron-phonon interaction. In particular, short-wavelength phonons propagating perpendicular to the layers with wavevector close to $2πn/d$, where $d$ is the period of the structure, induce a strong smoothly-varying component of the piezo-potential. As a result, they interact efficiently with 2D electrons. It is shown, that this property leads to emission of collimated quasi-monochromatic beams of high-frequency acoustic phonons from hot electrons in multiple-quantum well structures. We argue that this effect is responsible for the recently reported monochromatic transverse phonon emission from optically excited GaAs/AlAs superlattices, and provide additional experimental evidences of this.
△ Less
Submitted 10 June, 2006;
originally announced June 2006.
-
Use of coated silicon field emitters as neutralisers for fundamental physics missions in space
Authors:
K. L. Aplin,
B. J. Kent,
C. M. Collingwood,
L. Wang,
R. Stevens,
S. E. Huq,
A. Malik
Abstract:
Spacecraft neutralisers are required as part of the ion propulsion system for accurate station keeping in fundamental physics missions. This paper describes the use of thin layers of insulating materials as coatings for the gated silicon field emitter array structure used in a spacecraft neutraliser. These thin coatings are postulated to reduce power consumption and reduce overheating. The power c…
▽ More
Spacecraft neutralisers are required as part of the ion propulsion system for accurate station keeping in fundamental physics missions. This paper describes the use of thin layers of insulating materials as coatings for the gated silicon field emitter array structure used in a spacecraft neutraliser. These thin coatings are postulated to reduce power consumption and reduce overheating. The power consumption and lifetime of aluminium nitride and amorphous hydrogenated diamond-like carbon coatings have been tested by current-voltage and endurance tests. Diamond-like carbon coatings were promising, performing better in endurance tests than uncoated samples, but further work is required to characterise the coating's physical properties and its effects on field emission. The thermal conductivity of the coating material had little effect on measured sample lifetimes. Aluminium nitride had reduced power consumption compared to diamond-like carbon coated and uncoated samples. A thin (~5 nm) layer of aluminium nitride was found to be optimal, meeting European Space Agency specifications for the neutraliser engineering model.
△ Less
Submitted 7 June, 2011; v1 submitted 5 September, 2005;
originally announced September 2005.