subscribe to arXiv mailings

Lagrangian surplusection phenomena

Authors: Georgios Dimitroglou Rizell, Jonathan David Evans

Abstract: In this paper, we introduce a broad class of phenomena which appear when you intersect a given Lagrangian submanifold $K$ with a family of Lagrangian submanifolds $L_t$ (all Hamiltonian isotopic to one another). We establish that this phenomenon occurs in a particular situation, which lets us give a lower bound for the volume of any Lagrangian torus in $\mathbb{CP}^2$ which is Hamiltonian isotopic… ▽ More In this paper, we introduce a broad class of phenomena which appear when you intersect a given Lagrangian submanifold $K$ with a family of Lagrangian submanifolds $L_t$ (all Hamiltonian isotopic to one another). We establish that this phenomenon occurs in a particular situation, which lets us give a lower bound for the volume of any Lagrangian torus in $\mathbb{CP}^2$ which is Hamiltonian isotopic to the Chekanov torus. The rest of the paper is a discussion of why we should expect these phenomena to be very common, motivated by Oh's conjecture on the volume-minimising property of the Clifford torus and the concurrent normals conjecture in convex geometry. We pose many open questions. △ Less

Submitted 27 August, 2024; originally announced August 2024.

Comments: 22 pages, 5 figures

MSC Class: 53D12; 53D40

arXiv:2408.01285 [pdf, other]

The Mismeasure of Man and Models: Evaluating Allocational Harms in Large Language Models

Authors: Hannah Chen, Yangfeng Ji, David Evans

Abstract: Large language models (LLMs) are now being considered and even deployed for applications that support high-stakes decision-making, such as recruitment and clinical decisions. While several methods have been proposed for measuring bias, there remains a gap between predictions, which are what the proposed methods consider, and how they are used to make decisions. In this work, we introduce Rank-Allo… ▽ More Large language models (LLMs) are now being considered and even deployed for applications that support high-stakes decision-making, such as recruitment and clinical decisions. While several methods have been proposed for measuring bias, there remains a gap between predictions, which are what the proposed methods consider, and how they are used to make decisions. In this work, we introduce Rank-Allocational-Based Bias Index (RABBI), a model-agnostic bias measure that assesses potential allocational harms arising from biases in LLM predictions. We compare RABBI and current bias metrics on two allocation decision tasks. We evaluate their predictive validity across ten LLMs and utility for model selection. Our results reveal that commonly-used bias metrics based on average performance gap and distribution distance fail to reliably capture group disparities in allocation outcomes, whereas RABBI exhibits a strong correlation with allocation disparities. Our work highlights the need to account for how models are used in contexts with limited resource constraints. △ Less

Submitted 2 August, 2024; originally announced August 2024.

arXiv:2407.10799 [pdf, other]

The Chandra Source Catalog Release 2 Series

Authors: Ian N. Evans, Janet D. Evans, J. Rafael Martínez-Galarza, Joseph B. Miller, Francis A. Primini, Mojegan Azadi, Douglas J. Burke, Francesca M. Civano, Raffaele D'Abrusco, Giuseppina Fabbiano, Dale E. Graessle, John D. Grier, John C. Houck, Jennifer Lauer, Michael L. McCollough, Michael A. Nowak, David A. Plummer, Arnold H. Rots, Aneta Siemiginowska, Michael S. Tibbetts

Abstract: The Chandra Source Catalog (CSC) is a virtual X-ray astrophysics facility that enables both detailed individual source studies and statistical studies of large samples of X-ray sources detected in ACIS and HRC-I imaging observations obtained by the Chandra X-ray Observatory. The catalog provides carefully-curated, high-quality, and uniformly calibrated and analyzed tabulated positional, spatial, p… ▽ More The Chandra Source Catalog (CSC) is a virtual X-ray astrophysics facility that enables both detailed individual source studies and statistical studies of large samples of X-ray sources detected in ACIS and HRC-I imaging observations obtained by the Chandra X-ray Observatory. The catalog provides carefully-curated, high-quality, and uniformly calibrated and analyzed tabulated positional, spatial, photometric, spectral, and temporal source properties, as well as science-ready X-ray data products. The latter includes multiple types of source- and field-based FITS format products that can be used as a basis for further research, significantly simplifying followup analysis of scientifically meaningful source samples. We discuss in detail the algorithms used for the CSC Release 2 Series, including CSC 2.0, which includes 317,167 unique X-ray sources on the sky identified in observations released publicly through the end of 2014, and CSC 2.1, which adds Chandra data released through the end of 2021 and expands the catalog to 407,806 sources. Besides adding more recent observations, the CSC Release 2 Series includes multiple algorithmic enhancements that provide significant improvements over earlier releases. The compact source sensitivity limit for most observations is ~5 photons over most of the field of view, which is ~2x fainter than Release 1, achieved by co-adding observations and using an optimized source detection approach. A Bayesian X-ray aperture photometry code produces robust fluxes even in crowded fields and for low count sources. The current release, CSC 2.1, is tied to the Gaia-CRF3 astrometric reference frame for the best sky positions for catalog sources. △ Less

Submitted 15 July, 2024; originally announced July 2024.

Comments: 66 pages, 17 figures, 16 tables, accepted for publication in The Astrophysical Journal Supplement Series

arXiv:2407.04730 [pdf, other]

The OPS-SAT benchmark for detecting anomalies in satellite telemetry

Authors: Bogdan Ruszczak, Krzysztof Kotowski, David Evans, Jakub Nalepa

Abstract: Detecting anomalous events in satellite telemetry is a critical task in space operations. This task, however, is extremely time-consuming, error-prone and human dependent, thus automated data-driven anomaly detection algorithms have been emerging at a steady pace. However, there are no publicly available datasets of real satellite telemetry accompanied with the ground-truth annotations that could… ▽ More Detecting anomalous events in satellite telemetry is a critical task in space operations. This task, however, is extremely time-consuming, error-prone and human dependent, thus automated data-driven anomaly detection algorithms have been emerging at a steady pace. However, there are no publicly available datasets of real satellite telemetry accompanied with the ground-truth annotations that could be used to train and verify anomaly detection supervised models. In this article, we address this research gap and introduce the AI-ready benchmark dataset (OPSSAT-AD) containing the telemetry data acquired on board OPS-SAT -- a CubeSat mission which has been operated by the European Space Agency which has come to an end during the night of 22--23 May 2024 (CEST). The dataset is accompanied with the baseline results obtained using 30 supervised and unsupervised classic and deep machine learning algorithms for anomaly detection. They were trained and validated using the training-test dataset split introduced in this work, and we present a suggested set of quality metrics which should be always calculated to confront the new algorithms for anomaly detection while exploiting OPSSAT-AD. We believe that this work may become an important step toward building a fair, reproducible and objective validation procedure that can be used to quantify the capabilities of the emerging anomaly detection techniques in an unbiased and fully transparent way. △ Less

Submitted 29 June, 2024; originally announced July 2024.

Comments: 13 pages, 8 figures, 3 tables

arXiv:2406.11544 [pdf, other]

Do Parameters Reveal More than Loss for Membership Inference?

Authors: Anshuman Suri, Xiao Zhang, David Evans

Abstract: Membership inference attacks aim to infer whether an individual record was used to train a model, serving as a key tool for disclosure auditing. While such evaluations are useful to demonstrate risk, they are computationally expensive and often make strong assumptions about potential adversaries' access to models and training environments, and thus do not provide very tight bounds on leakage from… ▽ More Membership inference attacks aim to infer whether an individual record was used to train a model, serving as a key tool for disclosure auditing. While such evaluations are useful to demonstrate risk, they are computationally expensive and often make strong assumptions about potential adversaries' access to models and training environments, and thus do not provide very tight bounds on leakage from potential attacks. We show how prior claims around black-box access being sufficient for optimal membership inference do not hold for most useful settings such as stochastic gradient descent, and that optimal membership inference indeed requires white-box access. We validate our findings with a new white-box inference attack IHA (Inverse Hessian Attack) that explicitly uses model parameters by taking advantage of computing inverse-Hessian vector products. Our results show that both audits and adversaries may be able to benefit from access to model parameters, and we advocate for further research into white-box methods for membership privacy auditing. △ Less

Submitted 19 July, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

Comments: Accepted at High-dimensional Learning Dynamics (HiLD) Workshop, ICML 2024

arXiv:2406.05447 [pdf, other]

The PLATO Mission

Authors: Heike Rauer, Conny Aerts, Juan Cabrera, Magali Deleuil, Anders Erikson, Laurent Gizon, Mariejo Goupil, Ana Heras, Jose Lorenzo-Alvarez, Filippo Marliani, Cesar Martin-Garcia, J. Miguel Mas-Hesse, Laurence O'Rourke, Hugh Osborn, Isabella Pagano, Giampaolo Piotto, Don Pollacco, Roberto Ragazzoni, Gavin Ramsay, Stéphane Udry, Thierry Appourchaux, Willy Benz, Alexis Brandeker, Manuel Güdel, Eduardo Janot-Pacheco , et al. (801 additional authors not shown)

Abstract: PLATO (PLAnetary Transits and Oscillations of stars) is ESA's M3 mission designed to detect and characterise extrasolar planets and perform asteroseismic monitoring of a large number of stars. PLATO will detect small planets (down to <2 R_(Earth)) around bright stars (<11 mag), including terrestrial planets in the habitable zone of solar-like stars. With the complement of radial velocity observati… ▽ More PLATO (PLAnetary Transits and Oscillations of stars) is ESA's M3 mission designed to detect and characterise extrasolar planets and perform asteroseismic monitoring of a large number of stars. PLATO will detect small planets (down to <2 R_(Earth)) around bright stars (<11 mag), including terrestrial planets in the habitable zone of solar-like stars. With the complement of radial velocity observations from the ground, planets will be characterised for their radius, mass, and age with high accuracy (5 %, 10 %, 10 % for an Earth-Sun combination respectively). PLATO will provide us with a large-scale catalogue of well-characterised small planets up to intermediate orbital periods, relevant for a meaningful comparison to planet formation theories and to better understand planet evolution. It will make possible comparative exoplanetology to place our Solar System planets in a broader context. In parallel, PLATO will study (host) stars using asteroseismology, allowing us to determine the stellar properties with high accuracy, substantially enhancing our knowledge of stellar structure and evolution. The payload instrument consists of 26 cameras with 12cm aperture each. For at least four years, the mission will perform high-precision photometric measurements. Here we review the science objectives, present PLATO's target samples and fields, provide an overview of expected core science performance as well as a description of the instrument and the mission profile at the beginning of the serial production of the flight cameras. PLATO is scheduled for a launch date end 2026. This overview therefore provides a summary of the mission to the community in preparation of the upcoming operational phases. △ Less

Submitted 8 June, 2024; originally announced June 2024.

arXiv:2405.14368 [pdf]

Ferri-ionic Coupling in CuInP$_2$S$_6$ Nanoflakes: Polarization States and Controllable Negative Capacitance

Authors: Anna N. Morozovska, Sergei V. Kalinin, Eugene. A. Eliseev, Svitlana Kopyl, Yulian M. Vysochanskii, Dean R. Evans

Abstract: We consider nanoflakes of van der Waals ferrielectric CuInP$_2$S$_6$ covered by an ionic surface charge and reveal the appearance of polar states with relatively high polarization ~5 microC/cm$^2$ and stored free charge ~10 microC/cm$%2$, which can mimic "mid-gap" states associated with a surface field-induced transfer of Cu and/or In ions in the van der Waals gap. The change in the ionic screenin… ▽ More We consider nanoflakes of van der Waals ferrielectric CuInP$_2$S$_6$ covered by an ionic surface charge and reveal the appearance of polar states with relatively high polarization ~5 microC/cm$^2$ and stored free charge ~10 microC/cm$%2$, which can mimic "mid-gap" states associated with a surface field-induced transfer of Cu and/or In ions in the van der Waals gap. The change in the ionic screening degree and mismatch strains induce a broad range of the transitions between paraelectric phase, antiferroelectric, ferrielectric, and ferri-ionic states in CuInP$_2$S$_6$ nanoflakes. The states' stability and/or metastability is determined by the minimum of the system free energy consisting of electrostatic energy, elastic energy, and a Landau-type four-well potential of the ferrielectric dipole polarization. The possibility to govern the transitions by strain and ionic screening can be useful for controlling the tunneling barrier in thin film devices based on CuInP$_2$S$_6$ nanoflakes. Also, we predict that the CuInP$_2$S$_6$ nanoflakes reveal features of the controllable negative capacitance effect, which make them attractive for advanced electronic devices, such as nano-capacitors and gate oxide nanomaterials with reduced heat dissipation. △ Less

Submitted 3 August, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

Comments: 46 pages, including 5 figures and Appendix with 6 figures. Revised version

arXiv:2405.09721 [pdf, other]

DP-RuL: Differentially-Private Rule Learning for Clinical Decision Support Systems

Authors: Josephine Lamp, Lu Feng, David Evans

Abstract: Serious privacy concerns arise with the use of patient data in rule-based clinical decision support systems (CDSS). The goal of a privacy-preserving CDSS is to learn a population ruleset from individual clients' local rulesets, while protecting the potentially sensitive information contained in the rulesets. We present the first work focused on this problem and develop a framework for learning pop… ▽ More Serious privacy concerns arise with the use of patient data in rule-based clinical decision support systems (CDSS). The goal of a privacy-preserving CDSS is to learn a population ruleset from individual clients' local rulesets, while protecting the potentially sensitive information contained in the rulesets. We present the first work focused on this problem and develop a framework for learning population rulesets with local differential privacy (LDP), suitable for use within a distributed CDSS and other distributed settings. Our rule discovery protocol uses a Monte-Carlo Tree Search (MCTS) method integrated with LDP to search a rule grammar in a structured way and find rule structures clients are likely to have. Randomized response queries are sent to clients to determine promising paths to search within the rule grammar. In addition, we introduce an adaptive budget allocation method which dynamically determines how much privacy loss budget to use at each query, resulting in better privacy-utility trade-offs. We evaluate our approach using three clinical datasets and find that we are able to learn population rulesets with high coverage (breadth of rules) and clinical utility even at low privacy loss budgets. △ Less

Submitted 15 May, 2024; originally announced May 2024.

arXiv:2405.08102 [pdf, other]

Evaluating Google's Protected Audience Protocol

Authors: Minjun Long, David Evans

Abstract: While third-party cookies have been a key component of the digital marketing ecosystem for years, they allow users to be tracked across web sites in ways that raise serious privacy concerns. Google has proposed the Privacy Sandbox initiative to enable ad targeting without third-party cookies. While there have been several studies focused on other aspects of this initiative, there has been little a… ▽ More While third-party cookies have been a key component of the digital marketing ecosystem for years, they allow users to be tracked across web sites in ways that raise serious privacy concerns. Google has proposed the Privacy Sandbox initiative to enable ad targeting without third-party cookies. While there have been several studies focused on other aspects of this initiative, there has been little analysis to date as to how well the system achieves the intended goal of preventing request linking. This work focuses on analyzing linkage privacy risks for the reporting mechanisms proposed in the Protected Audience (PrAu) proposal (previously known as FLEDGE), which is intended to enable online remarketing without using third-party cookies. We summarize the overall workflow of PrAu and highlight potential privacy risks associated with its proposed design, focusing on scenarios in which adversaries attempt to link requests to different sites to the same user. We show how a realistic adversary would be still able to use the privacy-protected reporting mechanisms to link user requests and conduct mass surveillance, even with correct implementations of all the currently proposed privacy mechanisms. △ Less

Submitted 20 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

arXiv:2405.02735 [pdf, ps, other]

Tropical methods for stable Horikawa surfaces

Authors: Jonathan David Evans, Angelica Simonetti, Giancarlo Urzúa

Abstract: There are many strata in the KSBA boundary of the moduli space of octic double planes ($K^2=2$, $p_g=3$). We use methods from tropical and toric geometry to show that only three of these correspond to surfaces with at worst quotient singularities. There are many strata in the KSBA boundary of the moduli space of octic double planes ($K^2=2$, $p_g=3$). We use methods from tropical and toric geometry to show that only three of these correspond to surfaces with at worst quotient singularities. △ Less

Submitted 4 May, 2024; originally announced May 2024.

Comments: 64 pages, 27 figures

MSC Class: 14J10; 14J17; 14J29

arXiv:2404.14466 [pdf, other]

Quantum symmetries of noncommutative tori

Authors: David E. Evans, Corey Jones

Abstract: We consider the problem of building non-invertible quantum symmetries (as characterized by actions of unitary fusion categories) on noncommutative tori. We introduce a general method to construct actions of fusion categories on inductive limit C*-algberas using finite dimenionsal data, and then apply it to obtain AT-actions of arbitrary Haagerup-Izumi categories on noncommutative 2-tori, of the ev… ▽ More We consider the problem of building non-invertible quantum symmetries (as characterized by actions of unitary fusion categories) on noncommutative tori. We introduce a general method to construct actions of fusion categories on inductive limit C*-algberas using finite dimenionsal data, and then apply it to obtain AT-actions of arbitrary Haagerup-Izumi categories on noncommutative 2-tori, of the even part of the $E_{8}$ subfactor on a noncommutative 3-torus, and of $\text{PSU}(2)_{15}$ on a noncommutative 4-torus. △ Less

Submitted 31 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

arXiv:2404.14325 [pdf, other]

Adapting to time: why nature evolved a diverse set of neurons

Authors: Karim G. Habashy, Benjamin D. Evans, Dan F. M. Goodman, Jeffrey S. Bowers

Abstract: Brains have evolved a diverse set of neurons with varying morphologies, physiological properties and rich dynamics that impact their processing of temporal information. By contrast, most neural network models include a homogeneous set of units that only vary in terms of their spatial parameters (weights and biases). To investigate the importance of temporal parameters to neural function, we traine… ▽ More Brains have evolved a diverse set of neurons with varying morphologies, physiological properties and rich dynamics that impact their processing of temporal information. By contrast, most neural network models include a homogeneous set of units that only vary in terms of their spatial parameters (weights and biases). To investigate the importance of temporal parameters to neural function, we trained spiking neural networks on tasks of varying temporal complexity, with different subsets of parameters held constant. We find that in a tightly resource constrained setting, adapting conduction delays is essential to solve all test conditions, and indeed that it is possible to solve these tasks using only temporal parameters (delays and time constants) with weights held constant. In the most complex spatio-temporal task we studied, we found that an adaptable bursting parameter was essential. More generally, allowing for adaptation of both temporal and spatial parameters increases network robustness to noise, an important feature for both biological brains and neuromorphic computing systems. In summary, our findings highlight how rich and adaptable dynamics are key to solving temporally structured tasks at a low neural resource cost, which may be part of the reason why biological neurons vary so dramatically in their physiological properties. △ Less

Submitted 21 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

Comments: 14 pages, 6 figures

ACM Class: K.3.2; I.2.m

arXiv:2404.10486 [pdf, other]

doi 10.1051/0004-6361/202449763

Discovery of a dormant 33 solar-mass black hole in pre-release Gaia astrometry

Authors: Gaia Collaboration, P. Panuzzo, T. Mazeh, F. Arenou, B. Holl, E. Caffau, A. Jorissen, C. Babusiaux, P. Gavras, J. Sahlmann, U. Bastian, Ł. Wyrzykowski, L. Eyer, N. Leclerc, N. Bauchet, A. Bombrun, N. Mowlavi, G. M. Seabroke, D. Teyssier, E. Balbinot, A. Helmi, A. G. A. Brown, A. Vallenari, T. Prusti, J. H. J. de Bruijne , et al. (390 additional authors not shown)

Abstract: Gravitational waves from black-hole merging events have revealed a population of extra-galactic BHs residing in short-period binaries with masses that are higher than expected based on most stellar evolution models - and also higher than known stellar-origin black holes in our Galaxy. It has been proposed that those high-mass BHs are the remnants of massive metal-poor stars. Gaia astrometry is exp… ▽ More Gravitational waves from black-hole merging events have revealed a population of extra-galactic BHs residing in short-period binaries with masses that are higher than expected based on most stellar evolution models - and also higher than known stellar-origin black holes in our Galaxy. It has been proposed that those high-mass BHs are the remnants of massive metal-poor stars. Gaia astrometry is expected to uncover many Galactic wide-binary systems containing dormant BHs, which may not have been detected before. The study of this population will provide new information on the BH-mass distribution in binaries and shed light on their formation mechanisms and progenitors. As part of the validation efforts in preparation for the fourth Gaia data release (DR4), we analysed the preliminary astrometric binary solutions, obtained by the Gaia Non-Single Star pipeline, to verify their significance and to minimise false-detection rates in high-mass-function orbital solutions. The astrometric binary solution of one source, Gaia BH3, implies the presence of a 32.70 \pm 0.82 M\odot BH in a binary system with a period of 11.6 yr. Gaia radial velocities independently validate the astrometric orbit. Broad-band photometric and spectroscopic data show that the visible component is an old, very metal-poor giant of the Galactic halo, at a distance of 590 pc. The BH in the Gaia BH3 system is more massive than any other Galactic stellar-origin BH known thus far. The low metallicity of the star companion supports the scenario that metal-poor massive stars are progenitors of the high-mass BHs detected by gravitational-wave telescopes. The Galactic orbit of the system and its metallicity indicate that it might belong to the Sequoia halo substructure. Alternatively, and more plausibly, it could belong to the ED-2 stream, which likely originated from a globular cluster that had been disrupted by the Milky Way. △ Less

Submitted 19 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

Comments: 23 pages, accepted fro publication in A&A Letters. New version with small fixes

arXiv:2404.08594 [pdf, other]

Absolute dimensions of solar-type eclipsing binaries. NY Hya: A test for magnetic stellar evolution models

Authors: T. C. Hinse, O. Baştürk, J. Southworth, G. A. Feiden, J. Tregloan-Reed, V. B. Kostov, J. Livingston, E. M. Esmer, Mesut Yılmaz, Selçuk Yalçınkaya, Şeyma Torun, J. Vos, D. F. Evans, J. C. Morales, J. C. A. Wolf, E. H. Olsen, J. V. Clausen, B. E. Helt, C. T. K. Lý, O. Stahl, R. Wells, M. Herath, U. G. Jørgensen, M. Dominik, J. Skottfelt , et al. (7 additional authors not shown)

Abstract: The binary star NY Hya is a bright, detached, double-lined eclipsing system with an orbital period of just under five days with two components each nearly identical to the Sun and located in the solar neighbourhood. The objective of this study is to test and confront various stellar evolution models for solar-type stars based on accurate measurements of stellar mass and radius. We present new… ▽ More The binary star NY Hya is a bright, detached, double-lined eclipsing system with an orbital period of just under five days with two components each nearly identical to the Sun and located in the solar neighbourhood. The objective of this study is to test and confront various stellar evolution models for solar-type stars based on accurate measurements of stellar mass and radius. We present new ground-based spectroscopic and photometric as well as high-precision space-based photometric and astrometric data from which we derive orbital as well as physical properties of the components via the method of least-squares minimisation based on a standard binary model valid for two detached components. Classic statistical techniques were invoked to test the significance of model parameters. Additional empirical evidence was compiled from the public domain; the derived system properties were compared with archival broad-band photometry data enabling a measurement of the system's spectral energy distribution that allowed an independent estimate of stellar properties. We also utilised semi-empirical calibration methods to derive atmospheric properties from Strömgren photometry and related colour indices. Data was used to confront the observed physical properties with classic and magnetic stellar evolution models. △ Less

Submitted 12 April, 2024; originally announced April 2024.

Comments: 34 pages, 19 figures, 13 tables, (accepted for publication in A&A)

arXiv:2404.05290 [pdf, other]

MindSet: Vision. A toolbox for testing DNNs on key psychological experiments

Authors: Valerio Biscione, Dong Yin, Gaurav Malhotra, Marin Dujmovic, Milton L. Montero, Guillermo Puebla, Federico Adolfi, Rachel F. Heaton, John E. Hummel, Benjamin D. Evans, Karim Habashy, Jeffrey S. Bowers

Abstract: Multiple benchmarks have been developed to assess the alignment between deep neural networks (DNNs) and human vision. In almost all cases these benchmarks are observational in the sense they are composed of behavioural and brain responses to naturalistic images that have not been manipulated to test hypotheses regarding how DNNs or humans perceive and identify objects. Here we introduce the toolbo… ▽ More Multiple benchmarks have been developed to assess the alignment between deep neural networks (DNNs) and human vision. In almost all cases these benchmarks are observational in the sense they are composed of behavioural and brain responses to naturalistic images that have not been manipulated to test hypotheses regarding how DNNs or humans perceive and identify objects. Here we introduce the toolbox MindSet: Vision, consisting of a collection of image datasets and related scripts designed to test DNNs on 30 psychological findings. In all experimental conditions, the stimuli are systematically manipulated to test specific hypotheses regarding human visual perception and object recognition. In addition to providing pre-generated datasets of images, we provide code to regenerate these datasets, offering many configurable parameters which greatly extend the dataset versatility for different research contexts, and code to facilitate the testing of DNNs on these image datasets using three different methods (similarity judgments, out-of-distribution classification, and decoder method), accessible at https://github.com/MindSetVision/mindset-vision. We test ResNet-152 on each of these methods as an example of how the toolbox can be used. △ Less

Submitted 8 April, 2024; originally announced April 2024.

arXiv:2404.00463 [pdf, other]

Addressing Both Statistical and Causal Gender Fairness in NLP Models

Authors: Hannah Chen, Yangfeng Ji, David Evans

Abstract: Statistical fairness stipulates equivalent outcomes for every protected group, whereas causal fairness prescribes that a model makes the same prediction for an individual regardless of their protected characteristics. Counterfactual data augmentation (CDA) is effective for reducing bias in NLP models, yet models trained with CDA are often evaluated only on metrics that are closely tied to the caus… ▽ More Statistical fairness stipulates equivalent outcomes for every protected group, whereas causal fairness prescribes that a model makes the same prediction for an individual regardless of their protected characteristics. Counterfactual data augmentation (CDA) is effective for reducing bias in NLP models, yet models trained with CDA are often evaluated only on metrics that are closely tied to the causal fairness notion; similarly, sampling-based methods designed to promote statistical fairness are rarely evaluated for causal fairness. In this work, we evaluate both statistical and causal debiasing methods for gender bias in NLP models, and find that while such methods are effective at reducing bias as measured by the targeted metric, they do not necessarily improve results on other bias metrics. We demonstrate that combinations of statistical and causal debiasing techniques are able to reduce bias measured through both types of metrics. △ Less

Submitted 30 March, 2024; originally announced April 2024.

Comments: NAACL 2024 (Findings)

arXiv:2403.13168 [pdf]

Magnetoelectric coupling at the domain level in polycrystalline ErMnO3

Authors: J. Schultheiß, L. Puntigam, M. Winkler, S. Krohns, D. Meier, H. Das, D. M. Evans, I. Kézsmárki

Abstract: We explore the impact of a magnetic field on the ferroelectric domain pattern in polycrystalline hexagonal ErMnO3 at cryogenic temperatures. Utilizing piezoelectric force microscopy measurements at 1.65 K, we observe modifications of the topologically protected ferroelectric domain structure induced by the magnetic field. These alterations likely result from strain induced by the magnetic field, f… ▽ More We explore the impact of a magnetic field on the ferroelectric domain pattern in polycrystalline hexagonal ErMnO3 at cryogenic temperatures. Utilizing piezoelectric force microscopy measurements at 1.65 K, we observe modifications of the topologically protected ferroelectric domain structure induced by the magnetic field. These alterations likely result from strain induced by the magnetic field, facilitated by intergranular coupling in polycrystalline multiferroics. Our findings give insights into the interplay between electric and magnetic properties at the local scale and represent a so far unexplored pathway for manipulating topologically protected ferroelectric vortex patterns in hexagonal manganites. △ Less

Submitted 19 March, 2024; originally announced March 2024.

arXiv:2403.03519 [pdf, ps, other]

KIAS Lectures on Symplectic Aspects of Degenerations

Authors: Jonathan David Evans

Abstract: This is a series of three lectures I gave at the Korea Institute of Advanced Study in June 2019 at a workshop about "Algebraic and Symplectic Aspects of Degenerations of Complex Surfaces". I focus on the symplectic aspects, in particular on the case of cyclic quotient surface singularities. These notes have been available on a public Git repository since 2019, and I noticed that people occasionall… ▽ More This is a series of three lectures I gave at the Korea Institute of Advanced Study in June 2019 at a workshop about "Algebraic and Symplectic Aspects of Degenerations of Complex Surfaces". I focus on the symplectic aspects, in particular on the case of cyclic quotient surface singularities. These notes have been available on a public Git repository since 2019, and I noticed that people occasionally cited them in the years since. For that reason, I decided to post them on arXiv for a more permanent record; I have made some small corrections and annotations but otherwise they are unchanged. These notes are a purely expository account of stuff I was thinking about 2016-2019, and are largely self-aggrandising. △ Less

Submitted 6 March, 2024; originally announced March 2024.

Comments: 41 pages

arXiv:2403.00063 [pdf, ps, other]

Abundances of Neutron-Capture Elements in 62 Stars in the Globular Cluster Messier 15

Authors: Jonathan Cabrera Garcia, Charli M. Sakari, Ian U. Roederer, Donavon W. Evans, Pedro Silva, Mario Mateo, Ying-Yi Song, Anthony Kremin, John I. Bailey III, Matthew G. Walker

Abstract: M15 is a globular cluster with a known spread in neutron-capture elements. This paper presents abundances of neutron-capture elements for 62 stars in M15. Spectra were obtained with the Michigan/Magellan Fiber System (M2FS) spectrograph, covering a wavelength range from ~4430-4630 A. Spectral lines from Fe I, Fe II, Sr I, Zr II, Ba II, La II, Ce II, Nd II, Sm II, Eu II, and Dy II, were measured, e… ▽ More M15 is a globular cluster with a known spread in neutron-capture elements. This paper presents abundances of neutron-capture elements for 62 stars in M15. Spectra were obtained with the Michigan/Magellan Fiber System (M2FS) spectrograph, covering a wavelength range from ~4430-4630 A. Spectral lines from Fe I, Fe II, Sr I, Zr II, Ba II, La II, Ce II, Nd II, Sm II, Eu II, and Dy II, were measured, enabling classifications and neutron-capture abundance patterns for the stars. Of the 62 targets, 44 are found to be highly Eu-enhanced r-II stars, another 17 are moderately Eu-enhanced r-I stars, and one star is found to have an s-process signature. The neutron-capture patterns indicate that the majority of the stars are consistent with enrichment by the r-process. The 62 target stars are found to show significant star-to-star spreads in Sr, Zr, Ba, La, Ce, Nd, Sm, Eu, and Dy, but no significant spread in Fe. The neutron-capture abundances are further found to have slight correlations with sodium abundances from the literature, unlike what has been previously found; follow-up studies are needed to verify this result. The findings in this paper suggest that the Eu-enhanced stars in M15 were enhanced by the same process, that the nucleosynthetic source of this Eu pollution was the r-process, and that the r-process source occurred as the first generation of cluster stars was forming. △ Less

Submitted 29 February, 2024; originally announced March 2024.

Comments: Accepted for publication in the Astrophysical Journal

arXiv:2402.18558 [pdf, other]

Unifying F1TENTH Autonomous Racing: Survey, Methods and Benchmarks

Authors: Benjamin David Evans, Raphael Trumpp, Marco Caccamo, Felix Jahncke, Johannes Betz, Hendrik Willem Jordaan, Herman Arnold Engelbrecht

Abstract: The F1TENTH autonomous driving platform, consisting of 1:10-scale remote-controlled cars, has evolved into a well-established education and research platform. The many publications and real-world competitions span many domains, from classical path planning to novel learning-based algorithms. Consequently, the field is wide and disjointed, hindering direct comparison of developed methods and making… ▽ More The F1TENTH autonomous driving platform, consisting of 1:10-scale remote-controlled cars, has evolved into a well-established education and research platform. The many publications and real-world competitions span many domains, from classical path planning to novel learning-based algorithms. Consequently, the field is wide and disjointed, hindering direct comparison of developed methods and making it difficult to assess the state-of-the-art. Therefore, we aim to unify the field by surveying current approaches, describing common methods, and providing benchmark results to facilitate clear comparisons and establish a baseline for future work. This research aims to survey past and current work with F1TENTH vehicles in the classical and learning categories and explain the different solution approaches. We describe particle filter localisation, trajectory optimisation and tracking, model predictive contouring control, follow-the-gap, and end-to-end reinforcement learning. We provide an open-source evaluation of benchmark methods and investigate overlooked factors of control frequency and localisation accuracy for classical methods as well as reward signal and training map for learning methods. The evaluation shows that the optimisation and tracking method achieves the fastest lap times, followed by the online planning approach. Finally, our work identifies and outlines the relevant research aspects to help motivate future work in the F1TENTH domain. △ Less

Submitted 25 April, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

Comments: 12 pages, 18 figures. Sumbitted for publication

arXiv:2402.07841 [pdf, other]

Do Membership Inference Attacks Work on Large Language Models?

Authors: Michael Duan, Anshuman Suri, Niloofar Mireshghallah, Sewon Min, Weijia Shi, Luke Zettlemoyer, Yulia Tsvetkov, Yejin Choi, David Evans, Hannaneh Hajishirzi

Abstract: Membership inference attacks (MIAs) attempt to predict whether a particular datapoint is a member of a target model's training data. Despite extensive research on traditional machine learning models, there has been limited work studying MIA on the pre-training data of large language models (LLMs). We perform a large-scale evaluation of MIAs over a suite of language models (LMs) trained on the Pile… ▽ More Membership inference attacks (MIAs) attempt to predict whether a particular datapoint is a member of a target model's training data. Despite extensive research on traditional machine learning models, there has been limited work studying MIA on the pre-training data of large language models (LLMs). We perform a large-scale evaluation of MIAs over a suite of language models (LMs) trained on the Pile, ranging from 160M to 12B parameters. We find that MIAs barely outperform random guessing for most settings across varying LLM sizes and domains. Our further analyses reveal that this poor performance can be attributed to (1) the combination of a large dataset and few training iterations, and (2) an inherently fuzzy boundary between members and non-members. We identify specific settings where LLMs have been shown to be vulnerable to membership inference and show that the apparent success in such settings can be attributed to a distribution shift, such as when members and non-members are drawn from the seemingly identical domain but with different temporal ranges. We release our code and data as a unified benchmark package that includes all existing MIAs, supporting future work. △ Less

Submitted 12 February, 2024; originally announced February 2024.

arXiv:2402.06797 [pdf]

doi 10.1103/PhysRevApplied.21.054035

The Influence of Chemical Strains on the Electrocaloric Response, Polarization Morphology, Tetragonality and Negative Capacitance Effect of Ferroelectric Core-Shell Nanorods and Nanowires

Authors: Anna N. Morozovska, Eugene A. Eliseev, Olha A. Kovalenko, Dean R. Evans

Abstract: Using Landau-Ginzburg-Devonshire (LGD) approach we proposed the analytical description of the chemical strains influence on the spontaneous polarization and electrocaloric response in ferroelectric core-shell nanorods. We postulate that the nanorod core presents a defect-free single-crystalline ferroelectric material, and the elastic defects are accumulated in the ultra-thin shell, where they can… ▽ More Using Landau-Ginzburg-Devonshire (LGD) approach we proposed the analytical description of the chemical strains influence on the spontaneous polarization and electrocaloric response in ferroelectric core-shell nanorods. We postulate that the nanorod core presents a defect-free single-crystalline ferroelectric material, and the elastic defects are accumulated in the ultra-thin shell, where they can induce tensile or compressive chemical strains. The finite element modeling (FEM) based on the LGD approach reveals transitions of domain structure morphology induced by the chemical strains in the BaTiO3 nanorods. Namely, tensile chemical strains induce and support the single-domain state in the central part of the nanorod, while the curled domain structures appear near the unscreened or partially screened ends of the rod. The vortex-like domains propagate toward the central part of the rod and fill it entirely, when the rod is covered by a shell with compressive chemical strains above some critical value. The critical value depends on the nanorod sizes, aspect ratio, and screening conditions at its ends. Both analytical theory and FEM predict that the tensile chemical strains in the shell increase the nanorod polarization, lattice tetragonality, and electrocaloric response well-above the values corresponding to the bulk material. The physical reason of the increase is the strong electrostriction coupling between the mismatch-type elastic strains induced in the core by the chemical strains in the shell. Comparison with the earlier XRD data confirmed an increase of tetragonality ratio in tensiled BaTiO3 nanorods compared to the bulk material. △ Less

Submitted 8 April, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

Comments: 37 pages, including 8 figures and 3 Appendices

Journal ref: Physical Review Applied 21, 054035 (2024)

arXiv:2401.17732 [pdf, other]

High-performance Racing on Unmapped Tracks using Local Maps

Authors: Benjamin David Evans, Hendrik Willem Jordaan, Herman Arnold Engelbrecht

Abstract: Map-based methods for autonomous racing estimate the vehicle's location, which is used to follow a high-level plan. While map-based optimisation methods demonstrate high-performance results, they are limited by requiring a map of the environment. In contrast, mapless methods can operate in unmapped contexts since they directly process raw sensor data (often LiDAR) to calculate commands. However, a… ▽ More Map-based methods for autonomous racing estimate the vehicle's location, which is used to follow a high-level plan. While map-based optimisation methods demonstrate high-performance results, they are limited by requiring a map of the environment. In contrast, mapless methods can operate in unmapped contexts since they directly process raw sensor data (often LiDAR) to calculate commands. However, a major limitation in mapless methods is poor performance due to a lack of optimisation. In response, we propose the local map framework that uses easily extractable, low-level features to build local maps of the visible region that form the input to optimisation-based controllers. Our local map generation extracts the visible racetrack boundaries and calculates a centreline and track widths used for planning. We evaluate our method for simulated F1Tenth autonomous racing using a two-stage trajectory optimisation and tracking strategy and a model predictive controller. Our method achieves lap times that are 8.8% faster than the Follow-The-Gap method and 3.22% faster than end-to-end neural networks due to the optimisation resulting in a faster speed profile. The local map planner is 3.28% slower than global methods that have access to an entire map of the track that can be used for planning. Critically, our approach enables high-speed autonomous racing on unmapped tracks, achieving performance similar to global methods without requiring a track map. △ Less

Submitted 31 January, 2024; originally announced January 2024.

Comments: 6 pages, 14 figures. Submitted to IV 2024

arXiv:2401.10835 [pdf]

doi 10.1016/j.matt.2024.04.024

Post-synthesis tuning of dielectric constant via ferroelectric domain wall engineering

Authors: L. Zhou, L. Puntigam, P. Lunkenheimer, E. Bourret, Z. Yan, I. Kézsmárki, D. Meier, S. Krohns, J. Schultheiß, D. M. Evans

Abstract: A promising mechanism for achieving colossal dielectric constants is to use insulating internal barrier layers, which typically form during synthesis and then remain in the material. It has recently been shown that insulating domain walls in ferroelectrics can act as such barriers. One advantage domain walls have, in comparison to stationary interfaces, is that they can be moved, offering the pote… ▽ More A promising mechanism for achieving colossal dielectric constants is to use insulating internal barrier layers, which typically form during synthesis and then remain in the material. It has recently been shown that insulating domain walls in ferroelectrics can act as such barriers. One advantage domain walls have, in comparison to stationary interfaces, is that they can be moved, offering the potential of post-synthesis control of the dielectric constant. However, to date, direct imaging of how changes in domain wall pattern cause a change in dielectric constant within a single sample has not been realized. In this work, we demonstrate that changing the domain wall density allows the engineering of the dielectric constant in hexagonal-ErMnO3 single crystals. The changes of the domain wall density are quantified via microscopy techniques, while the dielectric constant is determined via macroscopic dielectric spectroscopy measurements. The observed changes in the dielectric constant are quantitatively consistent with the observed variation in domain wall density, implying that the insulating domain walls behave as 'ideal' capacitors connected in series. Our approach to engineer the domain wall density can be readily extended to other control methods, e.g., electric fields or mechanical stresses, providing a novel degree of flexibility to in-situ tune the dielectric constant. △ Less

Submitted 19 January, 2024; originally announced January 2024.

arXiv:2311.11544 [pdf, other]

Understanding Variation in Subpopulation Susceptibility to Poisoning Attacks

Authors: Evan Rose, Fnu Suya, David Evans

Abstract: Machine learning is susceptible to poisoning attacks, in which an attacker controls a small fraction of the training data and chooses that data with the goal of inducing some behavior unintended by the model developer in the trained model. We consider a realistic setting in which the adversary with the ability to insert a limited number of data points attempts to control the model's behavior on a… ▽ More Machine learning is susceptible to poisoning attacks, in which an attacker controls a small fraction of the training data and chooses that data with the goal of inducing some behavior unintended by the model developer in the trained model. We consider a realistic setting in which the adversary with the ability to insert a limited number of data points attempts to control the model's behavior on a specific subpopulation. Inspired by previous observations on disparate effectiveness of random label-flipping attacks on different subpopulations, we investigate the properties that can impact the effectiveness of state-of-the-art poisoning attacks against different subpopulations. For a family of 2-dimensional synthetic datasets, we empirically find that dataset separability plays a dominant role in subpopulation vulnerability for less separable datasets. However, well-separated datasets exhibit more dependence on individual subpopulation properties. We further discover that a crucial subpopulation property is captured by the difference in loss on the clean dataset between the clean model and a target model that misclassifies the subpopulation, and a subpopulation is much easier to attack if the loss difference is small. This property also generalizes to high-dimensional benchmark datasets. For the Adult benchmark dataset, we show that we can find semantically-meaningful subpopulation properties that are related to the susceptibilities of a selected group of subpopulations. The results in this paper are accompanied by a fully interactive web-based visualization of subpopulation poisoning attacks found at https://uvasrg.github.io/visualizing-poisoning △ Less

Submitted 20 November, 2023; originally announced November 2023.

Comments: 18 pages, 11 figures

arXiv:2311.04272 [pdf, other]

The Future of Astronomical Data Infrastructure: Meeting Report

Authors: Michael R. Blanton, Janet D. Evans, Dara Norman, William O'Mullane, Adrian Price-Whelan, Luca Rizzi, Alberto Accomazzi, Megan Ansdell, Stephen Bailey, Paul Barrett, Steven Berukoff, Adam Bolton, Julian Borrill, Kelle Cruz, Julianne Dalcanton, Vandana Desai, Gregory P. Dubois-Felsmann, Frossie Economou, Henry Ferguson, Bryan Field, Dan Foreman-Mackey, Jaime Forero-Romero, Niall Gaffney, Kim Gillies, Matthew J. Graham , et al. (47 additional authors not shown)

Abstract: The astronomical community is grappling with the increasing volume and complexity of data produced by modern telescopes, due to difficulties in reducing, accessing, analyzing, and combining archives of data. To address this challenge, we propose the establishment of a coordinating body, an "entity," with the specific mission of enhancing the interoperability, archiving, distribution, and productio… ▽ More The astronomical community is grappling with the increasing volume and complexity of data produced by modern telescopes, due to difficulties in reducing, accessing, analyzing, and combining archives of data. To address this challenge, we propose the establishment of a coordinating body, an "entity," with the specific mission of enhancing the interoperability, archiving, distribution, and production of both astronomical data and software. This report is the culmination of a workshop held in February 2023 on the Future of Astronomical Data Infrastructure. Attended by 70 scientists and software professionals from ground-based and space-based missions and archives spanning the entire spectrum of astronomical research, the group deliberated on the prevailing state of software and data infrastructure in astronomy, identified pressing issues, and explored potential solutions. In this report, we describe the ecosystem of astronomical data, its existing flaws, and the many gaps, duplication, inconsistencies, barriers to access, drags on productivity, missed opportunities, and risks to the long-term integrity of essential data sets. We also highlight the successes and failures in a set of deep dives into several different illustrative components of the ecosystem, included as an appendix. △ Less

Submitted 7 November, 2023; originally announced November 2023.

Comments: 59 pages; please send comments and/or questions to foadi@googlegroups.com

arXiv:2310.20017 [pdf]

Direct imaging of spatial heterogeneities in type II superconductors

Authors: Donald M. Evans, Michele Conroy, Lukas Puntigam, Dorina Croitori, Lilian Prodan, James O. Douglas, Baptiste Gault, Vladimir Tsurkan

Abstract: Understanding the exotic properties of quantum materials, including high-temperature superconductors, remains a formidable challenge that demands direct insights into electronic conductivity. Current methodologies either capture a bulk average or near-atomically-resolved information, missing direct measurements at the critical intermediate length scales. Here, using the superconductor Fe(Se,Te) as… ▽ More Understanding the exotic properties of quantum materials, including high-temperature superconductors, remains a formidable challenge that demands direct insights into electronic conductivity. Current methodologies either capture a bulk average or near-atomically-resolved information, missing direct measurements at the critical intermediate length scales. Here, using the superconductor Fe(Se,Te) as a model system, we use low-temperature conductive atomic force microscopy (cAFM) to bridge this gap. Contrary to the uniform superconductivity anticipated from bulk assessments, cAFM uncovers micron-scale conductive intrusions within a relatively insulating matrix. Subsequent compositional mapping through atom probe tomography, shows that differences in conductivity correlated with local changes in composition. cAFM, supported by advanced microscopy and microanalysis, represents a methodological breakthrough that can be used to navigate the intricate landscape of high-temperature superconductors and the broader realm of quantum materials. Such fundamental information is critical for theoretical understanding and future guided design. △ Less

Submitted 30 October, 2023; originally announced October 2023.

arXiv:2310.18362 [pdf, ps, other]

SoK: Memorization in General-Purpose Large Language Models

Authors: Valentin Hartmann, Anshuman Suri, Vincent Bindschaedler, David Evans, Shruti Tople, Robert West

Abstract: Large Language Models (LLMs) are advancing at a remarkable pace, with myriad applications under development. Unlike most earlier machine learning models, they are no longer built for one specific application but are designed to excel in a wide range of tasks. A major part of this success is due to their huge training datasets and the unprecedented number of model parameters, which allow them to me… ▽ More Large Language Models (LLMs) are advancing at a remarkable pace, with myriad applications under development. Unlike most earlier machine learning models, they are no longer built for one specific application but are designed to excel in a wide range of tasks. A major part of this success is due to their huge training datasets and the unprecedented number of model parameters, which allow them to memorize large amounts of information contained in the training data. This memorization goes beyond mere language, and encompasses information only present in a few documents. This is often desirable since it is necessary for performing tasks such as question answering, and therefore an important part of learning, but also brings a whole array of issues, from privacy and security to copyright and beyond. LLMs can memorize short secrets in the training data, but can also memorize concepts like facts or writing styles that can be expressed in text in many different ways. We propose a taxonomy for memorization in LLMs that covers verbatim text, facts, ideas and algorithms, writing styles, distributional properties, and alignment goals. We describe the implications of each type of memorization - both positive and negative - for model performance, privacy, security and confidentiality, copyright, and auditing, and ways to detect and prevent memorization. We further highlight the challenges that arise from the predominant way of defining memorization with respect to model behavior instead of model weights, due to LLM-specific phenomena such as reasoning capabilities or differences between decoding algorithms. Throughout the paper, we describe potential risks and opportunities arising from memorization in LLMs that we hope will motivate new research directions. △ Less

Submitted 24 October, 2023; originally announced October 2023.

arXiv:2310.17534 [pdf, other]

SoK: Pitfalls in Evaluating Black-Box Attacks

Authors: Fnu Suya, Anshuman Suri, Tingwei Zhang, Jingtao Hong, Yuan Tian, David Evans

Abstract: Numerous works study black-box attacks on image classifiers. However, these works make different assumptions on the adversary's knowledge and current literature lacks a cohesive organization centered around the threat model. To systematize knowledge in this area, we propose a taxonomy over the threat space spanning the axes of feedback granularity, the access of interactive queries, and the qualit… ▽ More Numerous works study black-box attacks on image classifiers. However, these works make different assumptions on the adversary's knowledge and current literature lacks a cohesive organization centered around the threat model. To systematize knowledge in this area, we propose a taxonomy over the threat space spanning the axes of feedback granularity, the access of interactive queries, and the quality and quantity of the auxiliary data available to the attacker. Our new taxonomy provides three key insights. 1) Despite extensive literature, numerous under-explored threat spaces exist, which cannot be trivially solved by adapting techniques from well-explored settings. We demonstrate this by establishing a new state-of-the-art in the less-studied setting of access to top-k confidence scores by adapting techniques from well-explored settings of accessing the complete confidence vector, but show how it still falls short of the more restrictive setting that only obtains the prediction label, highlighting the need for more research. 2) Identification the threat model of different attacks uncovers stronger baselines that challenge prior state-of-the-art claims. We demonstrate this by enhancing an initially weaker baseline (under interactive query access) via surrogate models, effectively overturning claims in the respective paper. 3) Our taxonomy reveals interactions between attacker knowledge that connect well to related areas, such as model inversion and extraction attacks. We discuss how advances in other areas can enable potentially stronger black-box attacks. Finally, we emphasize the need for a more realistic assessment of attack success by factoring in local attack runtime. This approach reveals the potential for certain attacks to achieve notably higher success rates and the need to evaluate attacks in diverse and harder settings, highlighting the need for better selection criteria. △ Less

Submitted 14 February, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

Comments: Accepted at SaTML 2024

arXiv:2310.08500 [pdf, other]

Progress towards ultracold Sr for the AION project -- sub-microkelvin atoms and an optical-heterodyne diagnostic tool for injection-locked laser diodes

Authors: E. Pasatembou, C. F. A. Baynham, O. Buchmüller, D. Evans, R. Hobson, L. Iannizzotto-Venezze, A. Josset

Abstract: Long-baseline atom interferometers, such as the one to be built by the AION collaboration, require ultra-cold atomic clouds. These are produced by trapping the atoms in Magneto-Optical Traps (MOTs) using high-power, narrow-linewidth lasers. We report on the laser and optical master-slave injection locked system used to address the 1S0 - 3P1 strontium transition at 689 nm, and on the trapping of st… ▽ More Long-baseline atom interferometers, such as the one to be built by the AION collaboration, require ultra-cold atomic clouds. These are produced by trapping the atoms in Magneto-Optical Traps (MOTs) using high-power, narrow-linewidth lasers. We report on the laser and optical master-slave injection locked system used to address the 1S0 - 3P1 strontium transition at 689 nm, and on the trapping of strontium atoms in a narrowband MOT. We demonstrate the quality of the injection through the characterisation of the injection lock using a novel, easy-to-assemble method which uses a double pass acousto-optic modulator (AOM) to generate and detect a heterodyne beatnote. The reported system is used to produce an atomic cloud at a temperature of 812 +/- 43 nK in a narrowband red MOT. △ Less

Submitted 12 October, 2023; originally announced October 2023.

Report number: AION-REPORT/2023-10

arXiv:2310.08183 [pdf, other]

Terrestrial Very-Long-Baseline Atom Interferometry: Workshop Summary

Authors: Sven Abend, Baptiste Allard, Iván Alonso, John Antoniadis, Henrique Araujo, Gianluigi Arduini, Aidan Arnold, Tobias Aßmann, Nadja Augst, Leonardo Badurina, Antun Balaz, Hannah Banks, Michele Barone, Michele Barsanti, Angelo Bassi, Baptiste Battelier, Charles Baynham, Beaufils Quentin, Aleksandar Belic, Ankit Beniwal, Jose Bernabeu, Francesco Bertinelli, Andrea Bertoldi, Ikbal Ahamed Biswas, Diego Blas , et al. (228 additional authors not shown)

Abstract: This document presents a summary of the 2023 Terrestrial Very-Long-Baseline Atom Interferometry Workshop hosted by CERN. The workshop brought together experts from around the world to discuss the exciting developments in large-scale atom interferometer (AI) prototypes and their potential for detecting ultralight dark matter and gravitational waves. The primary objective of the workshop was to lay… ▽ More This document presents a summary of the 2023 Terrestrial Very-Long-Baseline Atom Interferometry Workshop hosted by CERN. The workshop brought together experts from around the world to discuss the exciting developments in large-scale atom interferometer (AI) prototypes and their potential for detecting ultralight dark matter and gravitational waves. The primary objective of the workshop was to lay the groundwork for an international TVLBAI proto-collaboration. This collaboration aims to unite researchers from different institutions to strategize and secure funding for terrestrial large-scale AI projects. The ultimate goal is to create a roadmap detailing the design and technology choices for one or more km-scale detectors, which will be operational in the mid-2030s. The key sections of this report present the physics case and technical challenges, together with a comprehensive overview of the discussions at the workshop together with the main conclusions. △ Less

Submitted 12 October, 2023; originally announced October 2023.

Comments: Summary of the Terrestrial Very-Long-Baseline Atom Interferometry Workshop held at CERN: https://indico.cern.ch/event/1208783/

arXiv:2310.06551 [pdf, other]

doi 10.1051/0004-6361/202347203

Gaia Focused Product Release: Sources from Service Interface Function image analysis -- Half a million new sources in omega Centauri

Authors: Gaia Collaboration, K. Weingrill, A. Mints, J. Castañeda, Z. Kostrzewa-Rutkowska, M. Davidson, F. De Angeli, J. Hernández, F. Torra, M. Ramos-Lerate, C. Babusiaux, M. Biermann, C. Crowley, D. W. Evans, L. Lindegren, J. M. Martín-Fleitas, L. Palaversa, D. Ruz Mieres, K. Tisanić, A. G. A. Brown, A. Vallenari, T. Prusti, J. H. J. de Bruijne, F. Arenou, A. Barbier , et al. (378 additional authors not shown)

Abstract: Gaia's readout window strategy is challenged by very dense fields in the sky. Therefore, in addition to standard Gaia observations, full Sky Mapper (SM) images were recorded for nine selected regions in the sky. A new software pipeline exploits these Service Interface Function (SIF) images of crowded fields (CFs), making use of the availability of the full two-dimensional (2D) information. This ne… ▽ More Gaia's readout window strategy is challenged by very dense fields in the sky. Therefore, in addition to standard Gaia observations, full Sky Mapper (SM) images were recorded for nine selected regions in the sky. A new software pipeline exploits these Service Interface Function (SIF) images of crowded fields (CFs), making use of the availability of the full two-dimensional (2D) information. This new pipeline produced half a million additional Gaia sources in the region of the omega Centauri ($ω$ Cen) cluster, which are published with this Focused Product Release. We discuss the dedicated SIF CF data reduction pipeline, validate its data products, and introduce their Gaia archive table. Our aim is to improve the completeness of the {\it Gaia} source inventory in a very dense region in the sky, $ω$ Cen. An adapted version of {\it Gaia}'s Source Detection and Image Parameter Determination software located sources in the 2D SIF CF images. We validated the results by comparing them to the public {\it Gaia} DR3 catalogue and external Hubble Space Telescope data. With this Focused Product Release, 526\,587 new sources have been added to the {\it Gaia} catalogue in $ω$ Cen. Apart from positions and brightnesses, the additional catalogue contains parallaxes and proper motions, but no meaningful colour information. While SIF CF source parameters generally have a lower precision than nominal {\it Gaia} sources, in the cluster centre they increase the depth of the combined catalogue by three magnitudes and improve the source density by a factor of ten. This first SIF CF data publication already adds great value to the {\it Gaia} catalogue. It demonstrates what to expect for the fourth {\it Gaia} catalogue, which will contain additional sources for all nine SIF CF regions. △ Less

Submitted 8 November, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

Journal ref: A&A 680, A35 (2023)

arXiv:2310.06295 [pdf, other]

doi 10.1051/0004-6361/202347273

Gaia Focused Product Release: A catalogue of sources around quasars to search for strongly lensed quasars

Authors: Gaia Collaboration, A. Krone-Martins, C. Ducourant, L. Galluccio, L. Delchambre, I. Oreshina-Slezak, R. Teixeira, J. Braine, J. -F. Le Campion, F. Mignard, W. Roux, A. Blazere, L. Pegoraro, A. G. A. Brown, A. Vallenari, T. Prusti, J. H. J. de Bruijne, F. Arenou, C. Babusiaux, A. Barbier, M. Biermann, O. L. Creevey, D. W. Evans, L. Eyer, R. Guerra , et al. (376 additional authors not shown)

Abstract: Context. Strongly lensed quasars are fundamental sources for cosmology. The Gaia space mission covers the entire sky with the unprecedented resolution of $0.18$" in the optical, making it an ideal instrument to search for gravitational lenses down to the limiting magnitude of 21. Nevertheless, the previous Gaia Data Releases are known to be incomplete for small angular separations such as those ex… ▽ More Context. Strongly lensed quasars are fundamental sources for cosmology. The Gaia space mission covers the entire sky with the unprecedented resolution of $0.18$" in the optical, making it an ideal instrument to search for gravitational lenses down to the limiting magnitude of 21. Nevertheless, the previous Gaia Data Releases are known to be incomplete for small angular separations such as those expected for most lenses. Aims. We present the Data Processing and Analysis Consortium GravLens pipeline, which was built to analyse all Gaia detections around quasars and to cluster them into sources, thus producing a catalogue of secondary sources around each quasar. We analysed the resulting catalogue to produce scores that indicate source configurations that are compatible with strongly lensed quasars. Methods. GravLens uses the DBSCAN unsupervised clustering algorithm to detect sources around quasars. The resulting catalogue of multiplets is then analysed with several methods to identify potential gravitational lenses. We developed and applied an outlier scoring method, a comparison between the average BP and RP spectra of the components, and we also used an extremely randomised tree algorithm. These methods produce scores to identify the most probable configurations and to establish a list of lens candidates. Results. We analysed the environment of 3 760 032 quasars. A total of 4 760 920 sources, including the quasars, were found within 6" of the quasar positions. This list is given in the Gaia archive. In 87\% of cases, the quasar remains a single source, and in 501 385 cases neighbouring sources were detected. We propose a list of 381 lensed candidates, of which we identified 49 as the most promising. Beyond these candidates, the associate tables in this Focused Product Release allow the entire community to explore the unique Gaia data for strong lensing studies further. △ Less

Submitted 10 October, 2023; originally announced October 2023.

Comments: 35 pages, 60 figures, accepted for publication by Astronomy and Astrophysics

Journal ref: A&A 685, A130 (2024)

arXiv:2310.06051 [pdf, other]

Gaia Focused Product Release: Radial velocity time series of long-period variables

Authors: Gaia Collaboration, Gaia Collaboration, M. Trabucchi, N. Mowlavi, T. Lebzelter, I. Lecoeur-Taibi, M. Audard, L. Eyer, P. García-Lario, P. Gavras, B. Holl, G. Jevardat de Fombelle, K. Nienartowicz, L. Rimoldini, P. Sartoretti, R. Blomme, Y. Frémat, O. Marchal, Y. Damerdji, A. G. A. Brown, A. Guerrier, P. Panuzzo, D. Katz, G. M. Seabroke, K. Benson , et al. (382 additional authors not shown)

Abstract: The third Gaia Data Release (DR3) provided photometric time series of more than 2 million long-period variable (LPV) candidates. Anticipating the publication of full radial-velocity (RV) in DR4, this Focused Product Release (FPR) provides RV time series for a selection of LPVs with high-quality observations. We describe the production and content of the Gaia catalog of LPV RV time series, and the… ▽ More The third Gaia Data Release (DR3) provided photometric time series of more than 2 million long-period variable (LPV) candidates. Anticipating the publication of full radial-velocity (RV) in DR4, this Focused Product Release (FPR) provides RV time series for a selection of LPVs with high-quality observations. We describe the production and content of the Gaia catalog of LPV RV time series, and the methods used to compute variability parameters published in the Gaia FPR. Starting from the DR3 LPVs catalog, we applied filters to construct a sample of sources with high-quality RV measurements. We modeled their RV and photometric time series to derive their periods and amplitudes, and further refined the sample by requiring compatibility between the RV period and at least one of the $G$, $G_{\rm BP}$, or $G_{\rm RP}$ photometric periods. The catalog includes RV time series and variability parameters for 9\,614 sources in the magnitude range $6\lesssim G/{\rm mag}\lesssim 14$, including a flagged top-quality subsample of 6\,093 stars whose RV periods are fully compatible with the values derived from the $G$, $G_{\rm BP}$, and $G_{\rm RP}$ photometric time series. The RV time series contain a mean of 24 measurements per source taken unevenly over a duration of about three years. We identify the great most sources (88%) as genuine LPVs, with about half of them showing a pulsation period and the other half displaying a long secondary period. The remaining 12% consists of candidate ellipsoidal binaries. Quality checks against RVs available in the literature show excellent agreement. We provide illustrative examples and cautionary remarks. The publication of RV time series for almost 10\,000 LPVs constitutes, by far, the largest such database available to date in the literature. The availability of simultaneous photometric measurements gives a unique added value to the Gaia catalog (abridged) △ Less

Submitted 9 October, 2023; originally announced October 2023.

Comments: 36 pages, 38 figures

arXiv:2310.03562 [pdf, other]

doi 10.1051/0004-6361/202347371

Gaia data processing. SEAPipe: The source environment analysis pipeline

Authors: D. L. Harrison, F. van Leeuwen, P. J. Osborne, P. W. Burgess, F. De Angeli, D. W. Evans

Abstract: Aims. To describe two potential options for the Source Environment Analysis pipeline, SEAPipe, for the Gaia mission. This pipeline will enable the discovery of sources which are new to Gaia, in the sense that they were not found by the on-board detection algorithm. These additional sources (secondaries) are discoverable in the vicinity of those Gaia sources (primaries) that were found by the on-bo… ▽ More Aims. To describe two potential options for the Source Environment Analysis pipeline, SEAPipe, for the Gaia mission. This pipeline will enable the discovery of sources which are new to Gaia, in the sense that they were not found by the on-board detection algorithm. These additional sources (secondaries) are discoverable in the vicinity of those Gaia sources (primaries) that were found by the on-board detection. Methods. The main algorithmic steps required are described; the 2-dimensional image reconstruction of 1-dimensional transit data, the analysis of these images to find the additional sources present, and the determination of the mean positions, proper motions, parallaxes and brightness of these sources. Additionally, the Monte Carlo simulations used to characterise the performance of the pipelines are described. Results. The performance of the two options for SEAPipe, the vanilla and image-subtraction versions, are compared. Their selection functions are computed in terms of the magnitude of the secondary sources and their angular separations from their corresponding primary source. The completeness and purity of the resultant catalogue of secondary sources as found by each of the pipelines, given the expected magnitude distribution of the primary sources and the magnitude and angular separation distributions of the secondary sources, is also presented. The image-subtraction pipeline is shown to out-perform the vanilla pipeline. △ Less

Submitted 5 October, 2023; originally announced October 2023.

Comments: 13 pages, 14 figures. Accepted by A&A

Journal ref: A&A 679, A158 (2023)

arXiv:2309.10341 [pdf, other]

Theory of Nonequilibrium Symmetry-Breaking Coexistence and Active Crystallization

Authors: Daniel Evans, Ahmad K. Omar

Abstract: Crystallization is perhaps the most familiar example of a symmetry-breaking transition. In equilibrium, thermodynamic arguments result in a powerful and convenient set of criteria for determining the coexistence curves associated with these transitions. In recent years, nonequilibrium symmetry-breaking transitions have been routinely observed in a variety of natural and synthetic systems. The brea… ▽ More Crystallization is perhaps the most familiar example of a symmetry-breaking transition. In equilibrium, thermodynamic arguments result in a powerful and convenient set of criteria for determining the coexistence curves associated with these transitions. In recent years, nonequilibrium symmetry-breaking transitions have been routinely observed in a variety of natural and synthetic systems. The breaking of detailed balance, and the resulting absence of Boltzmann statistics, motivates the need for a symmetry-breaking coexistence theory that is independent of the underlying distribution of microstates. Here, we develop such a theory, relying only on mechanics, balance laws, and system symmetries. In doing so, we develop a generalized Gibbs-Duhem relation that results in nonequilibrium coexistence criteria solely in terms of bulk equations of state. We apply our framework to active crystallization, developing a complete description of the phase diagram of active Brownian hard spheres. Our predicted phase diagram quantitatively recapitulates the solid-fluid coexistence curve as well as other key features of active phase behavior, such as the liquid-gas coexistence binodal and solid-liquid-gas triple point. It is our hope that our findings offer a concrete path forward towards the development of a general theory for nonequilibrium coexistence. △ Less

Submitted 17 October, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

Comments: comments welcome

arXiv:2309.06651 [pdf, other]

ConR: Contrastive Regularizer for Deep Imbalanced Regression

Authors: Mahsa Keramati, Lili Meng, R. David Evans

Abstract: Imbalanced distributions are ubiquitous in real-world data. They create constraints on Deep Neural Networks to represent the minority labels and avoid bias towards majority labels. The extensive body of imbalanced approaches address categorical label spaces but fail to effectively extend to regression problems where the label space is continuous. Local and global correlations among continuous labe… ▽ More Imbalanced distributions are ubiquitous in real-world data. They create constraints on Deep Neural Networks to represent the minority labels and avoid bias towards majority labels. The extensive body of imbalanced approaches address categorical label spaces but fail to effectively extend to regression problems where the label space is continuous. Local and global correlations among continuous labels provide valuable insights towards effectively modelling relationships in feature space. In this work, we propose ConR, a contrastive regularizer that models global and local label similarities in feature space and prevents the features of minority samples from being collapsed into their majority neighbours. ConR discerns the disagreements between the label space and feature space and imposes a penalty on these disagreements. ConR addresses the continuous nature of label space with two main strategies in a contrastive manner: incorrect proximities are penalized proportionate to the label similarities and the correct ones are encouraged to model local similarities. ConR consolidates essential considerations into a generic, easy-to-integrate, and efficient method that effectively addresses deep imbalanced regression. Moreover, ConR is orthogonal to existing approaches and smoothly extends to uni- and multi-dimensional label spaces. Our comprehensive experiments show that ConR significantly boosts the performance of all the state-of-the-art methods on four large-scale deep imbalanced regression benchmarks. Our code is publicly available in https://github.com/BorealisAI/ConR. △ Less

Submitted 13 March, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

arXiv:2308.14942 [pdf]

Woolf et als GWAS by subtraction is not useful for cross-generational Mendelian randomization studies

Authors: David M Evans, George Davey Smith, Gunn-Helen Moen

Abstract: Mendelian randomization (MR) is an epidemiological method that can be used to strengthen causal inference regarding the relationship between a modifiable environmental exposure and a medically relevant trait and to estimate the magnitude of this relationship1. Recently, there has been considerable interest in using MR to examine potential causal relationships between parental phenotypes and outcom… ▽ More Mendelian randomization (MR) is an epidemiological method that can be used to strengthen causal inference regarding the relationship between a modifiable environmental exposure and a medically relevant trait and to estimate the magnitude of this relationship1. Recently, there has been considerable interest in using MR to examine potential causal relationships between parental phenotypes and outcomes amongst their offspring. In a recent issue of BMC Research Notes, Woolf et al (2023) present a new method, GWAS by subtraction, to derive genome-wide summary statistics for paternal smoking and other paternal phenotypes with the goal that these estimates can then be used in downstream (including two sample) MR studies. Whilst a potentially useful goal, Woolf et al. (2023) focus on the wrong parameter of interest for useful genome-wide association studies (GWAS) and downstream cross-generational MR studies, and the estimator that they derive is neither efficient nor appropriate for such use. △ Less

Submitted 28 August, 2023; originally announced August 2023.

Comments: 8 pages, 0 figures

arXiv:2308.11044 [pdf]

Strain-Induced Polarization Enhancement in BaTiO$_3$ Core-Shell Nanoparticles

Authors: Eugene A. Eliseev, Anna N. Morozovska, Sergei V. Kalinin, Dean R. Evans

Abstract: Despite fascinating experimental results, the influence of defects and elastic strains on the physical state of nanosized ferroelectrics is still poorly explored theoretically. One of unresolved theoretical problems is the analytical description of the strongly enhanced spontaneous polarization, piezoelectric response, and dielectric properties of ferroelectric oxide thin films and core-shell nano… ▽ More Despite fascinating experimental results, the influence of defects and elastic strains on the physical state of nanosized ferroelectrics is still poorly explored theoretically. One of unresolved theoretical problems is the analytical description of the strongly enhanced spontaneous polarization, piezoelectric response, and dielectric properties of ferroelectric oxide thin films and core-shell nanoparticles induced by elastic strains and stresses. In particular, the 10-nm quasi-spherical BaTiO3 core-shell nanoparticles reveal a giant spontaneous polarization up to 130 mu_C/cm2, where the physical origin is a large Ti off-centering. The available theoretical description cannot explain the giant spontaneous polarization observed in these spherical nanoparticles. This work analyzes polar properties of BaTiO3 core-shell spherical nanoparticles using the Landau-Ginzburg-Devonshire approach, which considers the nonlinear electrostriction coupling and large Vegard strains in the shell. We reveal that a spontaneous polarization greater than 50 mu_C/cm2 can be stable in a (10-100) nm BaTiO3 core at room temperature, where a 5 nm paraelectric shell is stretched by (3-6)% due to Vegard strains, which contribute to the elastic mismatch at the core-shell interface. The polarization value 50 mu_C/cm2 corresponds to high tetragonality ratios (1.02 - 1.04), which is further increased up to 100 mu_C/cm2 by higher Vegard strains and/or intrinsic surface stresses leading to unphysically high tetragonality ratios (1.08 - 1.16). The nonlinear electrostriction coupling and the elastic mismatch at the core-shell interface are key physical factors of the spontaneous polarization enhancement in the core. Doping with the highly-polarized core-shell nanoparticles can be useful in optoelectronics and nonlinear optics, electric field enhancement, reduced switching voltages, catalysis, and electrocaloric nanocoolers. △ Less

Submitted 27 August, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

Comments: 34 pages, including 5 figures and 1 Appendix

arXiv:2307.06592 [pdf, ps, other]

Noncommutative crepant resolutions of $cA_n$ singularities via Fukaya categories

Authors: Jonathan David Evans, Yanki Lekili

Abstract: We compute the wrapped Fukaya category $\mathcal{W}(T^*S^1, D)$ of a cylinder relative to a divisor $D= \{p_1,\ldots, p_n\}$ of $n$ points, proving a mirror equivalence with the category of perfect complexes on a crepant resolution (over $k[t_0,\ldots, t_n]$) of the singularity $uv=t_0t_1\ldots t_n$. Upon making the base-change $t_i= f_i(x,y)$, we obtain the derived category of any crepant resolut… ▽ More We compute the wrapped Fukaya category $\mathcal{W}(T^*S^1, D)$ of a cylinder relative to a divisor $D= \{p_1,\ldots, p_n\}$ of $n$ points, proving a mirror equivalence with the category of perfect complexes on a crepant resolution (over $k[t_0,\ldots, t_n]$) of the singularity $uv=t_0t_1\ldots t_n$. Upon making the base-change $t_i= f_i(x,y)$, we obtain the derived category of any crepant resolution of the $cA_{n}$ singularity given by the equation $uv= f_0\ldots f_n$. These categories inherit braid group actions via the action on $\mathcal{W}(T^*S^1,D)$ of the mapping class group of $T^*S^1$ fixing $D$. We also give a geometric model of the derived contraction algebra of a $cA_n$ singularity in terms of the relative Fukaya category of the disc. △ Less

Submitted 26 September, 2023; v1 submitted 13 July, 2023; originally announced July 2023.

Comments: 26 pages, 8 figures. A Fukaya categorical description of the (derived) contraction algebra is added in Section 6

MSC Class: 53D37; 14J33; 16S38; 14A22

arXiv:2307.05212 [pdf, other]

Preparing for Gaia Searches for Optical Counterparts of Gravitational Wave Events during O4

Authors: Sumedha Biswas, Zuzanna Kostrzewa-Rutkowska, Peter G. Jonker, Paul Vreeswijk, Deepak Eappachen, Paul J. Groot, Simon Hodgkin, Abdullah Yoldas, Guy Rixon, Diana Harrison, M. van Leeuwen, Dafydd Evans

Abstract: The discovery of gravitational wave (GW) events and the detection of electromagnetic counterparts from GW170817 has started the era of multimessenger GW astronomy.The field has been developing rapidly and in this paper,we discuss the preparation for detecting these events with the ESA Gaia satellite,during the 4th observing run of the LIGO-Virgo-KAGRA (LVK) collaboration that has started on May 24… ▽ More The discovery of gravitational wave (GW) events and the detection of electromagnetic counterparts from GW170817 has started the era of multimessenger GW astronomy.The field has been developing rapidly and in this paper,we discuss the preparation for detecting these events with the ESA Gaia satellite,during the 4th observing run of the LIGO-Virgo-KAGRA (LVK) collaboration that has started on May 24,2023. Gaia is contributing to the search for GW counterparts by a new transient detection pipeline called GaiaX. In GaiaX, a new source appearing in the field of view of only one of the two telescopes on-board Gaia is sufficient to send out an alert on the possible detection of a new transient. Ahead of O4, an experiment was conducted over a period of about two months. During the two weeks around New Moon in this period of time, the MeerLICHT (ML) telescope located in South Africa tried (weather permitting) to observe the same region of the sky as Gaia within 10 minutes. Any GaiaX detected transient was published publicly. ML and Gaia have similar limiting magnitudes for typical seeing conditions at ML. At the end of the experiment, we had 11861 GaiaX candidate transients and 15806 ML candidate transients, which we further analysed and the results of which are presented in this paper. Finally, we discuss the possibility and capabilities of Gaia contributing to the search for electromagnetic counterparts of gravitational wave events during O4 through the GaiaX detection and alert procedure. △ Less

Submitted 21 August, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

Comments: 13 pages, 9 figures; Accepted for publication in MNRAS;

arXiv:2307.01073 [pdf, other]

What Distributions are Robust to Indiscriminate Poisoning Attacks for Linear Learners?

Authors: Fnu Suya, Xiao Zhang, Yuan Tian, David Evans

Abstract: We study indiscriminate poisoning for linear learners where an adversary injects a few crafted examples into the training data with the goal of forcing the induced model to incur higher test error. Inspired by the observation that linear learners on some datasets are able to resist the best known attacks even without any defenses, we further investigate whether datasets can be inherently robust to… ▽ More We study indiscriminate poisoning for linear learners where an adversary injects a few crafted examples into the training data with the goal of forcing the induced model to incur higher test error. Inspired by the observation that linear learners on some datasets are able to resist the best known attacks even without any defenses, we further investigate whether datasets can be inherently robust to indiscriminate poisoning attacks for linear learners. For theoretical Gaussian distributions, we rigorously characterize the behavior of an optimal poisoning attack, defined as the poisoning strategy that attains the maximum risk of the induced model at a given poisoning budget. Our results prove that linear learners can indeed be robust to indiscriminate poisoning if the class-wise data distributions are well-separated with low variance and the size of the constraint set containing all permissible poisoning points is also small. These findings largely explain the drastic variation in empirical attack performance of the state-of-the-art poisoning attacks on linear learners across benchmark datasets, making an important initial step towards understanding the underlying reasons some learning tasks are vulnerable to data poisoning attacks. △ Less

Submitted 9 November, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

Comments: NeurIPS 2023 camera-ready version, 39 pages

arXiv:2307.00423 [pdf, ps, other]

Spectral Sequence Computation of Higher Twisted $K$-Groups of $ SU(n)$

Authors: David E. Evans, Ulrich Pennig

Abstract: Motivated by the Freed-Hopkins-Teleman theorem we study equivariant higher twists of $K$-theory for the groups $G = SU(n)$ induced by exponential functors. We compute the rationalisation of these groups for all $n$ and all non-trivial functors $F$ using the Mayer-Vietoris spectral sequence. Similar to the classical case only the $K$-theory in degree $\dim(G)$ is non-trivial and the non-vanishing g… ▽ More Motivated by the Freed-Hopkins-Teleman theorem we study equivariant higher twists of $K$-theory for the groups $G = SU(n)$ induced by exponential functors. We compute the rationalisation of these groups for all $n$ and all non-trivial functors $F$ using the Mayer-Vietoris spectral sequence. Similar to the classical case only the $K$-theory in degree $\dim(G)$ is non-trivial and the non-vanishing group is a quotient of a localisation of the representation ring $R(G) \otimes \mathbb{Q}$ by a higher fusion ideal $J_{F,\mathbb{Q}}$. We give generators for this ideal and prove that these can be obtained as derivatives of a potential. △ Less

Submitted 1 July, 2023; originally announced July 2023.

Comments: 33 pages, one figure

MSC Class: 19L50; 19L47; 46L80

arXiv:2306.07003 [pdf, other]

High-speed Autonomous Racing using Trajectory-aided Deep Reinforcement Learning

Authors: Benjamin David Evans, Herman Arnold Engelbrecht, Hendrik Willem Jordaan

Abstract: The classical method of autonomous racing uses real-time localisation to follow a precalculated optimal trajectory. In contrast, end-to-end deep reinforcement learning (DRL) can train agents to race using only raw LiDAR scans. While classical methods prioritise optimization for high-performance racing, DRL approaches have focused on low-performance contexts with little consideration of the speed p… ▽ More The classical method of autonomous racing uses real-time localisation to follow a precalculated optimal trajectory. In contrast, end-to-end deep reinforcement learning (DRL) can train agents to race using only raw LiDAR scans. While classical methods prioritise optimization for high-performance racing, DRL approaches have focused on low-performance contexts with little consideration of the speed profile. This work addresses the problem of using end-to-end DRL agents for high-speed autonomous racing. We present trajectory-aided learning (TAL) that trains DRL agents for high-performance racing by incorporating the optimal trajectory (racing line) into the learning formulation. Our method is evaluated using the TD3 algorithm on four maps in the open-source F1Tenth simulator. The results demonstrate that our method achieves a significantly higher lap completion rate at high speeds compared to the baseline. This is due to TAL training the agent to select a feasible speed profile of slowing down in the corners and roughly tracking the optimal trajectory. △ Less

Submitted 12 June, 2023; originally announced June 2023.

Comments: 7 pages, 16 figures. Submitted for review

arXiv:2305.20060 [pdf, other]

Centralised Design and Production of the Ultra-High Vacuum and Laser-Stabilisation Systems for the AION Ultra-Cold Strontium Laboratories

Authors: B. Stray, O. Ennis, S. Hedges, S. Dey, M. Langlois, K. Bongs, S. Lellouch, M. Holynski, B. Bostwick, J. Chen, Z. Eyler, V. Gibson, T. L. Harte, M. Hsu, M. Karzazi, J. Mitchell, N. Mouelle, U. Schneider, Y. Tang, K. Tkalcec, Y. Zhi, K. Clarke, A. Vick, K. Bridges, J. Coleman , et al. (47 additional authors not shown)

Abstract: This paper outlines the centralised design and production of the Ultra-High-Vacuum sidearm and Laser-Stabilisation systems for the AION Ultra-Cold Strontium Laboratories. Commissioning data on the residual gas and steady-state pressures in the sidearm chambers, on magnetic field quality, on laser stabilisation, and on the loading rate for the 3D Magneto-Optical Trap are presented. Streamlining the… ▽ More This paper outlines the centralised design and production of the Ultra-High-Vacuum sidearm and Laser-Stabilisation systems for the AION Ultra-Cold Strontium Laboratories. Commissioning data on the residual gas and steady-state pressures in the sidearm chambers, on magnetic field quality, on laser stabilisation, and on the loading rate for the 3D Magneto-Optical Trap are presented. Streamlining the design and production of the sidearm and laser stabilisation systems enabled the AION Collaboration to build and equip in parallel five state-of-the-art Ultra-Cold Strontium Laboratories within 24 months by leveraging key expertise in the collaboration. This approach could serve as a model for the development and construction of other cold atom experiments, such as atomic clock experiments and neutral atom quantum computing systems, by establishing dedicated design and production units at national laboratories. △ Less

Submitted 31 May, 2023; originally announced May 2023.

Comments: 27 pages, 21 figures

Report number: AION-REPORT/2023-03

arXiv:2305.18820 [pdf, other]

Robust Reinforcement Learning Objectives for Sequential Recommender Systems

Authors: Melissa Mozifian, Tristan Sylvain, Dave Evans, Lili Meng

Abstract: Attention-based sequential recommendation methods have shown promise in accurately capturing users' evolving interests from their past interactions. Recent research has also explored the integration of reinforcement learning (RL) into these models, in addition to generating superior user representations. By framing sequential recommendation as an RL problem with reward signals, we can develop reco… ▽ More Attention-based sequential recommendation methods have shown promise in accurately capturing users' evolving interests from their past interactions. Recent research has also explored the integration of reinforcement learning (RL) into these models, in addition to generating superior user representations. By framing sequential recommendation as an RL problem with reward signals, we can develop recommender systems that incorporate direct user feedback in the form of rewards, enhancing personalization for users. Nonetheless, employing RL algorithms presents challenges, including off-policy training, expansive combinatorial action spaces, and the scarcity of datasets with sufficient reward signals. Contemporary approaches have attempted to combine RL and sequential modeling, incorporating contrastive-based objectives and negative sampling strategies for training the RL component. In this work, we further emphasize the efficacy of contrastive-based objectives paired with augmentation to address datasets with extended horizons. Additionally, we recognize the potential instability issues that may arise during the application of negative sampling. These challenges primarily stem from the data imbalance prevalent in real-world datasets, which is a common issue in offline RL contexts. Furthermore, we introduce an enhanced methodology aimed at providing a more effective solution to these challenges. Experimental results across several real datasets show our method with increased robustness and state-of-the-art performance. △ Less

Submitted 17 April, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

arXiv:2304.06929 [pdf]

Advancing Differential Privacy: Where We Are Now and Future Directions for Real-World Deployment

Authors: Rachel Cummings, Damien Desfontaines, David Evans, Roxana Geambasu, Yangsibo Huang, Matthew Jagielski, Peter Kairouz, Gautam Kamath, Sewoong Oh, Olga Ohrimenko, Nicolas Papernot, Ryan Rogers, Milan Shen, Shuang Song, Weijie Su, Andreas Terzis, Abhradeep Thakurta, Sergei Vassilvitskii, Yu-Xiang Wang, Li Xiong, Sergey Yekhanin, Da Yu, Huanyu Zhang, Wanrong Zhang

Abstract: In this article, we present a detailed review of current practices and state-of-the-art methodologies in the field of differential privacy (DP), with a focus of advancing DP's deployment in real-world applications. Key points and high-level contents of the article were originated from the discussions from "Differential Privacy (DP): Challenges Towards the Next Frontier," a workshop held in July 20… ▽ More In this article, we present a detailed review of current practices and state-of-the-art methodologies in the field of differential privacy (DP), with a focus of advancing DP's deployment in real-world applications. Key points and high-level contents of the article were originated from the discussions from "Differential Privacy (DP): Challenges Towards the Next Frontier," a workshop held in July 2022 with experts from industry, academia, and the public sector seeking answers to broad questions pertaining to privacy and its implications in the design of industry-grade systems. This article aims to provide a reference point for the algorithmic and design decisions within the realm of privacy, highlighting important challenges and potential research directions. Covering a wide spectrum of topics, this article delves into the infrastructure needs for designing private systems, methods for achieving better privacy/utility trade-offs, performing privacy attacks and auditing, as well as communicating privacy with broader audiences and stakeholders. △ Less

Submitted 12 March, 2024; v1 submitted 14 April, 2023; originally announced April 2023.

arXiv:2303.11643 [pdf, other]

Manipulating Transfer Learning for Property Inference

Authors: Yulong Tian, Fnu Suya, Anshuman Suri, Fengyuan Xu, David Evans

Abstract: Transfer learning is a popular method for tuning pretrained (upstream) models for different downstream tasks using limited data and computational resources. We study how an adversary with control over an upstream model used in transfer learning can conduct property inference attacks on a victim's tuned downstream model. For example, to infer the presence of images of a specific individual in the d… ▽ More Transfer learning is a popular method for tuning pretrained (upstream) models for different downstream tasks using limited data and computational resources. We study how an adversary with control over an upstream model used in transfer learning can conduct property inference attacks on a victim's tuned downstream model. For example, to infer the presence of images of a specific individual in the downstream training set. We demonstrate attacks in which an adversary can manipulate the upstream model to conduct highly effective and specific property inference attacks (AUC score $> 0.9$), without incurring significant performance loss on the main task. The main idea of the manipulation is to make the upstream model generate activations (intermediate features) with different distributions for samples with and without a target property, thus enabling the adversary to distinguish easily between downstream models trained with and without training examples that have the target property. Our code is available at https://github.com/yulongt23/Transfer-Inference. △ Less

Submitted 21 March, 2023; originally announced March 2023.

Comments: Accepted to CVPR 2023

arXiv:2303.04459 [pdf, ps, other]

doi 10.1090/bull/1799

Subfactors and Mathematical Physics

Authors: David E. Evans, Yasuyuki Kawahigashi

Abstract: This paper surveys the long-standing connections and impact between Vaughan Jones's theory of subfactors and various topics in mathematical physics, namely statistical mechanics,quantum field theory,quantum information and two-dimensional conformal field theory. This paper surveys the long-standing connections and impact between Vaughan Jones's theory of subfactors and various topics in mathematical physics, namely statistical mechanics,quantum field theory,quantum information and two-dimensional conformal field theory. △ Less

Submitted 8 March, 2023; originally announced March 2023.

Comments: 22 pages, 1 figure. To appear in an issue of the Bulletin of the AMS, dedicated to the mathematical legacy of Vaughan Jones

MSC Class: 46L37; 17B69; 18D10; 81R10; 81T05; 81T40; 82B20; 82B23

Journal ref: Bull. Amer. Math. Soc. 60 (2023), 459-482

arXiv:2303.01621 [pdf, other]

GlucoSynth: Generating Differentially-Private Synthetic Glucose Traces

Authors: Josephine Lamp, Mark Derdzinski, Christopher Hannemann, Joost van der Linden, Lu Feng, Tianhao Wang, David Evans

Abstract: We focus on the problem of generating high-quality, private synthetic glucose traces, a task generalizable to many other time series sources. Existing methods for time series data synthesis, such as those using Generative Adversarial Networks (GANs), are not able to capture the innate characteristics of glucose data and cannot provide any formal privacy guarantees without severely degrading the ut… ▽ More We focus on the problem of generating high-quality, private synthetic glucose traces, a task generalizable to many other time series sources. Existing methods for time series data synthesis, such as those using Generative Adversarial Networks (GANs), are not able to capture the innate characteristics of glucose data and cannot provide any formal privacy guarantees without severely degrading the utility of the synthetic data. In this paper we present GlucoSynth, a novel privacy-preserving GAN framework to generate synthetic glucose traces. The core intuition behind our approach is to conserve relationships amongst motifs (glucose events) within the traces, in addition to temporal dynamics. Our framework incorporates differential privacy mechanisms to provide strong formal privacy guarantees. We provide a comprehensive evaluation on the real-world utility of the data using 1.2 million glucose traces; GlucoSynth outperforms all previous methods in its ability to generate high-quality synthetic glucose traces with strong privacy guarantees. △ Less

Submitted 31 October, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

Journal ref: Advances in Neural Information Processing Systems 36 (2023)

Showing 1–50 of 479 results for author: Evans, D