-
Multiplicity dependent $J/ψ$ and $ψ(2S)$ production at forward and backward rapidity in $p$$+$$p$ collisions at $\sqrt{s}=200$ GeV
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
C. Aidala,
Y. Akiba,
M. Alfred,
V. Andrieux,
S. Antsupov,
N. Apadula,
H. Asano,
B. Azmoun,
V. Babintsev,
N. S. Bandara,
E. Bannikov,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
R. Belmont,
A. Berdnikov,
Y. Berdnikov,
L. Bichon,
B. Blankenship,
D. S. Blau,
J. S. Bok
, et al. (276 additional authors not shown)
Abstract:
The $J/ψ$ and $ψ(2S)$ charmonium states, composed of $c\bar{c}$ quark pairs and known since the 1970s, are widely believed to serve as ideal probes to test quantum chromodynamics in high-energy hadronic interactions. However, there is not yet a complete understanding of the charmonium-production mechanism. Recent measurements of $J/ψ$ production as a function of event charged-particle multiplicity…
▽ More
The $J/ψ$ and $ψ(2S)$ charmonium states, composed of $c\bar{c}$ quark pairs and known since the 1970s, are widely believed to serve as ideal probes to test quantum chromodynamics in high-energy hadronic interactions. However, there is not yet a complete understanding of the charmonium-production mechanism. Recent measurements of $J/ψ$ production as a function of event charged-particle multiplicity at the collision energies of both the Large Hadron Collider (LHC) and the Relativistic Heavy Ion Collider (RHIC) show enhanced $J/ψ$ production yields with increasing multiplicity. One potential explanation for this type of dependence is multiparton interactions (MPI). We carry out the first measurements of self-normalized $J/ψ$ yields and the $ψ(2S)$ to $J/ψ$ ratio at both forward and backward rapidities as a function of self-normalized charged-particle multiplicity in $p$$+$$p$ collisions at $\sqrt{s}=200$ GeV. In addition, detailed {\sc pythia} studies tuned to RHIC energies were performed to investigate the MPI impacts. We find that the PHENIX data at RHIC are consistent with recent LHC measurements and can only be described by {\sc pythia} calculations that include MPI effects. The forward and backward $ψ(2S)$ to $J/ψ$ ratio, which serves as a unique and powerful approach to study final-state effects on charmonium production, is found to be less dependent on the charged-particle multiplicity.
△ Less
Submitted 5 September, 2024;
originally announced September 2024.
-
Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language
Authors:
Jeong Hun Yeo,
Chae Won Kim,
Hyunjun Kim,
Hyeongseop Rha,
Seunghee Han,
Wen-Huang Cheng,
Yong Man Ro
Abstract:
Lip reading aims to predict spoken language by analyzing lip movements. Despite advancements in lip reading technologies, performance degrades when models are applied to unseen speakers due to their sensitivity to variations in visual information such as lip appearances. To address this challenge, speaker adaptive lip reading technologies have advanced by focusing on effectively adapting a lip rea…
▽ More
Lip reading aims to predict spoken language by analyzing lip movements. Despite advancements in lip reading technologies, performance degrades when models are applied to unseen speakers due to their sensitivity to variations in visual information such as lip appearances. To address this challenge, speaker adaptive lip reading technologies have advanced by focusing on effectively adapting a lip reading model to target speakers in the visual modality. The effectiveness of adapting language information, such as vocabulary choice, of the target speaker has not been explored in the previous works. Moreover, existing datasets for speaker adaptation have limited vocabulary size and pose variations, limiting the validation of previous speaker-adaptive methods in real-world scenarios. To address these issues, we propose a novel speaker-adaptive lip reading method that adapts a pre-trained model to target speakers at both vision and language levels. Specifically, we integrate prompt tuning and the LoRA approach, applying them to a pre-trained lip reading model to effectively adapt the model to target speakers. In addition, to validate its effectiveness in real-world scenarios, we introduce a new dataset, VoxLRS-SA, derived from VoxCeleb2 and LRS3. It contains a vocabulary of approximately 100K words, offers diverse pose variations, and enables the validation of adaptation methods in wild, sentence-level lip reading for the first time. Through various experiments, we demonstrate that the existing speaker-adaptive method also improves performance in the wild at the sentence level. Moreover, with the proposed adaptation method, we show that the proposed method achieves larger improvements when applied to the target speaker, compared to the previous works.
△ Less
Submitted 2 September, 2024;
originally announced September 2024.
-
Incorporating General Contact Surfaces in the Kinematics of Tendon-Driven Rolling-Contact Joint Mechanisms
Authors:
Junhyoung Ha,
Chaewon Kim,
Chunwoo Kim
Abstract:
This paper presents the first kinematic modeling of tendon-driven rolling-contact joint mechanisms with general contact surfaces subject to external loads. We derived the kinematics as a set of recursive equations and developed efficient iterative algorithms to solve for both tendon force actuation and tendon displacement actuation. The configuration predictions of the kinematics were experimental…
▽ More
This paper presents the first kinematic modeling of tendon-driven rolling-contact joint mechanisms with general contact surfaces subject to external loads. We derived the kinematics as a set of recursive equations and developed efficient iterative algorithms to solve for both tendon force actuation and tendon displacement actuation. The configuration predictions of the kinematics were experimentally validated using a prototype mechanism. Our MATLAB implementation of the proposed kinematic is available at https://github.com/hjhdog1/RollingJoint.
△ Less
Submitted 1 September, 2024;
originally announced September 2024.
-
Learning from Negative Samples in Generative Biomedical Entity Linking
Authors:
Chanhwi Kim,
Hyunjae Kim,
Sihyeon Park,
Jiwoo Lee,
Mujeen Sung,
Jaewoo Kang
Abstract:
Generative models have become widely used in biomedical entity linking (BioEL) due to their excellent performance and efficient memory usage. However, these models are usually trained only with positive samples--entities that match the input mention's identifier--and do not explicitly learn from hard negative samples, which are entities that look similar but have different meanings. To address thi…
▽ More
Generative models have become widely used in biomedical entity linking (BioEL) due to their excellent performance and efficient memory usage. However, these models are usually trained only with positive samples--entities that match the input mention's identifier--and do not explicitly learn from hard negative samples, which are entities that look similar but have different meanings. To address this limitation, we introduce ANGEL (Learning from Negative Samples in Generative Biomedical Entity Linking), the first framework that trains generative BioEL models using negative samples. Specifically, a generative model is initially trained to generate positive samples from the knowledge base for given input entities. Subsequently, both correct and incorrect outputs are gathered from the model's top-k predictions. The model is then updated to prioritize the correct predictions through direct preference optimization. Our models fine-tuned with ANGEL outperform the previous best baseline models by up to an average top-1 accuracy of 1.4% on five benchmarks. When incorporating our framework into pre-training, the performance improvement further increases to 1.7%, demonstrating its effectiveness in both the pre-training and fine-tuning stages. Our code is available at https://github.com/dmis-lab/ANGEL.
△ Less
Submitted 29 August, 2024;
originally announced August 2024.
-
Arkenstone -- II. A model for unresolved cool clouds entrained in galactic winds in cosmological simulations
Authors:
Matthew C. Smith,
Drummond B. Fielding,
Greg L. Bryan,
Jake S. Bennett,
Chang-Goo Kim,
Eve C. Ostriker,
Rachel S. Somerville
Abstract:
Arkenstone is a new scheme that allows multiphase, stellar feedback-driven winds to be included in coarse resolution cosmological simulations. The evolution of galactic winds and their subsequent impact on the circumgalactic medium are altered by exchanges of mass, energy, momentum, and metals between their component phases. These exchanges are governed by complex, small-scale physical processes t…
▽ More
Arkenstone is a new scheme that allows multiphase, stellar feedback-driven winds to be included in coarse resolution cosmological simulations. The evolution of galactic winds and their subsequent impact on the circumgalactic medium are altered by exchanges of mass, energy, momentum, and metals between their component phases. These exchanges are governed by complex, small-scale physical processes that cannot be resolved in cosmological simulations. In this second presentation paper, we describe Arkenstone's novel cloud particle approach for modelling unresolvable cool clouds entrained in hot, fast winds. This general framework allows models of the cloud-wind interaction, derived from state-of-the-art high-resolution simulations, to be applied in a large-scale context. In this work, we adopt a cloud evolution model that captures simultaneous cloud mass loss to and gain from the ambient hot phase via turbulent mixing and radiative cooling, respectively. We demonstrate the scheme using non-cosmological idealized simulations of a galaxy with a realistic circumgalactic medium component, using the Arepo code. We show that the ability of a high-specific energy wind component to perform preventative feedback may be limited by heavy loading of cool clouds coupled into it. We demonstrate that the diverging evolution of clouds of initially differing masses leads to a complex velocity field for the cool phase and a cloud mass function that varies both spatially and temporally in a non-trivial manner. These latter two phenomena can manifest in the simulation because of our choice of a Lagrangian discretisation of the cloud population, in contrast to other proposed schemes. This is a Learning the Universe publication.
△ Less
Submitted 27 August, 2024;
originally announced August 2024.
-
Measurement of inclusive jet cross section and substructure in $p$$+$$p$ collisions at $\sqrt{s_{_{NN}}}=200$ GeV
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
J. Alexander,
M. Alfred,
V. Andrieux,
S. Antsupov,
K. Aoki,
N. Apadula,
H. Asano,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
X. Bai,
N. S. Bandara,
B. Bannier,
E. Bannikov,
K. N. Barish,
S. Bathe
, et al. (422 additional authors not shown)
Abstract:
The jet cross-section and jet-substructure observables in $p$$+$$p$ collisions at $\sqrt{s}=200$ GeV were measured by the PHENIX Collaboration at the Relativistic Heavy Ion Collider (RHIC). Jets are reconstructed from charged-particle tracks and electromagnetic-calorimeter clusters using the anti-$k_{t}$ algorithm with a jet radius $R=0.3$ for jets with transverse momentum within $8.0<p_T<40.0$ Ge…
▽ More
The jet cross-section and jet-substructure observables in $p$$+$$p$ collisions at $\sqrt{s}=200$ GeV were measured by the PHENIX Collaboration at the Relativistic Heavy Ion Collider (RHIC). Jets are reconstructed from charged-particle tracks and electromagnetic-calorimeter clusters using the anti-$k_{t}$ algorithm with a jet radius $R=0.3$ for jets with transverse momentum within $8.0<p_T<40.0$ GeV/$c$ and pseudorapidity $|η|<0.15$. Measurements include the jet cross section, as well as distributions of SoftDrop-groomed momentum fraction ($z_g$), charged-particle transverse momentum with respect to jet axis ($j_T$), and radial distributions of charged particles within jets ($r$). Also meaureed was the distribution of $ξ=-ln(z)$, where $z$ is the fraction of the jet momentum carried by the charged particle. The measurements are compared to theoretical next-to and next-to-next-to-leading-order calculatios, PYTHIA event generator, and to other existing experimental results. Indicated from these meaurements is a lower particle multiplicity in jets at RHIC energies when compared to models. Also noted are implications for future jet measurements with sPHENIX at RHIC as well as at the future Election-Ion Collider.
△ Less
Submitted 20 August, 2024;
originally announced August 2024.
-
DBHP: Trajectory Imputation in Multi-Agent Sports Using Derivative-Based Hybrid Prediction
Authors:
Hanjun Choi,
Hyunsung Kim,
Minho Lee,
Chang-Jo Kim,
Jinsung Yoon,
Sang-Ki Ko
Abstract:
Many spatiotemporal domains handle multi-agent trajectory data, but in real-world scenarios, collected trajectory data are often partially missing due to various reasons. While existing approaches demonstrate good performance in trajectory imputation, they face challenges in capturing the complex dynamics and interactions between agents due to a lack of physical constraints that govern realistic t…
▽ More
Many spatiotemporal domains handle multi-agent trajectory data, but in real-world scenarios, collected trajectory data are often partially missing due to various reasons. While existing approaches demonstrate good performance in trajectory imputation, they face challenges in capturing the complex dynamics and interactions between agents due to a lack of physical constraints that govern realistic trajectories, leading to suboptimal results. To address this issue, the paper proposes a Derivative-Based Hybrid Prediction (DBHP) framework that can effectively impute multiple agents' missing trajectories. First, a neural network equipped with Set Transformers produces a naive prediction of missing trajectories while satisfying the permutation-equivariance in terms of the order of input agents. Then, the framework makes alternative predictions leveraging velocity and acceleration information and combines all the predictions with properly determined weights to provide final imputed trajectories. In this way, our proposed framework not only accurately predicts position, velocity, and acceleration values but also enforces the physical relationship between them, eventually improving both the accuracy and naturalness of the predicted trajectories. Accordingly, the experiment results about imputing player trajectories in team sports show that our framework significantly outperforms existing imputation baselines.
△ Less
Submitted 22 August, 2024; v1 submitted 20 August, 2024;
originally announced August 2024.
-
LLaVA-Surg: Towards Multimodal Surgical Assistant via Structured Surgical Video Learning
Authors:
Jiajie Li,
Garrett Skinner,
Gene Yang,
Brian R Quaranto,
Steven D Schwaitzberg,
Peter C W Kim,
Jinjun Xiong
Abstract:
Multimodal large language models (LLMs) have achieved notable success across various domains, while research in the medical field has largely focused on unimodal images. Meanwhile, current general-domain multimodal models for videos still lack the capabilities to understand and engage in conversations about surgical videos. One major contributing factor is the absence of datasets in the surgical f…
▽ More
Multimodal large language models (LLMs) have achieved notable success across various domains, while research in the medical field has largely focused on unimodal images. Meanwhile, current general-domain multimodal models for videos still lack the capabilities to understand and engage in conversations about surgical videos. One major contributing factor is the absence of datasets in the surgical field. In this paper, we create a new dataset, Surg-QA, consisting of 102,000 surgical video-instruction pairs, the largest of its kind so far. To build such a dataset, we propose a novel two-stage question-answer generation pipeline with LLM to learn surgical knowledge in a structured manner from the publicly available surgical lecture videos. The pipeline breaks down the generation process into two stages to significantly reduce the task complexity, allowing us to use a more affordable, locally deployed open-source LLM than the premium paid LLM services. It also mitigates the risk of LLM hallucinations during question-answer generation, thereby enhancing the overall quality of the generated data. We further train LLaVA-Surg, a novel vision-language conversational assistant capable of answering open-ended questions about surgical videos, on this Surg-QA dataset, and conduct comprehensive evaluations on zero-shot surgical video question-answering tasks. We show that LLaVA-Surg significantly outperforms all previous general-domain models, demonstrating exceptional multimodal conversational skills in answering open-ended questions about surgical videos. We will release our code, model, and the instruction-tuning dataset.
△ Less
Submitted 15 August, 2024;
originally announced August 2024.
-
Coupling between electrons and charge density wave fluctuation and its possible role in superconductivity
Authors:
Yeonghoon Lee,
Yeahan Sur,
Sunghun Kim,
Jaehun Cha,
Jounghoon Hyun,
Chan-young Lim,
Makoto Hashimoto,
Donghui Lu,
Younsik Kim,
Soonsang Huh,
Changyoung Kim,
Shinichiro Ideta,
Kiyohisa Tanaka,
Kee Hoon Kim,
Yeongkwan Kim
Abstract:
In most of charge density wave (CDW) systems of different material classes, ranging from traditional correlated systems in low-dimension to recent topological systems with Kagome lattice, superconductivity emerges when the system is driven toward the quantum critical point (QCP) of CDW via external parameters of doping and pressure. Despite this rather universal trend, the essential hinge between…
▽ More
In most of charge density wave (CDW) systems of different material classes, ranging from traditional correlated systems in low-dimension to recent topological systems with Kagome lattice, superconductivity emerges when the system is driven toward the quantum critical point (QCP) of CDW via external parameters of doping and pressure. Despite this rather universal trend, the essential hinge between CDW and superconductivity has not been established yet. Here, the evidence of coupling between electron and CDW fluctuation is reported, based on a temperature- and intercalation-dependent kink in the angle-resolved photoemission spectra of 2H-PdxTaSe2. Kinks are observed only when the system is in the CDW phase, regardless of whether a long- or short-range order is established. Notably, the coupling strength is enhanced upon long-range CDW suppression, albeit the coupling energy scale is reduced. Interestingly, estimation of the superconducting critical temperature by incorporating the observed coupling characteristics into McMillan's equation yields result closely resembling the known values of the superconducting dome. Our results thus highlight a compelling possibility that this new coupling mediates Cooper pairs, which provides new insights on the competing relationship not only for CDW, but also for other competing orders.
△ Less
Submitted 14 August, 2024;
originally announced August 2024.
-
OMR: Occlusion-Aware Memory-Based Refinement for Video Lane Detection
Authors:
Dongkwon Jin,
Chang-Su Kim
Abstract:
A novel algorithm for video lane detection is proposed in this paper. First, we extract a feature map for a current frame and detect a latent mask for obstacles occluding lanes. Then, we enhance the feature map by developing an occlusion-aware memory-based refinement (OMR) module. It takes the obstacle mask and feature map from the current frame, previous output, and memory information as input, a…
▽ More
A novel algorithm for video lane detection is proposed in this paper. First, we extract a feature map for a current frame and detect a latent mask for obstacles occluding lanes. Then, we enhance the feature map by developing an occlusion-aware memory-based refinement (OMR) module. It takes the obstacle mask and feature map from the current frame, previous output, and memory information as input, and processes them recursively in a video. Moreover, we apply a novel data augmentation scheme for training the OMR module effectively. Experimental results show that the proposed algorithm outperforms existing techniques on video lane datasets. Our codes are available at https://github.com/dongkwonjin/OMR.
△ Less
Submitted 14 August, 2024;
originally announced August 2024.
-
Infant Type Ia Supernovae from the KMTNet I. Multi-Color Evolution and Populations
Authors:
Yuan Qi Ni,
Dae-Sik Moon,
Maria R. Drout,
Youngdae Lee,
Patrick Sandoval,
Jeehye Shin,
Hong Soo Park,
Sang Chul Kim,
Kyuseok Oh
Abstract:
We conduct a systematic analysis of the early multi-band light curves and colors of 19 Type Ia Supernovae (SNe) from the Korea Microlensing Telescope Network SN Program, including 16 previously unpublished events. Seven are detected $\lesssim$ 1 day since the estimated epoch of first light and the rest within $\lesssim$ 3 days. Some show excess emission within $<$ 0.5 days to $\sim$ 2 days, but mo…
▽ More
We conduct a systematic analysis of the early multi-band light curves and colors of 19 Type Ia Supernovae (SNe) from the Korea Microlensing Telescope Network SN Program, including 16 previously unpublished events. Seven are detected $\lesssim$ 1 day since the estimated epoch of first light and the rest within $\lesssim$ 3 days. Some show excess emission within $<$ 0.5 days to $\sim$ 2 days, but most show pure power-law rises. The colors are initially diverse before $\sim$ 5 days, but converge to a similar color at $\sim$ 10 days. We identify at least three populations based on 2--5-day color evolution: (1) "early-blues" exhibit slowly-evolving colors consistent with a $\sim$ 17,000 K blackbody; (2) "early-reds" have initially blue $B-V$ and red $V-i$ colors that cannot simultaneously be fit with a blackbody -- likely due to suppression of $B$- and $i$-band flux by Fe II/III and Ca II -- and evolve more rapidly; and (3) "early-yellows" evolve blueward, consistent with thermal heating from $\sim$ 8,000 to 13,000 K. The distributions of early-blue and early-red colors are compatible with them being either distinct populations -- with early-reds comprising (60 $\pm$ 15)% of them -- or extreme ends of one continuous population; whereas the early-yellow population identified here is clearly distinct. Compared to the other populations, early-blues in our sample differ by exhibiting excess emission within 1--2 days, nearly constant peak brightness regardless of $ΔM_{15}(B)$ after standardization, and shallower Si II features. Early-blues also prefer star-forming host environments, while early-yellows and, to a lesser extent, early-reds prefer quiescent ones. These preferences appear to indicate at least two Type Ia SN production channels based on stellar population age, while early-reds and early-blues may still share a common origin.
△ Less
Submitted 12 August, 2024;
originally announced August 2024.
-
Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation
Authors:
Utkarsh Nath,
Rajeev Goel,
Eun Som Jeon,
Changhoon Kim,
Kyle Min,
Yezhou Yang,
Yingzhen Yang,
Pavan Turaga
Abstract:
To address the data scarcity associated with 3D assets, 2D-lifting techniques such as Score Distillation Sampling (SDS) have become a widely adopted practice in text-to-3D generation pipelines. However, the diffusion models used in these techniques are prone to viewpoint bias and thus lead to geometric inconsistencies such as the Janus problem. To counter this, we introduce MT3D, a text-to-3D gene…
▽ More
To address the data scarcity associated with 3D assets, 2D-lifting techniques such as Score Distillation Sampling (SDS) have become a widely adopted practice in text-to-3D generation pipelines. However, the diffusion models used in these techniques are prone to viewpoint bias and thus lead to geometric inconsistencies such as the Janus problem. To counter this, we introduce MT3D, a text-to-3D generative model that leverages a high-fidelity 3D object to overcome viewpoint bias and explicitly infuse geometric understanding into the generation pipeline. Firstly, we employ depth maps derived from a high-quality 3D model as control signals to guarantee that the generated 2D images preserve the fundamental shape and structure, thereby reducing the inherent viewpoint bias. Next, we utilize deep geometric moments to ensure geometric consistency in the 3D representation explicitly. By incorporating geometric details from a 3D asset, MT3D enables the creation of diverse and geometrically consistent objects, thereby improving the quality and usability of our 3D representations.
△ Less
Submitted 12 August, 2024;
originally announced August 2024.
-
Better Not to Propagate: Understanding Edge Uncertainty and Over-smoothing in Signed Graph Neural Networks
Authors:
Yoonhyuk Choi,
Jiho Choi,
Taewook Ko,
Chong-Kwon Kim
Abstract:
Traditional Graph Neural Networks (GNNs) rely on network homophily, which can lead to performance degradation due to over-smoothing in many real-world heterophily scenarios. Recent studies analyze the smoothing effect (separability) after message-passing (MP), depending on the expectation of node features. Regarding separability gain, they provided theoretical backgrounds on over-smoothing caused…
▽ More
Traditional Graph Neural Networks (GNNs) rely on network homophily, which can lead to performance degradation due to over-smoothing in many real-world heterophily scenarios. Recent studies analyze the smoothing effect (separability) after message-passing (MP), depending on the expectation of node features. Regarding separability gain, they provided theoretical backgrounds on over-smoothing caused by various propagation schemes, including positive, signed, and blocked MPs. More recently, by extending these theorems, some works have suggested improvements in signed propagation under multiple classes. However, prior works assume that the error ratio of all propagation schemes is fixed, failing to investigate this phenomenon correctly. To solve this problem, we propose a novel method for estimating homophily and edge error ratio, integrated with dynamic selection between blocked and signed propagation during training. Our theoretical analysis, supported by extensive experiments, demonstrates that blocking MP can be more effective than signed propagation under high edge error ratios, improving the performance in both homophilic and heterophilic graphs.
△ Less
Submitted 25 August, 2024; v1 submitted 9 August, 2024;
originally announced August 2024.
-
Transformer Explainer: Interactive Learning of Text-Generative Models
Authors:
Aeree Cho,
Grace C. Kim,
Alexander Karpekov,
Alec Helbling,
Zijie J. Wang,
Seongmin Lee,
Benjamin Hoover,
Duen Horng Chau
Abstract:
Transformers have revolutionized machine learning, yet their inner workings remain opaque to many. We present Transformer Explainer, an interactive visualization tool designed for non-experts to learn about Transformers through the GPT-2 model. Our tool helps users understand complex Transformer concepts by integrating a model overview and enabling smooth transitions across abstraction levels of m…
▽ More
Transformers have revolutionized machine learning, yet their inner workings remain opaque to many. We present Transformer Explainer, an interactive visualization tool designed for non-experts to learn about Transformers through the GPT-2 model. Our tool helps users understand complex Transformer concepts by integrating a model overview and enabling smooth transitions across abstraction levels of mathematical operations and model structures. It runs a live GPT-2 instance locally in the user's browser, empowering users to experiment with their own input and observe in real-time how the internal components and parameters of the Transformer work together to predict the next tokens. Our tool requires no installation or special hardware, broadening the public's education access to modern generative AI techniques. Our open-sourced tool is available at https://poloclub.github.io/transformer-explainer/. A video demo is available at https://youtu.be/ECR4oAwocjs.
△ Less
Submitted 8 August, 2024;
originally announced August 2024.
-
Emergent quantum disordered phase in Na$_2$Co$_2$TeO$_6$ under intermediate magnetic field along $c$ axis
Authors:
Xu-Guang Zhou,
Han Li,
Chaebin Kim,
Akira Matsuo,
Kavita Mehlawat,
Kazuki Matsui,
Zhuo Yang,
Atsuhiko Miyata,
Gang Su,
Koichi Kindo,
Je-Geun Park,
Yoshimitsu Kohama,
Wei Li,
Yasuhiro H. Matsuda
Abstract:
Identifying the exotic quantum spin liquid phase in Kitaev magnets has garnered great research interests and remains a significant challenge. In experiments, most of the proposed candidate materials exhibit an antiferromagnetic (AFM) order at low temperatures, thus the challenge transforms into the searching for a field-driven disordered phase that is distinct from the partially polarized paramagn…
▽ More
Identifying the exotic quantum spin liquid phase in Kitaev magnets has garnered great research interests and remains a significant challenge. In experiments, most of the proposed candidate materials exhibit an antiferromagnetic (AFM) order at low temperatures, thus the challenge transforms into the searching for a field-driven disordered phase that is distinct from the partially polarized paramagnetic phase after suppressing the AFM order. Recently, Na$_2$Co$_2$TeO$_6$ has been proposed as one of the prime candidates, where the Kitaev interaction is realized by the high-spin $t^{5}_{2g}e^2_g$ configuration, and spin-orbit entangled $J_{\rm eff} = 1/2$ state in a bond-edge shared honeycomb lattice. In this study, we identify an emergent intermediate disordered phase induced by an external field along the $c$-axis of the honeycomb plane. This phase is characterized through magnetization and magnetocaloric effect experiments in high magnetic fields. To explain the experimental results, we propose an effective spin model with large AFM Kitaev interaction, which yields results in good agreement with both our findings and previously reported data. We determine that the effective $K$-$J$-$Γ$-$Γ'$ model for Na$_2$Co$_2$TeO$_6$ is nearly dual to that of $α$-RuCl$_3$ under an unitary transformation. Given the insignificant fragility of Na$_2$Co$_2$TeO$_6$ sample, further high-field experiments can be conducted to explore this intermediate-field quantum spin disordered phase.
△ Less
Submitted 4 August, 2024;
originally announced August 2024.
-
The Llama 3 Herd of Models
Authors:
Abhimanyu Dubey,
Abhinav Jauhri,
Abhinav Pandey,
Abhishek Kadian,
Ahmad Al-Dahle,
Aiesha Letman,
Akhil Mathur,
Alan Schelten,
Amy Yang,
Angela Fan,
Anirudh Goyal,
Anthony Hartshorn,
Aobo Yang,
Archi Mitra,
Archie Sravankumar,
Artem Korenev,
Arthur Hinsvark,
Arun Rao,
Aston Zhang,
Aurelien Rodriguez,
Austen Gregerson,
Ava Spataru,
Baptiste Roziere,
Bethany Biron,
Binh Tang
, et al. (510 additional authors not shown)
Abstract:
Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical…
▽ More
Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical evaluation of Llama 3. We find that Llama 3 delivers comparable quality to leading language models such as GPT-4 on a plethora of tasks. We publicly release Llama 3, including pre-trained and post-trained versions of the 405B parameter language model and our Llama Guard 3 model for input and output safety. The paper also presents the results of experiments in which we integrate image, video, and speech capabilities into Llama 3 via a compositional approach. We observe this approach performs competitively with the state-of-the-art on image, video, and speech recognition tasks. The resulting models are not yet being broadly released as they are still under development.
△ Less
Submitted 15 August, 2024; v1 submitted 31 July, 2024;
originally announced July 2024.
-
Multi-Site Class-Incremental Learning with Weighted Experts in Echocardiography
Authors:
Kit M. Bransby,
Woo-jin Cho Kim,
Jorge Oliveira,
Alex Thorley,
Arian Beqiri,
Alberto Gomez,
Agisilaos Chartsias
Abstract:
Building an echocardiography view classifier that maintains performance in real-life cases requires diverse multi-site data, and frequent updates with newly available data to mitigate model drift. Simply fine-tuning on new datasets results in "catastrophic forgetting", and cannot adapt to variations of view labels between sites. Alternatively, collecting all data on a single server and re-training…
▽ More
Building an echocardiography view classifier that maintains performance in real-life cases requires diverse multi-site data, and frequent updates with newly available data to mitigate model drift. Simply fine-tuning on new datasets results in "catastrophic forgetting", and cannot adapt to variations of view labels between sites. Alternatively, collecting all data on a single server and re-training may not be feasible as data sharing agreements may restrict image transfer, or datasets may only become available at different times. Furthermore, time and cost associated with re-training grows with every new dataset. We propose a class-incremental learning method which learns an expert network for each dataset, and combines all expert networks with a score fusion model. The influence of ``unqualified experts'' is minimised by weighting each contribution with a learnt in-distribution score. These weights promote transparency as the contribution of each expert is known during inference. Instead of using the original images, we use learned features from each dataset, which are easier to share and raise fewer licensing and privacy concerns. We validate our work on six datasets from multiple sites, demonstrating significant reductions in training time while improving view classification performance.
△ Less
Submitted 31 July, 2024;
originally announced July 2024.
-
Hecke Equivariance of Divisor Lifting with respect to Sesquiharmonic Maass Forms
Authors:
Daeyeol Jeon,
Soon-Yi Kang,
Chang Heon Kim
Abstract:
We investigate the properties of Hecke operator for sesquiharmonic Maass forms. We begin by proving Hecke equivariance of the divisor lifting with respect to sesquiharmonic Mass functions, which maps an integral weight meromorphic modular form to the holomorphic part of the Fourier expansion of a weight 2 sesquiharmonic Maass form. Using this Hecke equivariance, we show that the sesquiharmonic Maa…
▽ More
We investigate the properties of Hecke operator for sesquiharmonic Maass forms. We begin by proving Hecke equivariance of the divisor lifting with respect to sesquiharmonic Mass functions, which maps an integral weight meromorphic modular form to the holomorphic part of the Fourier expansion of a weight 2 sesquiharmonic Maass form. Using this Hecke equivariance, we show that the sesquiharmonic Maass functions, whose images under the hyperbolic Laplace operator are the Faber polynomials $J_n$ of the $j$-function, form a Hecke system analogous to $J_n$. By combining the Hecke equivariance of the divisor lifting with that of the Borcherds isomorphism, we extend Matsusaka's finding on the twisted traces of sesquiharmonic Maass functions.
△ Less
Submitted 31 July, 2024;
originally announced July 2024.
-
Determination of $|V_{ub}|$ from simultaneous measurements of untagged $B^0\toπ^- \ell^+ ν_{\ell}$ and $B^+\toρ^0 \ell^+ν_{\ell}$ decays
Authors:
Belle II Collaboration,
I. Adachi,
L. Aggarwal,
H. Aihara,
N. Akopov,
A. Aloisio,
N. Althubiti,
N. Anh Ky,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
M. Aversano,
R. Ayad,
V. Babu,
H. Bae,
S. Bahinipati,
P. Bambade,
Sw. Banerjee,
S. Bansal,
M. Barrett,
J. Baudot,
M. Bauer,
A. Baur,
A. Beaubien
, et al. (395 additional authors not shown)
Abstract:
We present a measurement of $|V_{ub}|$ from a simultaneous study of the charmless semileptonic decays $B^0\toπ^- \ell^+ ν_{\ell}$ and $B^+\toρ^0 \ell^+ν_{\ell}$, where $\ell = e, μ$. This measurement uses a data sample of 387 million $B\overline{B}$ meson pairs recorded by the Belle~II detector at the SuperKEKB electron-positron collider between 2019 and 2022. The two decays are reconstructed with…
▽ More
We present a measurement of $|V_{ub}|$ from a simultaneous study of the charmless semileptonic decays $B^0\toπ^- \ell^+ ν_{\ell}$ and $B^+\toρ^0 \ell^+ν_{\ell}$, where $\ell = e, μ$. This measurement uses a data sample of 387 million $B\overline{B}$ meson pairs recorded by the Belle~II detector at the SuperKEKB electron-positron collider between 2019 and 2022. The two decays are reconstructed without identifying the partner $B$ mesons. We simultaneously measure the differential branching fractions of $B^0\toπ^- \ell^+ ν_{\ell}$ and $B^+\toρ^0 \ell^+ν_{\ell}$ decays as functions of $q^2$ (momentum transfer squared). From these, we obtain total branching fractions $B(B^0\toπ^- \ell^+ ν_{\ell}) = (1.516 \pm 0.042 (\mathrm{stat}) \pm 0.059 (\mathrm{syst})) \times 10^{-4}$ and $B(B^+\toρ^0 \ell^+ν_{\ell}) = (1.625 \pm 0.079 (\mathrm{stat}) \pm 0.180 (\mathrm{syst})) \times 10^{-4}$. By fitting the measured $B^0\toπ^- \ell^+ ν_{\ell}$ partial branching fractions as functions of $q^2$, together with constraints on the non-perturbative hadronic contribution from lattice QCD calculations, we obtain $|V_{ub}|$ = $(3.93 \pm 0.09 \pm 0.13 \pm 0.19) \times 10^{-3}$. Here, the first uncertainty is statistical, the second is systematic, and the third is theoretical.
△ Less
Submitted 24 July, 2024;
originally announced July 2024.
-
CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation
Authors:
Hajin Shim,
Changhun Kim,
Eunho Yang
Abstract:
3D point clouds captured from real-world sensors frequently encompass noisy points due to various obstacles, such as occlusion, limited resolution, and variations in scale. These challenges hinder the deployment of pre-trained point cloud recognition models trained on clean point clouds, leading to significant performance degradation. While test-time adaptation (TTA) strategies have shown promisin…
▽ More
3D point clouds captured from real-world sensors frequently encompass noisy points due to various obstacles, such as occlusion, limited resolution, and variations in scale. These challenges hinder the deployment of pre-trained point cloud recognition models trained on clean point clouds, leading to significant performance degradation. While test-time adaptation (TTA) strategies have shown promising results on this issue in the 2D domain, their application to 3D point clouds remains under-explored. Among TTA methods, an input adaptation approach, which directly converts test instances to the source domain using a pre-trained diffusion model, has been proposed in the 2D domain. Despite its robust TTA performance in practical situations, naively adopting this into the 3D domain may be suboptimal due to the neglect of inherent properties of point clouds, and its prohibitive computational cost. Motivated by these limitations, we propose CloudFixer, a test-time input adaptation method tailored for 3D point clouds, employing a pre-trained diffusion model. Specifically, CloudFixer optimizes geometric transformation parameters with carefully designed objectives that leverage the geometric properties of point clouds. We also substantially improve computational efficiency by avoiding backpropagation through the diffusion model and a prohibitive generation process. Furthermore, we propose an online model adaptation strategy by aligning the original model prediction with that of the adapted input. Extensive experiments showcase the superiority of CloudFixer over various TTA baselines, excelling in handling common corruptions and natural distribution shifts across diverse real-world scenarios. Our code is available at https://github.com/shimazing/CloudFixer
△ Less
Submitted 23 July, 2024;
originally announced July 2024.
-
Reinforcement Learning Optimizes Power Dispatch in Decentralized Power Grid
Authors:
Yongsun Lee,
Hoyun Choi,
Laurent Pagnier,
Cook Hyun Kim,
Jongshin Lee,
Bukyoung Jhun,
Heetae Kim,
Juergen Kurths,
B. Kahng
Abstract:
Effective frequency control in power grids has become increasingly important with the increasing demand for renewable energy sources. Here, we propose a novel strategy for resolving this challenge using graph convolutional proximal policy optimization (GC-PPO). The GC-PPO method can optimally determine how much power individual buses dispatch to reduce frequency fluctuations across a power grid. W…
▽ More
Effective frequency control in power grids has become increasingly important with the increasing demand for renewable energy sources. Here, we propose a novel strategy for resolving this challenge using graph convolutional proximal policy optimization (GC-PPO). The GC-PPO method can optimally determine how much power individual buses dispatch to reduce frequency fluctuations across a power grid. We demonstrate its efficacy in controlling disturbances by applying the GC-PPO to the power grid of the UK. The performance of GC-PPO is outstanding compared to the classical methods. This result highlights the promising role of GC-PPO in enhancing the stability and reliability of power systems by switching lines or decentralizing grid topology.
△ Less
Submitted 21 July, 2024;
originally announced July 2024.
-
When Qualitative Research Meets Large Language Model: Exploring the Potential of QualiGPT as a Tool for Qualitative Coding
Authors:
He Zhang,
Chuhao Wu,
Jingyi Xie,
Fiona Rubino,
Sydney Graver,
ChanMin Kim,
John M. Carroll,
Jie Cai
Abstract:
Qualitative research, renowned for its in-depth exploration of complex phenomena, often involves time-intensive analysis, particularly during the coding stage. Existing software for qualitative evaluation frequently lacks automatic coding capabilities, user-friendliness, and cost-effectiveness. The advent of Large Language Models (LLMs) like GPT-3 and its successors marks a transformative era for…
▽ More
Qualitative research, renowned for its in-depth exploration of complex phenomena, often involves time-intensive analysis, particularly during the coding stage. Existing software for qualitative evaluation frequently lacks automatic coding capabilities, user-friendliness, and cost-effectiveness. The advent of Large Language Models (LLMs) like GPT-3 and its successors marks a transformative era for enhancing qualitative analysis. This paper introduces QualiGPT, a tool developed to address the challenges associated with using ChatGPT for qualitative analysis. Through a comparative analysis of traditional manual coding and QualiGPT's performance on both simulated and real datasets, incorporating both inductive and deductive coding approaches, we demonstrate that QualiGPT significantly improves the qualitative analysis process. Our findings show that QualiGPT enhances efficiency, transparency, and accessibility in qualitative coding. The tool's performance was evaluated using inter-rater reliability (IRR) measures, with results indicating substantial agreement between human coders and QualiGPT in various coding scenarios. In addition, we also discuss the implications of integrating AI into qualitative research workflows and outline future directions for enhancing human-AI collaboration in this field.
△ Less
Submitted 20 July, 2024;
originally announced July 2024.
-
Pumping Iron: How turbulent metal diffusion impacts multiphase galactic outflows
Authors:
Ulrich P. Steinwandel,
Douglas Rennehan,
Matthew E. Orr,
Drummond B. Fielding,
Chang-Goo Kim
Abstract:
Most numerical simulations of galaxy formation and evolution are unable to properly resolve the turbulent cascade at or below the resolution scale and turbulence models are required to capture the motion of eddies on those unresolved scales. In this study, we investigate the impact of turbulent metal diffusion models on multiphase outflows originating from dwarf galaxies (…
▽ More
Most numerical simulations of galaxy formation and evolution are unable to properly resolve the turbulent cascade at or below the resolution scale and turbulence models are required to capture the motion of eddies on those unresolved scales. In this study, we investigate the impact of turbulent metal diffusion models on multiphase outflows originating from dwarf galaxies ($M_{\rm halo} \sim 10^{10} - 10^{11}$ M$_\odot$). We use our state-of-the-art numerical model for the formation of single stars and non-equilibrium cooling and hydrogen chemistry. Our simulations are carried out at a mass resolution of $\sim$1 M$_{\odot}$, where the individual supernova explosions are resolved in terms of hot-phase generation and momentum input. We find that mass, energy, and metal loading factors are only weakly affected by the inclusion of a metal diffusion model. The metal enrichment factor at low altitude above the galactic disk is higher by around 20 per cent when the metal diffusion model is included. Specifically, we find more efficient cooling in the cold interstellar medium, as higher amounts of metals are kept in the cold dense phase. The most striking effect of the metal diffusion model is that, without metal diffusion, there is more rapid cooling in the hot phase and a reduced sound speed by a factor of two. Specifically, we find that the hot phase is more metal enriched in the case without metal diffusion leading to more rapid (over) cooling of that phase which is consistent with the higher sound speed we find in the runs with metal diffusion.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
Forbes: Face Obfuscation Rendering via Backpropagation Refinement Scheme
Authors:
Jintae Kim,
Seungwon yang,
Seong-Gyun Jeong,
Chang-Su Kim
Abstract:
A novel algorithm for face obfuscation, called Forbes, which aims to obfuscate facial appearance recognizable by humans but preserve the identity and attributes decipherable by machines, is proposed in this paper. Forbes first applies multiple obfuscating transformations with random parameters to an image to remove the identity information distinguishable by humans. Then, it optimizes the paramete…
▽ More
A novel algorithm for face obfuscation, called Forbes, which aims to obfuscate facial appearance recognizable by humans but preserve the identity and attributes decipherable by machines, is proposed in this paper. Forbes first applies multiple obfuscating transformations with random parameters to an image to remove the identity information distinguishable by humans. Then, it optimizes the parameters to make the transformed image decipherable by machines based on the backpropagation refinement scheme. Finally, it renders an obfuscated image by applying the transformations with the optimized parameters. Experimental results on various datasets demonstrate that Forbes achieves both human indecipherability and machine decipherability excellently. The source codes are available at https://github.com/mcljtkim/Forbes.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
The Local Group L-Band Survey: The First Measurements of Localized Cold Neutral Medium Properties in the Low-Metallicity Dwarf Galaxy NGC 6822
Authors:
Nickolas M. Pingel,
Hongxing Chen,
Snežana Stanimirović,
Eric W. Koch,
Adam K. Leroy,
Erik Rosolowsky,
Chang-Goo Kim,
Julianne J. Dalcanton,
Fabian Walter,
Michael P. Busch,
Ryan Chown,
Jennifer Donovan Meyer,
Cosima Eibensteiner,
Deidre A. Hunter,
Sumit K. Sarbadhicary,
Elizabeth Tarantino,
Vicente Villanueva,
Thomas G. Williams
Abstract:
Measuring the properties of the cold neutral medium (CNM) in low-metallicity galaxies provides insight into heating and cooling mechanisms in early Universe-like environments. We report detections of two localized atomic neutral hydrogen (HI) absorption features in NGC 6822, a low-metallicity (0.2 Z$_{\odot}$) dwarf galaxy in the Local Group. These are the first unambiguous CNM detections in a low…
▽ More
Measuring the properties of the cold neutral medium (CNM) in low-metallicity galaxies provides insight into heating and cooling mechanisms in early Universe-like environments. We report detections of two localized atomic neutral hydrogen (HI) absorption features in NGC 6822, a low-metallicity (0.2 Z$_{\odot}$) dwarf galaxy in the Local Group. These are the first unambiguous CNM detections in a low-metallicity dwarf galaxy outside the Magellanic Clouds. The Local Group L-Band Survey (LGLBS) enabled these detections due to its high spatial (15 pc for HI emission) and spectral (0.4 \kms) resolution. We introduce LGLBS and describe a custom pipeline to search for HI absorption at high angular resolution and extract associated HI emission. A detailed Gaussian decomposition and radiative transfer analysis of the NGC 6822 detections reveals five CNM components, with key properties: a mean spin temperature of 32$\pm$6 K, a mean CNM column density of 3.1$\times$10$^{20}$ cm$^{-2}$, and CNM mass fractions of 0.33 and 0.12 for the two sightlines. Stacking non-detections does not reveal low-level signals below our median optical depth sensitivity of 0.05. One detection intercepts a star-forming region, with the HI absorption profile encompassing the CO (2$-$1) emission, indicating coincident molecular gas and a depression in high-resolution HI emission. We also analyze a nearby sightline with deep, narrow HI self-absorption dips, where the background warm neutral medium is attenuated by intervening CNM. The association of CNM, CO, and H$α$ emissions suggests a close link between the colder, denser HI phase and star formation in NGC 6822.
△ Less
Submitted 18 July, 2024;
originally announced July 2024.
-
Long-Term 3D Point Tracking By Cost Volume Fusion
Authors:
Hung Nguyen,
Chanho Kim,
Rigved Naukarkar,
Li Fuxin
Abstract:
Long-term point tracking is essential to understand non-rigid motion in the physical world better. Deep learning approaches have recently been incorporated into long-term point tracking, but most prior work predominantly functions in 2D. Although these methods benefit from the well-established backbones and matching frameworks, the motions they produce do not always make sense in the 3D physical w…
▽ More
Long-term point tracking is essential to understand non-rigid motion in the physical world better. Deep learning approaches have recently been incorporated into long-term point tracking, but most prior work predominantly functions in 2D. Although these methods benefit from the well-established backbones and matching frameworks, the motions they produce do not always make sense in the 3D physical world. In this paper, we propose the first deep learning framework for long-term point tracking in 3D that generalizes to new points and videos without requiring test-time fine-tuning. Our model contains a cost volume fusion module that effectively integrates multiple past appearances and motion information via a transformer architecture, significantly enhancing overall tracking performance. In terms of 3D tracking performance, our model significantly outperforms simple scene flow chaining and previous 2D point tracking methods, even if one uses ground truth depth and camera pose to backproject 2D point tracks in a synthetic scenario.
△ Less
Submitted 18 July, 2024;
originally announced July 2024.
-
Swift-BAT GUANO follow-up of gravitational-wave triggers in the third LIGO-Virgo-KAGRA observing run
Authors:
Gayathri Raman,
Samuele Ronchini,
James Delaunay,
Aaron Tohuvavohu,
Jamie A. Kennea,
Tyler Parsotan,
Elena Ambrosi,
Maria Grazia Bernardini,
Sergio Campana,
Giancarlo Cusumano,
Antonino D'Ai,
Paolo D'Avanzo,
Valerio D'Elia,
Massimiliano De Pasquale,
Simone Dichiara,
Phil Evans,
Dieter Hartmann,
Paul Kuin,
Andrea Melandri,
Paul O'Brien,
Julian P. Osborne,
Kim Page,
David M. Palmer,
Boris Sbarufatti,
Gianpiero Tagliaferri
, et al. (1797 additional authors not shown)
Abstract:
We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wav…
▽ More
We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wave Transient Catalogs (GWTC-3). Targeted searches were carried out on the entire GW sample using the maximum--likelihood NITRATES pipeline on the BAT data made available via the GUANO infrastructure. We do not detect any significant electromagnetic emission that is temporally and spatially coincident with any of the GW candidates. We report flux upper limits in the 15-350 keV band as a function of sky position for all the catalog candidates. For GW candidates where the Swift-BAT false alarm rate is less than 10$^{-3}$ Hz, we compute the GW--BAT joint false alarm rate. Finally, the derived Swift-BAT upper limits are used to infer constraints on the putative electromagnetic emission associated with binary black hole mergers.
△ Less
Submitted 13 July, 2024;
originally announced July 2024.
-
The Future of Learning: Large Language Models through the Lens of Students
Authors:
He Zhang,
Jingyi Xie,
Chuhao Wu,
Jie Cai,
ChanMin Kim,
John M. Carroll
Abstract:
As Large-Scale Language Models (LLMs) continue to evolve, they demonstrate significant enhancements in performance and an expansion of functionalities, impacting various domains, including education. In this study, we conducted interviews with 14 students to explore their everyday interactions with ChatGPT. Our preliminary findings reveal that students grapple with the dilemma of utilizing ChatGPT…
▽ More
As Large-Scale Language Models (LLMs) continue to evolve, they demonstrate significant enhancements in performance and an expansion of functionalities, impacting various domains, including education. In this study, we conducted interviews with 14 students to explore their everyday interactions with ChatGPT. Our preliminary findings reveal that students grapple with the dilemma of utilizing ChatGPT's efficiency for learning and information seeking, while simultaneously experiencing a crisis of trust and ethical concerns regarding the outcomes and broader impacts of ChatGPT. The students perceive ChatGPT as being more "human-like" compared to traditional AI. This dilemma, characterized by mixed emotions, inconsistent behaviors, and an overall positive attitude towards ChatGPT, underscores its potential for beneficial applications in education and learning. However, we argue that despite its human-like qualities, the advanced capabilities of such intelligence might lead to adverse consequences. Therefore, it's imperative to approach its application cautiously and strive to mitigate potential harms in future developments.
△ Less
Submitted 17 July, 2024;
originally announced July 2024.
-
Development of MMC-based lithium molybdate cryogenic calorimeters for AMoRE-II
Authors:
A. Agrawal,
V. V. Alenkov,
P. Aryal,
H. Bae,
J. Beyer,
B. Bhandari,
R. S. Boiko,
K. Boonin,
O. Buzanov,
C. R. Byeon,
N. Chanthima,
M. K. Cheoun,
J. S. Choe,
S. Choi,
S. Choudhury,
J. S. Chung,
F. A. Danevich,
M. Djamal,
D. Drung,
C. Enss,
A. Fleischmann,
A. M. Gangapshev,
L. Gastaldo,
Y. M. Gavrilyuk,
A. M. Gezhaev
, et al. (84 additional authors not shown)
Abstract:
The AMoRE collaboration searches for neutrinoless double beta decay of $^{100}$Mo using molybdate scintillating crystals via low temperature thermal calorimetric detection. The early phases of the experiment, AMoRE-pilot and AMoRE-I, have demonstrated competitive discovery potential. Presently, the AMoRE-II experiment, featuring a large detector array with about 90 kg of $^{100}$Mo isotope, is und…
▽ More
The AMoRE collaboration searches for neutrinoless double beta decay of $^{100}$Mo using molybdate scintillating crystals via low temperature thermal calorimetric detection. The early phases of the experiment, AMoRE-pilot and AMoRE-I, have demonstrated competitive discovery potential. Presently, the AMoRE-II experiment, featuring a large detector array with about 90 kg of $^{100}$Mo isotope, is under construction.This paper discusses the baseline design and characterization of the lithium molybdate cryogenic calorimeters to be used in the AMoRE-II detector modules. The results from prototype setups that incorporate new housing structures and two different crystal masses (316 g and 517 - 521 g), operated at 10 mK temperature, show energy resolutions (FWHM) of 7.55 - 8.82 keV at the 2.615 MeV $^{208}$Tl $γ$ line, and effective light detection of 0.79 - 0.96 keV/MeV. The simultaneous heat and light detection enables clear separation of alpha particles with a discrimination power of 12.37 - 19.50 at the energy region around $^6$Li(n, $α$)$^3$H with Q-value = 4.785 MeV. Promising detector performances were demonstrated at temperatures as high as 30 mK, which relaxes the temperature constraints for operating the large AMoRE-II array.
△ Less
Submitted 16 July, 2024;
originally announced July 2024.
-
SlingBAG: Sliding ball adaptive growth algorithm with differentiable radiation enables super-efficient iterative 3D photoacoustic image reconstruction
Authors:
Shuang Li,
Yibing Wang,
Jian Gao,
Chulhong Kim,
Seongwook Choi,
Yu Zhang,
Qian Chen,
Yao Yao,
Changhui Li
Abstract:
High-quality 3D photoacoustic imaging (PAI) reconstruction under sparse view or limited view has long been challenging. Traditional 3D iterative-based reconstruction methods suffer from both slow speed and high memory consumption. Recently, in computer graphics, the differentiable rendering has made significant progress, particularly with the rise of 3D Gaussian Splatting. Inspired by these, we in…
▽ More
High-quality 3D photoacoustic imaging (PAI) reconstruction under sparse view or limited view has long been challenging. Traditional 3D iterative-based reconstruction methods suffer from both slow speed and high memory consumption. Recently, in computer graphics, the differentiable rendering has made significant progress, particularly with the rise of 3D Gaussian Splatting. Inspired by these, we introduce differentiable radiation into PAI, developing a novel reconstruction algorithm: the Sliding Ball Adaptive Growth algorithm (SlingBAG) for 3D PAI, which shows ability in high-quality 3D PAI reconstruction both under extremely sparse view and limited view.
We established the point cloud dataset in PAI, and used unique differentiable rapid radiator based on the spherical decomposition strategy and the randomly initialized point cloud adaptively optimized according to sparse sensor data. Each point undergoes updates in 3D coordinates, initial pressure, and resolution (denoted by the radius of ball). Points undergo adaptive growth during iterative process, including point destroying, splitting and duplicating along the gradient of their positions, manifesting the sliding ball effect.
Finally, our point cloud to voxel grid shader renders the final reconstruction results. Simulation and in vivo experiments demonstrate that our SlingBAG reconstruction result's SNR can be more than 40 dB under extremely sparse view, while the SNR of traditional back-projection algorithm's result is less than 20 dB. Moreover, the result of SlingBAG's structural similarity to the ground truth is significantly higher, with an SSIM value of 95.6%.
Notably, our differentiable rapid radiator can conduct forward PA simulation in homogeneous, non-viscous media substantially faster than current methods that numerically simulate the wave propagation, such as k-Wave. The dataset and all code will be open source.
△ Less
Submitted 16 July, 2024;
originally announced July 2024.
-
Observation of Aerosolization-induced Morphological Changes in Viral Capsids
Authors:
Abhishek Mall,
Anna Munke,
Zhou Shen,
Parichita Mazumder,
Johan Bielecki,
Juncheng E,
Armando Estillore,
Chan Kim,
Romain Letrun,
Jannik Lübke,
Safi Rafie-Zinedine,
Adam Round,
Ekaterina Round,
Michael Rütten,
Amit K. Samanta,
Abhisakh Sarma,
Tokushi Sato,
Florian Schulz,
Carolin Seuring,
Tamme Wollweber,
Lena Worbs,
Patrik Vagovic,
Richard Bean,
Adrian P. Mancuso,
Ne-Te Duane Loh
, et al. (5 additional authors not shown)
Abstract:
Single-stranded RNA viruses co-assemble their capsid with the genome and variations in capsid structures can have significant functional relevance. In particular, viruses need to respond to a dehydrating environment to prevent genomic degradation and remain active upon rehydration. Theoretical work has predicted low-energy buckling transitions in icosahedral capsids which could protect the virus f…
▽ More
Single-stranded RNA viruses co-assemble their capsid with the genome and variations in capsid structures can have significant functional relevance. In particular, viruses need to respond to a dehydrating environment to prevent genomic degradation and remain active upon rehydration. Theoretical work has predicted low-energy buckling transitions in icosahedral capsids which could protect the virus from further dehydration. However, there has been no direct experimental evidence, nor molecular mechanism, for such behaviour. Here we observe this transition using X-ray single particle imaging of MS2 bacteriophages after aerosolization. Using a combination of machine learning tools, we classify hundreds of thousands of single particle diffraction patterns to learn the structural landscape of the capsid morphology as a function of time spent in the aerosol phase. We found a previously unreported compact conformation as well as intermediate structures which suggest an incoherent buckling transition which does not preserve icosahedral symmetry. Finally, we propose a mechanism of this buckling, where a single 19-residue loop is destabilised, leading to the large observed morphology change. Our results provide experimental evidence for a mechanism by which viral capsids protect themselves from dehydration. In the process, these findings also demonstrate the power of single particle X-ray imaging and machine learning methods in studying biomolecular structural dynamics.
△ Less
Submitted 16 July, 2024;
originally announced July 2024.
-
AdapTable: Test-Time Adaptation for Tabular Data via Shift-Aware Uncertainty Calibrator and Label Distribution Handler
Authors:
Changhun Kim,
Taewon Kim,
Seungyeon Woo,
June Yong Yang,
Eunho Yang
Abstract:
In real-world scenarios, tabular data often suffer from distribution shifts that threaten the performance of machine learning models. Despite its prevalence and importance, handling distribution shifts in the tabular domain remains underexplored due to the inherent challenges within the tabular data itself. In this sense, test-time adaptation (TTA) offers a promising solution by adapting models to…
▽ More
In real-world scenarios, tabular data often suffer from distribution shifts that threaten the performance of machine learning models. Despite its prevalence and importance, handling distribution shifts in the tabular domain remains underexplored due to the inherent challenges within the tabular data itself. In this sense, test-time adaptation (TTA) offers a promising solution by adapting models to target data without accessing source data, crucial for privacy-sensitive tabular domains. However, existing TTA methods either 1) overlook the nature of tabular distribution shifts, often involving label distribution shifts, or 2) impose architectural constraints on the model, leading to a lack of applicability. To this end, we propose AdapTable, a novel TTA framework for tabular data. AdapTable operates in two stages: 1) calibrating model predictions using a shift-aware uncertainty calibrator, and 2) adjusting these predictions to match the target label distribution with a label distribution handler. We validate the effectiveness of AdapTable through theoretical analysis and extensive experiments on various distribution shift scenarios. Our results demonstrate AdapTable's ability to handle various real-world distribution shifts, achieving up to a 16% improvement on the HELOC dataset.
△ Less
Submitted 26 August, 2024; v1 submitted 15 July, 2024;
originally announced July 2024.
-
Measurement of $CP$ asymmetries in $B^0 \to K^0_S π^0 γ$ decays at Belle II
Authors:
Belle II Collaboration,
I. Adachi,
L. Aggarwal,
H. Ahmed,
H. Aihara,
N. Akopov,
A. Aloisio,
N. Anh Ky,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
M. Aversano,
R. Ayad,
V. Babu,
H. Bae,
S. Bahinipati,
P. Bambade,
Sw. Banerjee,
S. Bansal,
M. Barrett,
J. Baudot,
A. Baur,
A. Beaubien,
F. Becherer
, et al. (414 additional authors not shown)
Abstract:
We report measurements of time-dependent $CP$ asymmetries in $B^0 \to K^0_S π^0 γ$ decays based on a data sample of $(388\pm6)\times10^6$ $B\bar{B}$ events collected at the $Υ(4S)$ resonance with the Belle II detector. The Belle II experiment operates at the SuperKEKB asymmetric-energy $e^+e^-$ collider. We measure decay-time distributions to determine $CP$-violating parameters $S$ and $C$. We det…
▽ More
We report measurements of time-dependent $CP$ asymmetries in $B^0 \to K^0_S π^0 γ$ decays based on a data sample of $(388\pm6)\times10^6$ $B\bar{B}$ events collected at the $Υ(4S)$ resonance with the Belle II detector. The Belle II experiment operates at the SuperKEKB asymmetric-energy $e^+e^-$ collider. We measure decay-time distributions to determine $CP$-violating parameters $S$ and $C$. We determine these parameters for two ranges of $K^0_S π^0$ invariant mass: $m(K^0_S π^0)\in (0.8, 1.0)$ $GeV/c^2$, which is dominated by $B^0 \to K^{*0} (\to K^0_S π^0) γ$ decays, and a complementary region $m(K^0_S π^0)\in (0.6, 0.8)\cup(1.0, 1.8)$ $GeV/c^2$. Our results have improved precision as compared to previous measurements and are consistent with theory predictions.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Measurement of branching fractions, CP asymmetry, and isospin asymmetry for $\boldsymbol{B\rightarrowργ}$ decays using Belle and Belle II data
Authors:
Belle II Collaboration,
I. Adachi,
K. Adamczyk,
L. Aggarwal,
H. Aihara,
N. Akopov,
A. Aloisio,
N. Anh Ky,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
M. Aversano,
R. Ayad,
V. Babu,
H. Bae,
S. Bahinipati,
P. Bambade,
Sw. Banerjee,
S. Bansal,
M. Barrett,
J. Baudot,
A. Baur,
A. Beaubien,
F. Becherer
, et al. (385 additional authors not shown)
Abstract:
We present measurements of $B^{+}\rightarrowρ^{+}γ$ and $B^{0}\rightarrowρ^{0}γ$ decays using a combined data sample of $772 \times 10^6$ $B\overline{B}$ pairs collected by the Belle experiment and $387\times 10^6$ $B\overline{B}$ pairs collected by the Belle II experiment in $e^{+}e^{-}$ collisions at the $Υ(4S)$ resonance. After an optimized selection, a simultaneous fit to the Belle and Belle I…
▽ More
We present measurements of $B^{+}\rightarrowρ^{+}γ$ and $B^{0}\rightarrowρ^{0}γ$ decays using a combined data sample of $772 \times 10^6$ $B\overline{B}$ pairs collected by the Belle experiment and $387\times 10^6$ $B\overline{B}$ pairs collected by the Belle II experiment in $e^{+}e^{-}$ collisions at the $Υ(4S)$ resonance. After an optimized selection, a simultaneous fit to the Belle and Belle II data sets yields $114\pm 12$ $B^{+}\rightarrowρ^{+}γ$ and $99\pm 12$ $B^{0}\rightarrowρ^{0}γ$ decays. The measured branching fractions are $(13.1^{+2.0 +1.3}_{-1.9 -1.2})\times 10^{-7}$ and $(7.5\pm 1.3^{+1.0}_{-0.8})\times 10^{-7}$ for $B^{+}\rightarrowρ^{+}γ$ and $B^{0}\rightarrowρ^{0}γ$ decays, respectively, where the first uncertainty is statistical and the second is systematic. We also measure the isospin asymmetry $A_{\rm I}(B\rightarrowργ)=(10.9^{+11.2 +7.8}_{-11.7 -7.3})\%$ and the direct CP asymmetry $A_{CP}(B^{+}\rightarrowρ^{+}γ)=(-8.2\pm 15.2^{+1.6}_{-1.2})\%$.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Centrality dependence of Lévy-stable two-pion Bose-Einstein correlations in $\sqrt{s_{_{NN}}}=200$ GeV Au$+$Au collisions
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
H. Al-Ta'ani,
J. Alexander,
A. Angerami,
K. Aoki,
N. Apadula,
Y. Aramaki,
H. Asano,
E. C. Aschenauer,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
B. Bannier,
K. N. Barish,
B. Bassalleck,
S. Bathe
, et al. (377 additional authors not shown)
Abstract:
The PHENIX experiment measured the centrality dependence of two-pion Bose-Einstein correlation functions in $\sqrt{s_{_{NN}}}=200$~GeV Au$+$Au collisions at the Relativistic Heavy Ion Collider at Brookhaven National Laboratory. The data are well represented by Lévy-stable source distributions. The extracted source parameters are the correlation-strength parameter $λ$, the Lévy index of stability…
▽ More
The PHENIX experiment measured the centrality dependence of two-pion Bose-Einstein correlation functions in $\sqrt{s_{_{NN}}}=200$~GeV Au$+$Au collisions at the Relativistic Heavy Ion Collider at Brookhaven National Laboratory. The data are well represented by Lévy-stable source distributions. The extracted source parameters are the correlation-strength parameter $λ$, the Lévy index of stability $α$, and the Lévy-scale parameter $R$ as a function of transverse mass $m_T$ and centrality. The $λ(m_T)$ parameter is constant at larger values of $m_T$, but decreases as $m_T$ decreases. The Lévy scale parameter $R(m_T)$ decreases with $m_T$ and exhibits proportionality to the length scale of the nuclear overlap region. The Lévy exponent $α(m_T)$ is independent of $m_T$ within uncertainties in each investigated centrality bin, but shows a clear centrality dependence. At all centralities, the Lévy exponent $α$ is significantly different from that of Gaussian ($α=2$) or Cauchy ($α=1$) source distributions. Comparisons to the predictions of Monte-Carlo simulations of resonance-decay chains show that in all but the most peripheral centrality class (50%-60%), the obtained results are inconsistent with the measurements, unless a significant reduction of the in-medium mass of the $η'$ meson is included. In each centrality class, the best value of the in-medium $η'$ mass is compared to the mass of the $η$ meson, as well as to several theoretical predictions that consider restoration of $U_A(1)$ symmetry in hot hadronic matter.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
VideoMamba: Spatio-Temporal Selective State Space Model
Authors:
Jinyoung Park,
Hee-Seon Kim,
Kangwook Ko,
Minbeom Kim,
Changick Kim
Abstract:
We introduce VideoMamba, a novel adaptation of the pure Mamba architecture, specifically designed for video recognition. Unlike transformers that rely on self-attention mechanisms leading to high computational costs by quadratic complexity, VideoMamba leverages Mamba's linear complexity and selective SSM mechanism for more efficient processing. The proposed Spatio-Temporal Forward and Backward SSM…
▽ More
We introduce VideoMamba, a novel adaptation of the pure Mamba architecture, specifically designed for video recognition. Unlike transformers that rely on self-attention mechanisms leading to high computational costs by quadratic complexity, VideoMamba leverages Mamba's linear complexity and selective SSM mechanism for more efficient processing. The proposed Spatio-Temporal Forward and Backward SSM allows the model to effectively capture the complex relationship between non-sequential spatial and sequential temporal information in video. Consequently, VideoMamba is not only resource-efficient but also effective in capturing long-range dependency in videos, demonstrated by competitive performance and outstanding efficiency on a variety of video understanding benchmarks. Our work highlights the potential of VideoMamba as a powerful tool for video understanding, offering a simple yet effective baseline for future research in video analysis.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Telescope control software and proto-model siderostat for the SDSS-V Local Volume Mapper
Authors:
Hojae Ahn,
Florian Briegel,
Jimin Han,
Mingyu Jeon,
Thomas M. Herbst,
Sumin Lee,
Woojin Park,
Sunwoo Lee,
Inhwan Jung,
Tae-Geun Ji,
Changgon Kim,
Geon Hee Kim,
Wolfgang Gaessler,
Markus Kuhlberg,
Hyun Chul Park,
Soojong Pak,
Nicholas P. Konidaris,
Niv Drory,
José R. Sánchez-Gallego,
Cynthia S. Froning,
Solange Ramirez,
Juna A. Kollmeier
Abstract:
The fifth Sloan Digital Sky Survey (SDSS-V) Local Volume Mapper (LVM) is a wide-field integral field unit (IFU) survey that uses an array of four 160 mm fixed telescopes with siderostats to minimize the number of moving parts. Individual telescope observes the science field or calibration field independently and is synchronized with the science exposure. We developed the LVM Acquisition and Guidin…
▽ More
The fifth Sloan Digital Sky Survey (SDSS-V) Local Volume Mapper (LVM) is a wide-field integral field unit (IFU) survey that uses an array of four 160 mm fixed telescopes with siderostats to minimize the number of moving parts. Individual telescope observes the science field or calibration field independently and is synchronized with the science exposure. We developed the LVM Acquisition and Guiding Package (LVMAGP) optimized telescope control software program for LVM observations, which can simultaneously control four focusers, three K-mirrors, one fiber selector, four mounts (siderostats), and seven guide cameras. This software is built on a hierarchical architecture and the SDSS framework and provides three key sequences: autofocus, field acquisition, and autoguide. We designed and fabricated a proto-model siderostat to test the telescope pointing model and LVMAGP software. The mirrors of the proto-model were designed as an isogrid open-back type, which reduced the weight by 46% and enabled reaching thermal equilibrium quickly. Additionally, deflection due to bolting torque, self-gravity, and thermal deformation was simulated, and the maximum scatter of the pointing model induced by the tilt of optomechanics was predicted to be $4'.4$, which can be compensated for by the field acquisition sequence. We performed a real sky test of LVMAGP with the proto-model siderostat and obtained field acquisition and autoguide accuracies of $0''.38$ and $1''.5$, respectively. It met all requirements except for the autoguide specification, which will be resolved by more precise alignment among the hardware components at Las Campanas Observatory.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
3D E-textile for Exercise Physiology and Clinical Maternal Health Monitoring
Authors:
Junyi Zhao,
Chansoo Kim,
Weilun Li,
Zichao Wen,
Zhili Xiao,
Yong Wang,
Shantanu Chakrabartty,
Chuan Wang
Abstract:
Electronic textiles (E-textiles) offer great wearing comfort and unobtrusiveness, thus holding potential for next-generation health monitoring wearables. However, the practical implementation is hampered by challenges associated with poor signal quality, substantial motion artifacts, durability for long-term usage, and non-ideal user experience. Here, we report a cost-effective E-textile system th…
▽ More
Electronic textiles (E-textiles) offer great wearing comfort and unobtrusiveness, thus holding potential for next-generation health monitoring wearables. However, the practical implementation is hampered by challenges associated with poor signal quality, substantial motion artifacts, durability for long-term usage, and non-ideal user experience. Here, we report a cost-effective E-textile system that features 3D microfiber-based electrodes for greatly increasing the surface area. The soft and fluffy conductive microfibers disperse freely and securely adhere to the skin, achieving a low impedance at the electrode-skin interface even in the absence of gel. A superhydrophobic fluorinated self-assembled monolayer was deposited on the E-textile surface to render it waterproof while retaining the electrical conductivity. Equipped with a custom-designed motion-artifact canceling wireless data recording circuit, the E-textile system could be integrated into a variety of smart garments for exercise physiology and health monitoring applications. Real-time multimodal electrophysiological signal monitoring, including electrocardiogram (ECG) and electromyography (EMG), was successfully carried out during strenuous cycling and even underwater swimming activities. Furthermore, a multi-channel E-textile was developed and implemented in clinical patient studies for simultaneous real-time monitoring of maternal ECG and uterine EMG signals, incorporating spatial-temporal potential mapping capabilities.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Improved limit on neutrinoless double beta decay of \mohundred~from AMoRE-I
Authors:
A. Agrawal,
V. V. Alenkov,
P. Aryal,
J. Beyer,
B. Bhandari,
R. S. Boiko,
K. Boonin,
O. Buzanov,
C. R. Byeon,
N. Chanthima,
M. K. Cheoun,
J. S. Choe,
Seonho Choi,
S. Choudhury,
J. S. Chung,
F. A. Danevich,
M. Djamal,
D. Drung,
C. Enss,
A. Fleischmann,
A. M. Gangapshev,
L. Gastaldo,
Y. M. Gavrilyuk,
A. M. Gezhaev,
O. Gileva
, et al. (83 additional authors not shown)
Abstract:
AMoRE searches for the signature of neutrinoless double beta decay of $^{100}$Mo with a 100 kg sample of enriched $^{100}$Mo. Scintillating molybdate crystals coupled with a metallic magnetic calorimeter operate at milli-Kelvin temperatures to measure the energy of electrons emitted in the decay. As a demonstration of the full-scale AMoRE, we conducted AMoRE-I, a pre-experiment with 18 molybdate c…
▽ More
AMoRE searches for the signature of neutrinoless double beta decay of $^{100}$Mo with a 100 kg sample of enriched $^{100}$Mo. Scintillating molybdate crystals coupled with a metallic magnetic calorimeter operate at milli-Kelvin temperatures to measure the energy of electrons emitted in the decay. As a demonstration of the full-scale AMoRE, we conducted AMoRE-I, a pre-experiment with 18 molybdate crystals, at the Yangyang Underground Laboratory for over two years. The exposure was 8.02 kg$\cdot$year (or 3.89 kg$_{\mathrm{^{100}Mo}}\cdot$year) and the total background rate near the Q-value was 0.025 $\pm$ 0.002 counts/keV/kg/year. We observed no indication of $0νββ$ decay and report a new lower limit of the half-life of $^{100}$Mo $0νββ$ decay as $ T^{0ν}_{1/2}>3.0\times10^{24}~\mathrm{years}$ at 90\% confidence level. The effective Majorana mass limit range is $m_{ββ}<$(210--610) meV using nuclear matrix elements estimated in the framework of different models, including the recent shell model calculations.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Janus Deformation of de Sitter Space and Transitions in Gravitational Algebras
Authors:
Dongsu Bak,
Chanju Kim,
Sang-Heon Yi
Abstract:
We consider a time-dependent $\mathcal{O}(1/G)$ deformation of pure de Sitter (dS) space in dS gravity coupled to a massless scalar field. It is the dS counterpart of the AdS Janus deformation and interpolates two asymptotically dS spaces in the far past and the far future with a single deformation parameter. The Penrose diagram can be elongated along the time direction indefinitely as the deforma…
▽ More
We consider a time-dependent $\mathcal{O}(1/G)$ deformation of pure de Sitter (dS) space in dS gravity coupled to a massless scalar field. It is the dS counterpart of the AdS Janus deformation and interpolates two asymptotically dS spaces in the far past and the far future with a single deformation parameter. The Penrose diagram can be elongated along the time direction indefinitely as the deformation becomes large. After studying the classical properties of the geometry such as the area theorem and the fluctuation by a matter field, we explore the algebraic structure of the field operators on the deformed spacetime. We argue that the algebra is a von Neumann factor of type II$_\infty$ for small deformations, but there occurs a transition to type I$_\infty$ as the deformation increases so that the neck region of the deformed space becomes a Lorentzian cylinder.
△ Less
Submitted 9 July, 2024; v1 submitted 5 July, 2024;
originally announced July 2024.
-
NuSTAR as an Axion Helioscope
Authors:
J. Ruz,
E. Todarello,
J. K. Vogel,
M. Giannotti,
B. Grefenstette,
H. S. Hudson,
I. G. Hannah,
I. G. Irastorza,
C. S. Kim,
T. O'Shea,
M. Regis,
D. M. Smith,
M. Taoso,
J. Trujillo Bueno
Abstract:
The nature of dark matter in the Universe is still an open question in astrophysics and cosmology. Axions and axion-like particles (ALPs) offer a compelling solution, and traditionally ground-based experiments have eagerly, but to date unsuccessfully, searched for these hypothetical low-mass particles that are expected to be produced in large quantities in the strong electromagnetic fields in the…
▽ More
The nature of dark matter in the Universe is still an open question in astrophysics and cosmology. Axions and axion-like particles (ALPs) offer a compelling solution, and traditionally ground-based experiments have eagerly, but to date unsuccessfully, searched for these hypothetical low-mass particles that are expected to be produced in large quantities in the strong electromagnetic fields in the interior of stars. This work offers a fresh look at axions and ALPs by leveraging their conversion into X-rays in the magnetic field of the Sun's atmosphere rather than a laboratory magnetic field. Unique data acquired with the Nuclear Spectroscopic Telescope Array (NuSTAR) during the solar minimum in 2020 allows us to set stringent limits on the coupling of axions to photons using state-of-the-art magnetic field models of the solar atmosphere. We report pioneering limits on the axion-photon coupling strength of $6.9\times 10^{-12}$ GeV$^{-1}$ at 95\% confidence level for axion masses $m_a \lesssim 2\times 10^{-7}$ eV, surpassing current ground-based searches and further probing unexplored regions of the axion-photon coupling parameter space up to axion masses of $m_a \lesssim 5\times 10^{-4}$ eV.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Evidence of $h_{b}(\text{2P}) \to Υ(\text{1S})η$ decay and search for $h_{b}(\text{1P,2P}) \to Υ(\text{1S})π^0$ with the Belle detector
Authors:
Belle Collaboration,
E. Kovalenko,
I. Adachi,
H. Aihara,
D. M. Asner,
T. Aushev,
R. Ayad,
V. Babu,
Sw. Banerjee,
K. Belous,
J. Bennett,
M. Bessner,
T. Bilka,
D. Biswas,
A. Bobrov,
D. Bodrov,
A. Bondar,
A. Bozek,
M. Bračko,
P. Branchini,
T. E. Browder,
A. Budano,
M. Campajola,
M. -C. Chang,
B. G. Cheon
, et al. (142 additional authors not shown)
Abstract:
We report the first evidence for the $h_{b}(\text{2P}) \to Υ(\text{1S})η$ transition with a significance of $3.5$ standard deviations. The decay branching fraction is measured to be $\mathcal{B}[h_{b}(\text{2P}) \to Υ(\text{1S})η]=(7.1 ~^{+3.7} _{-3.2}\pm 0.8)\times10^{-3}$, which is noticeably smaller than expected. We also set upper limits on $π^0$ transitions of…
▽ More
We report the first evidence for the $h_{b}(\text{2P}) \to Υ(\text{1S})η$ transition with a significance of $3.5$ standard deviations. The decay branching fraction is measured to be $\mathcal{B}[h_{b}(\text{2P}) \to Υ(\text{1S})η]=(7.1 ~^{+3.7} _{-3.2}\pm 0.8)\times10^{-3}$, which is noticeably smaller than expected. We also set upper limits on $π^0$ transitions of $\mathcal{B}[h_{b}(\text{2P}) \to Υ(\text{1S})π^0] < 1.8\times10^{-3}$, and $\mathcal{B}[h_{b}(\text{1P})\to Υ(\text{1S})π^0] < 1.8\times10^{-3}$, at the $90\%$ confidence level. These results are obtained with a $131.4$~fb$^{-1}$ data sample collected near the $Υ(\text{5S})$ resonance with the Belle detector at the KEKB asymmetric-energy $e^+e^-$ collider.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Efficient Shapley Values for Attributing Global Properties of Diffusion Models to Data Group
Authors:
Chris Lin,
Mingyu Lu,
Chanwoo Kim,
Su-In Lee
Abstract:
As diffusion models are deployed in real-world settings, data attribution is needed to ensure fair acknowledgment for contributors of high-quality training data and to identify sources of harmful content. Previous work focuses on identifying individual training samples important for the generation of a given image. However, instead of focusing on a given generated image, some use cases require und…
▽ More
As diffusion models are deployed in real-world settings, data attribution is needed to ensure fair acknowledgment for contributors of high-quality training data and to identify sources of harmful content. Previous work focuses on identifying individual training samples important for the generation of a given image. However, instead of focusing on a given generated image, some use cases require understanding global properties of the distribution learned by a diffusion model (e.g., demographic diversity). Furthermore, training data for diffusion models are often contributed in groups rather than separately (e.g., multiple artworks from the same artist). Hence, here we tackle the problem of attributing global properties of diffusion models to groups of training data. Specifically, we develop a method to efficiently estimate Shapley values by leveraging model pruning and fine-tuning. We empirically demonstrate the utility of our method with three use cases: (i) global image quality for a DDPM trained on a CIFAR dataset, (ii) demographic diversity for an LDM trained on CelebA-HQ, and (iii) overall aesthetic quality for a Stable Diffusion model LoRA-finetuned on Post-Impressionist artworks.
△ Less
Submitted 9 June, 2024;
originally announced July 2024.
-
Measurement of the integrated luminosity of data samples collected during 2019-2022 by the Belle II experiment
Authors:
The Belle II Collaboration,
I. Adachi,
L. Aggarwal,
H. Ahmed,
J. K. Ahn,
H. Aihara,
N. Akopov,
A. Aloisio,
N. Althubiti,
N. Anh Ky,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
M. Aversano,
R. Ayad,
V. Babu,
H. Bae,
S. Bahinipati,
P. Bambade,
Sw. Banerjee,
M. Barrett,
J. Baudot,
A. Baur,
A. Beaubien
, et al. (382 additional authors not shown)
Abstract:
A series of data samples was collected with the Belle II detector at the SuperKEKB collider from March 2019 to June 2022. We determine the integrated luminosities of these data samples using three distinct methodologies involving Bhabha ($e^+e^- \to e^+e^-(nγ)$), digamma ($e^+e^- \to γγ(nγ)$), and dimuon ($e^+e^- \to μ^+ μ^- (nγ)$) events. The total integrated luminosity obtained with Bhabha, diga…
▽ More
A series of data samples was collected with the Belle II detector at the SuperKEKB collider from March 2019 to June 2022. We determine the integrated luminosities of these data samples using three distinct methodologies involving Bhabha ($e^+e^- \to e^+e^-(nγ)$), digamma ($e^+e^- \to γγ(nγ)$), and dimuon ($e^+e^- \to μ^+ μ^- (nγ)$) events. The total integrated luminosity obtained with Bhabha, digamma, and dimuon events is (426.52 $\pm$ 0.03 $\pm$ 2.48)~fb$^{-1}$, (427.32 $\pm$ 0.03 $\pm$ 2.56)~fb$^{-1}$, and (424.84 $\pm$ 0.04 $\pm$ 3.88)~fb$^{-1}$, where the first uncertainties are statistical and the second are systematic. The resulting total integrated luminosity obtained from the combination of the three methods is (426.88 $\pm$ 1.93)~fb$^{-1}$.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Study of $χ_{bJ}(2P)\toωΥ(1S)$ at Belle
Authors:
Belle Collaboration,
Z. S. Stottler,
T. K. Pedlar,
B. G. Fulsom,
I. Adachi,
K. Adamczyk,
H. Aihara,
S. Al Said,
D. M. Asner,
H. Atmacan,
T. Aushev,
R. Ayad,
V. Babu,
Sw. Banerjee,
M. Bauer,
P. Behera,
K. Belous,
J. Bennett,
F. Bernlochner,
M. Bessner,
T. Bilka,
D. Biswas,
A. Bobrov,
D. Bodrov,
G. Bonvicini
, et al. (157 additional authors not shown)
Abstract:
We report a study of the hadronic transitions $χ_{bJ}(2P)\toωΥ(1S)$, with $ω\toπ^{+}π^{-}π^{0}$, using $28.2\times10^6~Υ(3S)$ mesons recorded by the Belle detector. We present the first evidence for the near--threshold transition $χ_{b0}(2P)\toωΥ(1S)$, the analog of the charm sector decay $χ_{c1}(3872)\toωJ/ψ$, with a branching fraction of…
▽ More
We report a study of the hadronic transitions $χ_{bJ}(2P)\toωΥ(1S)$, with $ω\toπ^{+}π^{-}π^{0}$, using $28.2\times10^6~Υ(3S)$ mesons recorded by the Belle detector. We present the first evidence for the near--threshold transition $χ_{b0}(2P)\toωΥ(1S)$, the analog of the charm sector decay $χ_{c1}(3872)\toωJ/ψ$, with a branching fraction of $B\big(χ_{b0}(2P)\toωΥ(1S)\big) = \big(0.55\pm0.19\pm0.07\big)\%$. We also obtain branching fractions of $B\big(χ_{b1}(2P)\toωΥ(1S)\big) = \big(2.39{}^{+0.20}_{-0.19}\pm0.24\big)\%$ and $B\big(χ_{b2}(2P)\toωΥ(1S)\big) = \big(0.47{}^{+0.13}_{-0.12}\pm0.06\big)\%$, confirming the measurement of the $ω$ transitions of the $J=1,2~P$--wave states. The ratio for the $J=2$ to $J=1$ transitions is also measured and found to differ by 3.3 standard deviations from the expected value in the QCD multipole expansion.
△ Less
Submitted 8 July, 2024; v1 submitted 30 June, 2024;
originally announced July 2024.
-
BackMix: Mitigating Shortcut Learning in Echocardiography with Minimal Supervision
Authors:
Kit Mills Bransby,
Arian Beqiri,
Woo-Jin Cho Kim,
Jorge Oliveira,
Agisilaos Chartsias,
Alberto Gomez
Abstract:
Neural networks can learn spurious correlations that lead to the correct prediction in a validation set, but generalise poorly because the predictions are right for the wrong reason. This undesired learning of naive shortcuts (Clever Hans effect) can happen for example in echocardiogram view classification when background cues (e.g. metadata) are biased towards a class and the model learns to focu…
▽ More
Neural networks can learn spurious correlations that lead to the correct prediction in a validation set, but generalise poorly because the predictions are right for the wrong reason. This undesired learning of naive shortcuts (Clever Hans effect) can happen for example in echocardiogram view classification when background cues (e.g. metadata) are biased towards a class and the model learns to focus on those background features instead of on the image content. We propose a simple, yet effective random background augmentation method called BackMix, which samples random backgrounds from other examples in the training set. By enforcing the background to be uncorrelated with the outcome, the model learns to focus on the data within the ultrasound sector and becomes invariant to the regions outside this. We extend our method in a semi-supervised setting, finding that the positive effects of BackMix are maintained with as few as 5% of segmentation labels. A loss weighting mechanism, wBackMix, is also proposed to increase the contribution of the augmented examples. We validate our method on both in-distribution and out-of-distribution datasets, demonstrating significant improvements in classification accuracy, region focus and generalisability. Our source code is available at: https://github.com/kitbransby/BackMix
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Origin of extended Main Sequence Turn Off in open cluster NGC 2355
Authors:
Jayanand Maurya,
M. R. Samal,
Louis Amard,
Yu Zhang,
Hubiao Niu,
Sang Chul Kim,
Y. C. Joshi,
B. Kumar
Abstract:
The presence of extended Main Sequence Turn-Off (eMSTO) in the open clusters has been attributed to various factors, such as spread in rotation rates, binary stars, and dust-like extinction from stellar excretion discs. We present a comprehensive analysis of the eMSTO in the open cluster NGC 2355. Using spectra from the Gaia-ESO archives, we find that the stars in the red part of the eMSTO have a…
▽ More
The presence of extended Main Sequence Turn-Off (eMSTO) in the open clusters has been attributed to various factors, such as spread in rotation rates, binary stars, and dust-like extinction from stellar excretion discs. We present a comprehensive analysis of the eMSTO in the open cluster NGC 2355. Using spectra from the Gaia-ESO archives, we find that the stars in the red part of the eMSTO have a higher mean v sin i value of 135.3$\pm$4.6 km s$^{-1}$ compared to the stars in the blue part that have an average v sin i equal to 81.3$\pm$5.6 km s$^{-1}$. This suggests that the eMSTO in NGC 2355 is possibly caused by the spread in rotation rates of stars. We do not find any substantial evidence of the dust-like extinction from the eMSTO stars using ultraviolet data from the Swift survey. The estimated synchronization time for low mass ratio close binaries in the blue part of the eMSTO suggests that they would be mostly slow-rotating if present. However, the stars in the blue part of the eMSTO are preferentially located in the outer region of the cluster indicating that they may lack low mass ratio close binaries. The spread in rotation rates of eMSTO stars in NGC 2355 is most likely caused by the star-disc interaction mechanism. The stars in the lower main sequence beyond the eMSTO region of NGC 2355 are slow-rotating (mean v sin i = 26.5$\pm$1.3 km s$^{-1}$) possibly due to the magnetic braking of their rotations.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
12C+12C Reaction Rates and the Evolution of a Massive Star
Authors:
Gwangeon Seong,
Yubin Kim,
Kyujin Kwak,
Sunghoon Ahn,
Chaeyeon Park,
Kevin Insik Hahn,
Chunglee Kim
Abstract:
Carbon fusion is important to understand the late stages in the evolution of a massive star. Astronomically interesting energy ranges for the 12C+12C reactions have been, however, poorly constrained by experiments. Theoretical studies on stellar evolution have relied on reaction rates that are extrapolated from those measured in higher energies. In this work, we update the carbon fusion reaction r…
▽ More
Carbon fusion is important to understand the late stages in the evolution of a massive star. Astronomically interesting energy ranges for the 12C+12C reactions have been, however, poorly constrained by experiments. Theoretical studies on stellar evolution have relied on reaction rates that are extrapolated from those measured in higher energies. In this work, we update the carbon fusion reaction rates by fitting the astrophysical S-factor data obtained from direct measurements based on the Fowler, Caughlan, & Zimmerman (1975) formula. We examine the evolution of a 20 M_sun star with the updated 12C+12C reaction rates performing simulations with the MESA (Modules for Experiments for Stellar Astrophysics) code. Between 0.5 and 1 GK, the updated reaction rates are 0.35 to 0.5 times less than the rates suggested by Caughlan and Fowler (1988). The updated rates result in the increase of core temperature by about 7% and of the neutrino cooling by about a factor of three. Moreover, the carbon-burning lifetime is reduced by a factor of 2.7. The updated carbon fusion reaction rates lead to some changes in the details of the stellar evolution model, their impact seems relatively minor compared to other uncertain physical factors like convection, overshooting, rotation, and mass-loss history. The astrophysical S-factor measurements in lower energies have large errors below the Coulomb barrier. More precise measurements in lower energies for the carbon burning would be useful to improve our study and to understand the evolution of a massive star.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
TroL: Traversal of Layers for Large Language and Vision Models
Authors:
Byung-Kwan Lee,
Sangyun Chung,
Chae Won Kim,
Beomchan Park,
Yong Man Ro
Abstract:
Large language and vision models (LLVMs) have been driven by the generalization power of large language models (LLMs) and the advent of visual instruction tuning. Along with scaling them up directly, these models enable LLVMs to showcase powerful vision language (VL) performances by covering diverse tasks via natural language instructions. However, existing open-source LLVMs that perform comparabl…
▽ More
Large language and vision models (LLVMs) have been driven by the generalization power of large language models (LLMs) and the advent of visual instruction tuning. Along with scaling them up directly, these models enable LLVMs to showcase powerful vision language (VL) performances by covering diverse tasks via natural language instructions. However, existing open-source LLVMs that perform comparably to closed-source LLVMs such as GPT-4V are often considered too large (e.g., 26B, 34B, and 110B parameters), having a larger number of layers. These large models demand costly, high-end resources for both training and inference. To address this issue, we present a new efficient LLVM family with 1.8B, 3.8B, and 7B LLM model sizes, Traversal of Layers (TroL), which enables the reuse of layers in a token-wise manner. This layer traversing technique simulates the effect of looking back and retracing the answering stream while increasing the number of forward propagation layers without physically adding more layers. We demonstrate that TroL employs a simple layer traversing approach yet efficiently outperforms the open-source LLVMs with larger model sizes and rivals the performances of the closed-source LLVMs with substantial sizes.
△ Less
Submitted 19 June, 2024; v1 submitted 17 June, 2024;
originally announced June 2024.
-
An Introduction to Computational Fluctuating Hydrodynamics
Authors:
Alejandro L. Garcia,
John B. Bell,
Andrew Nonaka,
Ishan Srivastava,
Daniel Ladiges,
Changho Kim
Abstract:
These notes are an introduction to fluctuating hydrodynamics (FHD) and the formulation of numerical schemes for the resulting stochastic partial differential equations (PDEs). Fluctuating hydrodynamics was originally introduced by Landau and Lifshitz as a way to put thermal fluctuations into a continuum framework by including a stochastic forcing to each dissipative transport process (e.g., heat f…
▽ More
These notes are an introduction to fluctuating hydrodynamics (FHD) and the formulation of numerical schemes for the resulting stochastic partial differential equations (PDEs). Fluctuating hydrodynamics was originally introduced by Landau and Lifshitz as a way to put thermal fluctuations into a continuum framework by including a stochastic forcing to each dissipative transport process (e.g., heat flux). While FHD has been useful in modeling transport and fluid dynamics at the mesoscopic scale, theoretical calculations have been feasible only with simplifying assumptions. As such there is great interest in numerical schemes for Computational Fluctuating Hydrodynamics (CFHD). There are a variety of algorithms (e.g., spectral, finite element, lattice Boltzmann) but in this introduction we focus on finite volume schemes. Accompanying these notes is a demonstration program in Python available on GitHub.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.