subscribe to arXiv mailings

Benchmarking Sub-Genre Classification For Mainstage Dance Music

Authors: Hongzhi Shu, Xinglin Li, Hongyu Jiang, Minghao Fu, Xinyu Li

Abstract: Music classification, with a wide range of applications, is one of the most prominent tasks in music information retrieval. To address the absence of comprehensive datasets and high-performing methods in the classification of mainstage dance music, this work introduces a novel benchmark comprising a new dataset and a baseline. Our dataset extends the number of sub-genres to cover most recent mains… ▽ More Music classification, with a wide range of applications, is one of the most prominent tasks in music information retrieval. To address the absence of comprehensive datasets and high-performing methods in the classification of mainstage dance music, this work introduces a novel benchmark comprising a new dataset and a baseline. Our dataset extends the number of sub-genres to cover most recent mainstage live sets by top DJs worldwide in music festivals. A continuous soft labeling approach is employed to account for tracks that span multiple sub-genres, preserving the inherent sophistication. For the baseline, we developed deep learning models that outperform current state-of-the-art multimodel language models, which struggle to identify house music sub-genres, emphasizing the need for specialized models trained on fine-grained datasets. Our benchmark is applicable to serve for application scenarios such as music recommendation, DJ set curation, and interactive multimedia, where we also provide video demos. Our code is on \url{https://anonymous.4open.science/r/Mainstage-EDM-Benchmark/}. △ Less

Submitted 10 September, 2024; originally announced September 2024.

Comments: Submitted to ICASSP 2025

ACM Class: I.2.1

arXiv:2409.01622 [pdf]

T1-contrast Enhanced MRI Generation from Multi-parametric MRI for Glioma Patients with Latent Tumor Conditioning

Authors: Zach Eidex, Mojtaba Safari, Richard L. J. Qiu, David S. Yu, Hui-Kuo Shu, Hui Mao, Xiaofeng Yang

Abstract: Objective: Gadolinium-based contrast agents (GBCAs) are commonly used in MRI scans of patients with gliomas to enhance brain tumor characterization using T1-weighted (T1W) MRI. However, there is growing concern about GBCA toxicity. This study develops a deep-learning framework to generate T1-postcontrast (T1C) from pre-contrast multiparametric MRI. Approach: We propose the tumor-aware vision trans… ▽ More Objective: Gadolinium-based contrast agents (GBCAs) are commonly used in MRI scans of patients with gliomas to enhance brain tumor characterization using T1-weighted (T1W) MRI. However, there is growing concern about GBCA toxicity. This study develops a deep-learning framework to generate T1-postcontrast (T1C) from pre-contrast multiparametric MRI. Approach: We propose the tumor-aware vision transformer (TA-ViT) model that predicts high-quality T1C images. The predicted tumor region is significantly improved (P < .001) by conditioning the transformer layers from predicted segmentation maps through adaptive layer norm zero mechanism. The predicted segmentation maps were generated with the multi-parametric residual (MPR) ViT model and transformed into a latent space to produce compressed, feature-rich representations. The TA-ViT model predicted T1C MRI images of 501 glioma cases. Selected patients were split into training (N=400), validation (N=50), and test (N=51) sets. Main Results: Both qualitative and quantitative results demonstrate that the TA-ViT model performs superior against the benchmark MRP-ViT model. Our method produces synthetic T1C MRI with high soft tissue contrast and more accurately reconstructs both the tumor and whole brain volumes. The synthesized T1C images achieved remarkable improvements in both tumor and healthy tissue regions compared to the MRP-ViT model. For healthy tissue and tumor regions, the results were as follows: NMSE: 8.53 +/- 4.61E-4; PSNR: 31.2 +/- 2.2; NCC: 0.908 +/- .041 and NMSE: 1.22 +/- 1.27E-4, PSNR: 41.3 +/- 4.7, and NCC: 0.879 +/- 0.042, respectively. Significance: The proposed method generates synthetic T1C images that closely resemble real T1C images. Future development and application of this approach may enable contrast-agent-free MRI for brain tumor patients, eliminating the risk of GBCA toxicity and simplifying the MRI scan protocol. △ Less

Submitted 3 September, 2024; originally announced September 2024.

Comments: arXiv admin note: text overlap with arXiv:2407.02616

arXiv:2408.01760 [pdf, other]

Large Language Models for Equivalent Mutant Detection: How Far Are We?

Authors: Zhao Tian, Honglin Shu, Dong Wang, Xuejie Cao, Yasutaka Kamei, Junjie Chen

Abstract: Mutation testing is vital for ensuring software quality. However, the presence of equivalent mutants is known to introduce redundant cost and bias issues, hindering the effectiveness of mutation testing in practical use. Although numerous equivalent mutant detection (EMD) techniques have been proposed, they exhibit limitations due to the scarcity of training data and challenges in generalizing to… ▽ More Mutation testing is vital for ensuring software quality. However, the presence of equivalent mutants is known to introduce redundant cost and bias issues, hindering the effectiveness of mutation testing in practical use. Although numerous equivalent mutant detection (EMD) techniques have been proposed, they exhibit limitations due to the scarcity of training data and challenges in generalizing to unseen mutants. Recently, large language models (LLMs) have been extensively adopted in various code-related tasks and have shown superior performance by more accurately capturing program semantics. Yet the performance of LLMs in equivalent mutant detection remains largely unclear. In this paper, we conduct an empirical study on 3,302 method-level Java mutant pairs to comprehensively investigate the effectiveness and efficiency of LLMs for equivalent mutant detection. Specifically, we assess the performance of LLMs compared to existing EMD techniques, examine the various strategies of LLMs, evaluate the orthogonality between EMD techniques, and measure the time overhead of training and inference. Our findings demonstrate that LLM-based techniques significantly outperform existing techniques (i.e., the average improvement of 35.69% in terms of F1-score), with the fine-tuned code embedding strategy being the most effective. Moreover, LLM-based techniques offer an excellent balance between cost (relatively low training and inference time) and effectiveness. Based on our findings, we further discuss the impact of model size and embedding quality, and provide several promising directions for future research. This work is the first to examine LLMs in equivalent mutant detection, affirming their effectiveness and efficiency. △ Less

Submitted 3 August, 2024; originally announced August 2024.

Comments: This paper has been accepted by ISSTA'2024

arXiv:2408.00273 [pdf, other]

3D U-KAN Implementation for Multi-modal MRI Brain Tumor Segmentation

Authors: Tianze Tang, Yanbing Chen, Hai Shu

Abstract: We explore the application of U-KAN, a U-Net based network enhanced with Kolmogorov-Arnold Network (KAN) layers, for 3D brain tumor segmentation using multi-modal MRI data. We adapt the original 2D U-KAN model to the 3D task, and introduce a variant called UKAN-SE, which incorporates Squeeze-and-Excitation modules for global attention. We compare the performance of U-KAN and UKAN-SE against existi… ▽ More We explore the application of U-KAN, a U-Net based network enhanced with Kolmogorov-Arnold Network (KAN) layers, for 3D brain tumor segmentation using multi-modal MRI data. We adapt the original 2D U-KAN model to the 3D task, and introduce a variant called UKAN-SE, which incorporates Squeeze-and-Excitation modules for global attention. We compare the performance of U-KAN and UKAN-SE against existing methods such as U-Net, Attention U-Net, and Swin UNETR, using the BraTS 2024 dataset. Our results show that U-KAN and UKAN-SE, with approximately 10.6 million parameters, achieve exceptional efficiency, requiring only about 1/4 of the training time of U-Net and Attention U-Net, and 1/6 that of Swin UNETR, while surpassing these models across most evaluation metrics. Notably, UKAN-SE slightly outperforms U-KAN. △ Less

Submitted 1 August, 2024; originally announced August 2024.

arXiv:2407.19992 [pdf, other]

More precise edge detections

Authors: Hao Shu, Guo-Ping Qiu

Abstract: Image Edge detection (ED) is a base task in computer vision. While the performance of the ED algorithm has been improved greatly by introducing CNN-based models, current models still suffer from unsatisfactory precision rates especially when only a low error toleration distance is allowed. Therefore, model architecture for more precise predictions still needs an investigation. On the other hand, t… ▽ More Image Edge detection (ED) is a base task in computer vision. While the performance of the ED algorithm has been improved greatly by introducing CNN-based models, current models still suffer from unsatisfactory precision rates especially when only a low error toleration distance is allowed. Therefore, model architecture for more precise predictions still needs an investigation. On the other hand, the unavoidable noise training data provided by humans would lead to unsatisfactory model predictions even when inputs are edge maps themselves, which also needs improvement. In this paper, more precise ED models are presented with cascaded skipping density blocks (CSDB). Our models obtain state-of-the-art(SOTA) predictions in several datasets, especially in average precision rate (AP), which is confirmed by extensive experiments. Moreover, our models do not include down-sample operations, demonstrating those widely believed operations are not necessary. Also, a novel modification on data augmentation for training is employed, which allows noiseless data to be employed in model training and thus improves the performance of models predicting on edge maps themselves. △ Less

Submitted 29 July, 2024; originally announced July 2024.

Comments: 11 pages

arXiv:2407.11906 [pdf, other]

SegSTRONG-C: Segmenting Surgical Tools Robustly On Non-adversarial Generated Corruptions -- An EndoVis'24 Challenge

Authors: Hao Ding, Tuxun Lu, Yuqian Zhang, Ruixing Liang, Hongchao Shu, Lalithkumar Seenivasan, Yonghao Long, Qi Dou, Cong Gao, Mathias Unberath

Abstract: Accurate segmentation of tools in robot-assisted surgery is critical for machine perception, as it facilitates numerous downstream tasks including augmented reality feedback. While current feed-forward neural network-based methods exhibit excellent segmentation performance under ideal conditions, these models have proven susceptible to even minor corruptions, significantly impairing the model's pe… ▽ More Accurate segmentation of tools in robot-assisted surgery is critical for machine perception, as it facilitates numerous downstream tasks including augmented reality feedback. While current feed-forward neural network-based methods exhibit excellent segmentation performance under ideal conditions, these models have proven susceptible to even minor corruptions, significantly impairing the model's performance. This vulnerability is especially problematic in surgical settings where predictions might be used to inform high-stakes decisions. To better understand model behavior under non-adversarial corruptions, prior work has explored introducing artificial corruptions, like Gaussian noise or contrast perturbation to test set images, to assess model robustness. However, these corruptions are either not photo-realistic or model/task agnostic. Thus, these investigations provide limited insights into model deterioration under realistic surgical corruptions. To address this limitation, we introduce the SegSTRONG-C challenge that aims to promote the development of algorithms robust to unforeseen but plausible image corruptions of surgery, like smoke, bleeding, and low brightness. We collect and release corruption-free mock endoscopic video sequences for the challenge participants to train their algorithms and benchmark them on video sequences with photo-realistic non-adversarial corruptions for a binary robot tool segmentation task. This new benchmark will allow us to carefully study neural network robustness to non-adversarial corruptions of surgery, thus constituting an important first step towards more robust models for surgical computer vision. In this paper, we describe the data collection and annotation protocol, baseline evaluations of established segmentation models, and data augmentation-based techniques to enhance model robustness. △ Less

Submitted 16 July, 2024; originally announced July 2024.

arXiv:2407.02616 [pdf]

Deep Learning Based Apparent Diffusion Coefficient Map Generation from Multi-parametric MR Images for Patients with Diffuse Gliomas

Authors: Zach Eidex, Mojtaba Safari, Jacob Wynne, Richard L. J. Qiu, Tonghe Wang, David Viar Hernandez, Hui-Kuo Shu, Hui Mao, Xiaofeng Yang

Abstract: Purpose: Apparent diffusion coefficient (ADC) maps derived from diffusion weighted (DWI) MRI provides functional measurements about the water molecules in tissues. However, DWI is time consuming and very susceptible to image artifacts, leading to inaccurate ADC measurements. This study aims to develop a deep learning framework to synthesize ADC maps from multi-parametric MR images. Methods: We pro… ▽ More Purpose: Apparent diffusion coefficient (ADC) maps derived from diffusion weighted (DWI) MRI provides functional measurements about the water molecules in tissues. However, DWI is time consuming and very susceptible to image artifacts, leading to inaccurate ADC measurements. This study aims to develop a deep learning framework to synthesize ADC maps from multi-parametric MR images. Methods: We proposed the multiparametric residual vision transformer model (MPR-ViT) that leverages the long-range context of ViT layers along with the precision of convolutional operators. Residual blocks throughout the network significantly increasing the representational power of the model. The MPR-ViT model was applied to T1w and T2- fluid attenuated inversion recovery images of 501 glioma cases from a publicly available dataset including preprocessed ADC maps. Selected patients were divided into training (N=400), validation (N=50) and test (N=51) sets, respectively. Using the preprocessed ADC maps as ground truth, model performance was evaluated and compared against the Vision Convolutional Transformer (VCT) and residual vision transformer (ResViT) models. Results: The results are as follows using T1w + T2-FLAIR MRI as inputs: MPR-ViT - PSNR: 31.0 +/- 2.1, MSE: 0.009 +/- 0.0005, SSIM: 0.950 +/- 0.015. In addition, ablation studies showed the relative impact on performance of each input sequence. Both qualitative and quantitative results indicate that the proposed MR- ViT model performs favorably against the ground truth data. Conclusion: We show that high-quality ADC maps can be synthesized from structural MRI using a MPR- VCT model. Our predicted images show better conformality to the ground truth volume than ResViT and VCT predictions. These high-quality synthetic ADC maps would be particularly useful for disease diagnosis and intervention, especially when ADC maps have artifacts or are unavailable. △ Less

Submitted 4 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

Comments: arXiv admin note: text overlap with arXiv:2311.15044

arXiv:2407.00730 [pdf, other]

D-CDLF: Decomposition of Common and Distinctive Latent Factors for Multi-view High-dimensional Data

Authors: Hai Shu

Abstract: A typical approach to the joint analysis of multiple high-dimensional data views is to decompose each view's data matrix into three parts: a low-rank common-source matrix generated by common latent factors of all data views, a low-rank distinctive-source matrix generated by distinctive latent factors of the corresponding data view, and an additive noise matrix. Existing decomposition methods often… ▽ More A typical approach to the joint analysis of multiple high-dimensional data views is to decompose each view's data matrix into three parts: a low-rank common-source matrix generated by common latent factors of all data views, a low-rank distinctive-source matrix generated by distinctive latent factors of the corresponding data view, and an additive noise matrix. Existing decomposition methods often focus on the uncorrelatedness between the common latent factors and distinctive latent factors, but inadequately address the equally necessary uncorrelatedness between distinctive latent factors from different data views. We propose a novel decomposition method, called Decomposition of Common and Distinctive Latent Factors (D-CDLF), to effectively achieve both types of uncorrelatedness for two-view data. We also discuss the estimation of the D-CDLF under high-dimensional settings. △ Less

Submitted 1 August, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

Comments: This revision updates only Paragraph 1 of Section 2.1 and Remark 2 of Section 3.2 from version 1

arXiv:2406.11257 [pdf, other]

ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking

Authors: Wenshuo Li, Xinghao Chen, Han Shu, Yehui Tang, Yunhe Wang

Abstract: Large language models (LLM) have recently attracted significant attention in the field of artificial intelligence. However, the training process of these models poses significant challenges in terms of computational and storage capacities, thus compressing checkpoints has become an urgent problem. In this paper, we propose a novel Extreme Checkpoint Compression (ExCP) framework, which significantl… ▽ More Large language models (LLM) have recently attracted significant attention in the field of artificial intelligence. However, the training process of these models poses significant challenges in terms of computational and storage capacities, thus compressing checkpoints has become an urgent problem. In this paper, we propose a novel Extreme Checkpoint Compression (ExCP) framework, which significantly reduces the required storage of training checkpoints while achieving nearly lossless performance. We first calculate the residuals of adjacent checkpoints to obtain the essential but sparse information for higher compression ratio. To further excavate the redundancy parameters in checkpoints, we then propose a weight-momentum joint shrinking method to utilize another important information during the model optimization, i.e., momentum. In particular, we exploit the information of both model and optimizer to discard as many parameters as possible while preserving critical information to ensure optimal performance. Furthermore, we utilize non-uniform quantization to further compress the storage of checkpoints. We extensively evaluate our proposed ExCP framework on several models ranging from 410M to 7B parameters and demonstrate significant storage reduction while maintaining strong performance. For instance, we achieve approximately $70\times$ compression for the Pythia-410M model, with the final performance being as accurate as the original model on various downstream tasks. Codes will be available at https://github.com/Gaffey/ExCP. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: ICML 2024 oral

arXiv:2406.06481 [pdf, other]

Nodewise Loreg: Nodewise $L_0$-penalized Regression for High-dimensional Sparse Precision Matrix Estimation

Authors: Hai Shu, Ziqi Chen, Yingjie Zhang, Hongtu Zhu

Abstract: We propose Nodewise Loreg, a nodewise $L_0$-penalized regression method for estimating high-dimensional sparse precision matrices. We establish its asymptotic properties, including convergence rates, support recovery, and asymptotic normality under high-dimensional sub-Gaussian settings. Notably, the Nodewise Loreg estimator is asymptotically unbiased and normally distributed, eliminating the need… ▽ More We propose Nodewise Loreg, a nodewise $L_0$-penalized regression method for estimating high-dimensional sparse precision matrices. We establish its asymptotic properties, including convergence rates, support recovery, and asymptotic normality under high-dimensional sub-Gaussian settings. Notably, the Nodewise Loreg estimator is asymptotically unbiased and normally distributed, eliminating the need for debiasing required by Nodewise Lasso. We also develop a desparsified version of Nodewise Loreg, similar to the desparsified Nodewise Lasso estimator. The asymptotic variances of the undesparsified Nodewise Loreg estimator are upper bounded by those of both desparsified Nodewise Loreg and Lasso estimators for Gaussian data, potentially offering more powerful statistical inference. Extensive simulations show that the undesparsified Nodewise Loreg estimator generally outperforms the two desparsified estimators in asymptotic normal behavior. Moreover, Nodewise Loreg surpasses Nodewise Lasso, CLIME, and GLasso in most simulations in terms of matrix norm losses, support recovery, and timing performance. Application to a breast cancer gene expression dataset further demonstrates Nodewise Loreg's superiority over the three $L_1$-norm based methods. △ Less

Submitted 10 June, 2024; originally announced June 2024.

arXiv:2405.15230 [pdf, other]

$i$REPO: $i$mplicit Reward Pairwise Difference based Empirical Preference Optimization

Authors: Long Tan Le, Han Shu, Tung-Anh Nguyen, Choong Seon Hong, Nguyen H. Tran

Abstract: While astonishingly capable, large Language Models (LLM) can sometimes produce outputs that deviate from human expectations. Such deviations necessitate an alignment phase to prevent disseminating untruthful, toxic, or biased information. Traditional alignment methods based on reinforcement learning often struggle with the identified instability, whereas preference optimization methods are limited… ▽ More While astonishingly capable, large Language Models (LLM) can sometimes produce outputs that deviate from human expectations. Such deviations necessitate an alignment phase to prevent disseminating untruthful, toxic, or biased information. Traditional alignment methods based on reinforcement learning often struggle with the identified instability, whereas preference optimization methods are limited by their overfitting to pre-collected hard-label datasets. In this paper, we propose a novel LLM alignment framework named $i$REPO, which utilizes implicit Reward pairwise difference regression for Empirical Preference Optimization. Particularly, $i$REPO employs self-generated datasets labelled by empirical human (or AI annotator) preference to iteratively refine the aligned policy through a novel regression-based loss function. Furthermore, we introduce an innovative algorithm backed by theoretical guarantees for achieving optimal results under ideal assumptions and providing a practical performance-gap result without such assumptions. Experimental results with Phi-2 and Mistral-7B demonstrate that $i$REPO effectively achieves self-alignment using soft-label, self-generated responses and the logit of empirical AI annotators. Furthermore, our approach surpasses preference optimization baselines in evaluations using the Language Model Evaluation Harness and Multi-turn benchmarks. △ Less

Submitted 24 May, 2024; originally announced May 2024.

Comments: Under Review

arXiv:2404.06668 [pdf]

Forecasting the Future with Future Technologies: Advancements in Large Meteorological Models

Authors: Hailong Shu, Yue Wang, Weiwei Song, Huichuang Guo, Zhen Song

Abstract: The field of meteorological forecasting has undergone a significant transformation with the integration of large models, especially those employing deep learning techniques. This paper reviews the advancements and applications of these models in weather prediction, emphasizing their role in transforming traditional forecasting methods. Models like FourCastNet, Pangu-Weather, GraphCast, ClimaX, and… ▽ More The field of meteorological forecasting has undergone a significant transformation with the integration of large models, especially those employing deep learning techniques. This paper reviews the advancements and applications of these models in weather prediction, emphasizing their role in transforming traditional forecasting methods. Models like FourCastNet, Pangu-Weather, GraphCast, ClimaX, and FengWu have made notable contributions by providing accurate, high-resolution forecasts, surpassing the capabilities of traditional Numerical Weather Prediction (NWP) models. These models utilize advanced neural network architectures, such as Convolutional Neural Networks (CNNs), Graph Neural Networks (GNNs), and Transformers, to process diverse meteorological data, enhancing predictive accuracy across various time scales and spatial resolutions. The paper addresses challenges in this domain, including data acquisition and computational demands, and explores future opportunities for model optimization and hardware advancements. It underscores the integration of artificial intelligence with conventional meteorological techniques, promising improved weather prediction accuracy and a significant contribution to addressing climate-related challenges. This synergy positions large models as pivotal in the evolving landscape of meteorological forecasting. △ Less

Submitted 9 April, 2024; originally announced April 2024.

Comments: 5 pages

arXiv:2403.14803 [pdf, other]

Transmission Benefits and Cost Allocation under Ambiguity

Authors: Han Shu, Jacob Mays

Abstract: Disputes over cost allocation can present a significant barrier to investment in shared infrastructure. While it may be desirable to allocate cost in a way that corresponds to expected benefits, investments in long-lived projects are made under conditions of substantial uncertainty. In the context of electricity transmission, uncertainty combined with the inherent complexity of power systems analy… ▽ More Disputes over cost allocation can present a significant barrier to investment in shared infrastructure. While it may be desirable to allocate cost in a way that corresponds to expected benefits, investments in long-lived projects are made under conditions of substantial uncertainty. In the context of electricity transmission, uncertainty combined with the inherent complexity of power systems analysis prevents the calculation of an estimated distribution of benefits that is agreeable to all participants. To analyze aspects of the cost allocation problem, we construct a model for transmission and generation expansion planning under uncertainty, enabling the identification of transmission investments as well as the calculation of benefits for users of the network. Numerical tests confirm the potential for realized benefits at the participant level to differ significantly from ex ante estimates. Based on the model and numerical tests we discuss several issues, including 1) establishing a valid counterfactual against which to measure benefits, 2) allocating cost to new and incumbent generators vs. solely allocating to loads, 3) calculating benefits at the portfolio vs. the individual project level, 4) identifying losers in a surplus-enhancing transmission expansion, and 5) quantifying the divergence between cost allocation decisions made ex ante and benefits realized ex post. △ Less

Submitted 21 March, 2024; originally announced March 2024.

Comments: 32 pages, 7 figures, 7 tables

arXiv:2403.10004 [pdf, other]

ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images

Authors: Xiangtian Xue, Jiasong Wu, Youyong Kong, Lotfi Senhadji, Huazhong Shu

Abstract: We present a novel image editing scenario termed Text-grounded Object Generation (TOG), defined as generating a new object in the real image spatially conditioned by textual descriptions. Existing diffusion models exhibit limitations of spatial perception in complex real-world scenes, relying on additional modalities to enforce constraints, and TOG imposes heightened challenges on scene comprehens… ▽ More We present a novel image editing scenario termed Text-grounded Object Generation (TOG), defined as generating a new object in the real image spatially conditioned by textual descriptions. Existing diffusion models exhibit limitations of spatial perception in complex real-world scenes, relying on additional modalities to enforce constraints, and TOG imposes heightened challenges on scene comprehension under the weak supervision of linguistic information. We propose a universal framework ST-LDM based on Swin-Transformer, which can be integrated into any latent diffusion model with training-free backward guidance. ST-LDM encompasses a global-perceptual autoencoder with adaptable compression scales and hierarchical visual features, parallel with deformable multimodal transformer to generate region-wise guidance for the subsequent denoising process. We transcend the limitation of traditional attention mechanisms that only focus on existing visual features by introducing deformable feature alignment to hierarchically refine spatial positioning fused with multi-scale visual and linguistic information. Extensive Experiments demonstrate that our model enhances the localization of attention mechanisms while preserving the generative capabilities inherent to diffusion models. △ Less

Submitted 15 March, 2024; originally announced March 2024.

arXiv:2403.09128 [pdf, other]

Rethinking Referring Object Removal

Authors: Xiangtian Xue, Jiasong Wu, Youyong Kong, Lotfi Senhadji, Huazhong Shu

Abstract: Referring object removal refers to removing the specific object in an image referred by natural language expressions and filling the missing region with reasonable semantics. To address this task, we construct the ComCOCO, a synthetic dataset consisting of 136,495 referring expressions for 34,615 objects in 23,951 image pairs. Each pair contains an image with referring expressions and the ground t… ▽ More Referring object removal refers to removing the specific object in an image referred by natural language expressions and filling the missing region with reasonable semantics. To address this task, we construct the ComCOCO, a synthetic dataset consisting of 136,495 referring expressions for 34,615 objects in 23,951 image pairs. Each pair contains an image with referring expressions and the ground truth after elimination. We further propose an end-to-end syntax-aware hybrid mapping network with an encoding-decoding structure. Linguistic features are hierarchically extracted at the syntactic level and fused in the downsampling process of visual features with multi-head attention. The feature-aligned pyramid network is leveraged to generate segmentation masks and replace internal pixels with region affinity learned from external semantics in high-level feature maps. Extensive experiments demonstrate that our model outperforms diffusion models and two-stage methods which process the segmentation and inpainting task separately by a significant margin. △ Less

Submitted 14 March, 2024; originally announced March 2024.

arXiv:2403.08157 [pdf]

Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks

Authors: Fuzhi Wu, Jiasong Wu, Youyong Kong, Chunfeng Yang, Guanyu Yang, Huazhong Shu, Guy Carrault, Lotfi Senhadji

Abstract: Deep learning and Convolutional Neural Networks (CNNs) have driven major transformations in diverse research areas. However, their limitations in handling low-frequency information present obstacles in certain tasks like interpreting global structures or managing smooth transition images. Despite the promising performance of transformer structures in numerous tasks, their intricate optimization co… ▽ More Deep learning and Convolutional Neural Networks (CNNs) have driven major transformations in diverse research areas. However, their limitations in handling low-frequency information present obstacles in certain tasks like interpreting global structures or managing smooth transition images. Despite the promising performance of transformer structures in numerous tasks, their intricate optimization complexities highlight the persistent need for refined CNN enhancements using limited resources. Responding to these complexities, we introduce a novel framework, the Multiscale Low-Frequency Memory (MLFM) Network, with the goal to harness the full potential of CNNs while keeping their complexity unchanged. The MLFM efficiently preserves low-frequency information, enhancing performance in targeted computer vision tasks. Central to our MLFM is the Low-Frequency Memory Unit (LFMU), which stores various low-frequency data and forms a parallel channel to the core network. A key advantage of MLFM is its seamless compatibility with various prevalent networks, requiring no alterations to their original core structure. Testing on ImageNet demonstrated substantial accuracy improvements in multiple 2D CNNs, including ResNet, MobileNet, EfficientNet, and ConvNeXt. Furthermore, we showcase MLFM's versatility beyond traditional image classification by successfully integrating it into image-to-image translation tasks, specifically in semantic segmentation networks like FCN and U-Net. In conclusion, our work signifies a pivotal stride in the journey of optimizing the efficacy and efficiency of CNNs with limited resources. This research builds upon the existing CNN foundations and paves the way for future advancements in computer vision. Our codes are available at https://github.com/AlphaWuSeu/ MLFM. △ Less

Submitted 12 March, 2024; originally announced March 2024.

Comments: 9 pages, 10 figures,6 tables. AAAI 2024 conference

arXiv:2402.09337 [pdf, other]

doi 10.1103/PhysRevD.109.114505

Lattice $B$-field correlators for heavy quarks

Authors: Luis Altenkort, David de la Cruz, Olaf Kaczmarek, Guy D. Moore, Hai-Tao Shu

Abstract: We analyze the color-magnetic (or "$B$") field two-point function that encodes the finite-mass correction to the heavy quark momentum diffusion coefficient. The simulations are done on fine isotropic lattices in the quenched approximation at $1.5\,T_c$, using a range of gradient flow times for noise suppression and operator renormalization. The continuum extrapolation is performed at fixed flow ti… ▽ More We analyze the color-magnetic (or "$B$") field two-point function that encodes the finite-mass correction to the heavy quark momentum diffusion coefficient. The simulations are done on fine isotropic lattices in the quenched approximation at $1.5\,T_c$, using a range of gradient flow times for noise suppression and operator renormalization. The continuum extrapolation is performed at fixed flow time followed by a second extrapolation to zero flow time. Perturbative calculations to next-to-leading order of this correlation function, matching gradient-flowed correlators to MS-bar, are used to resolve nontrivial renormalization issues. We perform a spectral reconstruction based on perturbative model fits to estimate the coefficient $κ_B$ of the finite-mass correction to the heavy quark momentum diffusion coefficient. The approach we present here yields high-precision data for the correlator with all renormalization issues incorporated at next-to-leading order, and is also applicable for actions with dynamical fermions. △ Less

Submitted 16 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

Comments: 12 pages, 9 figures

Journal ref: Phys.Rev.D 109 (2024) 11, 114505

arXiv:2402.03492 [pdf, other]

Beyond Strong labels: Weakly-supervised Learning Based on Gaussian Pseudo Labels for The Segmentation of Ellipse-like Vascular Structures in Non-contrast CTs

Authors: Qixiang Ma, Antoine Łucas, Huazhong Shu, Adrien Kaladji, Pascal Haigron

Abstract: Deep-learning-based automated segmentation of vascular structures in preoperative CT scans contributes to computer-assisted diagnosis and intervention procedure in vascular diseases. While CT angiography (CTA) is the common standard, non-contrast CT imaging is significant as a contrast-risk-free alternative, avoiding complications associated with contrast agents. However, the challenges of labor-i… ▽ More Deep-learning-based automated segmentation of vascular structures in preoperative CT scans contributes to computer-assisted diagnosis and intervention procedure in vascular diseases. While CT angiography (CTA) is the common standard, non-contrast CT imaging is significant as a contrast-risk-free alternative, avoiding complications associated with contrast agents. However, the challenges of labor-intensive labeling and high labeling variability due to the ambiguity of vascular boundaries hinder conventional strong-label-based, fully-supervised learning in non-contrast CTs. This paper introduces a weakly-supervised framework using ellipses' topology in slices, including 1) an efficient annotation process based on predefined standards, 2) ellipse-fitting processing, 3) the generation of 2D Gaussian heatmaps serving as pseudo labels, 4) a training process through a combination of voxel reconstruction loss and distribution loss with the pseudo labels. We assess the effectiveness of the proposed method on one local and two public datasets comprising non-contrast CT scans, particularly focusing on the abdominal aorta. On the local dataset, our weakly-supervised learning approach based on pseudo labels outperforms strong-label-based fully-supervised learning (1.54\% of Dice score on average), reducing labeling time by around 82.0\%. The efficiency in generating pseudo labels allows the inclusion of label-agnostic external data in the training set, leading to an additional improvement in performance (2.74\% of Dice score on average) with a reduction of 66.3\% labeling time, where the labeling time remains considerably less than that of strong labels. On the public dataset, the pseudo labels achieve an overall improvement of 1.95\% in Dice score for 2D models while a reduction of 11.65 voxel spacing in Hausdorff distance for 3D model. △ Less

Submitted 10 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

arXiv:2401.08040 [pdf, other]

Transport and Connection to Heavy-ion Collisions via Heavy Flavor Probes

Authors: Hai-Tao Shu

Abstract: The heavy ion experiments in Relativistic Heavy Ion Collider (RHIC) and Large Hadron Collider (LHC) are going through upgrade in the next five years, shifting their focus more on the hard processes in the new runs. One of the main goals is to draw a finer image for the quark gluon plasma (QGP). The heavy flavor probes , which witness the whole history of heavy ion collision are particularly sensit… ▽ More The heavy ion experiments in Relativistic Heavy Ion Collider (RHIC) and Large Hadron Collider (LHC) are going through upgrade in the next five years, shifting their focus more on the hard processes in the new runs. One of the main goals is to draw a finer image for the quark gluon plasma (QGP). The heavy flavor probes , which witness the whole history of heavy ion collision are particularly sensitive to test the properties of QGP formed in such collisions. The lattice results for heavy flavor probes provide transport and phenomenological models crucial inputs to describe the experimental observations like the strong suppression of the nuclear modification factor $R_{AA}$ and the non-zero azimuthal anisotropy at low $p_T$. In the last two years we have seen significant advances in the lattice QCD studies of heavy flavor probes, including the in-medium quarkonium properties, the complex static quark-antiquark potential and the heavy quark diffusion from lattice simulations at nonzero temperature. These achievements substantially deepen our understanding of the fate of quarkonium, the screening/unscreening of the complex potential and the temperature and quark mass dependence of the heavy quark diffusion in thermal medium. In these proceedings, we review recent results and briefly discuss possible directions in these studies. △ Less

Submitted 14 February, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

Comments: 18 pages, 8 figures, plenary talk delivered at the 40th International Symposium on Lattice Field Theory (Lattice 2023), July 31st - August 4th, 2023, Fermilab

arXiv:2401.03766 [pdf, other]

doi 10.1007/JHEP03(2024)122

TBA equations and exact WKB analysis in deformed supersymmetric quantum mechanics

Authors: Katsushi Ito, Hongfei Shu

Abstract: We study the spectral problem in deformed supersymmetric quantum mechanics with polynomial superpotential by using the exact WKB method and the TBA equations. We apply the ODE/IM correspondence to the Schrödinger equation with an effective potential deformed by integrating out the fermions, which admits a continuous deformation parameter. We find that the TBA equations are described by the… ▽ More We study the spectral problem in deformed supersymmetric quantum mechanics with polynomial superpotential by using the exact WKB method and the TBA equations. We apply the ODE/IM correspondence to the Schrödinger equation with an effective potential deformed by integrating out the fermions, which admits a continuous deformation parameter. We find that the TBA equations are described by the ${\mathbb Z}_4$-extended ones. For cubic superpotential corresponding to the symmetric double-well potential, the TBA system splits into the two $D_3$-type TBA equations. We investigate in detail this example based on the TBA equations and their analytic continuation as well as the massless limit. We find that the energy spectrum obtained from the exact quantization condition is in good agreement with the diagonalization approach of the Hamiltonian. △ Less

Submitted 22 March, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

Comments: 31 pages, 1 figure; Published version

Report number: TIT/HEP- 698

Journal ref: JHEP03(2024)122

arXiv:2312.13789 [pdf, other]

TinySAM: Pushing the Envelope for Efficient Segment Anything Model

Authors: Han Shu, Wenshuo Li, Yehui Tang, Yiman Zhang, Yihao Chen, Houqiang Li, Yunhe Wang, Xinghao Chen

Abstract: Recently segment anything model (SAM) has shown powerful segmentation capability and has drawn great attention in computer vision fields. Massive following works have developed various applications based on the pretrained SAM and achieved impressive performance on downstream vision tasks. However, SAM consists of heavy architectures and requires massive computational capacity, which hinders the… ▽ More Recently segment anything model (SAM) has shown powerful segmentation capability and has drawn great attention in computer vision fields. Massive following works have developed various applications based on the pretrained SAM and achieved impressive performance on downstream vision tasks. However, SAM consists of heavy architectures and requires massive computational capacity, which hinders the further application of SAM on computation constrained edge devices. To this end, in this paper we propose a framework to obtain a tiny segment anything model (TinySAM) while maintaining the strong zero-shot performance. We first propose a full-stage knowledge distillation method with hard prompt sampling and hard mask weighting strategy to distill a lightweight student model. We also adapt the post-training quantization to the promptable segmentation task and further reduce the computational cost. Moreover, a hierarchical segmenting everything strategy is proposed to accelerate the everything inference by $2\times$ with almost no performance degradation. With all these proposed methods, our TinySAM leads to orders of magnitude computational reduction and pushes the envelope for efficient segment anything task. Extensive experiments on various zero-shot transfer tasks demonstrate the significantly advantageous performance of our TinySAM against counterpart methods. Pre-trained models and codes are available at https://github.com/xinghaochen/TinySAM and https://gitee.com/mindspore/models/tree/master/research/cv/TinySAM. △ Less

Submitted 9 March, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

arXiv:2312.08682 [pdf, other]

High-coherence parallelization in integrated photonics

Authors: Xuguang Zhang, Zixuan Zhou, Yijun Guo, Minxue Zhuang, Warren Jin, Bitao Shen, Yujun Chen, Jiahui Huang, Zihan Tao, Ming Jin, Ruixuan Chen, Zhangfeng Ge, Zhou Fang, Ning Zhang, Yadong Liu, Pengfei Cai, Weiwei Hu, Haowen Shu, Dong Pan, John E. Bowers, Xingjun Wang, Lin Chang

Abstract: Coherent optics has profoundly impacted diverse applications ranging from communications, LiDAR to quantum computations. However, building coherent systems in integrated photonics previously came at great expense in hardware integration and energy efficiency: the lack of a power-efficient way to generate highly coherent light necessitates bulky lasers and amplifiers, while frequency and phase reco… ▽ More Coherent optics has profoundly impacted diverse applications ranging from communications, LiDAR to quantum computations. However, building coherent systems in integrated photonics previously came at great expense in hardware integration and energy efficiency: the lack of a power-efficient way to generate highly coherent light necessitates bulky lasers and amplifiers, while frequency and phase recovery schemes require huge digital signal processing resources. In this work, we demonstrate a high-coherence parallelization strategy that facilitates advanced integrated coherent systems at a minimum price. Using a self-injection locked microcomb to injection lock a distributed feedback laser array, we boost the microcomb power by a record high gain of up to 60 dB on chip with no degradation in coherence. This strategy enables tens of highly coherent channels with an intrinsic linewidth down to the 10 Hz level and power of more than 20 dBm. The overall electrical to optical wall-plug efficiency reaches 19%, comparable with that of the state-of-the-art semiconductor lasers. Driven by this parallel source, we demonstrate a silicon photonic communication link with an unprecedented data rate beyond 60 Tbit/s. Importantly, the high coherence we achieve reduces the coherent-related DSP consumption by 99.999% compared with the traditional III-V laser pump scheme. This work paves a way to realizing scalable, high-performance coherent integrated photonic systems, potentially benefiting numerous applications. △ Less

Submitted 14 December, 2023; originally announced December 2023.

arXiv:2311.15044 [pdf]

High-resolution 3T to 7T MRI Synthesis with a Hybrid CNN-Transformer Model

Authors: Zach Eidex, Jing Wang, Mojtaba Safari, Eric Elder, Jacob Wynne, Tonghe Wang, Hui-Kuo Shu, Hui Mao, Xiaofeng Yang

Abstract: 7 Tesla (7T) apparent diffusion coefficient (ADC) maps derived from diffusion-weighted imaging (DWI) demonstrate improved image quality and spatial resolution over 3 Tesla (3T) ADC maps. However, 7T magnetic resonance imaging (MRI) currently suffers from limited clinical unavailability, higher cost, and increased susceptibility to artifacts. To address these issues, we propose a hybrid CNN-transfo… ▽ More 7 Tesla (7T) apparent diffusion coefficient (ADC) maps derived from diffusion-weighted imaging (DWI) demonstrate improved image quality and spatial resolution over 3 Tesla (3T) ADC maps. However, 7T magnetic resonance imaging (MRI) currently suffers from limited clinical unavailability, higher cost, and increased susceptibility to artifacts. To address these issues, we propose a hybrid CNN-transformer model to synthesize high-resolution 7T ADC maps from multi-modal 3T MRI. The Vision CNN-Transformer (VCT), composed of both Vision Transformer (ViT) blocks and convolutional layers, is proposed to produce high-resolution synthetic 7T ADC maps from 3T ADC maps and 3T T1-weighted (T1w) MRI. ViT blocks enabled global image context while convolutional layers efficiently captured fine detail. The VCT model was validated on the publicly available Human Connectome Project Young Adult dataset, comprising 3T T1w, 3T DWI, and 7T DWI brain scans. The Diffusion Imaging in the Python library was used to compute ADC maps from the DWI scans. A total of 171 patient cases were randomly divided: 130 training cases, 20 validation cases, and 21 test cases. The synthetic ADC maps were evaluated by comparing their similarity to the ground truth volumes with the following metrics: peak signal-to-noise ratio (PSNR), structural similarity index measure (SSIM), and mean squared error (MSE). The results are as follows: PSNR: 27.0+-0.9 dB, SSIM: 0.945+-0.010, and MSE: 2.0+-0.4E-3. Our predicted images demonstrate better spatial resolution and contrast compared to 3T MRI and prediction results made by ResViT and pix2pix. These high-quality synthetic 7T MR images could be beneficial for disease diagnosis and intervention, especially when 7T MRI scanners are unavailable. △ Less

Submitted 25 November, 2023; originally announced November 2023.

arXiv:2311.01525 [pdf, other]

doi 10.1103/PhysRevLett.132.051902

Quark Mass Dependence of Heavy Quark Diffusion Coefficient from Lattice QCD

Authors: Luis Altenkort, David de la Cruz, Olaf Kaczmarek, Rasmus Larsen, Guy D. Moore, Swagato Mukherjee, Peter Petreczky, Hai-Tao Shu, Simon Stendebach

Abstract: We present the first study of the quark mass dependence of the heavy quark momentum and spatial diffusion coefficients using lattice QCD with light dynamical quarks corresponding to a pion mass of 320 MeV. We find that, for the temperature range 195 MeV $<T<$ 293 MeV, the spatial diffusion coefficients of the charm and bottom quarks are smaller than those obtained in phenomenological models that d… ▽ More We present the first study of the quark mass dependence of the heavy quark momentum and spatial diffusion coefficients using lattice QCD with light dynamical quarks corresponding to a pion mass of 320 MeV. We find that, for the temperature range 195 MeV $<T<$ 293 MeV, the spatial diffusion coefficients of the charm and bottom quarks are smaller than those obtained in phenomenological models that describe the $p_T$ spectra and elliptic flow of open heavy flavor hadrons. △ Less

Submitted 1 February, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

Comments: 15 pages, 12 figures

Journal ref: Phys. Rev. Lett. 132, 051902 (2024)

arXiv:2310.19295 [pdf, other]

ROAM: memory-efficient large DNN training via optimized operator ordering and memory layout

Authors: Huiyao Shu, Ang Wang, Ziji Shi, Hanyu Zhao, Yong Li, Lu Lu

Abstract: As deep learning models continue to increase in size, the memory requirements for training have surged. While high-level techniques like offloading, recomputation, and compression can alleviate memory pressure, they also introduce overheads. However, a memory-efficient execution plan that includes a reasonable operator execution order and tensor memory layout can significantly increase the models'… ▽ More As deep learning models continue to increase in size, the memory requirements for training have surged. While high-level techniques like offloading, recomputation, and compression can alleviate memory pressure, they also introduce overheads. However, a memory-efficient execution plan that includes a reasonable operator execution order and tensor memory layout can significantly increase the models' memory efficiency and reduce overheads from high-level techniques. In this paper, we propose ROAM which operates on computation graph level to derive memory-efficient execution plan with optimized operator order and tensor memory layout for models. We first propose sophisticated theories that carefully consider model structure and training memory load to support optimization for large complex graphs that have not been well supported in the past. An efficient tree-based algorithm is further proposed to search task divisions automatically, along with delivering high performance and effectiveness to solve the problem. Experiments show that ROAM achieves a substantial memory reduction of 35.7%, 13.3%, and 27.2% compared to Pytorch and two state-of-the-art methods and offers a remarkable 53.7x speedup. The evaluation conducted on the expansive GPT2-XL further validates ROAM's scalability. △ Less

Submitted 30 October, 2023; originally announced October 2023.

arXiv:2310.13349 [pdf, other]

DeepFDR: A Deep Learning-based False Discovery Rate Control Method for Neuroimaging Data

Authors: Taehyo Kim, Hai Shu, Qiran Jia, Mony J. de Leon

Abstract: Voxel-based multiple testing is widely used in neuroimaging data analysis. Traditional false discovery rate (FDR) control methods often ignore the spatial dependence among the voxel-based tests and thus suffer from substantial loss of testing power. While recent spatial FDR control methods have emerged, their validity and optimality remain questionable when handling the complex spatial dependencie… ▽ More Voxel-based multiple testing is widely used in neuroimaging data analysis. Traditional false discovery rate (FDR) control methods often ignore the spatial dependence among the voxel-based tests and thus suffer from substantial loss of testing power. While recent spatial FDR control methods have emerged, their validity and optimality remain questionable when handling the complex spatial dependencies of the brain. Concurrently, deep learning methods have revolutionized image segmentation, a task closely related to voxel-based multiple testing. In this paper, we propose DeepFDR, a novel spatial FDR control method that leverages unsupervised deep learning-based image segmentation to address the voxel-based multiple testing problem. Numerical studies, including comprehensive simulations and Alzheimer's disease FDG-PET image analysis, demonstrate DeepFDR's superiority over existing methods. DeepFDR not only excels in FDR control and effectively diminishes the false nondiscovery rate, but also boasts exceptional computational efficiency highly suited for tackling large-scale neuroimaging data. △ Less

Submitted 10 March, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

Journal ref: Proceedings of The 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024), PMLR 238:946-954, 2024

arXiv:2310.08835 [pdf, other]

doi 10.1103/PhysRevApplied.21.044052

Demonstration of chronometric leveling using transportable optical clocks beyond laser coherence limit

Authors: Yi Yuan, Kaifeng Cui, Daoxin Liu, Jinbo Yuan, Jian Cao, Dehao Wang, Sijia Chao, Hualin Shu, Xueren Haung

Abstract: Optical clock network requires the establishment of optical frequency transmission link between multiple optical clocks, utilizing narrow linewidth lasers. Despite achieving link noise levels of 10${^{-20}}$, the final accuracy is limited by the phase noise of the clock laser. Correlation spectroscopy is developed to transmit frequency information between two optical clocks directly, enabling opti… ▽ More Optical clock network requires the establishment of optical frequency transmission link between multiple optical clocks, utilizing narrow linewidth lasers. Despite achieving link noise levels of 10${^{-20}}$, the final accuracy is limited by the phase noise of the clock laser. Correlation spectroscopy is developed to transmit frequency information between two optical clocks directly, enabling optical clock comparison beyond the phase noise limit of clock lasers, and significantly enhancing the measurement accuracy or shorten the measurement time. In this letter, two compact transportable ${^{40}}$Ca${^+}$ clocks are employed to accomplish the correlation spectroscopy comparison, demonstrating an 10 cm level measurement accuracy of chronometric leveling using a mediocre clock laser with linewidth of 200 Hz. The relative frequency instability reaches $6.0\times10{^{-15}}/\sqrt{τ/s}$, which is about 20 times better than the result with Rabi spectroscopy using the same clock laser. This research greatly reduces the harsh requirements on the performance of the clock laser, so that an ordinary stable-laser can also be employed in the construction of optical clock network, which is essential for the field applications, especially for the chronometric leveling. △ Less

Submitted 12 October, 2023; originally announced October 2023.

arXiv:2310.04663 [pdf]

doi 10.1088/0256-307X/40/11/117501

Giant 2D Skyrmion Topological Hall Effect with Ultrawide Temperature Window and Low-Current Manipulation in 2D Room-Temperature Ferromagnetic Crystals

Authors: Gaojie Zhang, Qingyuan Luo, Xiaokun Wen, Hao Wu, Li Yang, Wen Jin, Luji Li, Jia Zhang, Wenfeng Zhang, Haibo Shu, Haixin Chang

Abstract: The discovery and manipulation of topological Hall effect (THE), an abnormal magnetoelectric response mostly related to the Dzyaloshinskii-Moriya interaction (DMI), are promising for next-generation spintronic devices based on topological spin textures such as magnetic skyrmions. However, most skyrmions and THE are stabilized in a narrow temperature window either below or over room temperature wit… ▽ More The discovery and manipulation of topological Hall effect (THE), an abnormal magnetoelectric response mostly related to the Dzyaloshinskii-Moriya interaction (DMI), are promising for next-generation spintronic devices based on topological spin textures such as magnetic skyrmions. However, most skyrmions and THE are stabilized in a narrow temperature window either below or over room temperature with high critical current manipulation. It is still elusive and challenging to achieve large THE with both wide temperature window till room temperature and low critical current manipulation. Here, by using controllable, naturally-oxidized, sub-20 and sub-10 nm 2D van der Waals room-temperature ferromagnetic Fe3GaTe2-x crystals, robust 2D THE with ultrawide temperature window ranging in three orders of magnitude from 2 to 300 K is reported, combining with giant THE of ~5.4 micro-ohm cm at 10 K and ~0.15 micro-ohm cm at 300 K which is 1-3 orders of magnitude larger than that of all known room-temperature 2D skyrmion systems. Moreover, room-temperature current-controlled THE is also realized with a low critical current density of ~6.2*10^5 A cm^-2. First-principles calculations unveil natural oxidation-induced highly-enhanced 2D interfacial DMI reasonable for robust giant THE. This work paves the way to room-temperature, electrically-controlled 2D THE-based practical spintronic devices. △ Less

Submitted 6 October, 2023; originally announced October 2023.

Journal ref: Chinese Physics Letters 2023

arXiv:2310.00747 [pdf]

NoxTrader: LSTM-Based Stock Return Momentum Prediction for Quantitative Trading

Authors: Hsiang-Hui Liu, Han-Jay Shu, Wei-Ning Chiu

Abstract: We introduce NoxTrader, a sophisticated system designed for portfolio construction and trading execution with the primary objective of achieving profitable outcomes in the stock market, specifically aiming to generate moderate to long-term profits. The underlying learning process of NoxTrader is rooted in the assimilation of valuable insights derived from historical trading data, particularly focu… ▽ More We introduce NoxTrader, a sophisticated system designed for portfolio construction and trading execution with the primary objective of achieving profitable outcomes in the stock market, specifically aiming to generate moderate to long-term profits. The underlying learning process of NoxTrader is rooted in the assimilation of valuable insights derived from historical trading data, particularly focusing on time-series analysis due to the nature of the dataset employed. In our approach, we utilize price and volume data of US stock market for feature engineering to generate effective features, including Return Momentum, Week Price Momentum, and Month Price Momentum. We choose the Long Short-Term Memory (LSTM)model to capture continuous price trends and implement dynamic model updates during the trading execution process, enabling the model to continuously adapt to the current market trends. Notably, we have developed a comprehensive trading backtesting system - NoxTrader, which allows us to manage portfolios based on predictive scores and utilize custom evaluation metrics to conduct a thorough assessment of our trading performance. Our rigorous feature engineering and careful selection of prediction targets enable us to generate prediction data with an impressive correlation range between 0.65 and 0.75. Finally, we monitor the dispersion of our prediction data and perform a comparative analysis against actual market data. Through the use of filtering techniques, we improved the initial -60% investment return to 325%. △ Less

Submitted 31 October, 2023; v1 submitted 1 October, 2023; originally announced October 2023.

Comments: 5 pages, 7 figures

arXiv:2309.00885 [pdf, other]

doi 10.1016/j.media.2023.102945

A Generic Fundus Image Enhancement Network Boosted by Frequency Self-supervised Representation Learning

Authors: Heng Li, Haofeng Liu, Huazhu Fu, Yanwu Xu, Hui Shu, Ke Niu, Yan Hu, Jiang Liu

Abstract: Fundus photography is prone to suffer from image quality degradation that impacts clinical examination performed by ophthalmologists or intelligent systems. Though enhancement algorithms have been developed to promote fundus observation on degraded images, high data demands and limited applicability hinder their clinical deployment. To circumvent this bottleneck, a generic fundus image enhancement… ▽ More Fundus photography is prone to suffer from image quality degradation that impacts clinical examination performed by ophthalmologists or intelligent systems. Though enhancement algorithms have been developed to promote fundus observation on degraded images, high data demands and limited applicability hinder their clinical deployment. To circumvent this bottleneck, a generic fundus image enhancement network (GFE-Net) is developed in this study to robustly correct unknown fundus images without supervised or extra data. Levering image frequency information, self-supervised representation learning is conducted to learn robust structure-aware representations from degraded images. Then with a seamless architecture that couples representation learning and image enhancement, GFE-Net can accurately correct fundus images and meanwhile preserve retinal structures. Comprehensive experiments are implemented to demonstrate the effectiveness and advantages of GFE-Net. Compared with state-of-the-art algorithms, GFE-Net achieves superior performance in data dependency, enhancement performance, deployment efficiency, and scale generalizability. Follow-up fundus image analysis is also facilitated by GFE-Net, whose modules are respectively verified to be effective for image enhancement. △ Less

Submitted 2 September, 2023; originally announced September 2023.

Comments: Accepted by Medical Image Analysis in Auguest, 2023

Journal ref: Medical Image Analysis, 2023, 90:102945

arXiv:2308.16677 [pdf, other]

doi 10.1007/JHEP02(2024)140

Quasinormal Modes of C-metric from SCFTs

Authors: Yang Lei, Hongfei Shu, Kilar Zhang, Rui-Dong Zhu

Abstract: We study the quasinormal modes (QNM) of the charged C-metric, which physically stands for a charged accelerating black hole, with the help of Nekrasov's partition function of 4d $\mathcal{N}=2$ superconformal field theories (SCFTs). The QNM in the charged C-metric are classified into three types: the photon-surface modes, the accelerating modes and the near-extremal modes, and it is curious how th… ▽ More We study the quasinormal modes (QNM) of the charged C-metric, which physically stands for a charged accelerating black hole, with the help of Nekrasov's partition function of 4d $\mathcal{N}=2$ superconformal field theories (SCFTs). The QNM in the charged C-metric are classified into three types: the photon-surface modes, the accelerating modes and the near-extremal modes, and it is curious how the single quantization condition proposed in arXiv:2006.06111 can reproduce all the different families. We show that the connection formula encoded in terms of Nekrasov's partition function captures all these families of QNM numerically and recovers the asymptotic behavior of the accelerating and the near-extremal modes analytically. Using the connection formulae of different 4d $\mathcal{N}=2$ SCFTs, one can solve both the radial and the angular part of the scalar perturbation equation respectively. The same algorithm can be applied to the de Sitter (dS) black holes to calculate both the dS modes and the photon-sphere modes. △ Less

Submitted 30 January, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

Comments: 32+24 pages; typoes corrected and remark added in v4

Journal ref: JHEP02(2024)140

arXiv:2308.14234 [pdf, other]

An initial analysis of a strongly-lensed QSOs candidate identified by LAMOST

Authors: Y. H. Chen, M. Y. Tang, H. Shu, H. Tu

Abstract: From 2011 to 2021, LAMOST has released a total of 76,167 quasar data. We try to search for gravitationally lensed QSOs by limiting coordinate differences and redshift differences of these QSOs. The name, brightness, spectrum, photometry and other information of each QSO will be visually checked carefully. Special attention should be paid to check whether there are groups of galaxies, gravitational… ▽ More From 2011 to 2021, LAMOST has released a total of 76,167 quasar data. We try to search for gravitationally lensed QSOs by limiting coordinate differences and redshift differences of these QSOs. The name, brightness, spectrum, photometry and other information of each QSO will be visually checked carefully. Special attention should be paid to check whether there are groups of galaxies, gravitationally lensed arcs, Einstein crosses, or Einstein rings near the QSOs. Through careful selection, we select LAMOST J160603.01+290050.8 (A) and LAMOST J160602.81+290048.7 (B) as a candidate and perform an initial analysis. The component A and B are 3.36 arc seconds apart and they display blue during photometric observations. The redshift values of component A and B are 0.2\% different, their Gaia$\_$g values are 1.3\% different, and their ugriz values are 1.0\% or less different. For the spectra covering from 3,690 Å to 9,100 Å, the emission lines of C\,II, Mg, H\,$γ$, O\,III, and H\,$β$ are present for both component A and B and the ratio of flux(B) to flux(A) from LAMOST is basically a constant, around 2.2. We accidentally find a galaxy group near the component A and B. If the center of dark matter in the galaxy group is at the center between component A and B, the component A and B are probably gravitationally lensed QSOs. We estimate that the Einstein mass is 1.46 $\times$ $10^{11}$ $M_{\odot}$ and the total mass of the lens is 1.34 $\times$ $10^{13}$ $M_{\odot}$. The deflection angle is 1.97 arc seconds at positions A and B and the velocity dispersion is 261\,$km\,s^{-1}$. Theoretically, this candidate could be a pair of fold images of a strong lensing system by a galaxy group, and we will investigate the possibility when the redshifts of nearby galaxies are available. △ Less

Submitted 27 August, 2023; originally announced August 2023.

Comments: 2 tables and 4 figures, accepted by RAA on August 24, 2023

arXiv:2308.12062 [pdf, other]

Rationally Correcting Impurity Levels Positions Based on Electrostatic Potential Strategy for Photocatalytic Overall Water Splitting

Authors: Dazhong Sun, Wentao Li, Anqi Shi, Wenxia Zhang, Huabing Shu, Fengfeng Chi, Bing Wang, Xiuyun Zhang, Xianghong Niu

Abstract: Doping to induce suitable impurity levels is an effective strategy to achieve highly efficient photocatalytic overall water splitting (POWS). However, to predict the position of impurity levels, it is not enough to only depend on the projected density of states of the substituted atom in the traditional method. Herein, taking in phosphorus-doped g-C3N5 as a sample, we find that the impurity atom c… ▽ More Doping to induce suitable impurity levels is an effective strategy to achieve highly efficient photocatalytic overall water splitting (POWS). However, to predict the position of impurity levels, it is not enough to only depend on the projected density of states of the substituted atom in the traditional method. Herein, taking in phosphorus-doped g-C3N5 as a sample, we find that the impurity atom can change electrostatic potential gradient and polarity, then significantly affect the spatial electron density around the substituted atom, which further adjusts the impurity level position. Based on the redox potential requirement of POWS, we not only obtain suitable impurity levels, but also expand the visible light absorption range. Simultaneously, the strengthened polarity induced by doping further improve the redox ability of photogenerated carriers. Moreover, the enhanced surface dipoles obviously promote the adsorption and subsequent splitting of water molecules. Our study provides a more comprehensive view to realize accurate regulation of impurity levels in doping engineering and gives reasonable strategies for designing an excellent catalyst of POWS. △ Less

Submitted 23 August, 2023; originally announced August 2023.

Comments: 15 pages, 7 figures, 1 table, 37 reference articles

arXiv:2308.06982 [pdf, other]

Discrete Conditional Diffusion for Reranking in Recommendation

Authors: Xiao Lin, Xiaokai Chen, Chenyang Wang, Hantao Shu, Linfeng Song, Biao Li, Peng jiang

Abstract: Reranking plays a crucial role in modern multi-stage recommender systems by rearranging the initial ranking list to model interplay between items. Considering the inherent challenges of reranking such as combinatorial searching space, some previous studies have adopted the evaluator-generator paradigm, with a generator producing feasible sequences and a evaluator selecting the best one based on es… ▽ More Reranking plays a crucial role in modern multi-stage recommender systems by rearranging the initial ranking list to model interplay between items. Considering the inherent challenges of reranking such as combinatorial searching space, some previous studies have adopted the evaluator-generator paradigm, with a generator producing feasible sequences and a evaluator selecting the best one based on estimated listwise utility. Inspired by the remarkable success of diffusion generative models, this paper explores the potential of diffusion models for generating high-quality sequences in reranking. However, we argue that it is nontrivial to take diffusion models as the generator in the context of recommendation. Firstly, diffusion models primarily operate in continuous data space, differing from the discrete data space of item permutations. Secondly, the recommendation task is different from conventional generation tasks as the purpose of recommender systems is to fulfill user interests. Lastly, real-life recommender systems require efficiency, posing challenges for the inference of diffusion models. To overcome these challenges, we propose a novel Discrete Conditional Diffusion Reranking (DCDR) framework for recommendation. DCDR extends traditional diffusion models by introducing a discrete forward process with tractable posteriors, which adds noise to item sequences through step-wise discrete operations (e.g., swapping). Additionally, DCDR incorporates a conditional reverse process that generates item sequences conditioned on expected user responses. Extensive offline experiments conducted on public datasets demonstrate that DCDR outperforms state-of-the-art reranking methods. Furthermore, DCDR has been deployed in a real-world video app with over 300 million daily active users, significantly enhancing online recommendation quality. △ Less

Submitted 14 August, 2023; originally announced August 2023.

arXiv:2307.16438 [pdf]

Coexistence of Superconductivity and ferromagnetism in high entropy carbide ceramics

Authors: Huchen Shu, Wei Zhong, Jiajia Feng, Hongyang Zhao, Fang Hong, Binbin Yue

Abstract: Generally, the superconductivity was expected to be absent in magnetic systems, but this reception was disturbed by unconventional superconductors, such as cuprates, iron-based superconductors and recently discovered nickelate, since their superconductivity is proposed to be related to the electron-electron interaction mediated by the spin fluctuation. However, the coexistence of superconductivity… ▽ More Generally, the superconductivity was expected to be absent in magnetic systems, but this reception was disturbed by unconventional superconductors, such as cuprates, iron-based superconductors and recently discovered nickelate, since their superconductivity is proposed to be related to the electron-electron interaction mediated by the spin fluctuation. However, the coexistence of superconductivity and magnetism is still rare in conventional superconductors. In this work, we reported the coexistence of these two quantum orderings in high entropy carbide ceramics (Mo0.2Nb0.2Ta0.2V0.2W0.2)C0.9, (Ta0.25Ti0.25Nb0.25Zr0.25)C, and they are expected to be conventional superconductors. Clear magnetic hysteresis loop was observed in these high entropy carbides, indicating a ferromagnetic ground state. A sharp superconducting transition is observed in (Mo0.2Nb0.2Ta0.2V0.2W0.2)C0.9 with a Tc of 3.4 K and upper critical field of ~3.35 T. Meanwhile, superconductivity is suppressed to some extent and zero-resistance state disappears in (Ta0.25Ti0.25Nb0.25Zr0.25)C, in which stronger magnetism is presented. The upper critical field of (Ta0.25Ti0.25Nb0.25Zr0.25)C is only ~1.5 T, though they show higher transition temperature near 5.7 K. The ferromagnetism stems from the carbon vacancies which occurs often during the high temperature synthesis process. This work not just demonstrate the observation of superconductivity in high entropy carbide ceramics, but also provide alternative exotic platform to study the correlation between superconductivity and magnetism, and is of great benefit for the design of multifunctional electronic devices. △ Less

Submitted 31 July, 2023; originally announced July 2023.

Comments: 16 pages, 5 figures, 1 table. Suggestion and comments are welcome

arXiv:2307.00677 [pdf, other]

SDC-HSDD-NDSA: Structure Detecting Cluster by Hierarchical Secondary Directed Differential with Normalized Density and Self-Adaption

Authors: Hao Shu

Abstract: Density-based clustering could be the most popular clustering algorithm since it can identify clusters of arbitrary shape as long as different (high-density) clusters are separated by low-density regions. However, the requirement of the separateness of clusters by low-density regions is not trivial since a high-density region might have different structures which should be clustered into different… ▽ More Density-based clustering could be the most popular clustering algorithm since it can identify clusters of arbitrary shape as long as different (high-density) clusters are separated by low-density regions. However, the requirement of the separateness of clusters by low-density regions is not trivial since a high-density region might have different structures which should be clustered into different groups. Such a situation demonstrates the main flaw of all previous density-based clustering algorithms we have known--structures in a high-density cluster could not be detected. Therefore, this paper aims to provide a density-based clustering scheme that not only has the ability previous ones have but could also detect structures in a high-density region not separated by low-density ones. The algorithm employs secondary directed differential, hierarchy, normalized density, as well as the self-adaption coefficient, and thus is called Structure Detecting Cluster by Hierarchical Secondary Directed Differential with Normalized Density and Self-Adaption, dubbed by SDC-HSDD-NDSA for short. To illustrate its effectiveness, we run the algorithm in several data sets. The results verify its validity in structure detection, robustness over noises, as well as independence of granularities, and demonstrate that it could outperform previous ones. The Python code of the paper could be found on https://github.com/Hao-B-Shu/SDC-HSDD-NDSA. △ Less

Submitted 5 July, 2023; v1 submitted 2 July, 2023; originally announced July 2023.

Comments: 16 pages

arXiv:2306.10525 [pdf, ps, other]

Reduce dark count effects by optimizing measurements

Authors: Hao Shu

Abstract: When implementing quantum tasks practically, the imperfection of devices should take into account. Among all, One of the significant but unsolved problems is the dark count effect caused by single photon detectors. In this paper, we consider such an issue and define a new optimality for measurements, reflecting the robustness in dark count effects with practical detectors. Also, an optimization sc… ▽ More When implementing quantum tasks practically, the imperfection of devices should take into account. Among all, One of the significant but unsolved problems is the dark count effect caused by single photon detectors. In this paper, we consider such an issue and define a new optimality for measurements, reflecting the robustness in dark count effects with practical detectors. Also, an optimization scheme for general measurements is provided. This research could be the first one trying to handle dark count effects based on optimizing the choice of measurements, and we believe that the problem can be reduced by the scheme. △ Less

Submitted 25 October, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

Comments: 7 pages

arXiv:2306.08723 [pdf, other]

Hippocampus Substructure Segmentation Using Morphological Vision Transformer Learning

Authors: Yang Lei, Yifu Ding, Richard L. J. Qiu, Tonghe Wang, Justin Roper, Yabo Fu, Hui-Kuo Shu, Hui Mao, Xiaofeng Yang

Abstract: Background: The hippocampus plays a crucial role in memory and cognition. Because of the associated toxicity from whole brain radiotherapy, more advanced treatment planning techniques prioritize hippocampal avoidance, which depends on an accurate segmentation of the small and complexly shaped hippocampus. Purpose: To achieve accurate segmentation of the anterior and posterior regions of the hippoc… ▽ More Background: The hippocampus plays a crucial role in memory and cognition. Because of the associated toxicity from whole brain radiotherapy, more advanced treatment planning techniques prioritize hippocampal avoidance, which depends on an accurate segmentation of the small and complexly shaped hippocampus. Purpose: To achieve accurate segmentation of the anterior and posterior regions of the hippocampus from T1 weighted (T1w) MRI images, we developed a novel model, Hippo-Net, which uses a mutually enhanced strategy. Methods: The proposed model consists of two major parts: 1) a localization model is used to detect the volume-of-interest (VOI) of hippocampus. 2) An end-to-end morphological vision transformer network is used to perform substructures segmentation within the hippocampus VOI. A total of 260 T1w MRI datasets were used in this study. We conducted a five-fold cross-validation on the first 200 T1w MR images and then performed a hold-out test on the remaining 60 T1w MR images with the model trained on the first 200 images. Results: In five-fold cross-validation, the DSCs were 0.900+-0.029 and 0.886+-0.031for the hippocampus proper and parts of the subiculum, respectively. The MSD were 0.426+-0.115mm and 0.401+-0.100 mm for the hippocampus proper and parts of the subiculum, respectively. Conclusions: The proposed method showed great promise in automatically delineating hippocampus substructures on T1w MRI images. It may facilitate the current clinical workflow and reduce the physician effort. △ Less

Submitted 14 June, 2023; originally announced June 2023.

arXiv:2306.06488 [pdf, other]

doi 10.1007/JHEP08(2023)172

Lattice Calculation of the Intrinsic Soft Function and the Collins-Soper Kernel

Authors: Lattice Parton Collaboration, Min-Huan Chu, Jin-Chen He, Jun Hua, Jian Liang, Xiangdong Ji, Andreas Schäfer, Hai-Tao Shu, Yushan Su, Lisa Walter, Wei Wang, Ji-Hao Wang, Yi-Bo Yang, Jun Zeng, Qi-An Zhang

Abstract: We calculate the soft function using lattice QCD in the framework of large momentum effective theory incorporating the one-loop perturbative contributions. The soft function is a crucial ingredient in the lattice determination of light cone objects using transverse-momentum-dependent (TMD) factorization. It consists of a rapidity-independent part called intrinsic soft function and a rapidity-depen… ▽ More We calculate the soft function using lattice QCD in the framework of large momentum effective theory incorporating the one-loop perturbative contributions. The soft function is a crucial ingredient in the lattice determination of light cone objects using transverse-momentum-dependent (TMD) factorization. It consists of a rapidity-independent part called intrinsic soft function and a rapidity-dependent part called Collins-Soper kernel. We have adopted appropriate normalization when constructing the pseudo-scalar meson form factor that is needed in the determination of the intrinsic part and applied Fierz rearrangement to suppress the higher-twist effects. In the calculation of CS kernel we consider a CLS ensemble other than the MILC ensemble used in a previous study. We have also compared the applicability of determining the CS kernel using quasi TMDWFs and quasi TMDPDFs. As an example, the determined soft function is used to obtain the physical TMD wave functions (WFs) of pion and unpolarized iso-vector TMD parton distribution functions (PDFs) of proton. △ Less

Submitted 28 August, 2023; v1 submitted 10 June, 2023; originally announced June 2023.

Comments: 24 pages, 19 figures, published version

Journal ref: JHEP08(2023)172

arXiv:2306.04302 [pdf]

Robust magnetism against pressure in non-superconducting samples prepared from lutetium foil and H2/N2 gas mixture

Authors: Jing Guo, Shu Cai, Dong Wang, Haiyun Shu, Liuxiang Yang, Pengyu Wang, Wentao Wang, Huanfang Tian, Huaixin Yang, Yazhou Zhou, Jinyu Zhao, Jinyu Han, Jianqi Li Qi Wu, Yang Ding, Wenge Yang, Tao Xiang, Ho-kwang Mao, Liling Sun

Abstract: Recently, the claim of "near-ambient superconductivity" in a N-doped lutetium hydride attracted enormous following-up investigations in the community of condensed matter physics and material sciences. But quite soon, the experimental results from different groups indicate consistently that no evidence of near-ambient superconductivity is found in the samples synthesized by the same method as the r… ▽ More Recently, the claim of "near-ambient superconductivity" in a N-doped lutetium hydride attracted enormous following-up investigations in the community of condensed matter physics and material sciences. But quite soon, the experimental results from different groups indicate consistently that no evidence of near-ambient superconductivity is found in the samples synthesized by the same method as the reported one, or by the other alternative methods. From our extended high-pressure heat capacity and magnetic susceptibility measurements on the samples prepared with the lutetium foil and H2/N2 gas mixture, we report the finding of a magnetic transition at the temperature about 56 K. Our results show that this magnetic phase is robust against pressure up to 4.3 GPa, which covers the critical pressure of boosting the claimed near room temperature superconductivity. △ Less

Submitted 11 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

Comments: 14 pages, 4 figures

Journal ref: CPL 40 (2023)097401

arXiv:2306.01098 [pdf, other]

doi 10.1016/j.cpc.2024.109164

SIMULATeQCD: A simple multi-GPU lattice code for QCD calculations

Authors: Lukas Mazur, Dennis Bollweg, David A. Clarke, Luis Altenkort, Olaf Kaczmarek, Rasmus Larsen, Hai-Tao Shu, Jishnu Goswami, Philipp Scior, Hauke Sandmeyer, Marius Neumann, Henrik Dick, Sajid Ali, Jangho Kim, Christian Schmidt, Peter Petreczky, Swagato Mukherjee

Abstract: The rise of exascale supercomputers has fueled competition among GPU vendors, driving lattice QCD developers to write code that supports multiple APIs. Moreover, new developments in algorithms and physics research require frequent updates to existing software. These challenges have to be balanced against constantly changing personnel. At the same time, there is a wide range of applications for HIS… ▽ More The rise of exascale supercomputers has fueled competition among GPU vendors, driving lattice QCD developers to write code that supports multiple APIs. Moreover, new developments in algorithms and physics research require frequent updates to existing software. These challenges have to be balanced against constantly changing personnel. At the same time, there is a wide range of applications for HISQ fermions in QCD studies. This situation encourages the development of software featuring a HISQ action that is flexible, high-performing, open source, easy to use, and easy to adapt. In this technical paper, we explain the design strategy, provide implementation details, list available algorithms and modules, and show key performance indicators for SIMULATeQCD, a simple multi-GPU lattice code for large-scale QCD calculations, mainly developed and used by the HotQCD collaboration. The code is publicly available on GitHub. △ Less

Submitted 3 April, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

Comments: 17 pages, 7 figures

Journal ref: Comp. Phys. Commun. 300 (2024) 109164

arXiv:2305.06907 [pdf, other]

doi 10.1007/s00601-023-01833-w

Study of quarkonium in QGP from unquenched lattice QCD

Authors: Sajid Ali, Dibyendu Bala, Olaf Kaczmarek, Hai-Tao Shu, Tristan Ueding

Abstract: This paper discusses the charmonium and bottomonium correlators in the pseudoscalar channel and the corresponding spectral reconstruction on the lattice. The absence of a transport peak in the pseudoscalar channel spectral function allows for an easier study of the in-medium modification of bound states. However, extracting spectral information from Euclidean correlators is still a numerically ill… ▽ More This paper discusses the charmonium and bottomonium correlators in the pseudoscalar channel and the corresponding spectral reconstruction on the lattice. The absence of a transport peak in the pseudoscalar channel spectral function allows for an easier study of the in-medium modification of bound states. However, extracting spectral information from Euclidean correlators is still a numerically ill-posed problem. To constrain the spectral reconstruction, we use an ansatz motivated from perturbation theory. The perturbative model spectral function has two main contributions: a thermal part around the threshold obtained from pNRQCD and the vacuum part well above the threshold. These two regions are matched continuously, and the model spectral function is obtained by introducing parameters that control the overall thermal shift of the peak and the overall amplitude. The lattice correlator data is computed using clover-improved Wilson valence fermions on large and fine gauge field configurations generated using $N_f=2+1$ flavors Highly Improved Staggered Quark (HISQ) action with physical strange quark mass $m_s$, and slightly heavy degenerate up and down quark masses $m_l=m_s/5$ that corresponds to $m_π\simeq 320$ MeV. Our results obtained at $T=220$ MeV and $T=251$ MeV suggest that no resonance peaks are needed to describe the charmonium lattice data at these temperatures, while for bottomonium thermally broadened resonance peaks persist. △ Less

Submitted 15 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

Comments: 13 pages, 4 figures, Contribution to a special issue of Few Body Systems: Emergence and Structure of Baryons - Selected Contributions from the International Conference Baryons 2022

Journal ref: Few Body Syst. 64 (2023) 3, 52

arXiv:2304.11246 [pdf, other]

doi 10.1063/5.0156023

Nucleation of transition waves via collisions of elastic vector solitons

Authors: Hiromi Yasuda, Hang Shu, Weijian Jiao, Vincent Tournat, Jordan R. Raney

Abstract: In this work, we show that collisions of one type of nonlinear wave can lead to generation of a different kind of nonlinear wave. Specifically, we demonstrate the formation of topological solitons (or transition waves) via collisions of elastic vector solitons, another type of nonlinear wave, in a multi-stable mechanical system with coupling between translational and rotational degrees of freedom.… ▽ More In this work, we show that collisions of one type of nonlinear wave can lead to generation of a different kind of nonlinear wave. Specifically, we demonstrate the formation of topological solitons (or transition waves) via collisions of elastic vector solitons, another type of nonlinear wave, in a multi-stable mechanical system with coupling between translational and rotational degrees of freedom. We experimentally observe the nucleation of a phase transformation arising from colliding waves, and we numerically investigate head-on and overtaking collisions of solitary waves with vectorial properties (i.e., elastic vector solitons). Unlike KdV-type solitons, which maintain their shape despite collisions, our system shows that collisions of two vector solitons can cause nucleation of a new phase via annihilation of the vector soltions, triggering the propagation of transition waves. The propagation of these depends both on the amount of energy carried by the vector solitons and on their respective rotational directions. The observation of the initiation of transition waves with collisions of vector solitons in multistable mechanical systems serves as an example of new fundamental nonlinear wave interactions, and could also prove useful in applications involving reconfigurable structures. △ Less

Submitted 21 April, 2023; originally announced April 2023.

arXiv:2304.04684 [pdf, ps, other]

Correlation Functions in the TsT/$T{\bar T}$ Correspondence

Authors: Wei Cui, Hongfei Shu, Wei Song, Juntao Wang

Abstract: We investigate the proposed holographic duality between the TsT transformation of IIB string theory on AdS$_3\times {\cal N}$ with NS-NS flux and a single-trace $T\bar{T}$ deformation of the symmetric orbifold CFT. We present a non-perturbative calculation of two-point correlation functions using string theory and demonstrate their consistency with those of the $T\bar{T}$ deformation. The two-poin… ▽ More We investigate the proposed holographic duality between the TsT transformation of IIB string theory on AdS$_3\times {\cal N}$ with NS-NS flux and a single-trace $T\bar{T}$ deformation of the symmetric orbifold CFT. We present a non-perturbative calculation of two-point correlation functions using string theory and demonstrate their consistency with those of the $T\bar{T}$ deformation. The two-point correlation function of the deformed theory on the plane, written in momentum space, is obtained from that of the undeformed theory by replacing $h$ with $h+2{\tilde λ\over w} p\bar p$, where $h$ is the spacetime conformal weight, $\tilde λ$ is a deformation parameter, $p$ and $\bar p$ are the momenta, and $w$ labels the twisted sectors in the deformed symmetric product. At $w=1$, the non-perturbative result satisfies the Callan-Symanzik equation for double-trace $T\bar T$ deformed CFT derived in \cite{Cardy:2019qao}. We also perform conformal perturbations on both the worldsheet CFT and the symmetric orbifold CFT as a sanity check. The perturbative and non-perturbative matching between results on the two sides provides further evidence of the conjectured TsT/$T\bar{T}$ correspondence. △ Less

Submitted 10 May, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

Comments: 35 pages; references added

arXiv:2304.03100 [pdf]

No evidence of superconductivity in the compressed sample prepared from the lutetium foil and H2/N2 gas mixture

Authors: Shu Cai, Jing Guo, Haiyun Shu, Liuxiang Yang, Pengyu Wang, Yazhou Zhou, Jinyu Zhao, Jinyu Han, Qi Wu, Wenge Yang, Tao Xiang, Ho-kwang Mao, Liling Sun

Abstract: A material described as lutetium-hydrogen-nitrogen (Lu-H-N in short) was recently claimed to have near-ambient superconductivity[Gammon et al, Nature 615, 244, 2023]. If the results could be reproduced by other teams, it would be a major scientific breakthrough. Here, we report our results of transport and structure measurements on a material prepared using the same method as that reported by Gamm… ▽ More A material described as lutetium-hydrogen-nitrogen (Lu-H-N in short) was recently claimed to have near-ambient superconductivity[Gammon et al, Nature 615, 244, 2023]. If the results could be reproduced by other teams, it would be a major scientific breakthrough. Here, we report our results of transport and structure measurements on a material prepared using the same method as that reported by Gammon et al. Our X-ray diffraction measurements indicated that the obtained sample contained three substances: the FCC-1 phase (Fm-3m) with a lattice parameter a=5.03 Å, the FCC-2 phase (Fm-3m) with a lattice parameter a= 4.755 Å and Lu metal. These two FCC phases are identical to the those reported in the so-called near-ambient superconductor. However, we found that the samples had no evidence of superconductivity, through our resistance measurements in the temperature range of 300 - 4 K and pressure range of 0.9 - 3.4 GPa, and our magnetic susceptibility measurements in the pressure range of 0.8-3.3 GPa and temperature down to 100 K. We also used a laser heating technique to heat the sample to 1800°C and found no superconductivity in the produced dark blue samples below 6.5 GPa. In addition, the color of the both samples remain dark blue in the pressure range investigated. △ Less

Submitted 6 April, 2023; originally announced April 2023.

Comments: 13 pages and 4 figures

Journal ref: Matter and Radiation at Extremes 8 (2023) 048001

arXiv:2302.12097 [pdf, other]

Phase transitions in 2D multistable mechanical metamaterials via collisions of soliton-like pulses

Authors: Weijian Jiao, Hang Shu, Vincent Tournat, Hiromi Yasuda, Jordan R. Raney

Abstract: In this work, we report observations of phase transitions in 2D multistable mechanical metamaterials that are initiated by collisions of soliton-like pulses in the metamaterial. Analogous to first-order phase transitions in crystalline solids, we experimentally and numerically observe that the multistable metamaterials support phase transitions if the new phase meets or exceeds a critical nucleus… ▽ More In this work, we report observations of phase transitions in 2D multistable mechanical metamaterials that are initiated by collisions of soliton-like pulses in the metamaterial. Analogous to first-order phase transitions in crystalline solids, we experimentally and numerically observe that the multistable metamaterials support phase transitions if the new phase meets or exceeds a critical nucleus size. If this criterion is met, the new phase subsequently propagates in the form of transition waves, converting the rest of the metamaterial to the new phase. More interestingly, we observe that the critical nucleus can be formed via collisions of soliton-like pulses. Moreover, the rich direction-dependent behavior of the nonlinear pulses enables control of the location of nucleation and the spatio-temporal shape of the growing phase. △ Less

Submitted 23 February, 2023; originally announced February 2023.

arXiv:2302.11231 [pdf, other]

Drugs Resistance Analysis from Scarce Health Records via Multi-task Graph Representation

Authors: Honglin Shu, Pei Gao, Lingwei Zhu, Zheng Chen

Abstract: Clinicians prescribe antibiotics by looking at the patient's health record with an experienced eye. However, the therapy might be rendered futile if the patient has drug resistance. Determining drug resistance requires time-consuming laboratory-level testing while applying clinicians' heuristics in an automated way is difficult due to the categorical or binary medical events that constitute health… ▽ More Clinicians prescribe antibiotics by looking at the patient's health record with an experienced eye. However, the therapy might be rendered futile if the patient has drug resistance. Determining drug resistance requires time-consuming laboratory-level testing while applying clinicians' heuristics in an automated way is difficult due to the categorical or binary medical events that constitute health records. In this paper, we propose a novel framework for rapid clinical intervention by viewing health records as graphs whose nodes are mapped from medical events and edges as correspondence between events in given a time window. A novel graph-based model is then proposed to extract informative features and yield automated drug resistance analysis from those high-dimensional and scarce graphs. The proposed method integrates multi-task learning into a common feature extracting graph encoder for simultaneous analyses of multiple drugs as well as stabilizing learning. On a massive dataset comprising over 110,000 patients with urinary tract infections, we verify the proposed method is capable of attaining superior performance on the drug resistance prediction problem. Furthermore, automated drug recommendations resemblant to laboratory-level testing can also be made based on the model resistance analysis. △ Less

Submitted 8 March, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

Comments: 12 pages, 5 figures

arXiv:2302.09961 [pdf, other]

Transverse-Momentum-Dependent Wave Functions of Pion from Lattice QCD

Authors: Min-Huan Chu, Jin-Chen He, Jun Hua, Jian Liang, Xiangdong Ji, Andreas Schafer, Hai-Tao Shu, Yushan Su, Ji-Hao Wang, Wei Wang, Yi-Bo Yang, Jun Zeng, Jian-Hui Zhang, Qi-An Zhang

Abstract: We present a first lattice QCD calculation of the transverse-momentum-dependent wave functions (TMDWFs) of the pion using large-momentum effective theory. Numerical simulations are based on one ensemble with 2+1+1 flavors of highly improved staggered quarks action with lattice spacing $a=0.121$~fm from the MILC Collaboration, and one with 2 +1 flavor clover fermions and tree-level Symanzik gauge a… ▽ More We present a first lattice QCD calculation of the transverse-momentum-dependent wave functions (TMDWFs) of the pion using large-momentum effective theory. Numerical simulations are based on one ensemble with 2+1+1 flavors of highly improved staggered quarks action with lattice spacing $a=0.121$~fm from the MILC Collaboration, and one with 2 +1 flavor clover fermions and tree-level Symanzik gauge action generated by the CLS Collaboration with $a=0.098$~fm. As a key ingredient, the soft function is first obtained by incorporating the one-loop perturbative contributions and a proper normalization. Based on this and the equal-time quasi-TMDWFs simulated on the lattice, we extract the light-cone TMDWFs. The results are comparable between the two lattice ensembles and a comparison with phenomenological parametrization is made. Our studies provide a first attempt of $ab$ $initio$ calculation of TMDWFs which will eventually lead to crucial theory inputs for making predictions for exclusive processes under QCD factorization. △ Less

Submitted 20 February, 2023; originally announced February 2023.

arXiv:2302.08501 [pdf, other]

doi 10.1103/PhysRevLett.130.231902

Heavy Quark Diffusion from 2+1 Flavor Lattice QCD with 320 MeV Pion Mass

Authors: Luis Altenkort, Olaf Kaczmarek, Rasmus Larsen, Swagato Mukherjee, Peter Petreczky, Hai-Tao Shu, Simon Stendebach

Abstract: We present the first calculations of the heavy flavor diffusion coefficient using lattice QCD with light dynamical quarks. For temperatures $195\,\mathrm{MeV}<T<352\,\mathrm{MeV}$, the heavy quark spatial diffusion coefficient is found to be significantly smaller than previous quenched lattice QCD and recent phenomenological estimates. The result implies very fast hydrodynamization of heavy quarks… ▽ More We present the first calculations of the heavy flavor diffusion coefficient using lattice QCD with light dynamical quarks. For temperatures $195\,\mathrm{MeV}<T<352\,\mathrm{MeV}$, the heavy quark spatial diffusion coefficient is found to be significantly smaller than previous quenched lattice QCD and recent phenomenological estimates. The result implies very fast hydrodynamization of heavy quarks in the quark-gluon plasma created during ultrarelativistic heavy-ion collision experiments. △ Less

Submitted 12 July, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

Journal ref: Phys. Rev. Lett. 130, 231902 (2023)

arXiv:2302.06502 [pdf, other]

Universality of the Collins-Soper kernel in lattice calculations

Authors: Hai-Tao Shu, Maximilian Schlemmer, Tobias Sizmann, Alexey Vladimirov, Lisa Walter, Michael Engelhardt, Andreas Schäfer, Yi-Bo Yang

Abstract: The Collins-Soper (CS) kernel is a nonperturbative function that characterizes the rapidity evolution of transverse-momentum-dependent parton distribution functions (TMDPDFs) and wave functions. In this Letter, we calculate the CS kernel for pion and proton targets and for quasi-TMDPDFs of leading and next-to-leading power. The calculations are carried out on the CLS ensemble H101 with dynamical… ▽ More The Collins-Soper (CS) kernel is a nonperturbative function that characterizes the rapidity evolution of transverse-momentum-dependent parton distribution functions (TMDPDFs) and wave functions. In this Letter, we calculate the CS kernel for pion and proton targets and for quasi-TMDPDFs of leading and next-to-leading power. The calculations are carried out on the CLS ensemble H101 with dynamical $N_f=2+1$ clover-improved Wilson fermions. Our analyses demonstrate the consistency of different lattice extractions of the CS kernel for mesons and baryons, as well as for twist-two and twist-three operators, even though lattice artifacts could be significant. This consistency corroborates the universality of the lattice-determined CS kernel and suggests that a high-precision determination of it is in reach. △ Less

Submitted 31 October, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

Comments: 10 pages, 7 figures, published version

Journal ref: 10.1103/PhysRevD.108.074519 (2023)

Showing 1–50 of 213 results for author: Shu, H