Skip to main content

Showing 1–50 of 202 results for author: Peng, T

  1. arXiv:2409.08153  [pdf, other

    eess.AS

    Dark Experience for Incremental Keyword Spotting

    Authors: Tianyi Peng, Yang Xiao

    Abstract: Spoken keyword spotting (KWS) is crucial for identifying keywords within audio inputs and is widely used in applications like Apple Siri and Google Home, particularly on edge devices. Current deep learning-based KWS systems, which are typically trained on a limited set of keywords, can suffer from performance degradation when encountering new domains, a challenge often addressed through few-shot f… ▽ More

    Submitted 12 September, 2024; v1 submitted 12 September, 2024; originally announced September 2024.

    Comments: submitted ICASSP 2025

  2. arXiv:2409.00369   

    cs.CL

    An Empirical Study on Information Extraction using Large Language Models

    Authors: Ridong Han, Chaohao Yang, Tao Peng, Prayag Tiwari, Xiang Wan, Lu Liu, Benyou Wang

    Abstract: Human-like large language models (LLMs), especially the most powerful and popular ones in OpenAI's GPT family, have proven to be very helpful for many natural language processing (NLP) related tasks. Therefore, various attempts have been made to apply LLMs to information extraction (IE), which is a fundamental NLP task that involves extracting information from unstructured plain text. To demonstra… ▽ More

    Submitted 9 September, 2024; v1 submitted 31 August, 2024; originally announced September 2024.

    Comments: This submission was intended instead as the replacement of arXiv:2305.14450 , where it now appears as arXiv:2305.14450v2

  3. arXiv:2408.15032  [pdf, other

    cs.CV cs.AI

    Mamba2MIL: State Space Duality Based Multiple Instance Learning for Computational Pathology

    Authors: Yuqi Zhang, Xiaoqian Zhang, Jiakai Wang, Yuancheng Yang, Taiying Peng, Chao Tong

    Abstract: Computational pathology (CPath) has significantly advanced the clinical practice of pathology. Despite the progress made, Multiple Instance Learning (MIL), a promising paradigm within CPath, continues to face challenges, particularly related to incomplete information utilization. Existing frameworks, such as those based on Convolutional Neural Networks (CNNs), attention, and selective scan space s… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

  4. arXiv:2408.08121  [pdf

    eess.SY

    Optimizing Highway Ramp Merge Safety and Efficiency via Spatio-Temporal Cooperative Control and Vehicle-Road Coordination

    Authors: Ting Peng, Xiaoxue Xu, Yuan Li, Jie Wu, Tao Li, Xiang Dong, Yincai Cai, Peng Wu

    Abstract: In view of existing automatic driving, it is difficult to accurately and timely obtain the status and driving intention of other vehicles. The safety risk and urgency of autonomous vehicles in the absence of collision are evaluated. To ensure safety and improve road efficiency, a method of pre-compiling the spatio-temporal trajectory of vehicles is established to eliminate conflicts between vehicl… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  5. arXiv:2408.05151  [pdf

    cs.LG cs.AI eess.SP

    Meta-Learning Guided Label Noise Distillation for Robust Signal Modulation Classification

    Authors: Xiaoyang Hao, Zhixi Feng, Tongqing Peng, Shuyuan Yang

    Abstract: Automatic modulation classification (AMC) is an effective way to deal with physical layer threats of the internet of things (IoT). However, there is often label mislabeling in practice, which significantly impacts the performance and robustness of deep neural networks (DNNs). In this paper, we propose a meta-learning guided label noise distillation method for robust AMC. Specifically, a teacher-st… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: 8 pages, 7 figures

    ACM Class: I.2; C.2

  6. arXiv:2408.00799  [pdf, other

    cs.IR cs.LG stat.ML

    Deep Uncertainty-Based Explore for Index Construction and Retrieval in Recommendation System

    Authors: Xin Jiang, Kaiqiang Wang, Yinlong Wang, Fengchang Lv, Taiyang Peng, Shuai Yang, Xianteng Wu, Pengye Zhang, Shuo Yuan, Yifan Zeng

    Abstract: In recommendation systems, the relevance and novelty of the final results are selected through a cascade system of Matching -> Ranking -> Strategy. The matching model serves as the starting point of the pipeline and determines the upper bound of the subsequent stages. Balancing the relevance and novelty of matching results is a crucial step in the design and optimization of recommendation systems,… ▽ More

    Submitted 5 August, 2024; v1 submitted 21 July, 2024; originally announced August 2024.

    Comments: accepted by cikm2024

  7. arXiv:2407.19867  [pdf

    eess.SY

    Design and Testing for Steel Support Axial Force Servo System

    Authors: Sana Ullah, Yonghong Zhou, Maokai Lai, Xiang Dong, Tao Li, Xiaoxue Xu, Yuan Li, Ting Peng

    Abstract: Foundation excavations are deepening, expanding, and approaching structures. Steel supports measure and manage axial force. The study regulates steel support structure power during deep excavation using a novel axial force management system for safety, efficiency, and structural integrity. Closed-loop control changes actuator output to maintain axial force based on force. In deep excavation, the s… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: 6 pages,7 figures, 1 table, 2 graph, conference paper

  8. arXiv:2407.12511  [pdf, other

    cs.CV

    Fast Context-Based Low-Light Image Enhancement via Neural Implicit Representations

    Authors: Tomáš Chobola, Yu Liu, Hanyi Zhang, Julia A. Schnabel, Tingying Peng

    Abstract: Current deep learning-based low-light image enhancement methods often struggle with high-resolution images, and fail to meet the practical demands of visual perception across diverse and unseen scenarios. In this paper, we introduce a novel approach termed CoLIE, which redefines the enhancement process through mapping the 2D coordinates of an underexposed image to its illumination component, condi… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV24

  9. arXiv:2407.09328  [pdf

    physics.optics

    > 2π Phase Modulation using Exciton-Polaritons in a Two-Dimensional Superlattice

    Authors: Jason Lynch, Pawan Kumar, Chen Chen, Nicholas Trainor, Shalini Kumari, Tzu-Yu Peng, Cindy Yueli Chen, Yu-Jung Lu, Joan Redwing, Deep Jariwala

    Abstract: Active metamaterials promise to enable arbitrary, temporal control over the propagation of wavefronts of light for applications such as beam steering, optical communication modulators, and holograms. This has been done in the past using patterned silicon photonics to locally control the phase of light such that the metasurface acts as a large number of wavelets. Although phase modulation only requ… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  10. arXiv:2407.06612  [pdf

    eess.IV cs.CV cs.LG

    AI-based Automatic Segmentation of Prostate on Multi-modality Images: A Review

    Authors: Rui Jin, Derun Li, Dehui Xiang, Lei Zhang, Hailing Zhou, Fei Shi, Weifang Zhu, Jing Cai, Tao Peng, Xinjian Chen

    Abstract: Prostate cancer represents a major threat to health. Early detection is vital in reducing the mortality rate among prostate cancer patients. One approach involves using multi-modality (CT, MRI, US, etc.) computer-aided diagnosis (CAD) systems for the prostate region. However, prostate segmentation is challenging due to imperfections in the images and the prostate's complex tissue structure. The ad… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  11. arXiv:2407.03671  [pdf

    eess.SY

    Spatio-temporal cooperative control Method of Highway Ramp Merge Based on Vehicle-road Coordination

    Authors: Xiaoxue Xu, Maokai Lai, Haitao Zhang, Xiang Dong, Tao Li, Jie Wu, Yuan Li, Ting Peng

    Abstract: The merging area of highway ramps faces multiple challenges, including traffic congestion, collision risks, speed mismatches, driver behavior uncertainties, limited visibility, and bottleneck effects. However, autonomous vehicles engaging in depth coordination between vehicle and road in merging zones, by pre-planning and uploading travel trajectories, can significantly enhance the safety and effi… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  12. arXiv:2407.02282  [pdf, other

    cs.RO

    Learning Bipedal Walking on a Quadruped Robot via Adversarial Motion Priors

    Authors: Tianhu Peng, Lingfan Bao, Joseph Humphreys, Andromachi Maria Delfaki, Dimitrios Kanoulas, Chengxu Zhou

    Abstract: Previous studies have successfully demonstrated agile and robust locomotion in challenging terrains for quadrupedal robots. However, the bipedal locomotion mode for quadruped robots remains unverified. This paper explores the adaptation of a learning framework originally designed for quadrupedal robots to operate blind locomotion in biped mode. We leverage a framework that incorporates Adversarial… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 7 pages,5 figures

  13. arXiv:2406.13285  [pdf, ps, other

    math.AP math.CV

    The extremal problem for weighted combined energy and $ρ-$Nitsche type inequality

    Authors: Ting Peng, Chaochuan Wang, Xiaogao Feng

    Abstract: Let $A_1$ and $A_2$ be two circular annuli and let $ρ$ be a radial metric defined in the annuli $A_2$. We study the existence and uniqueness of the extremal problem for weighted combined energy between $A_1$ and $A_2$, and obtain that the extremal mapping is a certain radial mapping. In fact, this extremal mapping generalizes the $ρ-$harmonic mapping and satisfies equation (2.7) obtained by mean o… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 14 pages

    MSC Class: 30C70

  14. arXiv:2406.01939  [pdf, other

    cs.AI cs.DC cs.LG

    Speeding up Policy Simulation in Supply Chain RL

    Authors: Vivek Farias, Joren Gijsbrechts, Aryan Khojandi, Tianyi Peng, Andrew Zheng

    Abstract: Simulating a single trajectory of a dynamical system under some state-dependent policy is a core bottleneck in policy optimization algorithms. The many inherently serial policy evaluations that must be performed in a single simulation constitute the bulk of this bottleneck. To wit, in applying policy optimization to supply chain optimization (SCO) problems, simulating a single month of a supply ch… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  15. arXiv:2405.08621  [pdf, other

    eess.IV cs.CV

    RMT-BVQA: Recurrent Memory Transformer-based Blind Video Quality Assessment for Enhanced Video Content

    Authors: Tianhao Peng, Chen Feng, Duolikun Danier, Fan Zhang, Benoit Vallade, Alex Mackin, David Bull

    Abstract: With recent advances in deep learning, numerous algorithms have been developed to enhance video quality, reduce visual artifacts, and improve perceptual quality. However, little research has been reported on the quality assessment of enhanced content - the evaluation of enhancement methods is often based on quality metrics that were designed for compression applications. In this paper, we propose… ▽ More

    Submitted 5 September, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: This paper has been accepted by the ECCV 2024 AIM Advances in Image Manipulation workshop

  16. arXiv:2405.01872  [pdf, other

    cs.CV

    Defect Image Sample Generation With Diffusion Prior for Steel Surface Defect Recognition

    Authors: Yichun Tai, Kun Yang, Tao Peng, Zhenzhen Huang, Zhijiang Zhang

    Abstract: The task of steel surface defect recognition is an industrial problem with great industry values. The data insufficiency is the major challenge in training a robust defect recognition network. Existing methods have investigated to enlarge the dataset by generating samples with generative models. However, their generation quality is still limited by the insufficiency of defect image samples. To thi… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  17. arXiv:2404.19368  [pdf, other

    cs.SE

    Exploring Multi-Lingual Bias of Large Code Models in Code Generation

    Authors: Chaozheng Wang, Zongjie Li, Cuiyun Gao, Wenxuan Wang, Ting Peng, Hailiang Huang, Yuetang Deng, Shuai Wang, Michael R. Lyu

    Abstract: Code generation aims to synthesize code and fulfill functional requirements based on natural language (NL) specifications, which can greatly improve development efficiency. In the era of large language models (LLMs), large code models (LCMs) have been recently proposed to generate source code. LCMs can generate highly feasible solutions for programming problems described in natural language. Despi… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: 12 pages

  18. arXiv:2404.17070  [pdf, other

    cs.RO

    Deep Reinforcement Learning for Bipedal Locomotion: A Brief Survey

    Authors: Lingfan Bao, Joseph Humphreys, Tianhu Peng, Chengxu Zhou

    Abstract: Bipedal robots are garnering increasing global attention due to their potential applications and advancements in artificial intelligence, particularly in Deep Reinforcement Learning (DRL). While DRL has driven significant progress in bipedal locomotion, developing a comprehensive and unified framework capable of adeptly performing a wide range of tasks remains a challenge. This survey systematical… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 14 pages, 4 figures

  19. arXiv:2404.05022  [pdf, other

    cs.CV cs.LG

    DinoBloom: A Foundation Model for Generalizable Cell Embeddings in Hematology

    Authors: Valentin Koch, Sophia J. Wagner, Salome Kazeminia, Ece Sancar, Matthias Hehr, Julia Schnabel, Tingying Peng, Carsten Marr

    Abstract: In hematology, computational models offer significant potential to improve diagnostic accuracy, streamline workflows, and reduce the tedious work of analyzing single cells in peripheral blood or bone marrow smears. However, clinical adoption of computational models has been hampered by the lack of generalization due to large batch effects, small dataset sizes, and poor performance in transfer lear… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  20. arXiv:2403.19763  [pdf, other

    cs.SD cs.HC cs.MM eess.AS

    Creating Aesthetic Sonifications on the Web with SIREN

    Authors: Tristan Peng, Hongchan Choi, Jonathan Berger

    Abstract: SIREN is a flexible, extensible, and customizable web-based general-purpose interface for auditory data display (sonification). Designed as a digital audio workstation for sonification, synthesizers written in JavaScript using the Web Audio API facilitate intuitive mapping of data to auditory parameters for a wide range of purposes. This paper explores the breadth of sound synthesis techniques s… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 7 pages, 1 figure, 5 listings, submitted to the Web Audio Conference 2024

  21. arXiv:2403.07720  [pdf, other

    cs.CV cs.AI

    Multi-modal Auto-regressive Modeling via Visual Words

    Authors: Tianshuo Peng, Zuchao Li, Lefei Zhang, Hai Zhao, Ping Wang, Bo Du

    Abstract: Large Language Models (LLMs), benefiting from the auto-regressive modelling approach performed on massive unannotated texts corpora, demonstrates powerful perceptual and reasoning capabilities. However, as for extending auto-regressive modelling to multi-modal scenarios to build Large Multi-modal Models (LMMs), there lies a great difficulty that the image information is processed in the LMM as con… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  22. Structural disorder-induced topological phase transitions in quasicrystals

    Authors: Tan Peng, Yong-Chen Xiong, Chun-Bo Hua, Zheng-Rong Liu, Xiaolu Zhu, Wei Cao, Fang Lv, Yue Hou, Bin Zhou, Ziyu Wang, Rui Xiong

    Abstract: Recently, the structural disorder-induced topological phase transitions in periodic systems have attracted much attention. However, in aperiodic systems such as quasicrystalline systems, the interplay between structural disorder and band topology is still unclear. In this work, we investigate the effects of structural disorder on a quantum spin Hall insulator phase and a higher-order topological p… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 9 pages,7 figures. arXiv admin note: text overlap with arXiv:2108.04971

    Journal ref: Phys. Rev. B 109, 195301 (2024)

  23. How higher charmonia shape the puzzling data of the $e^+e^-\to ηJ/ψ$ cross section

    Authors: Tian-Cai Peng, Zi-Yue Bai, Jun-Zhang Wang, Xiang Liu

    Abstract: Recently, the BESIII collaboration performed a precise measurement of the $e^+e^-\to ηJ/ψ$ cross section. It is puzzling that the resonance parameters of the reported $Y(4230)$ show a substantial divergence from the previously measured results in both the open-charmed and hidden-charmed decay channels, and the line shape asymmetry of the data approaching 4.2 GeV also suggests that it might be diff… ▽ More

    Submitted 30 May, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: 11 pages, 4 figures and 4 tables. Accepted by Phys. Rev. D. More discussions were added

    Journal ref: Phys. Rev. D 109, 094048 (2024)

  24. arXiv:2402.18189  [pdf, other

    cs.CR

    VulMCI : Code Splicing-based Pixel-row Oversampling for More Continuous Vulnerability Image Generation

    Authors: Tao Peng, Ling Gui, Yi Sun

    Abstract: In recent years, the rapid development of deep learning technology has brought new prospects to the field of vulnerability detection. Many vulnerability detection methods involve converting source code into images for detection, yet they often overlook the quality of the generated images. Due to the fact that vulnerability images lack clear and continuous contours, unlike images used in object det… ▽ More

    Submitted 16 April, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  25. arXiv:2402.12620  [pdf, other

    cs.CY

    Are Large Language Models (LLMs) Good Social Predictors?

    Authors: Kaiqi Yang, Hang Li, Hongzhi Wen, Tai-Quan Peng, Jiliang Tang, Hui Liu

    Abstract: The prediction has served as a crucial scientific method in modern social studies. With the recent advancement of Large Language Models (LLMs), efforts have been made to leverage LLMs to predict the human features in social life, such as presidential voting. These works suggest that LLMs are capable of generating human-like responses. However, we find that the promising performance achieved by pre… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  26. arXiv:2401.09948  [pdf, ps, other

    math.CV

    The extremal problem for weighted combined energy and the generalization of Nitsche inequality

    Authors: Xiaogao Feng, Ruyue Tang, Ting Peng

    Abstract: We consider the existence and uniqueness of a minimizer of the extremal problem for weighted combined energy between two concentric annuli and obtain that the extremal mapping is a certain radial mapping. Meanwhile, this in turn implies a Nitsche type phenomenon and we get a $\frac{1}{|w|^λ}-$Nitsche type inequality ($λ\neq1$). As an application, on the basis of the relationship between weighted c… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 15 pages

    MSC Class: 30C62

  27. arXiv:2401.08868  [pdf, other

    cs.CV

    B-Cos Aligned Transformers Learn Human-Interpretable Features

    Authors: Manuel Tran, Amal Lahiani, Yashin Dicente Cid, Melanie Boxberg, Peter Lienemann, Christian Matek, Sophia J. Wagner, Fabian J. Theis, Eldad Klaiman, Tingying Peng

    Abstract: Vision Transformers (ViTs) and Swin Transformers (Swin) are currently state-of-the-art in computational pathology. However, domain experts are still reluctant to use these models due to their lack of interpretability. This is not surprising, as critical decisions need to be transparent and understandable. The most common approach to understanding transformers is to visualize their attention. Howev… ▽ More

    Submitted 18 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted at MICCAI 2023 (oral). Camera-ready available at https://doi.org/10.1007/978-3-031-43993-3_50

  28. arXiv:2401.04720  [pdf, other

    cs.CV

    Low-resource finetuning of foundation models beats state-of-the-art in histopathology

    Authors: Benedikt Roth, Valentin Koch, Sophia J. Wagner, Julia A. Schnabel, Carsten Marr, Tingying Peng

    Abstract: To handle the large scale of whole slide images in computational pathology, most approaches first tessellate the images into smaller patches, extract features from these patches, and finally aggregate the feature vectors with weakly-supervised learning. The performance of this workflow strongly depends on the quality of the extracted features. Recently, foundation models in computer vision showed… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

  29. arXiv:2401.02669  [pdf, other

    cs.DC cs.AR

    Infinite-LLM: Efficient LLM Service for Long Context with DistAttention and Distributed KVCache

    Authors: Bin Lin, Chen Zhang, Tao Peng, Hanyu Zhao, Wencong Xiao, Minmin Sun, Anmin Liu, Zhipeng Zhang, Lanbo Li, Xiafei Qiu, Shen Li, Zhigang Ji, Tao Xie, Yong Li, Wei Lin

    Abstract: Large Language Models (LLMs) demonstrate substantial potential across a diverse array of domains via request serving. However, as trends continue to push for expanding context sizes, the autoregressive nature of LLMs results in highly dynamic behavior of the attention layers, showcasing significant differences in computational characteristics and memory requirements from the non-attention layers.… ▽ More

    Submitted 4 July, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

  30. arXiv:2312.12917  [pdf, other

    cs.CV cs.AI

    Sign Language Production with Latent Motion Transformer

    Authors: Pan Xie, Taiyi Peng, Yao Du, Qipeng Zhang

    Abstract: Sign Language Production (SLP) is the tough task of turning sign language into sign videos. The main goal of SLP is to create these videos using a sign gloss. In this research, we've developed a new method to make high-quality sign videos without using human poses as a middle step. Our model works in two main parts: first, it learns from a generator and the video's hidden features, and next, it us… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: Accepted by WACV2024

  31. arXiv:2312.09708  [pdf, other

    cs.LG cs.AI

    GraphRARE: Reinforcement Learning Enhanced Graph Neural Network with Relative Entropy

    Authors: Tianhao Peng, Wenjun Wu, Haitao Yuan, Zhifeng Bao, Zhao Pengrui, Xin Yu, Xuetao Lin, Yu Liang, Yanjun Pu

    Abstract: Graph neural networks (GNNs) have shown advantages in graph-based analysis tasks. However, most existing methods have the homogeneity assumption and show poor performance on heterophilic graphs, where the linked nodes have dissimilar features and different class labels, and the semantically related nodes might be multi-hop away. To address this limitation, this paper presents GraphRARE, a general… ▽ More

    Submitted 13 April, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: 14 pages, 7 figures

  32. arXiv:2312.08084  [pdf, other

    cs.AI

    A Novel Energy based Model Mechanism for Multi-modal Aspect-Based Sentiment Analysis

    Authors: Tianshuo Peng, Zuchao Li, Ping Wang, Lefei Zhang, Hai Zhao

    Abstract: Multi-modal aspect-based sentiment analysis (MABSA) has recently attracted increasing attention. The span-based extraction methods, such as FSUIE, demonstrate strong performance in sentiment analysis due to their joint modeling of input sequences and target labels. However, previous methods still have certain limitations: (i) They ignore the difference in the focus of visual information between di… ▽ More

    Submitted 15 December, 2023; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: AAAI2024

  33. Accelerating Learnt Video Codecs with Gradient Decay and Layer-wise Distillation

    Authors: Tianhao Peng, Ge Gao, Heming Sun, Fan Zhang, David Bull

    Abstract: In recent years, end-to-end learnt video codecs have demonstrated their potential to compete with conventional coding algorithms in term of compression efficiency. However, most learning-based video compression models are associated with high computational complexity and latency, in particular at the decoder side, which limits their deployment in practical applications. In this paper, we present a… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Report number: 2312.02605

  34. arXiv:2311.13117  [pdf

    physics.app-ph physics.chem-ph

    Bile dynamics within the biliary tract and microfluidic-based bile component detection: A review

    Authors: Tao Peng, Chenxiao Zhou, Zhexin Zhang, Yingying Liu, Xiaodong Lin, Yongqing, Yunlong Zhong, Ping Wang, Yanwei Jia

    Abstract: Bilestones are solid masses found in the gallbladder or biliary tract, which block the normal bile flow and eventually result in severe life-threatening complications. Studies have shown that bilestone formation may be related to bile flow dynamics and the concentration level of bile components. The bile flow dynamics in the biliary tract play a critical role in disclosing the mechanism of bile st… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  35. arXiv:2311.12461  [pdf, other

    eess.IV cs.CV

    HiFi-Syn: Hierarchical Granularity Discrimination for High-Fidelity Synthesis of MR Images with Structure Preservation

    Authors: Ziqi Yu, Botao Zhao, Shengjie Zhang, Xiang Chen, Jianfeng Feng, Tingying Peng, Xiao-Yong Zhang

    Abstract: Synthesizing medical images while preserving their structural information is crucial in medical research. In such scenarios, the preservation of anatomical content becomes especially important. Although recent advances have been made by incorporating instance-level information to guide translation, these methods overlook the spatial coherence of structural-level representation and the anatomical i… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  36. Can Large Language Models Capture Public Opinion about Global Warming? An Empirical Assessment of Algorithmic Fidelity and Bias

    Authors: S. Lee, T. Q. Peng, M. H. Goldberg, S. A. Rosenthal, J. E. Kotcher, E. W. Maibach, A. Leiserowitz

    Abstract: Large language models (LLMs) have demonstrated their potential in social science research by emulating human perceptions and behaviors, a concept referred to as algorithmic fidelity. This study assesses the algorithmic fidelity and bias of LLMs by utilizing two nationally representative climate change surveys. The LLMs were conditioned on demographics and/or psychological covariates to simulate su… ▽ More

    Submitted 7 February, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: 34 pages, 6 figures, 1 table

    Journal ref: PLOS Climate, 3(2024), e0000429

  37. arXiv:2310.05663  [pdf, ps, other

    cond-mat.mes-hall cond-mat.supr-con

    Secondary proximity effect in a side-coupled double quantum dot structure

    Authors: Jia-Ning Wang, Yong-Chen Xiong, Wang-Huai Zhou, Tan Peng, Ziyu Wang

    Abstract: Semiconductor quantum dots in close proximity to superconductors may provoke localized bound states within the superconducting energy gap known as Yu-Shiba-Rusinov (YSR) state, which is a promising candidate for constructing Majorana zero modes and topological qubits. Side-coupled double quantum dot systems are ideal platforms revealing the secondary proximity effect. Numerical renormalization gro… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  38. arXiv:2310.04153  [pdf, other

    math.HO physics.data-an stat.OT

    Fair coins tend to land on the same side they started: Evidence from 350,757 flips

    Authors: František Bartoš, Alexandra Sarafoglou, Henrik R. Godmann, Amir Sahrani, David Klein Leunk, Pierre Y. Gui, David Voss, Kaleem Ullah, Malte J. Zoubek, Franziska Nippold, Frederik Aust, Felipe F. Vieira, Chris-Gabriel Islam, Anton J. Zoubek, Sara Shabani, Jonas Petter, Ingeborg B. Roos, Adam Finnemann, Aaron B. Lob, Madlen F. Hoffstadt, Jason Nak, Jill de Ron, Koen Derks, Karoline Huth, Sjoerd Terpstra , et al. (25 additional authors not shown)

    Abstract: Many people have flipped coins but few have stopped to ponder the statistical and physical intricacies of the process. In a preregistered study we collected $350{,}757$ coin flips to test the counterintuitive prediction from a physics model of human coin tossing developed by Diaconis, Holmes, and Montgomery (DHM; 2007). The model asserts that when people flip an ordinary coin, it tends to land on… ▽ More

    Submitted 2 June, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

  39. arXiv:2310.02097  [pdf, other

    cs.CV eess.IV

    Leveraging Classic Deconvolution and Feature Extraction in Zero-Shot Image Restoration

    Authors: Tomáš Chobola, Gesine Müller, Veit Dausmann, Anton Theileis, Jan Taucher, Jan Huisken, Tingying Peng

    Abstract: Non-blind deconvolution aims to restore a sharp image from its blurred counterpart given an obtained kernel. Existing deep neural architectures are often built based on large datasets of sharp ground truth images and trained with supervision. Sharp, high quality ground truth images, however, are not always available, especially for biomedical applications. This severely hampers the applicability o… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  40. arXiv:2309.01865  [pdf, other

    eess.IV cs.AI

    BigFUSE: Global Context-Aware Image Fusion in Dual-View Light-Sheet Fluorescence Microscopy with Image Formation Prior

    Authors: Yu Liu, Gesine Muller, Nassir Navab, Carsten Marr, Jan Huisken, Tingying Peng

    Abstract: Light-sheet fluorescence microscopy (LSFM), a planar illumination technique that enables high-resolution imaging of samples, experiences defocused image quality caused by light scattering when photons propagate through thick tissues. To circumvent this issue, dualview imaging is helpful. It allows various sections of the specimen to be scanned ideally by viewing the sample from opposing orientatio… ▽ More

    Submitted 3 November, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: paper in MICCAI 2023

  41. arXiv:2308.07708  [pdf, ps, other

    physics.app-ph eess.SP

    A Real-time Non-contact Localization Method for Faulty Electric Energy Storage Components using Highly Sensitive Magnetometers

    Authors: Tonghui Peng, Wei Gao, Ya Wu, Yulong Ma, Shiwu Zhang, Yinan Hu

    Abstract: With the wide application of electric energy storage component arrays, such as battery arrays, capacitor arrays, inductor arrays, their potential safety risks have gradually drawn the public attention. However, existing technologies cannot meet the needs of non-contact and real-time diagnosis for faulty components inside these massive arrays. To solve this problem, this paper proposes a new method… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  42. arXiv:2308.02038  [pdf, other

    cs.CY cs.AI

    CLGT: A Graph Transformer for Student Performance Prediction in Collaborative Learning

    Authors: Tianhao Peng, Yu Liang, Wenjun Wu, Jian Ren, Zhao Pengrui, Yanjun Pu

    Abstract: Modeling and predicting the performance of students in collaborative learning paradigms is an important task. Most of the research presented in literature regarding collaborative learning focuses on the discussion forums and social learning networks. There are only a few works that investigate how students interact with each other in team projects and how such interactions affect their academic pe… ▽ More

    Submitted 30 July, 2023; originally announced August 2023.

    Comments: 8 pages, 5 figures, conference: AAAI

  43. CliniDigest: A Case Study in Large Language Model Based Large-Scale Summarization of Clinical Trial Descriptions

    Authors: Renee D. White, Tristan Peng, Pann Sripitak, Alexander Rosenberg Johansen, Michael Snyder

    Abstract: A clinical trial is a study that evaluates new biomedical interventions. To design new trials, researchers draw inspiration from those current and completed. In 2022, there were on average more than 100 clinical trials submitted to ClinicalTrials.gov every day, with each trial having a mean of approximately 1500 words [1]. This makes it nearly impossible to keep up to date. To mitigate this issue,… ▽ More

    Submitted 31 July, 2023; v1 submitted 26 July, 2023; originally announced July 2023.

    Comments: 7 pages, 3 figures, 3 tables, conference: ACM GoodIt 23'; Second co-author: Tristan Peng; Citation: White, Peng, et al

  44. arXiv:2307.07998  [pdf, other

    cs.CV cs.LG

    LUCYD: A Feature-Driven Richardson-Lucy Deconvolution Network

    Authors: Tomáš Chobola, Gesine Müller, Veit Dausmann, Anton Theileis, Jan Taucher, Jan Huisken, Tingying Peng

    Abstract: The process of acquiring microscopic images in life sciences often results in image degradation and corruption, characterised by the presence of noise and blur, which poses significant challenges in accurately analysing and interpreting the obtained data. This paper proposes LUCYD, a novel method for the restoration of volumetric microscopy images that combines the Richardson-Lucy deconvolution fo… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

    Comments: Accepted by 26th International Conference on Medical Image Computing and Computer Assisted Intervention

  45. arXiv:2306.17088  [pdf, other

    math.OC physics.optics

    Pupil-driven quantitative differential phase contrast imaging

    Authors: Shuhe Zhang, Hao Wu, Tao Peng, Zeyu Ke, Meng Shao, Tos T. J. M. Berendschot, Jinhua Zhou

    Abstract: In this research, we reveal the inborn but hitherto ignored properties of quantitative differential phase contrast (qDPC) imaging: the phase transfer function being an edge detection filter. Inspired by this, we highlighted the duality of qDPC between optics and pattern recognition, and propose a simple and effective qDPC reconstruction algorithm, termed Pupil-Driven qDPC (pd-qDPC), to facilitate… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  46. arXiv:2306.14913  [pdf, other

    cs.CL cs.AI

    FSUIE: A Novel Fuzzy Span Mechanism for Universal Information Extraction

    Authors: Tianshuo Peng, Zuchao Li, Lefei Zhang, Bo Du, Hai Zhao

    Abstract: Universal Information Extraction (UIE) has been introduced as a unified framework for various Information Extraction (IE) tasks and has achieved widespread success. Despite this, UIE models have limitations. For example, they rely heavily on span boundaries in the data during training, which does not reflect the reality of span annotation challenges. Slight adjustments to positions can also meet r… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: ACL2023

  47. arXiv:2306.11368  [pdf, other

    cs.CV

    RoMe: Towards Large Scale Road Surface Reconstruction via Mesh Representation

    Authors: Ruohong Mei, Wei Sui, Jiaxin Zhang, Xue Qin, Gang Wang, Tao Peng, Cong Yang

    Abstract: In autonomous driving applications, accurate and efficient road surface reconstruction is paramount. This paper introduces RoMe, a novel framework designed for the robust reconstruction of large-scale road surfaces. Leveraging a unique mesh representation, RoMe ensures that the reconstructed road surfaces are accurate and seamlessly aligned with semantics. To address challenges in computational ef… ▽ More

    Submitted 21 June, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Published in: IEEE Transactions on Intelligent Vehicles

  48. arXiv:2305.14450  [pdf, other

    cs.CL

    An Empirical Study on Information Extraction using Large Language Models

    Authors: Ridong Han, Chaohao Yang, Tao Peng, Prayag Tiwari, Xiang Wan, Lu Liu, Benyou Wang

    Abstract: Human-like large language models (LLMs), especially the most powerful and popular ones in OpenAI's GPT family, have proven to be very helpful for many natural language processing (NLP) related tasks. Therefore, various attempts have been made to apply LLMs to information extraction (IE), which is a fundamental NLP task that involves extracting information from unstructured plain text. To demonstra… ▽ More

    Submitted 10 September, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 52 pages, Version 2.0; This article has an original arxiv version entitled "Is Information Extraction Solved by ChatGPT? An Analysis of Performance, Evaluation Criteria, Robustness and Errors'', whose url link is arXiv:2305.14450v1

  49. arXiv:2305.14243  [pdf, other

    cs.AI cs.CV

    Training Transitive and Commutative Multimodal Transformers with LoReTTa

    Authors: Manuel Tran, Yashin Dicente Cid, Amal Lahiani, Fabian J. Theis, Tingying Peng, Eldad Klaiman

    Abstract: Training multimodal foundation models is challenging due to the limited availability of multimodal datasets. While many public datasets pair images with text, few combine images with audio or text with audio. Even rarer are datasets that align all three modalities at once. Critical domains such as healthcare, infrastructure, or transportation are particularly affected by missing modalities. This m… ▽ More

    Submitted 16 January, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted at NeurIPS 2023 (poster). Camera-ready version

  50. arXiv:2305.02542  [pdf, other

    stat.ME cs.LG stat.AP stat.ML

    Correcting for Interference in Experiments: A Case Study at Douyin

    Authors: Vivek F. Farias, Hao Li, Tianyi Peng, Xinyuyang Ren, Huawei Zhang, Andrew Zheng

    Abstract: Interference is a ubiquitous problem in experiments conducted on two-sided content marketplaces, such as Douyin (China's analog of TikTok). In many cases, creators are the natural unit of experimentation, but creators interfere with each other through competition for viewers' limited time and attention. "Naive" estimators currently used in practice simply ignore the interference, but in doing so i… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.