subscribe to arXiv mailings

Insights from the exact analytical solution of periodically driven transverse field Ising chain

Abstract: We derive an exact analytical expression, at stroboscopic intervals, for the time-dependent wave function of a class of integrable quantum many-body systems, driven by the periodic delta-kick protocol. To investigate long-time dynamics, we use the wave-function to obtain an exact analytical expression for the expectation value of defect density, magnetization, residual energy, fidelity, and correl… ▽ More We derive an exact analytical expression, at stroboscopic intervals, for the time-dependent wave function of a class of integrable quantum many-body systems, driven by the periodic delta-kick protocol. To investigate long-time dynamics, we use the wave-function to obtain an exact analytical expression for the expectation value of defect density, magnetization, residual energy, fidelity, and correlation function after the $n$th drive cycle. Periodically driven integrable closed quantum systems absorb energy, and the long-time universal dynamics are described by the periodic generalized Gibbs ensemble(GGE). We demonstrate that the expectation values of all observables are divided into two parts: one highly oscillatory term that depends on the drive cycle $n$, and the rest of the terms are independent of it. Typically, the $n$-independent part constitutes the saturation at large $n$ and periodic GGE. The contribution from the highly oscillatory term vanishes in large $n$. △ Less

Submitted 13 September, 2024; originally announced September 2024.

Comments: 11 pages, 15 figures

arXiv:2409.08541 [pdf, other]

Role of material-dependent properties in THz field-derivative-torque-induced nonlinear magnetization dynamics

Authors: Arpita Dutta, Pratyay Mukherjee, Swosti P. Sarangi, Somasree Bhattacharjee, Shovon Pal, Ritwik Mondal

Abstract: The traditional Landau-Lifshitz-Gilbert (LLG) equation has often delineated the linear and nonlinear magnetization dynamics, even at ultrashort timescales e.g., femtoseconds. In contrast, several other non-relativistic and relativistic spin torques have been reported as an extension of the LLG spin dynamics. Here, we explore the contribution of the relativistic field-derivative torque (FDT) in the… ▽ More The traditional Landau-Lifshitz-Gilbert (LLG) equation has often delineated the linear and nonlinear magnetization dynamics, even at ultrashort timescales e.g., femtoseconds. In contrast, several other non-relativistic and relativistic spin torques have been reported as an extension of the LLG spin dynamics. Here, we explore the contribution of the relativistic field-derivative torque (FDT) in the nonlinear THz magnetization dynamics response applied to ferrimagnets with high Gilbert damping and exchange magnon frequency. Our findings suggest that the FDT plays a significant role in magnetization dynamics in both linear and nonlinear regimes, bridging the gap between the traditional LLG spin dynamics and experimental observations. We find that the coherent THz magnon excitation amplitude is enhanced with the field-derivative torque. Furthermore, a phase shift in the magnon oscillation is induced by the FDT term. This phase shift is almost 90 for the antiferromagnet, while it is almost zero for the ferrimagnet under our investigation. Analyzing the dual THz excitation and their FDT, we find that the nonlinear signals can not be distinctly observed without the FDT terms. However, the inclusion of the FDT terms produces distinct nonlinear signals which matches extremely well with the previously reported experimental results. △ Less

Submitted 13 September, 2024; originally announced September 2024.

Comments: 6 figures

arXiv:2409.03092 [pdf, other]

Resilient Two-Time-Scale Local Stochastic Gradient Descent for Byzantine Federated Learning

Authors: Amit Dutta, Thinh T. Doan

Abstract: We study local stochastic gradient descent methods for solving federated optimization over a network of agents communicating indirectly through a centralized coordinator. We are interested in the Byzantine setting where there is a subset of $f$ malicious agents that could observe the entire network and send arbitrary values to the coordinator to disrupt the performance of other non-faulty agents.… ▽ More We study local stochastic gradient descent methods for solving federated optimization over a network of agents communicating indirectly through a centralized coordinator. We are interested in the Byzantine setting where there is a subset of $f$ malicious agents that could observe the entire network and send arbitrary values to the coordinator to disrupt the performance of other non-faulty agents. The objective of the non-faulty agents is to collaboratively compute the optimizer of their respective local functions under the presence of Byzantine agents. In this setting, prior works show that the local stochastic gradient descent method can only return an approximate of the desired solutions due to the impacts of Byzantine agents. Whether this method can find an exact solution remains an open question. In this paper, we will address this open question by proposing a new variant of the local stochastic gradient descent method. Under similar conditions that are considered in the existing works, we will show that the proposed method converges exactly to the desired solutions. We will provide theoretical results to characterize the convergence properties of our method, in particular, the proposed method convergences at an optimal rate $\mathcal{O}(1/k)$ in both strongly convex and non-convex settings, where $k$ is the number of iterations. Finally, we will present a number of simulations to illustrate our theoretical results. △ Less

Submitted 4 September, 2024; originally announced September 2024.

arXiv:2408.06458 [pdf, other]

Towards Autonomous Agents: Adaptive-planning, Reasoning, and Acting in Language Models

Authors: Yen-Che Hsiao, Abhishek Dutta

Abstract: We propose a novel in-context learning algorithm for building autonomous decision-making language agents. The language agent continuously attempts to solve the same task by self-correcting each time the task fails. Our selected language agent demonstrates the ability to solve tasks in a text-based game environment. Our results show that the gemma-2-9b-it language model, using our proposed method,… ▽ More We propose a novel in-context learning algorithm for building autonomous decision-making language agents. The language agent continuously attempts to solve the same task by self-correcting each time the task fails. Our selected language agent demonstrates the ability to solve tasks in a text-based game environment. Our results show that the gemma-2-9b-it language model, using our proposed method, can successfully complete two of six tasks that failed in the first attempt. This highlights the effectiveness of our approach in enhancing the problem-solving capabilities of a single language model through self-correction, paving the way for more advanced autonomous agents. The code is publicly available at https://github.com/YenCheHsiao/AutonomousLLMAgentwithAdaptingPlanning. △ Less

Submitted 12 August, 2024; originally announced August 2024.

arXiv:2408.05510 [pdf, other]

Experimental observation of relativistic field-derivative torque in nonlinear THz response of magnetization dynamics

Authors: Arpita Dutta, Christian Tzschaschel, Debankit Priyadarshi, Kouki Mikuni, Takuya Satoh, Ritwik Mondal, Shovon Pal

Abstract: Understanding the complete light-spin interactions in magnetic systems is the key to manipulating the magnetization using optical means at ultrafast timescales. The selective addressing of spins by terahertz (THz) electromagnetic fields via Zeeman torque is, by far, one of the most successful ultrafast means of controlling magnetic excitations. Here we show that this traditional Zeeman torque on t… ▽ More Understanding the complete light-spin interactions in magnetic systems is the key to manipulating the magnetization using optical means at ultrafast timescales. The selective addressing of spins by terahertz (THz) electromagnetic fields via Zeeman torque is, by far, one of the most successful ultrafast means of controlling magnetic excitations. Here we show that this traditional Zeeman torque on the spins is not sufficient, rather an additional relativistic field-derivative torque is essential to realize the observed magnetization dynamics. We accomplish this by exploring the ultrafast nonlinear magnetization dynamics of rare-earth, Bi-doped iron garnet when excited by two co-propagating THz pulses. By non-thermal optical pump-probe technique, we, first, find the collective exchange resonance mode between rare-earth and transition metal sublattices at 0.48 THz. We further explore the magnetization dynamics via a rather direct and efficient THz time-domain spectroscopic means. We find that the observed nonlinear trace of the magnetic response cannot be mapped to the magnetization precession induced by the Zeeman torque, while the Zeeman torque supplemented by an additional field-derivative torque follows the experimental evidences. This breakthrough not only enhances our comprehension of ultra-relativistic effects but also paves the way for the development of novel technologies harnessing light-induced control over magnetic systems. △ Less

Submitted 10 August, 2024; originally announced August 2024.

Comments: 5 figures

arXiv:2408.01408 [pdf, other]

Derivation of Back-propagation for Graph Convolutional Networks using Matrix Calculus and its Application to Explainable Artificial Intelligence

Authors: Yen-Che Hsiao, Rongting Yue, Abhishek Dutta

Abstract: This paper provides a comprehensive and detailed derivation of the backpropagation algorithm for graph convolutional neural networks using matrix calculus. The derivation is extended to include arbitrary element-wise activation functions and an arbitrary number of layers. The study addresses two fundamental problems, namely node classification and link prediction. To validate our method, we compar… ▽ More This paper provides a comprehensive and detailed derivation of the backpropagation algorithm for graph convolutional neural networks using matrix calculus. The derivation is extended to include arbitrary element-wise activation functions and an arbitrary number of layers. The study addresses two fundamental problems, namely node classification and link prediction. To validate our method, we compare it with reverse-mode automatic differentiation. The experimental results demonstrate that the median sum of squared errors of the updated weight matrices, when comparing our method to the approach using reverse-mode automatic differentiation, falls within the range of $10^{-18}$ to $10^{-14}$. These outcomes are obtained from conducting experiments on a five-layer graph convolutional network, applied to a node classification problem on Zachary's karate club social network and a link prediction problem on a drug-drug interaction network. Finally, we show how the derived closed-form solution can facilitate the development of explainable AI and sensitivity analysis. △ Less

Submitted 2 August, 2024; originally announced August 2024.

arXiv:2408.01374 [pdf, other]

Hybrid Coordinate Descent for Efficient Neural Network Learning Using Line Search and Gradient Descent

Authors: Yen-Che Hsiao, Abhishek Dutta

Abstract: This paper presents a novel coordinate descent algorithm leveraging a combination of one-directional line search and gradient information for parameter updates for a squared error loss function. Each parameter undergoes updates determined by either the line search or gradient method, contingent upon whether the modulus of the gradient of the loss with respect to that parameter surpasses a predefin… ▽ More This paper presents a novel coordinate descent algorithm leveraging a combination of one-directional line search and gradient information for parameter updates for a squared error loss function. Each parameter undergoes updates determined by either the line search or gradient method, contingent upon whether the modulus of the gradient of the loss with respect to that parameter surpasses a predefined threshold. Notably, a larger threshold value enhances algorithmic efficiency. Despite the potentially slower nature of the line search method relative to gradient descent, its parallelizability facilitates computational time reduction. Experimental validation conducted on a 2-layer Rectified Linear Unit network with synthetic data elucidates the impact of hyperparameters on convergence rates and computational efficiency. △ Less

Submitted 2 August, 2024; originally announced August 2024.

arXiv:2407.09141 [pdf, other]

Accuracy is Not All You Need

Authors: Abhinav Dutta, Sanjeev Krishnan, Nipun Kwatra, Ramachandran Ramjee

Abstract: When Large Language Models (LLMs) are compressed using techniques such as quantization, the predominant way to demonstrate the validity of such techniques is by measuring the model's accuracy on various benchmarks.If the accuracies of the baseline model and the compressed model are close, it is assumed that there was negligible degradation in quality.However, even when the accuracy of baseline and… ▽ More When Large Language Models (LLMs) are compressed using techniques such as quantization, the predominant way to demonstrate the validity of such techniques is by measuring the model's accuracy on various benchmarks.If the accuracies of the baseline model and the compressed model are close, it is assumed that there was negligible degradation in quality.However, even when the accuracy of baseline and compressed model are similar, we observe the phenomenon of flips, wherein answers change from correct to incorrect and vice versa in proportion.We conduct a detailed study of metrics across multiple compression techniques, models and datasets, demonstrating that the behavior of compressed models as visible to end-users is often significantly different from the baseline model, even when accuracy is similar.We further evaluate compressed models qualitatively and quantitatively using MT-Bench and show that compressed models are significantly worse than baseline models in this free-form generative task.Thus, we argue that compression techniques should also be evaluated using distance metrics.We propose two such metrics, KL-Divergence and flips, and show that they are well correlated. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.03549 [pdf, other]

POSTURE: Pose Guided Unsupervised Domain Adaptation for Human Body Part Segmentation

Authors: Arindam Dutta, Rohit Lal, Yash Garg, Calvin-Khang Ta, Dripta S. Raychaudhuri, Hannah Dela Cruz, Amit K. Roy-Chowdhury

Abstract: Existing algorithms for human body part segmentation have shown promising results on challenging datasets, primarily relying on end-to-end supervision. However, these algorithms exhibit severe performance drops in the face of domain shifts, leading to inaccurate segmentation masks. To tackle this issue, we introduce POSTURE: \underline{Po}se Guided Un\underline{s}upervised Domain Adap\underline{t}… ▽ More Existing algorithms for human body part segmentation have shown promising results on challenging datasets, primarily relying on end-to-end supervision. However, these algorithms exhibit severe performance drops in the face of domain shifts, leading to inaccurate segmentation masks. To tackle this issue, we introduce POSTURE: \underline{Po}se Guided Un\underline{s}upervised Domain Adap\underline{t}ation for H\underline{u}man Body Pa\underline{r}t S\underline{e}gmentation - an innovative pseudo-labelling approach designed to improve segmentation performance on the unlabeled target data. Distinct from conventional domain adaptive methods for general semantic segmentation, POSTURE stands out by considering the underlying structure of the human body and uses anatomical guidance from pose keypoints to drive the adaptation process. This strong inductive prior translates to impressive performance improvements, averaging 8\% over existing state-of-the-art domain adaptive semantic segmentation methods across three benchmark datasets. Furthermore, the inherent flexibility of our proposed approach facilitates seamless extension to source-free settings (SF-POSTURE), effectively mitigating potential privacy and computational concerns, with negligible drop in performance. △ Less

Submitted 22 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

arXiv:2407.02238 [pdf, other]

MIREncoder: Multi-modal IR-based Pretrained Embeddings for Performance Optimizations

Authors: Akash Dutta, Ali Jannesari

Abstract: One of the primary areas of interest in High Performance Computing is the improvement of performance of parallel workloads. Nowadays, compilable source code-based optimization tasks that employ deep learning often exploit LLVM Intermediate Representations (IRs) for extracting features from source code. Most such works target specific tasks, or are designed with a pre-defined set of heuristics. So… ▽ More One of the primary areas of interest in High Performance Computing is the improvement of performance of parallel workloads. Nowadays, compilable source code-based optimization tasks that employ deep learning often exploit LLVM Intermediate Representations (IRs) for extracting features from source code. Most such works target specific tasks, or are designed with a pre-defined set of heuristics. So far, pre-trained models are rare in this domain, but the possibilities have been widely discussed. Especially approaches mimicking large-language models (LLMs) have been proposed. But these have prohibitively large training costs. In this paper, we propose MIREncoder, a M}ulti-modal IR-based Auto-Encoder that can be pre-trained to generate a learned embedding space to be used for downstream tasks by machine learning-based approaches. A multi-modal approach enables us to better extract features from compilable programs. It allows us to better model code syntax, semantics and structure. For code-based performance optimizations, these features are very important while making optimization decisions. A pre-trained model/embedding implicitly enables the usage of transfer learning, and helps move away from task-specific trained models. Additionally, a pre-trained model used for downstream performance optimization should itself have reduced overhead, and be easily usable. These considerations have led us to propose a modeling approach that i) understands code semantics and structure, ii) enables use of transfer learning, and iii) is small and simple enough to be easily re-purposed or reused even with low resource availability. Our evaluations will show that our proposed approach can outperform the state of the art while reducing overhead. △ Less

Submitted 2 July, 2024; originally announced July 2024.

Comments: 12 pages, 6 figures, 9 tables, PACT '24 conference

arXiv:2406.19391 [pdf, other]

Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads

Authors: Ali Khaleghi Rahimian, Manish Kumar Govind, Subhajit Maity, Dominick Reilly, Christian Kümmerle, Srijan Das, Aritra Dutta

Abstract: Visual perception tasks are predominantly solved by Vision Transformer (ViT) architectures, which, despite their effectiveness, encounter a computational bottleneck due to the quadratic complexity of computing self-attention. This inefficiency is largely due to the self-attention heads capturing redundant token interactions, reflecting inherent redundancy within visual data. Many works have aimed… ▽ More Visual perception tasks are predominantly solved by Vision Transformer (ViT) architectures, which, despite their effectiveness, encounter a computational bottleneck due to the quadratic complexity of computing self-attention. This inefficiency is largely due to the self-attention heads capturing redundant token interactions, reflecting inherent redundancy within visual data. Many works have aimed to reduce the computational complexity of self-attention in ViTs, leading to the development of efficient and sparse transformer architectures. In this paper, viewing through the efficiency lens, we realized that introducing any sparse self-attention strategy in ViTs can keep the computational overhead low. However, these strategies are sub-optimal as they often fail to capture fine-grained visual details. This observation leads us to propose a general, efficient, sparse architecture, named Fibottention, for approximating self-attention with superlinear complexity that is built upon Fibonacci sequences. The key strategies in Fibottention include: it excludes proximate tokens to reduce redundancy, employs structured sparsity by design to decrease computational demands, and incorporates inception-like diversity across attention heads. This diversity ensures the capture of complementary information through non-overlapping token interactions, optimizing both performance and resource utilization in ViTs for visual representation learning. We embed our Fibottention mechanism into multiple state-of-the-art transformer architectures dedicated to visual tasks. Leveraging only 2-6% of the elements in the self-attention heads, Fibottention in conjunction with ViT and its variants, consistently achieves significant performance boosts compared to standard ViTs in nine datasets across three domains $\unicode{x2013}$ image classification, video understanding, and robot learning tasks. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: The code is publicly available at https://github.com/Charlotte-CharMLab/Fibottention

arXiv:2406.13881 [pdf, other]

Static Generation of Efficient OpenMP Offload Data Mappings

Authors: Luke Marzen, Akash Dutta, Ali Jannesari

Abstract: Increasing heterogeneity in HPC architectures and compiler advancements have led to OpenMP being frequently used to enable computations on heterogeneous devices. However, the efficient movement of data on heterogeneous computing platforms is crucial for achieving high utilization. Programmers must explicitly map data between the host and connected accelerator devices to achieve efficient data move… ▽ More Increasing heterogeneity in HPC architectures and compiler advancements have led to OpenMP being frequently used to enable computations on heterogeneous devices. However, the efficient movement of data on heterogeneous computing platforms is crucial for achieving high utilization. Programmers must explicitly map data between the host and connected accelerator devices to achieve efficient data movement. Ensuring efficient data transfer requires programmers to reason about complex data flow. This can be a laborious and error-prone process since the programmer must keep a mental model of data validity and lifetime spanning multiple data environments. We present a static analysis tool, OMPDart (OpenMP Data Reduction Tool), for OpenMP programs that models data dependencies between host and device regions and applies source code transformations to achieve efficient data transfer. Our evaluations on nine HPC benchmarks demonstrate that OMPDart is capable of generating effective data mapping constructs that substantially reduce data transfer between host and device. △ Less

Submitted 7 September, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

Comments: 12 pages, accepted to the 2024 International Conference for High Performance Computing, Networking, Storage, and Analysis (SC24)

arXiv:2406.12069 [pdf, other]

Satyrn: A Platform for Analytics Augmented Generation

Authors: Marko Sterbentz, Cameron Barrie, Shubham Shahi, Abhratanu Dutta, Donna Hooshmand, Harper Pack, Kristian J. Hammond

Abstract: Large language models (LLMs) are capable of producing documents, and retrieval augmented generation (RAG) has shown itself to be a powerful method for improving accuracy without sacrificing fluency. However, not all information can be retrieved from text. We propose an approach that uses the analysis of structured data to generate fact sets that are used to guide generation in much the same way th… ▽ More Large language models (LLMs) are capable of producing documents, and retrieval augmented generation (RAG) has shown itself to be a powerful method for improving accuracy without sacrificing fluency. However, not all information can be retrieved from text. We propose an approach that uses the analysis of structured data to generate fact sets that are used to guide generation in much the same way that retrieved documents are used in RAG. This analytics augmented generation (AAG) approach supports the ability to utilize standard analytic techniques to generate facts that are then converted to text and passed to an LLM. We present a neurosymbolic platform, Satyrn that leverages AAG to produce accurate, fluent, and coherent reports grounded in large scale databases. In our experiments, we find that Satyrn generates reports in which over 86% accurate claims while maintaining high levels of fluency and coherence, even when using smaller language models such as Mistral-7B, as compared to GPT-4 Code Interpreter in which just 57% of claims are accurate. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2406.02697 [pdf, other]

doi 10.1142/S0218271824500342

On the role of closed timelike curves and confinement structure around Kerr-Newman singularity

Authors: Ayanendu Dutta, Dhritimalya Roy, Subenoy Chakraborty

Abstract: In this study, the particle motion around the naked singularity and black hole of Kerr-Newman spacetime is investigated with a special attention on the closed timelike orbits. It is found that both in the naked singularity (NS) and in black hole (BH), the singularity is concealed by causality violating regions, and the Cauchy surface consistently resides inside the inner horizon in non-extremal bl… ▽ More In this study, the particle motion around the naked singularity and black hole of Kerr-Newman spacetime is investigated with a special attention on the closed timelike orbits. It is found that both in the naked singularity (NS) and in black hole (BH), the singularity is concealed by causality violating regions, and the Cauchy surface consistently resides inside the inner horizon in non-extremal black holes. For neutral particles and particles with an identical charge to the source, only particles with positive angular momentum are permitted to traverse the closed timelike curves. Conversely, for particles with the opposite charge to the source, the strong Coulomb attraction draws all particles inside the Cauchy surface, allowing them to be present in the closed timelike curves irrespective of their angular momentum. However, in both the NS and BH (both extremal and non-extremal), test particles are confined at a considerable distance from the singular point such that there always exists an empty region surrounding the singularity which prevents particles from interacting with it. The radius of the empty surface that depends on the source parameters and the particle characteristics, is investigated with an accurate expression. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: 17 pages, 9 figures, accepted in IJMPD

Journal ref: Int. J. Mod. Phys. D (2024)

arXiv:2405.20070 [pdf]

doi 10.1038/s41598-024-58935-6

Pick-up and assembling of chemically sensitive van der Waals heterostructures using dry cryogenic exfoliation

Authors: Vilas Patil, Sanat Ghosh, Amit Basu, Kuldeep, Achintya Dutta, Khushabu Agrawal, Neha Bhatia, Amit Shah, Digambar A. Jangade, Ruta Kulkarni, A. Thamizhavel, Mandar M. Deshmukh

Abstract: Assembling atomic layers of van der Waals materials (vdW) combines the physics of two materials, offering opportunities for novel functional devices. Realization of this has been possible because of advancements in nanofabrication processes which often involve chemical processing of the materials under study; this can be detrimental to device performance. To address this issue, we have developed a… ▽ More Assembling atomic layers of van der Waals materials (vdW) combines the physics of two materials, offering opportunities for novel functional devices. Realization of this has been possible because of advancements in nanofabrication processes which often involve chemical processing of the materials under study; this can be detrimental to device performance. To address this issue, we have developed a modified micro-manipulator setup for cryogenic exfoliation, pick up, and transfer of vdW materials to assemble heterostructures. We use the glass transition of a polymer PDMS to cleave a flake into two, followed by its pick-up and drop to form pristine twisted junctions. To demonstrate the potential of the technique, we fabricated twisted heterostructure of Bi$_2$Sr$_2$CaCu$_2$O$_{8+x}$ (BSCCO), a van der Waals high-temperature cuprate superconductor. We also employed this method to re-exfoliate NbSe$_2$ and make twisted heterostructure. Transport measurements of the fabricated devices indicate the high quality of the artificial twisted interface. In addition, we extend this cryogenic exfoliation method for other vdW materials, offering an effective way of assembling heterostructures and twisted junctions with pristine interfaces. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Journal ref: Scientific Reports 14, Article number: 11097 (2024)

arXiv:2405.17038 [pdf, other]

Advancements in Tactile Hand Gesture Recognition for Enhanced Human-Machine Interaction

Authors: Chiara Fumelli, Anirvan Dutta, Mohsen Kaboli

Abstract: Motivated by the growing interest in enhancing intuitive physical Human-Machine Interaction (HRI/HVI), this study aims to propose a robust tactile hand gesture recognition system. We performed a comprehensive evaluation of different hand gesture recognition approaches for a large area tactile sensing interface (touch interface) constructed from conductive textiles. Our evaluation encompassed tradi… ▽ More Motivated by the growing interest in enhancing intuitive physical Human-Machine Interaction (HRI/HVI), this study aims to propose a robust tactile hand gesture recognition system. We performed a comprehensive evaluation of different hand gesture recognition approaches for a large area tactile sensing interface (touch interface) constructed from conductive textiles. Our evaluation encompassed traditional feature engineering methods, as well as contemporary deep learning techniques capable of real-time interpretation of a range of hand gestures, accommodating variations in hand sizes, movement velocities, applied pressure levels, and interaction points. Our extensive analysis of the various methods makes a significant contribution to tactile-based gesture recognition in the field of human-machine interaction. △ Less

Submitted 27 May, 2024; originally announced May 2024.

arXiv:2405.14987 [pdf, other]

Simultaneous quantum identity authentication scheme utilizing entanglement swapping with secret key preservation

Authors: Arindam Dutta, Anirban Pathak

Abstract: Unconditional security in quantum key distribution (QKD) relies on authenticating the identities of users involved in key distribution. While classical identity authentication schemes were initially utilized in QKD implementations, concerns regarding their vulnerability have prompted the exploration of quantum identity authentication (QIA) protocols. In this study, we introduce a new protocol for… ▽ More Unconditional security in quantum key distribution (QKD) relies on authenticating the identities of users involved in key distribution. While classical identity authentication schemes were initially utilized in QKD implementations, concerns regarding their vulnerability have prompted the exploration of quantum identity authentication (QIA) protocols. In this study, we introduce a new protocol for QIA, derived from the concept of controlled secure direct quantum communication. Our proposed scheme facilitates simultaneous authentication between two users, Alice and Bob, leveraging Bell states with the assistance of a third party, Charlie. Through rigorous security analysis, we demonstrate that the proposed protocol withstands various known attacks, including impersonation, intercept and resend and impersonated fraudulent attacks. Additionally, we establish the relevance of the proposed protocol by comparing it with the existing protocols of similar type. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: A new scheme for quantum identity authentication is proposed

arXiv:2405.12634 [pdf, other]

Visuo-Tactile based Predictive Cross Modal Perception for Object Exploration in Robotics

Authors: Anirvan Dutta, Etienne Burdet, Mohsen Kaboli

Abstract: Autonomously exploring the unknown physical properties of novel objects such as stiffness, mass, center of mass, friction coefficient, and shape is crucial for autonomous robotic systems operating continuously in unstructured environments. We introduce a novel visuo-tactile based predictive cross-modal perception framework where initial visual observations (shape) aid in obtaining an initial prior… ▽ More Autonomously exploring the unknown physical properties of novel objects such as stiffness, mass, center of mass, friction coefficient, and shape is crucial for autonomous robotic systems operating continuously in unstructured environments. We introduce a novel visuo-tactile based predictive cross-modal perception framework where initial visual observations (shape) aid in obtaining an initial prior over the object properties (mass). The initial prior improves the efficiency of the object property estimation, which is autonomously inferred via interactive non-prehensile pushing and using a dual filtering approach. The inferred properties are then used to enhance the predictive capability of the cross-modal function efficiently by using a human-inspired `surprise' formulation. We evaluated our proposed framework in the real-robotic scenario, demonstrating superior performance. △ Less

Submitted 23 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

Comments: Accepted at IEEE International Symposium on Robotic and Sensors Environments 2024

arXiv:2405.12212 [pdf, other]

Forced Measurement of Astronomical Sources at Low Signal to Noise

Authors: Anirban Dutta, John R. Peterson, Glenn Sembroski

Abstract: We propose a modified moment matching algorithm to avoid catastrophic failures for sources with a low signal to noise ratio (SNR). The proposed modifications include a method to eliminate non-physical negative pixel values and a forced single iteration with an initial guess derived from co-add measurements when iterative methods are unstable. We correct for all biases in measurements introduced by… ▽ More We propose a modified moment matching algorithm to avoid catastrophic failures for sources with a low signal to noise ratio (SNR). The proposed modifications include a method to eliminate non-physical negative pixel values and a forced single iteration with an initial guess derived from co-add measurements when iterative methods are unstable. We correct for all biases in measurements introduced by the method. We find that the proposed modifications allow the algorithm to avoid catastrophic failures in nearly 100\% of the cases, especially at low signal to noise ratio. Additionally, with a reasonable guess from co-add measurements, the algorithm measures the flux, centroid, size, shape and ellipticity with bias statistically consistent with zero. We show the proposed method allows us to measure sources seven times fainter than traditional methods when applied to images obtained from WIYN-ODI. We also present a scheme to find uncertainties in measurements when using the new method to measure astronomical sources. △ Less

Submitted 26 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

Comments: Accepted for publication AJ

arXiv:2405.08085 [pdf]

Four-component relativistic third-order algebraic diagrammatic construction theory for electron detachment, attachment, electronic excitation problem and calculation of first order transition properties

Authors: Sudipta Chakraborty, Tamoghna Mukhopadhyay, Malaya K. Nayak, Achintya Kumar Dutta

Abstract: An efficient third-order algebraic diagrammatic construction (ADC) theory has been implemented to calculate ionisation potential, electron attachment and excitation energy (IP/EA/EE-ADC(3)) in a four-component relativistic framework. We have used polarisation propagator formulation for third-order perturbation theory to access the excitation energies (EE), and for IP/EA, a single-particle propagat… ▽ More An efficient third-order algebraic diagrammatic construction (ADC) theory has been implemented to calculate ionisation potential, electron attachment and excitation energy (IP/EA/EE-ADC(3)) in a four-component relativistic framework. We have used polarisation propagator formulation for third-order perturbation theory to access the excitation energies (EE), and for IP/EA, a single-particle propagator has been used based on a non-Dyson formulation. The benchmarking calculations have been performed on various types of systems to test the accuracy of the four component ADC(3) scheme for the computation of IP, EA and EE. We have applied our IP-ADC(3) to demonstrate the computation of splitting in the IP states for halogen monoxides (XO, X = Cl, Br, I ) due to spin-orbital coupling in the 2^Π ground state and compared it with experimental results. Next, we have studied the effect of relativity and the size of the basis set on the electron attachment calculations of halogen atoms (F, Cl, Br, I and At) using EA-ADC(3). As our next step, we have shown the efficiency of four component ADC(3) in computing excitation energies of triiodide ion and compared with relativistic equation of motion coupled cluster with singles and doubles (EOM-CCSD), intermediate Hamiltonian Fock space coupled cluster (IHFS-CC) and other EOM-CCSD schemes in which spin-orbit coupling is incorporated with different degrees of approximation. Finally, we have also investigated the excitation energies and transition dipole moments for the four excited states of Xe atom and compared them with our recent four-component EOM-CCSD implementation and relativistic finite field Fock space coupled cluster results, along with the experimental estimates. △ Less

Submitted 13 May, 2024; originally announced May 2024.

arXiv:2405.05625 [pdf, other]

doi 10.1016/j.newast.2024.102248

Does Dynamical Wormhole Evolve From Emergent Scenario?

Authors: Dhritimalya Roy, Ayanendu Dutta, Bikram Ghosh, Subenoy Chakraborty

Abstract: In the present work we analyse a dynamical wormhole solution with two fluids system (one isotropic and homogeneous and the other being inhomogeneous and anisotropic in nature) as the matter at the throat. We choose two different forms of Equation of State(EoS) and investigate two solutions of the wormhole geometry. The properties to ensure existence and traversability has been analysed. Also, the… ▽ More In the present work we analyse a dynamical wormhole solution with two fluids system (one isotropic and homogeneous and the other being inhomogeneous and anisotropic in nature) as the matter at the throat. We choose two different forms of Equation of State(EoS) and investigate two solutions of the wormhole geometry. The properties to ensure existence and traversability has been analysed. Also, the model of the dynamic wormhole has been examined for a possibility of the Emergent Universe(EU) model in cosmological context. Finally, for the dynamical wormholes so obtained, Null Energy Condition(NEC) has been examined near the throat. △ Less

Submitted 9 May, 2024; originally announced May 2024.

Journal ref: New Astronomy, Volume 111, October 2024, 102248

arXiv:2405.03555 [pdf, other]

A Comprehensive Overview and Survey of O-RAN: Exploring Slicing-aware Architecture, Deployment Options, and Use Cases

Authors: Khurshid Alam, Mohammad Asif Habibi, Matthias Tammen, Dennis Krummacker, Walid Saad, Marco Di Renzo, Tommaso Melodia, Xavier Costa-Pérez, Mérouane Debbah, Ashutosh Dutta, Hans D. Schotten

Abstract: Open-radio access network (O-RAN) seeks to establish principles of openness, programmability, automation, intelligence, and hardware-software disaggregation with interoperable interfaces. It advocates for multi-vendorism and multi-stakeholderism within a cloudified and virtualized wireless infrastructure, aimed at enhancing the deployment, operation, and maintenance of RAN architecture. This enhan… ▽ More Open-radio access network (O-RAN) seeks to establish principles of openness, programmability, automation, intelligence, and hardware-software disaggregation with interoperable interfaces. It advocates for multi-vendorism and multi-stakeholderism within a cloudified and virtualized wireless infrastructure, aimed at enhancing the deployment, operation, and maintenance of RAN architecture. This enhancement promises increased flexibility, performance optimization, service innovation, energy efficiency, and cost efficiency in fifth-generation (5G), sixth-generation (6G), and future networks. One of the key features of the O-RAN architecture is its support for network slicing, which entails interaction with other slicing domains within a mobile network, notably the transport network (TN) domain and the core network (CN) domain, to realize end-to-end (E2E) network slicing. The study of this feature requires exploring the stances and contributions of diverse standards development organizations (SDOs). In this context, we note that despite the ongoing industrial deployments and standardization efforts, the research and standardization communities have yet to comprehensively address network slicing in O-RAN. To address this gap, this survey paper provides a comprehensive exploration of network slicing in O-RAN through an in-depth review of specification documents from O-RAN Alliance and research papers from leading industry and academic institutions. The paper commences with an overview of the ongoing standardization efforts and open-source contributions associated with O-RAN, subsequently delving into the latest O-RAN architecture with an emphasis on its slicing aspects. Further, the paper explores deployment scenarios for network slicing within O-RAN, examining options for the deployment and orchestration of O-RAN and TN network slice subnets... △ Less

Submitted 8 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

Comments: 45 pages, 12 figures, 4 tables, submitted to the IEEE for possible publication

arXiv:2404.12467 [pdf, other]

Towards Multi-modal Transformers in Federated Learning

Authors: Guangyu Sun, Matias Mendieta, Aritra Dutta, Xin Li, Chen Chen

Abstract: Multi-modal transformers mark significant progress in different domains, but siloed high-quality data hinders their further improvement. To remedy this, federated learning (FL) has emerged as a promising privacy-preserving paradigm for training models without direct access to the raw data held by different clients. Despite its potential, a considerable research direction regarding the unpaired uni… ▽ More Multi-modal transformers mark significant progress in different domains, but siloed high-quality data hinders their further improvement. To remedy this, federated learning (FL) has emerged as a promising privacy-preserving paradigm for training models without direct access to the raw data held by different clients. Despite its potential, a considerable research direction regarding the unpaired uni-modal clients and the transformer architecture in FL remains unexplored. To fill this gap, this paper explores a transfer multi-modal federated learning (MFL) scenario within the vision-language domain, where clients possess data of various modalities distributed across different datasets. We systematically evaluate the performance of existing methods when a transformer architecture is utilized and introduce a novel framework called Federated modality complementary and collaboration (FedCola) by addressing the in-modality and cross-modality gaps among clients. Through extensive experiments across various FL settings, FedCola demonstrates superior performance over previous approaches, offering new perspectives on future federated training of multi-modal transformers. △ Less

Submitted 16 July, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

Comments: 2024 European Conference on Computer Vision (ECCV)

arXiv:2404.11984 [pdf, other]

doi 10.1016/j.newast.2024.102236

Particle motion around traversable wormholes: Possibility of closed timelike geodesics

Authors: Ayanendu Dutta, Dhritimalya Roy, Subenoy Chakraborty

Abstract: The present work investigates the general wormhole solution in Einstein gravity with an exponential shape function around an ultrastatic and a finite redshift geometry. The geodesic motion around the wormholes is studied in which the deflection angle of the orbiting photon sphere is found to be negative after a certain region, indicating the presence of repulsive effect of gravity in both the ultr… ▽ More The present work investigates the general wormhole solution in Einstein gravity with an exponential shape function around an ultrastatic and a finite redshift geometry. The geodesic motion around the wormholes is studied in which the deflection angle of the orbiting photon sphere is found to be negative after a certain region, indicating the presence of repulsive effect of gravity in both the ultrastatic and finite redshift wormholes. Various unbounded and bounded timelike trajectories are presented on the wormhole embedding diagrams, in which some of the bound orbits involve intersection points that may lead to causality violating geodesics. Another class of closed timelike geodesics are obtained in the unstable circular trajectory that appeared at the wormhole throat. Finally, the trajectories are classified in terms of the family of CTG orbits. △ Less

Submitted 18 April, 2024; originally announced April 2024.

Comments: 12 pages, 7 figures, 1 table. Accepted in New Astronomy

Journal ref: New Astron. 111, 102236 (2024)

arXiv:2404.05869 [pdf]

Spin-free exact two-component linear response coupled cluster theory for estimation of frequency-dependent second-order property

Authors: Sudipta Chakraborty, Tamoghna Mukhopadhyay, Achintya Kumar Dutta

Abstract: We have presented the theory, implementation, and benchmark results for the one-electronic variant of spin-free exact two-component (SFX2C1e) linear response coupled cluster (LRCCSD) theory for static and dynamic polarizabilities of atoms and molecules in the spin-adapted formulation. The resolution of identity (RI) approximation for two-electron integrals has been used to reduce the computational… ▽ More We have presented the theory, implementation, and benchmark results for the one-electronic variant of spin-free exact two-component (SFX2C1e) linear response coupled cluster (LRCCSD) theory for static and dynamic polarizabilities of atoms and molecules in the spin-adapted formulation. The resolution of identity (RI) approximation for two-electron integrals has been used to reduce the computational cost of the calculation and has been shown to have a negligible effect on accuracy. The calculated static and dynamic polarizability values agree very well with the more expensive X2C-LRCCSD and experimental results. Our calculated results show that accurate predictions of polarizabilities of atoms and molecules containing heavy atoms require the use of a large basis set containing an adequate number of diffuse functions, in addition to accounting for electron correlation and relativistic effects. △ Less

Submitted 30 May, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

arXiv:2404.02035 [pdf, other]

doi 10.1093/mnras/stae1345

Ram pressure stripping in clusters: Gravity can bind the ISM but not the CGM

Authors: Ritali Ghosh, Alankar Dutta, Prateek Sharma

Abstract: We explore the survival of a galaxy's circumgalactic medium (CGM) as it experiences ram pressure stripping (RPS) moving through the intracluster medium (ICM). For a satellite galaxy, the CGM is often assumed to be entirely stripped/evaporated, an assumption that may not always be justified. We carry out 3D-hydrodynamic simulations of the interstellar and circumgalactic media (ISM+CGM) of a galaxy… ▽ More We explore the survival of a galaxy's circumgalactic medium (CGM) as it experiences ram pressure stripping (RPS) moving through the intracluster medium (ICM). For a satellite galaxy, the CGM is often assumed to be entirely stripped/evaporated, an assumption that may not always be justified. We carry out 3D-hydrodynamic simulations of the interstellar and circumgalactic media (ISM+CGM) of a galaxy like JO201 moving through the ICM. The CGM can survive long at cluster outskirts ($\gtrsim2 \rm \ Gyr$) but at smaller cluster-centric distances, 90\% of the CGM mass is lost within $\sim 500$ Myr. The gravitational restoring force on the CGM is mostly negligible and the CGM-ICM interaction is analogous to \textit{`cloud-wind interaction'}. The CGM stripping timescale does not depend on the ram pressure but on the CGM to ICM density contrast $χ$. Two distinct regimes emerge for CGM stripping: the $χ>1$ regime, which is the well-known \textit{`cloud crushing'} problem, and the $χ<1$ regime, which we refer to as the (relatively unexplored) \textit{`bubble drag'} problem. The first pericentric passage near the cluster core can rapidly -- over a crossing time $t_{\rm drag} \sim R/v_{\rm rel}$ -- strip the CGM in the \textit{bubble drag} regime. The ISM stripping criterion unlike the CGM criterion, still depends on the ram pressure $ρ_{\rm ICM} v_{\rm rel}^2$. The stripped tails of satellites contain contributions from both the disk and the CGM. The X-ray plume in M89 in the Virgo cluster and a lack of it in the nearby M90 might be attributed to their orbital histories. M90 has likely undergone stripping in the bubble drag regime due to a pericentric passage close to the cluster center. △ Less

Submitted 11 August, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

Comments: 23 pages, 15 figures; journal-accepted version; a short video description of the paper is here: https://youtu.be/yssXeoE6JV4?si=etoZuLLT1btLzo6_

Journal ref: MNRAS, vol 531, 3445-3467 (2024)

arXiv:2404.00412 [pdf, other]

SVGCraft: Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout

Authors: Ayan Banerjee, Nityanand Mathur, Josep Lladós, Umapada Pal, Anjan Dutta

Abstract: Generating VectorArt from text prompts is a challenging vision task, requiring diverse yet realistic depictions of the seen as well as unseen entities. However, existing research has been mostly limited to the generation of single objects, rather than comprehensive scenes comprising multiple elements. In response, this work introduces SVGCraft, a novel end-to-end framework for the creation of vect… ▽ More Generating VectorArt from text prompts is a challenging vision task, requiring diverse yet realistic depictions of the seen as well as unseen entities. However, existing research has been mostly limited to the generation of single objects, rather than comprehensive scenes comprising multiple elements. In response, this work introduces SVGCraft, a novel end-to-end framework for the creation of vector graphics depicting entire scenes from textual descriptions. Utilizing a pre-trained LLM for layout generation from text prompts, this framework introduces a technique for producing masked latents in specified bounding boxes for accurate object placement. It introduces a fusion mechanism for integrating attention maps and employs a diffusion U-Net for coherent composition, speeding up the drawing process. The resulting SVG is optimized using a pre-trained encoder and LPIPS loss with opacity modulation to maximize similarity. Additionally, this work explores the potential of primitive shapes in facilitating canvas completion in constrained environments. Through both qualitative and quantitative assessments, SVGCraft is demonstrated to surpass prior works in abstraction, recognizability, and detail, as evidenced by its performance metrics (CLIP-T: 0.4563, Cosine Similarity: 0.6342, Confusion: 0.66, Aesthetic: 6.7832). The code will be available at https://github.com/ayanban011/SVGCraft. △ Less

Submitted 30 March, 2024; originally announced April 2024.

arXiv:2403.19863 [pdf, other]

DeNetDM: Debiasing by Network Depth Modulation

Authors: Silpa Vadakkeeveetil Sreelatha, Adarsh Kappiyath, Anjan Dutta

Abstract: When neural networks are trained on biased datasets, they tend to inadvertently learn spurious correlations, leading to challenges in achieving strong generalization and robustness. Current approaches to address such biases typically involve utilizing bias annotations, reweighting based on pseudo-bias labels, or enhancing diversity within bias-conflicting data points through augmentation technique… ▽ More When neural networks are trained on biased datasets, they tend to inadvertently learn spurious correlations, leading to challenges in achieving strong generalization and robustness. Current approaches to address such biases typically involve utilizing bias annotations, reweighting based on pseudo-bias labels, or enhancing diversity within bias-conflicting data points through augmentation techniques. We introduce DeNetDM, a novel debiasing method based on the observation that shallow neural networks prioritize learning core attributes, while deeper ones emphasize biases when tasked with acquiring distinct information. Using a training paradigm derived from Product of Experts, we create both biased and debiased branches with deep and shallow architectures and then distill knowledge to produce the target debiased model. Extensive experiments and analyses demonstrate that our approach outperforms current debiasing techniques, achieving a notable improvement of around 5% in three datasets, encompassing both synthetic and real-world data. Remarkably, DeNetDM accomplishes this without requiring annotations pertaining to bias labels or bias types, while still delivering performance on par with supervised counterparts. Furthermore, our approach effectively harnesses the diversity of bias-conflicting points within the data, surpassing previous methods and obviating the need for explicit augmentation-based methods to enhance the diversity of such bias-conflicting points. The source code will be available upon acceptance. △ Less

Submitted 28 March, 2024; originally announced March 2024.

Comments: 23 pages including supplementary

arXiv:2403.17799 [pdf, other]

doi 10.1051/0004-6361/202449303

Discovery and timing of ten new millisecond pulsars in the globular cluster Terzan 5

Authors: P. V. Padmanabh, S. M. Ransom, P. C. C. Freire, A. Ridolfi, J. D. Taylor, C. Choza, C. J. Clark, F. Abbate, M. Bailes, E. D. Barr, S. Buchner, M. Burgay, M. E. DeCesar, W. Chen, A. Corongiu, D. J. Champion, A. Dutta, M. Geyer, J. W. T. Hessels, M. Kramer, A. Possenti, I. H. Stairs, B. W. Stappers, V. Venkatraman Krishnan, L. Vleeschower , et al. (1 additional authors not shown)

Abstract: We report the discovery of ten new pulsars in the globular cluster Terzan 5 as part of the Transients and Pulsars with MeerKAT (TRAPUM) Large Survey Project. We observed Terzan 5 at L-band (856--1712 MHz) with the MeerKAT radio telescope for four hours on two epochs, and performed acceleration searches of 45 out of 288 tied-array beams covering the core of the cluster. We obtained phase-connected… ▽ More We report the discovery of ten new pulsars in the globular cluster Terzan 5 as part of the Transients and Pulsars with MeerKAT (TRAPUM) Large Survey Project. We observed Terzan 5 at L-band (856--1712 MHz) with the MeerKAT radio telescope for four hours on two epochs, and performed acceleration searches of 45 out of 288 tied-array beams covering the core of the cluster. We obtained phase-connected timing solutions for nine discoveries, covering nearly two decades of archival observations from the Green Bank Telescope for all but one. Highlights include PSR J1748$-$2446ao which is an eccentric ($e = 0.32$) wide-orbit (orbital period $P_{\rm b} = 57.55$ d) system. We were able to measure the rate of advance of periastron ($\dotω$) for this system allowing us to determine a total mass of $3.17 \pm \, 0.02\, \rm M_{\odot}$. With a minimum companion mass ($M_{\rm c}$) of $\sim 0.8\, \rm M_{\odot}$, PSR J1748$-$2446ao is a candidate double neutron star (DNS) system. If confirmed to be a DNS, it would be the fastest spinning pulsar ($P = 2.27$ ms) and the longest orbital period measured for any known DNS system. PSR J1748$-$2446ap has the second highest eccentricity for any recycled pulsar ($e \sim 0.905$) and for this system we can measure the total mass ($1.997 \pm 0.006\, \rm M_{\odot}$) and also estimate the individual pulsar and companion masses. PSR J1748$-$2446ar is an eclipsing redback (minimum $M_{\rm c} \sim 0.34\, \rm M_{\odot}$) system whose properties confirm it to be the counterpart to a previously published source identified in radio and X-ray imaging. With these discoveries, the total number of confirmed pulsars in Terzan 5 is 49, the highest for any globular cluster so far. These discoveries further enhance the rich set of pulsars known in Terzan 5 and provide scope for a deeper understanding of binary stellar evolution, cluster dynamics and ensemble population studies. △ Less

Submitted 19 June, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

Comments: 23 pages, 11 figures, 5 tables, published in A&A

Journal ref: A&A 686, A166 (2024)

arXiv:2403.15562 [pdf, ps, other]

Self-Consistent Atmosphere Representation and Interaction in Photon Monte Carlo Simulations

Authors: J. R. Peterson, G. Sembroski, A. Dutta, C. Remacaldo

Abstract: We present a self-consistent representation of the atmosphere and implement the interactions of light with the atmosphere using a photon Monte Carlo approach. We compile global climate distributions based on historical data, self-consistent vertical profiles of thermodynamic quantities, spatial models of cloud variation and cover, and global distributions of four kinds of aerosols. We then impleme… ▽ More We present a self-consistent representation of the atmosphere and implement the interactions of light with the atmosphere using a photon Monte Carlo approach. We compile global climate distributions based on historical data, self-consistent vertical profiles of thermodynamic quantities, spatial models of cloud variation and cover, and global distributions of four kinds of aerosols. We then implement refraction, Rayleigh scattering, molecular interactions, Tyndall-Mie scattering to all photons emitted from astronomical sources and various background components using physics first principles. This results in emergent image properties that include: differential astrometry and elliptical point spread functions predicted completely to the horizon, arcminute-scale spatial-dependent photometry variations at 20 mmag for short exposures, excess background spatial variations at 0.2% due the atmosphere, and a point spread function wing due to water droplets. We reproduce the well-known correlations in image characteristics: correlations in altitude with absolute photometry (overall transmission) and relative photometry (spectrally-dependent transmission), anti-correlations of altitude with differential astrometry (non-ideal astrometric patterns) and background levels, and an anti-correlation in absolute photometry with cloud depth. However, we also find further subtle correlations including an anti-correlation of temperature with background and differential astrometry, a correlation of temperature with absolute and relative photometry, an anti-correlation of absolute photometry with humidity, a correlation of humidity with Lunar background, a significant correlation of PSF wing with cloud depth, an anti-correlation of background with cloud depth, and a correlation of lunar background with cloud depth. △ Less

Submitted 22 March, 2024; originally announced March 2024.

Comments: 32 pages, 26 figures, ApJ Accepted

Journal ref: ApJ 964 124 (2024)

arXiv:2403.06309 [pdf, other]

doi 10.1088/1402-4896/ad635f

Use of Nash equilibrium in finding game theoretic robust security bound on quantum bit error rate

Authors: Arindam Dutta, Anirban Pathak

Abstract: Nash equilibrium is employed to find a game theoretic robust security bound on quantum bit error rate (QBER) for DL04 protocol which is a scheme for quantum secure direct communication that has been experimentally realized recently. The receiver, sender and eavesdropper (Eve) are considered to be quantum players (players having the capability to perform quantum operations). Specifically, Eve is co… ▽ More Nash equilibrium is employed to find a game theoretic robust security bound on quantum bit error rate (QBER) for DL04 protocol which is a scheme for quantum secure direct communication that has been experimentally realized recently. The receiver, sender and eavesdropper (Eve) are considered to be quantum players (players having the capability to perform quantum operations). Specifically, Eve is considered to have the capability of performing quantum attacks (e.g., Wójcik's original attack, Wójcik's symmetrized attack and Pavičić attack) and classical intercept and resend attack. Game theoretic analysis of the security of DL04 protocol in the above scenario is performed by considering several game scenarios. The analysis revealed the absence of a Pareto optimal Nash equilibrium point within these game scenarios. Consequently, mixed strategy Nash equilibrium points are identified and employed to establish both upper and lower bounds for QBER. Further, the vulnerability of the DL04 protocol to Pavičić attack in the message mode is established. In addition, it is observed that the quantum attacks performed by Eve are more powerful than the classical attack, as the QBER value and the probability of detecting Eve's presence are found to be lower in quantum attacks compared to classical ones. △ Less

Submitted 10 July, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

Comments: Quantum Game, Quantum Secure Direct Communication (QSDC), Nash Equilibrium, Secure QBER Bound Estimation

Journal ref: Phys. Scr. 99 095106 (2024)

arXiv:2403.05435 [pdf, other]

OmniCount: Multi-label Object Counting with Semantic-Geometric Priors

Authors: Anindya Mondal, Sauradip Nag, Xiatian Zhu, Anjan Dutta

Abstract: Object counting is pivotal for understanding the composition of scenes. Previously, this task was dominated by class-specific methods, which have gradually evolved into more adaptable class-agnostic strategies. However, these strategies come with their own set of limitations, such as the need for manual exemplar input and multiple passes for multiple categories, resulting in significant inefficien… ▽ More Object counting is pivotal for understanding the composition of scenes. Previously, this task was dominated by class-specific methods, which have gradually evolved into more adaptable class-agnostic strategies. However, these strategies come with their own set of limitations, such as the need for manual exemplar input and multiple passes for multiple categories, resulting in significant inefficiencies. This paper introduces a more practical approach enabling simultaneous counting of multiple object categories using an open-vocabulary framework. Our solution, OmniCount, stands out by using semantic and geometric insights (priors) from pre-trained models to count multiple categories of objects as specified by users, all without additional training. OmniCount distinguishes itself by generating precise object masks and leveraging varied interactive prompts via the Segment Anything Model for efficient counting. To evaluate OmniCount, we created the OmniCount-191 benchmark, a first-of-its-kind dataset with multi-label object counts, including points, bounding boxes, and VQA annotations. Our comprehensive evaluation in OmniCount-191, alongside other leading benchmarks, demonstrates OmniCount's exceptional performance, significantly outpacing existing solutions. △ Less

Submitted 20 August, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

arXiv:2403.05022 [pdf, other]

Effective Fault Localization using Probabilistic and Grouping Approach

Authors: Saksham Sahai Srivastava, Arpita Dutta, Rajib Mall

Abstract: Context: Fault localization (FL) is the key activity while debugging a program. Any improvement to this activity leads to significant improvement in total software development cost. There is an internal linkage between the program spectrum and test execution result. Conditional probability in statistics captures the probability of occurring one event in relationship to one or more other events. Ob… ▽ More Context: Fault localization (FL) is the key activity while debugging a program. Any improvement to this activity leads to significant improvement in total software development cost. There is an internal linkage between the program spectrum and test execution result. Conditional probability in statistics captures the probability of occurring one event in relationship to one or more other events. Objectives: The aim of this paper is to use the conception of conditional probability to design an effective fault localization technique. Methods: In the paper, we present a fault localization technique that derives the association between statement coverage information and test case execution result using condition probability statistics. This association with the failed test case result shows the fault containing the probability of that specific statement. Subsequently, we use a grouping method to refine the obtained statement ranking sequence for better fault localization. Results: We evaluated the effectiveness of proposed method over eleven open-source data sets. Our obtained results show that on average, the proposed CGFL method is 24.56% more effective than other contemporary fault localization methods such as D*, Tarantula, Ochiai, Crosstab, BPNN, RBFNN, DNN, and CNN. Conclusion: We devised an effective fault localization technique by combining the conditional probabilistic method with failed test case execution-based approach. Our experimental evaluation shows our proposed method outperforms the existing fault localization techniques. △ Less

Submitted 7 March, 2024; originally announced March 2024.

arXiv:2402.16159 [pdf, other]

DistALANER: Distantly Supervised Active Learning Augmented Named Entity Recognition in the Open Source Software Ecosystem

Authors: Somnath Banerjee, Avik Dutta, Aaditya Agrawal, Rima Hazra, Animesh Mukherjee

Abstract: With the AI revolution in place, the trend for building automated systems to support professionals in different domains such as the open source software systems, healthcare systems, banking systems, transportation systems and many others have become increasingly prominent. A crucial requirement in the automation of support tools for such systems is the early identification of named entities, which… ▽ More With the AI revolution in place, the trend for building automated systems to support professionals in different domains such as the open source software systems, healthcare systems, banking systems, transportation systems and many others have become increasingly prominent. A crucial requirement in the automation of support tools for such systems is the early identification of named entities, which serves as a foundation for developing specialized functionalities. However, due to the specific nature of each domain, different technical terminologies and specialized languages, expert annotation of available data becomes expensive and challenging. In light of these challenges, this paper proposes a novel named entity recognition (NER) technique specifically tailored for the open-source software systems. Our approach aims to address the scarcity of annotated software data by employing a comprehensive two-step distantly supervised annotation process. This process strategically leverages language heuristics, unique lookup tables, external knowledge sources, and an active learning approach. By harnessing these powerful techniques, we not only enhance model performance but also effectively mitigate the limitations associated with cost and the scarcity of expert annotators. It is noteworthy that our model significantly outperforms the state-of-the-art LLMs by a substantial margin. We also show the effectiveness of NER in the downstream task of relation extraction. △ Less

Submitted 20 June, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

Comments: Accepted at ECML-PKDD 2024 (Long Paper)

arXiv:2402.11682 [pdf, other]

Learning Conditional Invariances through Non-Commutativity

Authors: Abhra Chaudhuri, Serban Georgescu, Anjan Dutta

Abstract: Invariance learning algorithms that conditionally filter out domain-specific random variables as distractors, do so based only on the data semantics, and not the target domain under evaluation. We show that a provably optimal and sample-efficient way of learning conditional invariances is by relaxing the invariance criterion to be non-commutatively directed towards the target domain. Under domain… ▽ More Invariance learning algorithms that conditionally filter out domain-specific random variables as distractors, do so based only on the data semantics, and not the target domain under evaluation. We show that a provably optimal and sample-efficient way of learning conditional invariances is by relaxing the invariance criterion to be non-commutatively directed towards the target domain. Under domain asymmetry, i.e., when the target domain contains semantically relevant information absent in the source, the risk of the encoder $\varphi^*$ that is optimal on average across domains is strictly lower-bounded by the risk of the target-specific optimal encoder $Φ^*_τ$. We prove that non-commutativity steers the optimization towards $Φ^*_τ$ instead of $\varphi^*$, bringing the $\mathcal{H}$-divergence between domains down to zero, leading to a stricter bound on the target risk. Both our theory and experiments demonstrate that non-commutative invariance (NCI) can leverage source domain samples to meet the sample complexity needs of learning $Φ^*_τ$, surpassing SOTA invariance learning algorithms for domain adaptation, at times by over $2\%$, approaching the performance of an oracle. Implementation is available at https://github.com/abhrac/nci. △ Less

Submitted 18 February, 2024; originally announced February 2024.

Comments: International Conference on Learning Representations (ICLR) 2024

arXiv:2402.07261 [pdf, other]

doi 10.1109/GLOBECOM54140.2023.10437604

A Novel Technique to Parameterize Congestion Control in 6TiSCH IIoT Networks

Authors: Kushal Chakraborty, Aritra Kumar Dutta, Mohammad Avesh Hussain, Syed Raafay Mohiuddin, Nikumani Choudhury, Rakesh Matam, Mithun Mukherjee

Abstract: The Industrial Internet of Things (IIoT) refers to the use of interconnected smart devices, sensors, and other technologies to create a network of intelligent systems that can monitor and manage industrial processes. 6TiSCH (IPv6 over the Time Slotted Channel Hopping mode of IEEE 802.15.4e) as an enabling technology facilitates low-power and low-latency communication between IoT devices in industr… ▽ More The Industrial Internet of Things (IIoT) refers to the use of interconnected smart devices, sensors, and other technologies to create a network of intelligent systems that can monitor and manage industrial processes. 6TiSCH (IPv6 over the Time Slotted Channel Hopping mode of IEEE 802.15.4e) as an enabling technology facilitates low-power and low-latency communication between IoT devices in industrial environments. The Routing Protocol for Low power and lossy networks (RPL), which is used as the de-facto routing protocol for 6TiSCH networks is observed to suffer from several limitations, especially during congestion in the network. Therefore, there is an immediate need for some modifications to the RPL to deal with this problem. Under traffic load which keeps on changing continuously at different instants of time, the proposed mechanism aims at finding the appropriate parent for a node that can forward the packet to the destination through the least congested path with minimal packet loss. This facilitates congestion management under dynamic traffic loads. For this, a new metric for routing using the concept of exponential weighting has been proposed, which takes the number of packets present in the queue of the node into account when choosing the parent at a particular instance of time. Additionally, the paper proposes a parent selection and swapping mechanism for congested networks. Performance evaluations are carried out in order to validate the proposed work. The results show an improvement in the performance of RPL under heavy and dynamic traffic loads. △ Less

Submitted 11 February, 2024; originally announced February 2024.

Comments: The paper has been submitted, accepted, and presented at the 2023 IEEE Global Communications Conference: Next-Generation Networking and Internet, with plans for publication. It was delivered during the IEEE Global Communications Conference held on December 6th, 2023, in Kuala Lumpur, Malaysia

arXiv:2402.02018 [pdf, other]

The Landscape and Challenges of HPC Research and LLMs

Authors: Le Chen, Nesreen K. Ahmed, Akash Dutta, Arijit Bhattacharjee, Sixing Yu, Quazi Ishtiaque Mahmud, Waqwoya Abebe, Hung Phan, Aishwarya Sarkar, Branden Butler, Niranjan Hasabnis, Gal Oren, Vy A. Vo, Juan Pablo Munoz, Theodore L. Willke, Tim Mattson, Ali Jannesari

Abstract: Recently, language models (LMs), especially large language models (LLMs), have revolutionized the field of deep learning. Both encoder-decoder models and prompt-based techniques have shown immense potential for natural language processing and code-based tasks. Over the past several years, many research labs and institutions have invested heavily in high-performance computing, approaching or breach… ▽ More Recently, language models (LMs), especially large language models (LLMs), have revolutionized the field of deep learning. Both encoder-decoder models and prompt-based techniques have shown immense potential for natural language processing and code-based tasks. Over the past several years, many research labs and institutions have invested heavily in high-performance computing, approaching or breaching exascale performance levels. In this paper, we posit that adapting and utilizing such language model-based techniques for tasks in high-performance computing (HPC) would be very beneficial. This study presents our reasoning behind the aforementioned position and highlights how existing ideas can be improved and adapted for HPC tasks. △ Less

Submitted 6 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

arXiv:2401.16428 [pdf, other]

A Comparative Investigation into the Operation of an Optimal Control Problem: The Maximal Stretch

Authors: Anurag Dutta, K. Lakshmanan, John Harshith, A. Ramamoorthy

Abstract: Mathematical Selection is a method in which we select a particular choice from a set of such. It have always been an interesting field of study for mathematicians. Combinatorial optimisation is the practice of selecting the best constituent from a collection of prospective possibilities according to some particular characterization. In simple cases, an optimal process problem encompasses identifyi… ▽ More Mathematical Selection is a method in which we select a particular choice from a set of such. It have always been an interesting field of study for mathematicians. Combinatorial optimisation is the practice of selecting the best constituent from a collection of prospective possibilities according to some particular characterization. In simple cases, an optimal process problem encompasses identifying components out of a finite arrangement and establishing the function's significance in possible to lessen or achieve maximum with a functional purpose. To extrapolate optimisation theory, it employs a wide range of mathematical concepts. Optimisation, when applied to a variety of different types of optimization algorithms, necessitates determining the best consequences of the specific predetermined characteristic in a particular circumstance. In this work, we will be working on one similar problem - The Maximal Stretch Problem with computational rigour. Beginning with the Problem Statement itself, we will be developing numerous step - by - step algorithms to solve the problem, and will finally pose a comparison between them on the basis of their Computational Complexity. The article entails around the Brute Force Solution, A Recursive Approach to deal with the problem, and finally a Dynamically Programmed Approach for the same. △ Less

Submitted 18 January, 2024; originally announced January 2024.

arXiv:2401.13471 [pdf, other]

doi 10.1103/PhysRevE.109.064143

Ordering kinetics in the active Ising model

Authors: Sayam Bandyopadhyay, Swarnajit Chatterjee, Aditya Kumar Dutta, Mintu Karmakar, Heiko Rieger, Raja Paul

Abstract: We undertake a numerical study of the ordering kinetics in the two-dimensional ($2d$) active Ising model (AIM), a discrete flocking model with a conserved density field coupled to a non-conserved magnetization field. We find that for a quench into the liquid-gas coexistence region and in the ordered liquid region, the characteristic length scale of both the density and magnetization domains follow… ▽ More We undertake a numerical study of the ordering kinetics in the two-dimensional ($2d$) active Ising model (AIM), a discrete flocking model with a conserved density field coupled to a non-conserved magnetization field. We find that for a quench into the liquid-gas coexistence region and in the ordered liquid region, the characteristic length scale of both the density and magnetization domains follows the Lifshitz-Cahn-Allen (LCA) growth law: $R(t) \sim t^{1/2}$, consistent with the growth law of passive systems with scalar order parameter and non-conserved dynamics. The system morphology is analyzed with the two-point correlation function and its Fourier transform, the structure factor, which conforms to the well-known Porod's law, a manifestation of the coarsening of compact domains with smooth boundaries. We also find the domain growth exponent unaffected by different noise strengths and self-propulsion velocities of the active particles. However, transverse diffusion is found to play the most significant role in the growth kinetics of the AIM. We extract the same growth exponent by solving the hydrodynamic equations of the AIM. △ Less

Submitted 20 June, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

Journal ref: Physical Review E 109, 064143 (2024)

arXiv:2401.12671 [pdf, other]

Context Matters: Pushing the Boundaries of Open-Ended Answer Generation with Graph-Structured Knowledge Context

Authors: Somnath Banerjee, Amruit Sahoo, Sayan Layek, Avik Dutta, Rima Hazra, Animesh Mukherjee

Abstract: In the continuously advancing AI landscape, crafting context-rich and meaningful responses via Large Language Models (LLMs) is essential. Researchers are becoming more aware of the challenges that LLMs with fewer parameters encounter when trying to provide suitable answers to open-ended questions. To address these hurdles, the integration of cutting-edge strategies, augmentation of rich external d… ▽ More In the continuously advancing AI landscape, crafting context-rich and meaningful responses via Large Language Models (LLMs) is essential. Researchers are becoming more aware of the challenges that LLMs with fewer parameters encounter when trying to provide suitable answers to open-ended questions. To address these hurdles, the integration of cutting-edge strategies, augmentation of rich external domain knowledge to LLMs, offers significant improvements. This paper introduces a novel framework that combines graph-driven context retrieval in conjunction to knowledge graphs based enhancement, honing the proficiency of LLMs, especially in domain specific community question answering platforms like AskUbuntu, Unix, and ServerFault. We conduct experiments on various LLMs with different parameter sizes to evaluate their ability to ground knowledge and determine factual accuracy in answers to open-ended questions. Our methodology GraphContextGen consistently outperforms dominant text-based retrieval systems, demonstrating its robustness and adaptability to a larger number of use cases. This advancement highlights the importance of pairing context rich data retrieval with LLMs, offering a renewed approach to knowledge sourcing and generation in AI systems. We also show that, due to rich contextual data retrieval, the crucial entities, along with the generated answer, remain factually coherent with the gold answer. △ Less

Submitted 5 March, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

arXiv:2401.09872 [pdf, other]

doi 10.1126/science.adg3005

A pulsar in a binary with a compact object in the mass gap between neutron stars and black holes

Authors: Ewan D. Barr, Arunima Dutta, Paulo C. C. Freire, Mario Cadelano, Tasha Gautam, Michael Kramer, Cristina Pallanca, Scott M. Ransom, Alessandro Ridolfi, Benjamin W. Stappers, Thomas M. Tauris, Vivek Venkatraman Krishnan, Norbert Wex, Matthew Bailes, Jan Behrend, Sarah Buchner, Marta Burgay, Weiwei Chen, David J. Champion, C. -H. Rosie Chen, Alessandro Corongiu, Marisa Geyer, Y. P. Men, Prajwal V. Padmanabh, Andrea Possenti

Abstract: Among the compact objects observed in gravitational wave merger events a few have masses in the gap between the most massive neutron stars (NSs) and least massive black holes (BHs) known. Their nature and the formation of their merging binaries are not well understood. We report on pulsar timing observations using the Karoo Array Telescope (MeerKAT) of PSR J0514-4002E, an eccentric binary millisec… ▽ More Among the compact objects observed in gravitational wave merger events a few have masses in the gap between the most massive neutron stars (NSs) and least massive black holes (BHs) known. Their nature and the formation of their merging binaries are not well understood. We report on pulsar timing observations using the Karoo Array Telescope (MeerKAT) of PSR J0514-4002E, an eccentric binary millisecond pulsar in the globular cluster NGC 1851 with a total binary mass of $3.887 \pm 0.004$ solar masses. The companion to the pulsar is a compact object and its mass (between $2.09$ and $2.71$ solar masses, 95% confidence interval) is in the mass gap, so it either is a very massive NS or a low-mass BH. We propose the companion was formed by a merger between two earlier NSs. △ Less

Submitted 18 January, 2024; originally announced January 2024.

Comments: 41 pages, 13 figures, 3 tables, to be published in Science

arXiv:2401.07107 [pdf, other]

SN 2020udy: A new piece of the homogeneous bright group in the diverse Iax subclass

Authors: Mridweeka Singh, Devendra K. Sahu, Barnabas Barna, Anjasha Gangopadhyay, Raya Dastidar, Rishabh Singh Teja, Kuntal Misra, D. Andrew Howell, Xiaofeng Wang, Jun Mo, Shengyu Yan, Daichi Hiramatsu, Craig Pellegrino, G. C. Anupama, Arti Joshi, K. Azalee Bostroem, Jamison Burke, Curtis McCully, Rama Subramanian V, Gaici Li, Gaobo Xi, Xin Li, Zhitong Li, Shubham Srivastav, Hyobin Im , et al. (1 additional authors not shown)

Abstract: We present optical observations and analysis of a bright type Iax SN~2020udy hosted by NGC 0812. The light curve evolution of SN~2020udy is similar to other bright Iax SNe. Analytical modeling of the quasi bolometric light curves of SN 2020udy suggests that 0.08$\pm$0.01 M$_{\odot}$ of $^{56}$Ni would have been synthesized during the explosion. Spectral features of SN 2020udy are similar to the br… ▽ More We present optical observations and analysis of a bright type Iax SN~2020udy hosted by NGC 0812. The light curve evolution of SN~2020udy is similar to other bright Iax SNe. Analytical modeling of the quasi bolometric light curves of SN 2020udy suggests that 0.08$\pm$0.01 M$_{\odot}$ of $^{56}$Ni would have been synthesized during the explosion. Spectral features of SN 2020udy are similar to the bright members of type Iax class showing weak Si {\sc II} line. The late-time spectral sequence is mostly dominated by Iron Group Elements (IGEs) with broad emission lines. Abundance tomography modeling of the spectral time series of SN~2020udy using TARDIS indicates stratification in the outer ejecta, however, to confirm this, spectral modeling at a very early phase is required. After maximum light, uniform mixing of chemical elements is sufficient to explain the spectral evolution. Unlike the case of normal type Ia SNe, the photospheric approximation remains robust until +100 days, requiring an additional continuum source. Overall, the observational features of SN 2020udy are consistent with the deflagration of a Carbon-Oxygen white dwarf. △ Less

Submitted 13 January, 2024; originally announced January 2024.

Comments: 18 pages, 17 figures, 3 tables, Accepted for publication in ApJ

arXiv:2401.04794 [pdf]

A Reduced Cost Four-Component Relativistic Unitary Coupled Cluster Method for Molecules

Authors: Kamal Majee, Tamoghna Mukhopadhyay, Malaya K. Nayak, Achintya Kumar Dutta

Abstract: We present a four-component relativistic unitary coupled cluster method for molecules. We have used commutator-based non-perturbative approximation using the ''Bernoulli expansion'' to derive an approximation to the relativistic unitary coupled cluster method. The performance of the full quadratic unitary coupled-cluster singles and doubles method \left ( qUCCSD \right ), as well as a perturbative… ▽ More We present a four-component relativistic unitary coupled cluster method for molecules. We have used commutator-based non-perturbative approximation using the ''Bernoulli expansion'' to derive an approximation to the relativistic unitary coupled cluster method. The performance of the full quadratic unitary coupled-cluster singles and doubles method \left ( qUCCSD \right ), as well as a perturbative approximation variant \left ( UCC3 \right ), has been reported for both energies and properties. It can be seen that both methods give results comparable to those of the standard relativistic coupled cluster method. The qUCCSD method shows better agreement with experimental results due to better inclusion of the relaxation effects. A natural spinor-based scheme to reduce the computation cost of relativistic UCC3 and qUCCSD methods has been discussed. △ Less

Submitted 5 March, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

Comments: 34 pages, 10 figures, 4 tables

arXiv:2312.17437 [pdf]

doi 10.1063/5.0167547

Bilayer Vanadium Dioxide Thin Film with Elevated Transition Temperatures and High Resistance Switching

Authors: Achintya Dutta, Ashok P, Amit Verma

Abstract: Despite widespread interest in the phase-change applications of vanadium dioxide (VO$_2$), the fabrication of high-quality VO$_2$ thin films with elevated transition temperatures (TIMT) and high Insulator-Metal-Transition resistance switching still remains a challenge. This study introduces a two-step atmospheric oxidation approach to fabricate bilayer VO$_{2-x}$/VO$_2$ films on a c-plane sapphire… ▽ More Despite widespread interest in the phase-change applications of vanadium dioxide (VO$_2$), the fabrication of high-quality VO$_2$ thin films with elevated transition temperatures (TIMT) and high Insulator-Metal-Transition resistance switching still remains a challenge. This study introduces a two-step atmospheric oxidation approach to fabricate bilayer VO$_{2-x}$/VO$_2$ films on a c-plane sapphire substrate. To quantify the impact of the VO$_2$ buffer layer, a single-layer VO$_2$ film of the same thickness was also fabricated. The bilayer VO$_{2-x}$/VO$_2$ films wherein the top VO$_{2-x}$ film was under-oxidized demonstrated an elevation in TIMT reaching ~97 $^\circ$C, one of the highest reported to date for VO$_2$ films and is achieved in a doping-free manner. Our results also reveal a one-order increase in resistance switching, with the optimum bilayer VO$_2$/VO$_2$ film exhibiting ~3.6 orders of switching from 25 $^\circ$C to 110 $^\circ$C, compared to the optimum single-layer VO$_2$ reference film. This is accompanied by a one-order decrease in the on-state resistance in its metallic phase. The elevation in TIMT, coupled with increased strain extracted from the XRD characterization of the bilayer film, suggests the possibility of compressive strain along the c-axis. These VO$_{2-x}$/VO$_2$ films also demonstrate a significant change in the slope of their resistance vs temperature curves contrary to the conventional smooth transition. This feature was ascribed to the rutile/monoclinic quasi-heterostructure formed due to the top VO$_{2-x}$ film having a reduced TIMT. Our findings carry significant implications for both the lucid fabrication of VO$_2$ thin film devices as well as the study of phase transitions in correlated oxides. △ Less

Submitted 28 December, 2023; originally announced December 2023.

Journal ref: Journal of Applied Physics 134, no. 14 (2023)

arXiv:2312.17010 [pdf]

Robust Multi-Modal Image Stitching for Improved Scene Understanding

Authors: Aritra Dutta, G Suseela, Asmita Sood

Abstract: Multi-modal image stitching can be a difficult feat. That's why, in this paper, we've devised a unique and comprehensive image-stitching pipeline that taps into OpenCV's stitching module. Our approach integrates feature-based matching, transformation estimation, and blending techniques to bring about panoramic views that are of top-tier quality - irrespective of lighting, scale or orientation diff… ▽ More Multi-modal image stitching can be a difficult feat. That's why, in this paper, we've devised a unique and comprehensive image-stitching pipeline that taps into OpenCV's stitching module. Our approach integrates feature-based matching, transformation estimation, and blending techniques to bring about panoramic views that are of top-tier quality - irrespective of lighting, scale or orientation differences between images. We've put our pipeline to the test with a varied dataset and found that it's very effective in enhancing scene understanding and finding real-world applications. △ Less

Submitted 28 December, 2023; originally announced December 2023.

Comments: 8 pages, 11 figures

arXiv:2312.16879 [pdf, other]

Relativistic equation-of-motion coupled-cluster theory analysis of black-body radiation shift in the clock transition of Zn I

Authors: Somesh Chamoli, Anmol Mishra, Richa Sharma Kesarkar, B. K. Sahoo, Achintya Kumar Dutta

Abstract: We have employed equation-of-motion coupled-cluster (EOM-CC) method in the four-component relativistic theory framework to understand roles of electron correlation effects in the $\textit{ab initio}$ estimations of electric dipole polarizabilities ($α$) of the states engaged in the clock transition ($^{1}$S$_{0}$$\rightarrow$$^{3}$P$_{0}$) of the zinc atom. Roles of basis size, inclusion of higher… ▽ More We have employed equation-of-motion coupled-cluster (EOM-CC) method in the four-component relativistic theory framework to understand roles of electron correlation effects in the $\textit{ab initio}$ estimations of electric dipole polarizabilities ($α$) of the states engaged in the clock transition ($^{1}$S$_{0}$$\rightarrow$$^{3}$P$_{0}$) of the zinc atom. Roles of basis size, inclusion of higher-level excitations, and higher-order relativistic effects in the evaluation of both excitation energies of a few low-lying excited states and $α$ are analyzed systematically. Our EOM-CC values are compared with the earlier reported theoretical and experimental results. This demonstrates the capability of the EOM-CC method to ascertain the preciseness of the black-body radiation shift in a clock transition, which holds paramount importance for optical clock-based experiments. △ Less

Submitted 15 January, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

Comments: 3 Figures, 11 pages

arXiv:2312.16221 [pdf, other]

STRIDE: Single-video based Temporally Continuous Occlusion Robust 3D Pose Estimation

Authors: Rohit Lal, Saketh Bachu, Yash Garg, Arindam Dutta, Calvin-Khang Ta, Dripta S. Raychaudhuri, Hannah Dela Cruz, M. Salman Asif, Amit K. Roy-Chowdhury

Abstract: The capability to accurately estimate 3D human poses is crucial for diverse fields such as action recognition, gait recognition, and virtual/augmented reality. However, a persistent and significant challenge within this field is the accurate prediction of human poses under conditions of severe occlusion. Traditional image-based estimators struggle with heavy occlusions due to a lack of temporal co… ▽ More The capability to accurately estimate 3D human poses is crucial for diverse fields such as action recognition, gait recognition, and virtual/augmented reality. However, a persistent and significant challenge within this field is the accurate prediction of human poses under conditions of severe occlusion. Traditional image-based estimators struggle with heavy occlusions due to a lack of temporal context, resulting in inconsistent predictions. While video-based models benefit from processing temporal data, they encounter limitations when faced with prolonged occlusions that extend over multiple frames. This challenge arises because these models struggle to generalize beyond their training datasets, and the variety of occlusions is hard to capture in the training data. Addressing these challenges, we propose STRIDE (Single-video based TempoRally contInuous occlusion Robust 3D Pose Estimation), a novel Test-Time Training (TTT) approach to fit a human motion prior for each video. This approach specifically handles occlusions that were not encountered during the model's training. By employing STRIDE, we can refine a sequence of noisy initial pose estimates into accurate, temporally coherent poses during test time, effectively overcoming the limitations of prior methods. Our framework demonstrates flexibility by being model-agnostic, allowing us to use any off-the-shelf 3D pose estimation method for improving robustness and temporal consistency. We validate STRIDE's efficacy through comprehensive experiments on challenging datasets like Occluded Human3.6M, Human3.6M, and OCMotion, where it not only outperforms existing single-image and video-based pose estimation models but also showcases superior handling of substantial occlusions, achieving fast, robust, accurate, and temporally consistent 3D pose estimates. △ Less

Submitted 13 March, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

arXiv:2312.15488 [pdf, other]

The Zeta ($ζ$) Notation for Complex Asymptotes

Authors: Anurag Dutta, K. Lakshmanan, John Harshith, A. Ramamoorthy, C. Pradeep, Pijush Kanti Kumar

Abstract: Time Complexity is an important metric to compare algorithms based on their cardinality. The commonly used, trivial notations to qualify the same are the Big-Oh, Big-Omega, Big-Theta, Small-Oh, and Small-Omega Notations. All of them, consider time a part of the real entity, i.e., Time coincides with the horizontal axis in the argand plane. But what if the Time rather than completely coinciding wit… ▽ More Time Complexity is an important metric to compare algorithms based on their cardinality. The commonly used, trivial notations to qualify the same are the Big-Oh, Big-Omega, Big-Theta, Small-Oh, and Small-Omega Notations. All of them, consider time a part of the real entity, i.e., Time coincides with the horizontal axis in the argand plane. But what if the Time rather than completely coinciding with the real axis of the argand plane, makes some angle with it? We are trying to focus on the case when the Time Complexity will have both real and imaginary components. For Instance, if $T\left(n\right)=\ n\log{n}$, the existing asymptomatic notations are capable of handling that in real time But, if we come across a problem where, $T\left(n\right)=\ n\log{n}+i\cdot n^2$, where, $i=\sqrt[2]{-1}$, the existing asymptomatic notations will not be able to catch up. To mitigate the same, in this research, we would consider proposing the Zeta Notation ($ζ$), which would qualify Time in both the Real and Imaginary Axis, as per the Argand Plane. △ Less

Submitted 1 February, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

arXiv:2312.13589 [pdf, other]

doi 10.1103/PhysRevB.109.L121114

The anomalous Floquet Anderson insulator in a continuously driven optical lattice

Authors: Arijit Dutta, Efe Sen, Jun-Hui Zheng, Monika Aidelsburger, Walter Hofstetter

Abstract: The anomalous Floquet Anderson insulator (AFAI) has been theoretically predicted in step-wise periodically driven models, but its stability under more general driving protocols hasn't been determined. We show that adding disorder to the anomalous Floquet topological insulator realized with a continuous driving protocol in the experiment by K. Wintersperger et. al., Nat. Phys. $\textbf{16}$, 1058 (… ▽ More The anomalous Floquet Anderson insulator (AFAI) has been theoretically predicted in step-wise periodically driven models, but its stability under more general driving protocols hasn't been determined. We show that adding disorder to the anomalous Floquet topological insulator realized with a continuous driving protocol in the experiment by K. Wintersperger et. al., Nat. Phys. $\textbf{16}$, 1058 (2020), supports an AFAI phase, where, for a range of disorder strengths, all the time averaged bulk states become localized, while the pumped charge in a Laughlin pump setup remains quantized. △ Less

Submitted 21 December, 2023; originally announced December 2023.

Comments: 5 pages, 4 figures in the main text; 6 pages, 4 figures in the supplement. Comments are welcome

Journal ref: Phys. Rev. B 109, L121114 (2024)

arXiv:2312.10189 [pdf, other]

Resilient Federated Learning under Byzantine Attack in Distributed Nonconvex Optimization with 2-f Redundancy

Authors: Amit Dutta, Thinh T. Doan, Jeffrey H. Reed

Abstract: We study the problem of Byzantine fault tolerance in a distributed optimization setting, where there is a group of $N$ agents communicating with a trusted centralized coordinator. Among these agents, there is a subset of $f$ agents that may not follow a prescribed algorithm and may share arbitrarily incorrect information with the coordinator. The goal is to find the optimizer of the aggregate cost… ▽ More We study the problem of Byzantine fault tolerance in a distributed optimization setting, where there is a group of $N$ agents communicating with a trusted centralized coordinator. Among these agents, there is a subset of $f$ agents that may not follow a prescribed algorithm and may share arbitrarily incorrect information with the coordinator. The goal is to find the optimizer of the aggregate cost functions of the honest agents. We will be interested in studying the local gradient descent method, also known as federated learning, to solve this problem. However, this method often returns an approximate value of the underlying optimal solution in the Byzantine setting. Recent work showed that by incorporating the so-called comparative elimination (CE) filter at the coordinator, one can provably mitigate the detrimental impact of Byzantine agents and precisely compute the true optimizer in the convex setting. The focus of the present work is to provide theoretical results to show the convergence of local gradient methods with the CE filter in a nonconvex setting. We will also provide a number of numerical simulations to support our theoretical results. △ Less

Submitted 15 December, 2023; originally announced December 2023.

Showing 1–50 of 453 results for author: Dutta, A