-
TCNFormer: Temporal Convolutional Network Former for Short-Term Wind Speed Forecasting
Authors:
Abid Hasan Zim,
Aquib Iqbal,
Asad Malik,
Zhicheng Dong,
Hanzhou Wu
Abstract:
Global environmental challenges and rising energy demands have led to extensive exploration of wind energy technologies. Accurate wind speed forecasting (WSF) is crucial for optimizing wind energy capture and ensuring system stability. However, predicting wind speed remains challenging due to its inherent randomness, fluctuation, and unpredictability. This study proposes the Temporal Convolutional…
▽ More
Global environmental challenges and rising energy demands have led to extensive exploration of wind energy technologies. Accurate wind speed forecasting (WSF) is crucial for optimizing wind energy capture and ensuring system stability. However, predicting wind speed remains challenging due to its inherent randomness, fluctuation, and unpredictability. This study proposes the Temporal Convolutional Network Former (TCNFormer) for short-term (12-hour) wind speed forecasting. The TCNFormer integrates the Temporal Convolutional Network (TCN) and transformer encoder to capture the spatio-temporal features of wind speed. The transformer encoder consists of two distinct attention mechanisms: causal temporal multi-head self-attention (CT-MSA) and temporal external attention (TEA). CT-MSA ensures that the output of a step derives only from previous steps, i.e., causality. Locality is also introduced to improve efficiency. TEA explores potential relationships between different sample sequences in wind speed data. This study utilizes wind speed data from the NASA Prediction of Worldwide Energy Resources (NASA POWER) of Patenga Sea Beach, Chittagong, Bangladesh (latitude 22.2352° N, longitude 91.7914° E) over a year (six seasons). The findings indicate that the TCNFormer outperforms state-of-the-art models in prediction accuracy. The proposed TCNFormer presents a promising method for spatio-temporal WSF and may achieve desirable performance in real-world applications of wind power systems.
△ Less
Submitted 27 August, 2024;
originally announced August 2024.
-
EAViT: External Attention Vision Transformer for Audio Classification
Authors:
Aquib Iqbal,
Abid Hasan Zim,
Md Asaduzzaman Tonmoy,
Limengnan Zhou,
Asad Malik,
Minoru Kuribayashi
Abstract:
This paper presents the External Attention Vision Transformer (EAViT) model, a novel approach designed to enhance audio classification accuracy. As digital audio resources proliferate, the demand for precise and efficient audio classification systems has intensified, driven by the need for improved recommendation systems and user personalization in various applications, including music streaming p…
▽ More
This paper presents the External Attention Vision Transformer (EAViT) model, a novel approach designed to enhance audio classification accuracy. As digital audio resources proliferate, the demand for precise and efficient audio classification systems has intensified, driven by the need for improved recommendation systems and user personalization in various applications, including music streaming platforms and environmental sound recognition. Accurate audio classification is crucial for organizing vast audio libraries into coherent categories, enabling users to find and interact with their preferred audio content more effectively. In this study, we utilize the GTZAN dataset, which comprises 1,000 music excerpts spanning ten diverse genres. Each 30-second audio clip is segmented into 3-second excerpts to enhance dataset robustness and mitigate overfitting risks, allowing for more granular feature analysis. The EAViT model integrates multi-head external attention (MEA) mechanisms into the Vision Transformer (ViT) framework, effectively capturing long-range dependencies and potential correlations between samples. This external attention (EA) mechanism employs learnable memory units that enhance the network's capacity to process complex audio features efficiently. The study demonstrates that EAViT achieves a remarkable overall accuracy of 93.99%, surpassing state-of-the-art models.
△ Less
Submitted 23 August, 2024;
originally announced August 2024.
-
AnimalFormer: Multimodal Vision Framework for Behavior-based Precision Livestock Farming
Authors:
Ahmed Qazi,
Taha Razzaq,
Asim Iqbal
Abstract:
We introduce a multimodal vision framework for precision livestock farming, harnessing the power of GroundingDINO, HQSAM, and ViTPose models. This integrated suite enables comprehensive behavioral analytics from video data without invasive animal tagging. GroundingDINO generates accurate bounding boxes around livestock, while HQSAM segments individual animals within these boxes. ViTPose estimates…
▽ More
We introduce a multimodal vision framework for precision livestock farming, harnessing the power of GroundingDINO, HQSAM, and ViTPose models. This integrated suite enables comprehensive behavioral analytics from video data without invasive animal tagging. GroundingDINO generates accurate bounding boxes around livestock, while HQSAM segments individual animals within these boxes. ViTPose estimates key body points, facilitating posture and movement analysis. Demonstrated on a sheep dataset with grazing, running, sitting, standing, and walking activities, our framework extracts invaluable insights: activity and grazing patterns, interaction dynamics, and detailed postural evaluations. Applicable across species and video resolutions, this framework revolutionizes non-invasive livestock monitoring for activity detection, counting, health assessments, and posture analyses. It empowers data-driven farm management, optimizing animal welfare and productivity through AI-powered behavioral understanding.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Relations between nonsmooth vector variational inequalities and nonsmooth vector optimization problems on Hadamard manifold in terms of bifunction
Authors:
Nagendra Singh,
Akhlad Iqbal,
Shahid Ali
Abstract:
In this paper, we discuss the concepts of bifunction and geodesic convexity for vector valued functions on Hadamard manifold. The Hadamard manifold is a particular type of Riemannian manifold with non-positive sectional curvature. Using bifunction, we introduce a definition of generalized geodesic convexity in the context of the Hadamard manifold. To support the definition, we construct a non-triv…
▽ More
In this paper, we discuss the concepts of bifunction and geodesic convexity for vector valued functions on Hadamard manifold. The Hadamard manifold is a particular type of Riemannian manifold with non-positive sectional curvature. Using bifunction, we introduce a definition of generalized geodesic convexity in the context of the Hadamard manifold. To support the definition, we construct a non-trivial example that demonstrates the property of geodesic convexity on Hadamard manifold. Additionally, we define the geodesic $h$-convexity, geodesic $h$-pseudoconvexity and geodesic $h$-quasiconvexity for vector valued function using bifunction and study their several properties. Furthermore, we demonstrate the uniqueness of the solution for nonsmooth vector variational inequality problem (NVVIP) and prove the characterization property for the solution of NVVIP and the Minty type NVVIP (MNVVIP) on Hadamard manifold in terms of bifunction. Afterward, we consider a nonsmooth vector optimization problem (NVOP) and investigate the relationships among the solutions of NVOP, NVVIP, and MNVVIP.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
On relationships between vector variational inequalities and optimization problems using convexificators on Hadamard manifold
Authors:
Nagendra Singh,
Akhlad Iqbal,
Shahid Ali
Abstract:
An important concept of convexificators has been extended to Hadamard manifolds in this paper. The mean value theorem for convexificators on the Hadamard manifold has also been derived. Monotonicity of the bounded convexificators has been discussed and an important characterization for the bounded convexificators to be $\partial_{*}^{*}$-geodesic convexity has been derived. Furthermore, a vector v…
▽ More
An important concept of convexificators has been extended to Hadamard manifolds in this paper. The mean value theorem for convexificators on the Hadamard manifold has also been derived. Monotonicity of the bounded convexificators has been discussed and an important characterization for the bounded convexificators to be $\partial_{*}^{*}$-geodesic convexity has been derived. Furthermore, a vector variational inequalities problem using convexificators on Hadamard manifold has been considered. In addition, the necessary and sufficient conditions for vector optimization problems in terms of Stampacchia and Minty type partial vector variational inequality problem ($\partial_{*}^{*}$-VVIP) have been derived.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
A Large-Scale Empirical Study of COVID-19 Contact Tracing Mobile App Reviews
Authors:
Sifat Ishmam Parisa,
Md Awsaf Alam Anindya,
Anindya Iqbal,
Gias Uddin
Abstract:
Since the beginning of 2020, the novel coronavirus has begun to sweep across the globe. Given the prevalence of smartphones everywhere, many countries across continents also developed COVID-19 contract tracing apps that users can install to get a warning of potential contacts with infected people. Unlike regular apps that undergo detailed requirement analysis, carefully designed development, rigor…
▽ More
Since the beginning of 2020, the novel coronavirus has begun to sweep across the globe. Given the prevalence of smartphones everywhere, many countries across continents also developed COVID-19 contract tracing apps that users can install to get a warning of potential contacts with infected people. Unlike regular apps that undergo detailed requirement analysis, carefully designed development, rigorous testing, contact tracing apps were deployed after rapid development. Therefore such apps may not reach expectations for all end users. Users share their opinions and experience of the usage of the apps in the app store. This paper aims to understand the types of topics users discuss in the reviews of the COVID-19 contact tracing apps across the continents by analyzing the app reviews. We collected all the reviews of 35 COVID-19 contact tracing apps developed by 34 countries across the globe. We group the app reviews into the following geographical regions: Asia, Europe, North America, Latin America, Africa, Middle East, and Australasia (Australia and NZ). We run topic modeling on the app reviews of each region. We analyze the produced topics and their evolution over time by categorizing them into hierarchies and computing the ratings of reviews related to the topics. While privacy could be a concern with such apps, we only find privacy-related topics in Australasia, North America, and Middle East. Topics related to usability and performance of the apps are prevalent across all regions. Users frequently complained about the lack of features, user interface and the negative impact of such apps on their mobile batteries. Still, we also find that many users praised the apps because they helped them stay aware of the potential danger of getting infected. The finding of this study is expected to help app developers utilize their resources to address the reported issues in a prioritized way.
△ Less
Submitted 28 April, 2024;
originally announced April 2024.
-
On Test Sequence Generation using Multi-Objective Particle Swarm Optimization
Authors:
Zain Iqbal,
Kashif Zafar,
Aden Iqbal,
Ayesha Khan
Abstract:
Software testing is an important and essential part of the software development life cycle and accounts for almost one-third of system development costs. In the software industry, testing costs can account for about 35% to 40% of the total cost of a software project. Therefore, providing efficient ways to test software is critical to reduce cost, time, and effort. Black-box testing and White-box t…
▽ More
Software testing is an important and essential part of the software development life cycle and accounts for almost one-third of system development costs. In the software industry, testing costs can account for about 35% to 40% of the total cost of a software project. Therefore, providing efficient ways to test software is critical to reduce cost, time, and effort. Black-box testing and White-box testing are two essential components of software testing. Black-box testing focuses on the software's functionality, while White-box testing examines its internal structure. These tests contribute significantly to ensuring program coverage, which remains one of the main goals of the software testing paradigm. One of the main problems in this area is the identification of appropriate paths for program coverage, which are referred to as test sequences. Creating an automated and effective test sequence is a challenging task in the software testing process. In the proposed methodology, the challenge of "test sequence generation" is considered a multi-objective optimization problem that includes the Oracle cost and the path, both of which are optimized in a symmetrical manner to achieve optimal software testing. Multi-Objective Particle Swarm Optimization (MOPSO) is used to represent the test sequences with the highest priority and the lowest Oracle cost as optimal. The performance of the implemented approach is compared with the Multi-Objective Firefly Algorithm (MOFA) for generating test sequences. The MOPSO-based solution outperforms the MOFA-based approach and simultaneously provides the optimal solution for both objectives.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
A Mixed Method Study of DevOps Challenges
Authors:
Minaoar Hossain Tanzil,
Masud Sarker,
Gias Uddin,
Anindya Iqbal
Abstract:
Context: DevOps practices combine software development and IT operations. There is a growing number of DevOps related posts in popular online developer forum Stack Overflow (SO). While previous research analyzed SO posts related to build/release engineering, we are aware of no research that specifically focused on DevOps related discussions. Objective: To learn the challenges developers face while…
▽ More
Context: DevOps practices combine software development and IT operations. There is a growing number of DevOps related posts in popular online developer forum Stack Overflow (SO). While previous research analyzed SO posts related to build/release engineering, we are aware of no research that specifically focused on DevOps related discussions. Objective: To learn the challenges developers face while using the currently available DevOps tools and techniques along with the organizational challenges in DevOps practices. Method: We conduct an empirical study by applying topic modeling on 174K SO posts that contain DevOps discussions. We then validate and extend the empirical study findings with a survey of 21 professional DevOps practitioners. Results: We find that: (1) There are 23 DevOps topics grouped into four categories: Cloud & CI/CD Tools, Infrastructure as Code, Container & Orchestration, and Quality Assurance. (2) The topic category Cloud & CI/CD Tools contains the highest number of topics (10) which cover 48.6% of all questions in our dataset, followed by the category Infrastructure as Code (28.9%). (3) File management is the most popular topic followed by Jenkins Pipeline, while infrastructural Exception Handling and Jenkins Distributed Architecture are the most difficult topics (with least accepted answers). (4) In the survey, developers mention that it requires hands-on experience before current DevOps tools can be considered easy. They raised the needs for better documentation and learning resources to learn the rapidly changing DevOps tools and techniques. Practitioners also emphasized on the formal training approach by the organizations for DevOps skill development. Conclusion: Architects and managers can use the findings of this research to adopt appropriate DevOps technologies, and organizations can design tool or process specific DevOps training programs.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
HT-LIP Model based Robust Control of Quadrupedal Robot Locomotion under Unknown Vertical Ground Motion
Authors:
Amir Iqbal,
Sushant Veer,
Christopher Niezrecki,
Yan Gu
Abstract:
This paper presents a hierarchical control framework that enables robust quadrupedal locomotion on a dynamic rigid surface (DRS) with general and unknown vertical motions. The key novelty of the framework lies in its higher layer, which is a discrete-time, provably stabilizing footstep controller. The basis of the footstep controller is a new hybrid, time-varying, linear inverted pendulum (HT-LIP)…
▽ More
This paper presents a hierarchical control framework that enables robust quadrupedal locomotion on a dynamic rigid surface (DRS) with general and unknown vertical motions. The key novelty of the framework lies in its higher layer, which is a discrete-time, provably stabilizing footstep controller. The basis of the footstep controller is a new hybrid, time-varying, linear inverted pendulum (HT-LIP) model that is low-dimensional and accurately captures the essential robot dynamics during DRS locomotion. A new set of sufficient stability conditions are then derived to directly guide the controller design for ensuring the asymptotic stability of the HT-LIP model under general, unknown, vertical DRS motions. Further, the footstep controller is cast as a computationally efficient quadratic program that incorporates the proposed HT-LIP model and stability conditions. The middle layer takes the desired footstep locations generated by the higher layer as input to produce kinematically feasible full-body reference trajectories, which are then accurately tracked by a lower-layer torque controller. Hardware experiments on a Unitree Go1 quadrupedal robot confirm the robustness of the proposed framework under various unknown, aperiodic, vertical DRS motions and uncertainties (e.g., slippery and uneven surfaces, solid and liquid loads, and sudden pushes).
△ Less
Submitted 24 March, 2024;
originally announced March 2024.
-
BanglaNum -- A Public Dataset for Bengali Digit Recognition from Speech
Authors:
Mir Sayeed Mohammad,
Azizul Zahid,
Md Asif Iqbal
Abstract:
Automatic speech recognition (ASR) converts the human voice into readily understandable and categorized text or words. Although Bengali is one of the most widely spoken languages in the world, there have been very few studies on Bengali ASR, particularly on Bangladeshi-accented Bengali. In this study, audio recordings of spoken digits (0-9) from university students were used to create a Bengali sp…
▽ More
Automatic speech recognition (ASR) converts the human voice into readily understandable and categorized text or words. Although Bengali is one of the most widely spoken languages in the world, there have been very few studies on Bengali ASR, particularly on Bangladeshi-accented Bengali. In this study, audio recordings of spoken digits (0-9) from university students were used to create a Bengali speech digits dataset that may be employed to train artificial neural networks for voice-based digital input systems. This paper also compares the Bengali digit recognition accuracy of several Convolutional Neural Networks (CNNs) using spectrograms and shows that a test accuracy of 98.23% is achievable using parameter-efficient models such as SqueezeNet on our dataset.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Privacy-Preserving Collaborative Split Learning Framework for Smart Grid Load Forecasting
Authors:
Asif Iqbal,
Prosanta Gope,
Biplab Sikdar
Abstract:
Accurate load forecasting is crucial for energy management, infrastructure planning, and demand-supply balancing. Smart meter data availability has led to the demand for sensor-based load forecasting. Conventional ML allows training a single global model using data from multiple smart meters requiring data transfer to a central server, raising concerns for network requirements, privacy, and securi…
▽ More
Accurate load forecasting is crucial for energy management, infrastructure planning, and demand-supply balancing. Smart meter data availability has led to the demand for sensor-based load forecasting. Conventional ML allows training a single global model using data from multiple smart meters requiring data transfer to a central server, raising concerns for network requirements, privacy, and security. We propose a split learning-based framework for load forecasting to alleviate this issue. We split a deep neural network model into two parts, one for each Grid Station (GS) responsible for an entire neighbourhood's smart meters and the other for the Service Provider (SP). Instead of sharing their data, client smart meters use their respective GSs' model split for forward pass and only share their activations with the GS. Under this framework, each GS is responsible for training a personalized model split for their respective neighbourhoods, whereas the SP can train a single global or personalized model for each GS. Experiments show that the proposed models match or exceed a centrally trained model's performance and generalize well. Privacy is analyzed by assessing information leakage between data and shared activations of the GS model split. Additionally, differential privacy enhances local data privacy while examining its impact on performance. A transformer model is used as our base learner.
△ Less
Submitted 12 March, 2024; v1 submitted 3 March, 2024;
originally announced March 2024.
-
CHEX-MATE: A LOFAR pilot X-ray$-$radio study on five radio halo clusters
Authors:
M. Balboni,
F. Gastaldello,
A. Bonafede,
A. Botteon,
I. Bartalucci,
H. Bourdin,
G. Brunetti,
R. Cassano,
S. De Grandi,
F. De Luca,
S. Ettori,
S. Ghizzardi,
M. Gitti,
A. Iqbal,
M. Johnston-Hollitt,
L. Lovisari,
P. Mazzotta,
S. Molendi,
E. Pointecouteau,
G. W. Pratt,
G. Riva,
M. Rossetti,
H. Rottgering,
M. Sereno,
R. J. van Weeren
, et al. (2 additional authors not shown)
Abstract:
The connection between the thermal and non-thermal properties in galaxy clusters hosting radio halos seems fairly well established. However, a comprehensive analysis of such a connection has been made only for integrated quantities (e.g. $L_X - P_{radio}$ relation). In recent years new-generation radio telescopes have enabled the unprecedented possibility to study the non-thermal properties of gal…
▽ More
The connection between the thermal and non-thermal properties in galaxy clusters hosting radio halos seems fairly well established. However, a comprehensive analysis of such a connection has been made only for integrated quantities (e.g. $L_X - P_{radio}$ relation). In recent years new-generation radio telescopes have enabled the unprecedented possibility to study the non-thermal properties of galaxy clusters on a spatially resolved basis. Here, we perform a pilot study to investigate the mentioned properties on five targets, by combining X-ray data from the CHEX-MATE project with the second data release from the LOFAR Two meter Sky survey. We find a strong correlation ($r_s \sim 0.7$) with a slope less than unity between the radio and X-ray surface brightness. We also report differences in the spatially resolved properties of the radio emission in clusters which show different levels of dynamical disturbance. In particular, less perturbed clusters (according to X-ray parameters) show peaked radio profiles in the centre, with a flattening in the outer regions, while the three dynamically disturbed clusters have steeper profiles in the outer regions. We fit a model to the radio emission in the context of turbulent re-acceleration with a constant ratio between thermal and non-thermal particles energy density and a magnetic field profile linked to the thermal gas density as $B(r) \propto n_{th}^{0.5}$. We found that this simple model cannot reproduce the behaviour of the observed radio emission.
△ Less
Submitted 1 March, 2024; v1 submitted 28 February, 2024;
originally announced February 2024.
-
CHEX-MATE: Robust reconstruction of temperature profiles in galaxy clusters with XMM-Newton
Authors:
M. Rossetti,
D. Eckert,
F. Gastaldello,
E. Rasia,
G. W. Pratt,
S. Ettori,
S. Molendi,
M. Arnaud,
M. Balboni,
I. Bartalucci,
R. M. Batalha,
S. Borgani,
H. Bourdin,
S. De Grandi,
F. De Luca,
M. De Petris,
W. Forman,
M. Gaspari,
S. Ghizzardi,
A. Iqbal,
S. Kay,
L. Lovisari,
B. J. Maughan,
P. Mazzotta,
E. Pointecouteau
, et al. (3 additional authors not shown)
Abstract:
The "Cluster HEritage project with \xmm: Mass Assembly and Thermodynamics at the Endpoint of structure formation" (CHEX-MATE) is a multi-year Heritage program, to obtain homogeneous XMM-Newton observations of a representative sample of 118 galaxy clusters. The observations are tuned to reconstruct the distribution of the main thermodynamic quantities of the ICM up to $R_{500}$ and to obtain indivi…
▽ More
The "Cluster HEritage project with \xmm: Mass Assembly and Thermodynamics at the Endpoint of structure formation" (CHEX-MATE) is a multi-year Heritage program, to obtain homogeneous XMM-Newton observations of a representative sample of 118 galaxy clusters. The observations are tuned to reconstruct the distribution of the main thermodynamic quantities of the ICM up to $R_{500}$ and to obtain individual mass measurements, via the hydrostatic-equilibrium equation, with a precision of 15-20%. Temperature profiles are a necessary ingredient for the scientific goals of the project and it is thus crucial to derive the best possible temperature measurements from our data. This is why we have built a new pipeline for spectral extraction and analysis of XMM-Newton data, based on a new physically motivated background model and on a Bayesian approach with Markov Chain Monte Carlo (MCMC) methods, that we present in this paper for the first time. We applied this new method to a subset of 30 galaxy clusters representative of the CHEX-MATE sample and show that we can obtain reliable temperature measurements up to regions where the source intensity is as low as 20% of the background, keeping systematic errors below 10%. We compare the median profile of our sample and the best fit slope at large radii with literature results and we find a good agreement with other measurements based on XMM-Newton data. Conversely, when we exclude from our analysis the most contaminated regions, where the source intensity is below 20 of the background, we find significantly flatter profiles, in agreement with predictions from numerical simulations and independent measurements with a combination of Sunyaev-Zeldovich and X-ray imaging data.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Generalizability Under Sensor Failure: Tokenization + Transformers Enable More Robust Latent Spaces
Authors:
Geeling Chau,
Yujin An,
Ahamed Raffey Iqbal,
Soon-Jo Chung,
Yisong Yue,
Sabera Talukder
Abstract:
A major goal in neuroscience is to discover neural data representations that generalize. This goal is challenged by variability along recording sessions (e.g. environment), subjects (e.g. varying neural structures), and sensors (e.g. sensor noise), among others. Recent work has begun to address generalization across sessions and subjects, but few study robustness to sensor failure which is highly…
▽ More
A major goal in neuroscience is to discover neural data representations that generalize. This goal is challenged by variability along recording sessions (e.g. environment), subjects (e.g. varying neural structures), and sensors (e.g. sensor noise), among others. Recent work has begun to address generalization across sessions and subjects, but few study robustness to sensor failure which is highly prevalent in neuroscience experiments. In order to address these generalizability dimensions we first collect our own electroencephalography dataset with numerous sessions, subjects, and sensors, then study two time series models: EEGNet (Lawhern et al., 2018) and TOTEM (Talukder et al., 2024). EEGNet is a widely used convolutional neural network, while TOTEM is a discrete time series tokenizer and transformer model. We find that TOTEM outperforms or matches EEGNet across all generalizability cases. Finally through analysis of TOTEM's latent codebook we observe that tokenization enables generalization.
△ Less
Submitted 19 March, 2024; v1 submitted 28 February, 2024;
originally announced February 2024.
-
Bayesian variable selection in sample selection models using spike-and-slab priors
Authors:
Adam Iqbal,
Emmanuel Ogundimu,
F. Javier Rubio
Abstract:
Sample selection models represent a common methodology for correcting bias induced by data missing not at random. It is well known that these models are not empirically identifiable without exclusion restrictions. In other words, some variables predictive of missingness do not affect the outcome model of interest. The drive to establish this requirement often leads to the inclusion of irrelevant v…
▽ More
Sample selection models represent a common methodology for correcting bias induced by data missing not at random. It is well known that these models are not empirically identifiable without exclusion restrictions. In other words, some variables predictive of missingness do not affect the outcome model of interest. The drive to establish this requirement often leads to the inclusion of irrelevant variables in the model. A recent proposal uses adaptive LASSO to circumvent this problem, but its performance depends on the so-called covariance assumption, which can be violated in small to moderate samples. Additionally, there are no tools yet for post-selection inference for this model. To address these challenges, we propose two families of spike-and-slab priors to conduct Bayesian variable selection in sample selection models. These prior structures allow for constructing a Gibbs sampler with tractable conditionals, which is scalable to the dimensions of practical interest. We illustrate the performance of the proposed methodology through a simulation study and present a comparison against adaptive LASSO and stepwise selection. We also provide two applications using publicly available real data. An implementation and code to reproduce the results in this paper can be found at https://github.com/adam-iqbal/selection-spike-slab
△ Less
Submitted 13 December, 2023; v1 submitted 6 December, 2023;
originally announced December 2023.
-
Theoretical investigation of slow gain recovery of quantum cascade lasers observed in pump-probe experiment
Authors:
Mrinmoy Kundu,
Aroni Ghosh,
Abdullah Jubair Bin Iqbal,
Muhammad Anisuzzaman Talukder
Abstract:
Time-resolved spectroscopy-based pump-probe experiments performed on quantum cascade lasers (QCLs) exhibit an initial fast gain recovery followed by a slow tail such that the equilibrium gain is not recovered in a cavity round-trip time. This ultra-slow gain recovery or non-recovered gain cannot be explained by only the intersubband carrier dynamics of QCLs. This work shows that the Fabry-Perot ca…
▽ More
Time-resolved spectroscopy-based pump-probe experiments performed on quantum cascade lasers (QCLs) exhibit an initial fast gain recovery followed by a slow tail such that the equilibrium gain is not recovered in a cavity round-trip time. This ultra-slow gain recovery or non-recovered gain cannot be explained by only the intersubband carrier dynamics of QCLs. This work shows that the Fabry-Perot cavity dynamics and localized intersubband electron heating of QCLs are essential in ultra-slow and nonrecovered gain recovery. We developed a comprehensive model, coupling cavity dynamics to the intersubband electrons' thermal evolution. We employ a four-level coupled Maxwell-Bloch model that considers temperature-dependent scattering and transport mechanisms in calculating the gain recovery dynamics. If an intense pump pulse electrically pumped close to the threshold propagates in the forward direction after being coupled into the cavity, the reflected pump pulse will significantly deplete the gain medium while propagating in the backward direction. Additionally, we show that the intersubband electron sustains a localized high temperature even after the pump pulse has left, which affects the overall carrier dynamics and leads to an ultra-slow gain recovery process. At near-perfect reflectivity, we observe a gain depletion of 4% for 2 mm QCL. We further demonstrate that an additional 10% gain depletion of probe pulse is seen at a steady state when the laser is pumped at 1.6 times the threshold compared to the case where the hot electron effect is not considered.
△ Less
Submitted 24 November, 2023;
originally announced November 2023.
-
Evolution of X-ray galaxy Cluster Properties in a Representative Sample (EXCPReS). Optimal binning for temperature profile extraction
Authors:
C. M. H. Chen,
M. Arnaud,
E. Pointecouteau,
G. W. Pratt,
A. Iqbal
Abstract:
We present XMM-Newton observations of a representative X-ray selected sample of 31 galaxy clusters at moderate redshift $(0.4<z<0.6)$, spanning the mass range $10^{14} < M_{\textrm 500} < 10^{15}$~M$_\odot$. This sample, EXCPRES (Evolution of X-ray galaxy Cluster Properties in a Representative Sample), is used to test and validate a new method to produce optimally-binned cluster X-ray temperature…
▽ More
We present XMM-Newton observations of a representative X-ray selected sample of 31 galaxy clusters at moderate redshift $(0.4<z<0.6)$, spanning the mass range $10^{14} < M_{\textrm 500} < 10^{15}$~M$_\odot$. This sample, EXCPRES (Evolution of X-ray galaxy Cluster Properties in a Representative Sample), is used to test and validate a new method to produce optimally-binned cluster X-ray temperature profiles. The method uses a dynamic programming algorithm, based on partitioning of the soft-band X-ray surface brightness profile, to obtain a binning scheme that optimally fulfils a given signal-to-noise threshold criterion out to large radius. From the resulting optimally-binned EXCPRES temperature profiles, and combining with those from the local REXCESS sample, we provide a generic scaling relation between the relative error on the temperature and the [0.3-2] keV surface brightness signal-to-noise ratio, and its dependence on temperature and redshift. We derive an average scaled 3D temperature profile for the sample. Comparing to the average scaled 3D temperature profiles from REXCESS, we find no evidence for evolution of the average profile shape within the redshift range that we probe.
△ Less
Submitted 10 June, 2024; v1 submitted 17 November, 2023;
originally announced November 2023.
-
The Karush-Kuhn-Tucker Optimality Conditions for Multi-Objective Interval-Valued Optimization Problem on Hadamard Manifolds
Authors:
Hilal Ahmad Bhat,
Akhlad Iqbal,
Izhar Ahmad
Abstract:
The KKT optimality conditions for multi-objective interval-valued optimization problem on Hadamard manifold are studied in this paper. Several concepts of Pareto optimal solutions, considered under LU and CW ordering on the class of all closed intervals in $\mathbb{R}$, are given. The KKT conditions are presented under the notions of convexity, pseudo-convexity and generalized Hukuhara difference.…
▽ More
The KKT optimality conditions for multi-objective interval-valued optimization problem on Hadamard manifold are studied in this paper. Several concepts of Pareto optimal solutions, considered under LU and CW ordering on the class of all closed intervals in $\mathbb{R}$, are given. The KKT conditions are presented under the notions of convexity, pseudo-convexity and generalized Hukuhara difference. We show, with the help of an example, that the results done in this paper for solving multi-objective interval-valued optimization problems on Hadamard spaces are more general than the existing ones on Euclidean spaces. The main results are supported by examples.
△ Less
Submitted 26 August, 2024; v1 submitted 17 September, 2023;
originally announced November 2023.
-
LogShield: A Transformer-based APT Detection System Leveraging Self-Attention
Authors:
Sihat Afnan,
Mushtari Sadia,
Shahrear Iqbal,
Anindya Iqbal
Abstract:
Cyber attacks are often identified using system and network logs. There have been significant prior works that utilize provenance graphs and ML techniques to detect attacks, specifically advanced persistent threats, which are very difficult to detect. Lately, there have been studies where transformer-based language models are being used to detect various types of attacks from system logs. However,…
▽ More
Cyber attacks are often identified using system and network logs. There have been significant prior works that utilize provenance graphs and ML techniques to detect attacks, specifically advanced persistent threats, which are very difficult to detect. Lately, there have been studies where transformer-based language models are being used to detect various types of attacks from system logs. However, no such attempts have been made in the case of APTs. In addition, existing state-of-the-art techniques that use system provenance graphs, lack a data processing framework generalized across datasets for optimal performance. For mitigating this limitation as well as exploring the effectiveness of transformer-based language models, this paper proposes LogShield, a framework designed to detect APT attack patterns leveraging the power of self-attention in transformers. We incorporate customized embedding layers to effectively capture the context of event sequences derived from provenance graphs. While acknowledging the computational overhead associated with training transformer networks, our framework surpasses existing LSTM and Language models regarding APT detection. We integrated the model parameters and training procedure from the RoBERTa model and conducted extensive experiments on well-known APT datasets (DARPA OpTC and DARPA TC E3). Our framework achieved superior F1 scores of 98% and 95% on the two datasets respectively, surpassing the F1 scores of 96% and 94% obtained by LSTM models. Our findings suggest that LogShield's performance benefits from larger datasets and demonstrates its potential for generalization across diverse domains. These findings contribute to the advancement of APT attack detection methods and underscore the significance of transformer-based architectures in addressing security challenges in computer systems.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
Optimality Conditions for Interval-Valued Optimization Problems on Riemannian Manifolds Under a Total Order Relation
Authors:
Hilal Ahmad Bhat,
Akhlad Iqbal,
Mahwash Aftab
Abstract:
This article explores fundamental properties of convex interval-valued functions defined on Riemannian manifolds. The study employs generalized Hukuhara directional differentiability to derive KKT-type optimality conditions for an interval-valued optimization problem on Riemannian manifolds. Based on type of functions involved in optimization problems, we consider the following cases:
1. objecti…
▽ More
This article explores fundamental properties of convex interval-valued functions defined on Riemannian manifolds. The study employs generalized Hukuhara directional differentiability to derive KKT-type optimality conditions for an interval-valued optimization problem on Riemannian manifolds. Based on type of functions involved in optimization problems, we consider the following cases:
1. objective function as well as constraints are real-valued;
2. objective function is interval-valued, and constraints are real-valued;
3. objective function as well as constraints are interval-valued.
The whole theory is justified with the help of examples. The order relation that we use throughout the paper is a total order relation defined on the collection of all closed and bounded intervals in $\mathbb{R}$.
△ Less
Submitted 6 September, 2024; v1 submitted 17 September, 2023;
originally announced September 2023.
-
CHEX-MATE: A non-parametric deep learning technique to deproject and deconvolve galaxy cluster X-ray temperature profiles
Authors:
A. Iqbal,
G. W. Pratt,
J. Bobin,
M. Arnaud,
E. Rasia,
M. Rossetti,
R. T. Duffy,
I. Bartalucci,
H. Bourdin,
F. De Luca,
M. De Petris,
M. Donahue,
D. Eckert,
S. Ettori,
A. Ferragamo,
M. Gaspari,
F. Gastaldello,
R. Gavazzi,
S. Ghizzardi,
L. Lovisari,
P. Mazzotta,
B. J. Maughan,
E. Pointecouteau,
M. Sereno
Abstract:
Temperature profiles of the hot galaxy cluster intracluster medium (ICM) have a complex non-linear structure that traditional parametric modelling may fail to fully approximate. For this study, we made use of neural networks, for the first time, to construct a data-driven non-parametric model of ICM temperature profiles. A new deconvolution algorithm was then introduced to uncover the true (3D) te…
▽ More
Temperature profiles of the hot galaxy cluster intracluster medium (ICM) have a complex non-linear structure that traditional parametric modelling may fail to fully approximate. For this study, we made use of neural networks, for the first time, to construct a data-driven non-parametric model of ICM temperature profiles. A new deconvolution algorithm was then introduced to uncover the true (3D) temperature profiles from the observed projected (2D) temperature profiles. An auto-encoder-inspired neural network was first trained by learning a non-linear interpolatory scheme to build the underlying model of 3D temperature profiles in the radial range of [0.02-2] R$_{500}$, using a sparse set of hydrodynamical simulations from the THREE HUNDRED PROJECT. A deconvolution algorithm using a learning-based regularisation scheme was then developed. The model was tested using high and low resolution input temperature profiles, such as those expected from simulations and observations, respectively. We find that the proposed deconvolution and deprojection algorithm is robust with respect to the quality of the data, the morphology of the cluster, and the deprojection scheme used. The algorithm can recover unbiased 3D radial temperature profiles with a precision of around 5\% over most of the fitting range. We apply the method to the first sample of temperature profiles obtained with XMM{\it -Newton} for the CHEX-MATE project and compared it to parametric deprojection and deconvolution techniques. Our work sets the stage for future studies that focus on the deconvolution of the thermal profiles (temperature, density, pressure) of the ICM and the dark matter profiles in galaxy clusters, using deep learning techniques in conjunction with X-ray, Sunyaev Zel'Dovich (SZ) and optical datasets.
△ Less
Submitted 9 November, 2023; v1 submitted 5 September, 2023;
originally announced September 2023.
-
Contrastive Learning for API Aspect Analysis
Authors:
G. M. Shahariar,
Tahmid Hasan,
Anindya Iqbal,
Gias Uddin
Abstract:
We present a novel approach - CLAA - for API aspect detection in API reviews that utilizes transformer models trained with a supervised contrastive loss objective function. We evaluate CLAA using performance and impact analysis. For performance analysis, we utilized a benchmark dataset on developer discussions collected from Stack Overflow and compare the results to those obtained using state-of-t…
▽ More
We present a novel approach - CLAA - for API aspect detection in API reviews that utilizes transformer models trained with a supervised contrastive loss objective function. We evaluate CLAA using performance and impact analysis. For performance analysis, we utilized a benchmark dataset on developer discussions collected from Stack Overflow and compare the results to those obtained using state-of-the-art transformer models. Our experiments show that contrastive learning can significantly improve the performance of transformer models in detecting aspects such as Performance, Security, Usability, and Documentation. For impact analysis, we performed empirical and developer study. On a randomly selected and manually labeled 200 online reviews, CLAA achieved 92% accuracy while the SOTA baseline achieved 81.5%. According to our developer study involving 10 participants, the use of 'Stack Overflow + CLAA' resulted in increased accuracy and confidence during API selection. Replication package: https://github.com/disa-lab/Contrastive-Learning-API-Aspect-ASE2023
△ Less
Submitted 14 August, 2023; v1 submitted 31 July, 2023;
originally announced July 2023.
-
A Novel DDPM-based Ensemble Approach for Energy Theft Detection in Smart Grids
Authors:
Xun Yuan,
Yang Yang,
Asif Iqbal,
Prosanta Gope,
Biplab Sikdar
Abstract:
Energy theft, characterized by manipulating energy consumption readings to reduce payments, poses a dual threat-causing financial losses for grid operators and undermining the performance of smart grids. Effective Energy Theft Detection (ETD) methods become crucial in mitigating these risks by identifying such fraudulent activities in their early stages. However, the majority of current ETD method…
▽ More
Energy theft, characterized by manipulating energy consumption readings to reduce payments, poses a dual threat-causing financial losses for grid operators and undermining the performance of smart grids. Effective Energy Theft Detection (ETD) methods become crucial in mitigating these risks by identifying such fraudulent activities in their early stages. However, the majority of current ETD methods rely on supervised learning, which is hindered by the difficulty of labelling data and the risk of overfitting known attacks. To address these challenges, several unsupervised ETD methods have been proposed, focusing on learning the normal patterns from honest users, specifically the reconstruction of input. However, our investigation reveals a limitation in current unsupervised ETD methods, as they can only detect anomalous behaviours in users exhibiting regular patterns. Users with high-variance behaviours pose a challenge to these methods. In response, this paper introduces a Denoising Diffusion Probabilistic Model (DDPM)-based ETD approach. This innovative approach demonstrates impressive ETD performance on high-variance smart grid data by incorporating additional attributes correlated with energy consumption. The proposed methods improve the average ETD performance on high-variance smart grid data from below 0.5 to over 0.9 w.r.t. AUC. On the other hand, our experimental findings indicate that while the state-of-the-art ETD methods based on reconstruction error can identify ETD attacks for the majority of users, they prove ineffective in detecting attacks for certain users. To address this, we propose a novel ensemble approach that considers both reconstruction error and forecasting error, enhancing the robustness of the ETD methodology. The proposed ensemble method improves the average ETD performance on the stealthiest attacks from nearly 0 to 0.5 w.r.t. 5%-TPR.
△ Less
Submitted 13 January, 2024; v1 submitted 30 July, 2023;
originally announced July 2023.
-
CHEX-MATE: CLUster Multi-Probes in Three Dimensions (CLUMP-3D), I. Gas Analysis Method using X-ray and Sunyaev-Zel'dovich Effect Data
Authors:
Junhan Kim,
Jack Sayers,
Mauro Sereno,
Iacopo Bartalucci,
Loris Chappuis,
Sabrina De Grandi,
Federico De Luca,
Marco De Petris,
Megan E. Donahue,
Dominique Eckert,
Stefano Ettori,
Massimo Gaspari,
Fabio Gastaldello,
Raphael Gavazzi,
Adriana Gavidia,
Simona Ghizzardi,
Asif Iqbal,
Scott Kay,
Lorenzo Lovisari,
Ben J. Maughan,
Pasquale Mazzotta,
Nobuhiro Okabe,
Etienne Pointecouteau,
Gabriel W. Pratt,
Mariachiara Rossetti
, et al. (1 additional authors not shown)
Abstract:
Galaxy clusters are the products of structure formation through myriad physical processes that affect their growth and evolution throughout cosmic history. As a result, the matter distribution within galaxy clusters, or their shape, is influenced by cosmology and astrophysical processes, in particular the accretion of new material due to gravity. We introduce an analysis method to investigate the…
▽ More
Galaxy clusters are the products of structure formation through myriad physical processes that affect their growth and evolution throughout cosmic history. As a result, the matter distribution within galaxy clusters, or their shape, is influenced by cosmology and astrophysical processes, in particular the accretion of new material due to gravity. We introduce an analysis method to investigate the 3D triaxial shapes of galaxy clusters from the Cluster HEritage project with XMM-Newton -- Mass Assembly and Thermodynamics at the Endpoint of structure formation (CHEX-MATE). In this work, the first paper of a CHEX-MATE triaxial analysis series, we focus on utilizing X-ray data from XMM and Sunyaev-Zel'dovich (SZ) effect maps from Planck and ACT to obtain a three dimensional triaxial description of the intracluster medium (ICM) gas. We present the forward modeling formalism of our technique, which projects a triaxial ellipsoidal model for the gas density and pressure to compare directly with the observed two dimensional distributions in X-rays and the SZ effect. A Markov chain Monte Carlo is used to estimate the posterior distributions of the model parameters. Using mock X-ray and SZ observations of a smooth model, we demonstrate that the method can reliably recover the true parameter values. In addition, we apply the analysis to reconstruct the gas shape from the observed data of one CHEX-MATE galaxy cluster, Abell 1689, to illustrate the technique. The inferred parameters are in agreement with previous analyses for that cluster, and our results indicate that the geometrical properties, including the axial ratios of the ICM distribution, are constrained to within a few percent. With much better precision than previous studies, we thus further establish that Abell 1689 is significantly elongated along the line of sight, resulting in its exceptional gravitational lensing properties.
△ Less
Submitted 21 March, 2024; v1 submitted 10 July, 2023;
originally announced July 2023.
-
Towards Automated Classification of Code Review Feedback to Support Analytics
Authors:
Asif Kamal Turzo,
Fahim Faysal,
Ovi Poddar,
Jaydeb Sarker,
Anindya Iqbal,
Amiangshu Bosu
Abstract:
Background: As improving code review (CR) effectiveness is a priority for many software development organizations, projects have deployed CR analytics platforms to identify potential improvement areas. The number of issues identified, which is a crucial metric to measure CR effectiveness, can be misleading if all issues are placed in the same bin. Therefore, a finer-grained classification of issue…
▽ More
Background: As improving code review (CR) effectiveness is a priority for many software development organizations, projects have deployed CR analytics platforms to identify potential improvement areas. The number of issues identified, which is a crucial metric to measure CR effectiveness, can be misleading if all issues are placed in the same bin. Therefore, a finer-grained classification of issues identified during CRs can provide actionable insights to improve CR effectiveness. Although a recent work by Fregnan et al. proposed automated models to classify CR-induced changes, we have noticed two potential improvement areas -- i) classifying comments that do not induce changes and ii) using deep neural networks (DNN) in conjunction with code context to improve performances. Aims: This study aims to develop an automated CR comment classifier that leverages DNN models to achieve a more reliable performance than Fregnan et al. Method: Using a manually labeled dataset of 1,828 CR comments, we trained and evaluated supervised learning-based DNN models leveraging code context, comment text, and a set of code metrics to classify CR comments into one of the five high-level categories proposed by Turzo and Bosu. Results: Based on our 10-fold cross-validation-based evaluations of multiple combinations of tokenization approaches, we found a model using CodeBERT achieving the best accuracy of 59.3%. Our approach outperforms Fregnan et al.'s approach by achieving 18.7% higher accuracy. Conclusion: Besides facilitating improved CR analytics, our proposed model can be useful for developers in prioritizing code review feedback and selecting reviewers.
△ Less
Submitted 7 July, 2023;
originally announced July 2023.
-
CHEX-MATE: Constraining the origin of the scatter in galaxy cluster radial X-ray surface brightness profiles
Authors:
I. Bartalucci,
S. Molendi,
E. Rasia,
G. W. Pratt,
M. Arnaud,
M. Rossetti,
F. Gastaldello,
D. Eckert,
M. Balboni,
S. Borgani,
H. Bourdin,
M. G. Campitiello,
S. De Grandi,
M. De Petris,
R. T. Duffy,
S. Ettori,
A. Ferragamo,
M. Gaspari,
R. Gavazzi,
S. Ghizzardi,
A. Iqbal,
S. T. Kay,
L. Lovisari,
P. Mazzotta,
B. J. Maughan
, et al. (3 additional authors not shown)
Abstract:
We investigate the statistical properties and the origin of the scatter within the spatially resolved surface brightness profiles of the CHEX-MATE sample, formed by 118 galaxy clusters selected via the SZ effect. These objects have been drawn from the Planck SZ catalogue and cover a wide range of masses, M$_{500}=[2-15] \times 10^{14} $M$_{\odot}$, and redshift, z=[0.05,0.6]. We derived the surfac…
▽ More
We investigate the statistical properties and the origin of the scatter within the spatially resolved surface brightness profiles of the CHEX-MATE sample, formed by 118 galaxy clusters selected via the SZ effect. These objects have been drawn from the Planck SZ catalogue and cover a wide range of masses, M$_{500}=[2-15] \times 10^{14} $M$_{\odot}$, and redshift, z=[0.05,0.6]. We derived the surface brightness and emission measure profiles and determined the statistical properties of the full sample. We found that there is a critical scale, R$\sim 0.4 R_{500}$, within which morphologically relaxed and disturbed object profiles diverge. The median of each sub-sample differs by a factor of $\sim 10$ at $0.05\,R_{500}$. There are no significant differences between mass- and redshift-selected sub-samples once proper scaling is applied. We compare CHEX-MATE with a sample of 115 clusters drawn from the The Three Hundred suite of cosmological simulations. We found that simulated emission measure profiles are systematically steeper than those of observations. For the first time, the simulations were used to break down the components causing the scatter between the profiles. We investigated the behaviour of the scatter due to object-by-object variation. We found that the high scatter, approximately 110%, at $R<0.4R_{500}$ is due to a genuine difference between the distribution of the gas in the core. The intermediate scale, $R_{500} =[0.4-0.8]$, is characterised by the minimum value of the scatter on the order of 0.56, indicating a region where cluster profiles are the closest to the self-similar regime. Larger scales are characterised by increasing scatter due to the complex spatial distribution of the gas. Also for the first time, we verify that the scatter due to projection effects is smaller than the scatter due to genuine object-by-object variation in all the considered scales. [abridged]
△ Less
Submitted 4 May, 2023;
originally announced May 2023.
-
Imaging Light-Induced Migration of Dislocations in Halide Perovskites with 3D Nanoscale Strain Mapping
Authors:
Kieran W. P. Orr,
Jiecheng Diao,
Muhammad Naufal Lintangpradipto,
Darren J. Batey,
Affan N. Iqbal,
Simon Kahmann,
Kyle Frohna,
Milos Dubajic,
Szymon J. Zelewski,
Alice E. Dearle,
Thomas A. Selby,
Peng Li,
Tiarnan A. S. Doherty,
Stephan Hofmann,
Osman M. Bakr,
Ian K. Robinson,
Samuel D. Stranks
Abstract:
In recent years, halide perovskite materials have been used to make high performance solar cell and light-emitting devices. However, material defects still limit device performance and stability. Here, we use synchrotron-based Bragg Coherent Diffraction Imaging to visualise nanoscale strain fields, such as those local to defects, in halide perovskite microcrystals. We find significant strain heter…
▽ More
In recent years, halide perovskite materials have been used to make high performance solar cell and light-emitting devices. However, material defects still limit device performance and stability. Here, we use synchrotron-based Bragg Coherent Diffraction Imaging to visualise nanoscale strain fields, such as those local to defects, in halide perovskite microcrystals. We find significant strain heterogeneity within MAPbBr$_{3}$ (MA = CH$_{3}$NH$_{3}^{+}$) crystals in spite of their high optoelectronic quality, and identify both $\langle$100$\rangle$ and $\langle$110$\rangle$ edge dislocations through analysis of their local strain fields. By imaging these defects and strain fields in situ under continuous illumination, we uncover dramatic light-induced dislocation migration across hundreds of nanometres. Further, by selectively studying crystals that are damaged by the X-ray beam, we correlate large dislocation densities and increased nanoscale strains with material degradation and substantially altered optoelectronic properties assessed using photoluminescence microscopy measurements. Our results demonstrate the dynamic nature of extended defects and strain in halide perovskites and their direct impact on device performance and operational stability.
△ Less
Submitted 19 April, 2023;
originally announced April 2023.
-
Enhancing Automated Program Repair through Fine-tuning and Prompt Engineering
Authors:
Rishov Paul,
Md. Mohib Hossain,
Mohammed Latif Siddiq,
Masum Hasan,
Anindya Iqbal,
Joanna C. S. Santos
Abstract:
Sequence-to-sequence models have been used to transform erroneous programs into correct ones when trained with a large enough dataset. Some recent studies also demonstrated strong empirical evidence that code review could improve the program repair further. Large language models, trained with Natural Language (NL) and Programming Language (PL), can contain inherent knowledge of both. In this study…
▽ More
Sequence-to-sequence models have been used to transform erroneous programs into correct ones when trained with a large enough dataset. Some recent studies also demonstrated strong empirical evidence that code review could improve the program repair further. Large language models, trained with Natural Language (NL) and Programming Language (PL), can contain inherent knowledge of both. In this study, we investigate if this inherent knowledge of PL and NL can be utilized to improve automated program repair. We applied PLBART and CodeT5, two state-of-the-art language models that are pre-trained with both PL and NL, on two such natural language-based program repair datasets and found that the pre-trained language models fine-tuned with datasets containing both code review and subsequent code changes notably outperformed each of the previous models. With the advent of code generative models like Codex and GPT-3.5-Turbo, we also performed zero-shot and few-shots learning-based prompt engineering to assess their performance on these datasets. However, the practical application of using LLMs in the context of automated program repair is still a long way off based on our manual analysis of the generated repaired codes by the learning models.
△ Less
Submitted 21 July, 2023; v1 submitted 16 April, 2023;
originally announced April 2023.
-
Resolving game theoretical dilemmas with quantum states
Authors:
Azhar Iqbal,
James M. Chappell,
Claudia Szabo,
Derek Abbott
Abstract:
We present a new framework for creating a quantum version of a classical game, based on Fine's theorem. This theorem shows that for a given set of marginals, a system of Bell's inequalities constitutes both necessary and sufficient conditions for the existence of the corresponding joint probability distribution. Using Fine's theorem, we re-express both the player payoffs and their strategies in te…
▽ More
We present a new framework for creating a quantum version of a classical game, based on Fine's theorem. This theorem shows that for a given set of marginals, a system of Bell's inequalities constitutes both necessary and sufficient conditions for the existence of the corresponding joint probability distribution. Using Fine's theorem, we re-express both the player payoffs and their strategies in terms of a set of marginals, thus paving the way for the consideration of sets of marginals -- corresponding to entangled quantum states -- for which no corresponding joint probability distribution may exist. By harnessing quantum states and employing Positive Operator-Valued Measures (POVMs), we then consider particular quantum states that can potentially resolve dilemmas inherent in classical games.
△ Less
Submitted 2 November, 2023; v1 submitted 7 April, 2023;
originally announced April 2023.
-
Strongly geodesic preinvexity and Strongly Invariant η-Monotonicity on Riemannian Manifolds and its Application
Authors:
Akhlad Iqbal,
Askar Hussain,
Hilal Ahmad Bhat
Abstract:
In this paper, we present strongly geodesic preinvexity on Riemannian manifolds (RM) and strongly η-invexity of order m on RM. Furthermore, we define strongly invariant η-monotonicity of order m on RM. Under Condition C, an important characterization of these functions are studied. We construct several non-trivial examples in support of these definitions. Afterwords, an important and significant c…
▽ More
In this paper, we present strongly geodesic preinvexity on Riemannian manifolds (RM) and strongly η-invexity of order m on RM. Furthermore, we define strongly invariant η-monotonicity of order m on RM. Under Condition C, an important characterization of these functions are studied. We construct several non-trivial examples in support of these definitions. Afterwords, an important and significant characterization of a strict η-minimizers (η-minimizers)of order m for MOP and a solution to the variational like-inequality problem (VVLIP) has been derived.
△ Less
Submitted 6 February, 2023;
originally announced February 2023.
-
Non-linear programming problem for semi strongly $E$-preinvexity
Authors:
Akhlad Iqbal,
Askar Hussain
Abstract:
In this article, we present semi strongly $E$-preinvexity and semi strongly $E$-invexity. To demonstrate the existence of these functions, certain nontrivial examples have been developed. Several significant relationships and characterizations of these functions on strongly $E$-invex sets are discussed. Furthermore, we consider a non-linear programming problem for semi strongly $E$-preinvex functi…
▽ More
In this article, we present semi strongly $E$-preinvexity and semi strongly $E$-invexity. To demonstrate the existence of these functions, certain nontrivial examples have been developed. Several significant relationships and characterizations of these functions on strongly $E$-invex sets are discussed. Furthermore, we consider a non-linear programming problem for semi strongly $E$-preinvex functions and investigate relationships between the set of optimal solutions and these functions.
△ Less
Submitted 18 January, 2023;
originally announced January 2023.
-
Quasi Strongly $E$-preinvexity and its Relationships with Nonlinear Programming
Authors:
Akhlad Iqbal,
Askar Hussain
Abstract:
In this paper, we extend the class of strongly $E$-preinvex and strongly $E$-invex functions to quasi strongly $E$-preinvex, quasi strongly $E$-invex and pseudo strongly $E$-invex functions. Some nontrivial suitable examples have been constructed in support of our definitions. Several interesting properties and relationships of these functions are discussed. Furthermore, to show the application of…
▽ More
In this paper, we extend the class of strongly $E$-preinvex and strongly $E$-invex functions to quasi strongly $E$-preinvex, quasi strongly $E$-invex and pseudo strongly $E$-invex functions. Some nontrivial suitable examples have been constructed in support of our definitions. Several interesting properties and relationships of these functions are discussed. Furthermore, to show the application of our results, we consider a nonlinear programming problem and show that the local minimum point is also a strictly global minimum.
△ Less
Submitted 18 January, 2023;
originally announced January 2023.
-
Real-Time Walking Pattern Generation of Quadrupedal Dynamic-Surface Locomotion based on a Linear Time-Varying Pendulum Model
Authors:
Amir Iqbal,
Sushant Veer,
Yan Gu
Abstract:
This study introduces an analytically tractable and computationally efficient model of the legged robot dynamics associated with locomotion on a dynamic rigid surface (DRS), and develops a real-time motion planner based on the proposed model and its analytical solution. This study first theoretically extends the classical linear inverted pendulum (LIP) model from legged locomotion on a static surf…
▽ More
This study introduces an analytically tractable and computationally efficient model of the legged robot dynamics associated with locomotion on a dynamic rigid surface (DRS), and develops a real-time motion planner based on the proposed model and its analytical solution. This study first theoretically extends the classical linear inverted pendulum (LIP) model from legged locomotion on a static surface to DRS locomotion, by relaxing the LIP's underlying assumption that the surface is static. The resulting model, which we call "DRS-LIP", is explicitly time-varying. After converting the DRS-LIP into Mathieu's equation, an approximate analytical solution of the DRS-LIP is obtained, which is reasonably accurate with a low computational cost. Furthermore, to illustrate the practical uses of the analytical results, they are exploited to develop a hierarchical motion planner that efficiently generates physically feasible trajectories for DRS locomotion. Finally, the effectiveness of the proposed theoretical results and motion planner is demonstrated both through PyBullet simulations and experimentally on a Laikago quadrupedal robot that walks on a rocking treadmill. The videos of simulations and hardware experiments are available at https://youtu.be/u2Q_u2pR99c.
△ Less
Submitted 8 January, 2023;
originally announced January 2023.
-
Effective, Practical PON Monitoring Beyond the Splitter
Authors:
Neil Parkin,
Sophie Minoughan,
Md Asif Iqbal
Abstract:
Monitoring beyond the splitter in a PON is costly due to the need for additional hardware. A non-standard monitoring wavelength can reduce cost and increase the visibility of customers to 97% on a C+ GPON
Monitoring beyond the splitter in a PON is costly due to the need for additional hardware. A non-standard monitoring wavelength can reduce cost and increase the visibility of customers to 97% on a C+ GPON
△ Less
Submitted 9 December, 2022;
originally announced December 2022.
-
Generalized Hukuhara directional differentiability of interval-valued functions on Riemannian manifolds
Authors:
Hilal Ahmad Bhat,
Akhlad Iqbal
Abstract:
In this paper, we show that generalized Hukuhara directional differentiability of an interval-valued function (IVF) defined on Riemannian manifolds is not equivalent to the directional differentiability of its center and half-width functions and hence not to its end point functions. This contrasts with S.-L. Chen's \cite{chen} assertion which says the equivalence holds in terms of endpoint functio…
▽ More
In this paper, we show that generalized Hukuhara directional differentiability of an interval-valued function (IVF) defined on Riemannian manifolds is not equivalent to the directional differentiability of its center and half-width functions and hence not to its end point functions. This contrasts with S.-L. Chen's \cite{chen} assertion which says the equivalence holds in terms of endpoint functions of an IVF which is defined on a Hadamard manifold. Additionally, the paper addresses some other inaccuracies which arise when assuming the convexity of a function at a single point in its domain. In light of these arguments, the paper presents some basic results that relate to both the convexity and directional differentiability of an IVF.
△ Less
Submitted 18 August, 2024; v1 submitted 8 December, 2022;
originally announced December 2022.
-
Exploring diffuse radio emission in galaxy clusters and groups with the uGMRT and the SKA
Authors:
Surajit Paul,
Ruta Kale,
Abhirup Datta,
Aritra Basu,
Sharanya Sur,
Viral Parekh,
Prateek Gupta,
Swarna Chatterjee,
Sameer Salunkhe,
Asif Iqbal,
Mamta Pandey-Pommier,
Ramij Raja,
Majidul Rahaman,
Somak Raychaudhury,
Biman B. Nath,
Subhabrata Majumdar
Abstract:
Diffuse radio emission has been detected in a considerable number of galaxy clusters and groups, revealing the presence of pervasive cosmic magnetic fields, and of relativistic particles in the large-scale structure (LSS) of the Universe. Since cluster radio emission is faint and steep spectrum, its observations are largely limited by the instrument sensitivity and frequency of observation, leadin…
▽ More
Diffuse radio emission has been detected in a considerable number of galaxy clusters and groups, revealing the presence of pervasive cosmic magnetic fields, and of relativistic particles in the large-scale structure (LSS) of the Universe. Since cluster radio emission is faint and steep spectrum, its observations are largely limited by the instrument sensitivity and frequency of observation, leading to a dearth of information, more so for lower-mass systems. The unprecedented sensitivity of recently commissioned low-frequency radio telescope arrays, aided by the development of advanced calibration and imaging techniques, have helped in achieving unparalleled image quality. At the same time, the development of sophisticated numerical simulations and the availability of supercomputing facilities have paved the way for high-resolution numerical modeling of radio emission, and the structure of the cosmic magnetic fields in LSS, leading to predictions matching the capabilities of observational facilities. In view of these rapidly-evolving scenerio in modeling and observations, in this review, we summarise the role of the new telescope arrays and the development of advanced imaging techniques and discuss the detections of various kinds of cluster radio sources. In particular, we discuss observations of the cosmic web in the form of supercluster filaments, studies of emission in poor clusters and groups of galaxies, and of ultra-steep spectrum sources. We also review the current theoretical understanding of various diffuse cluster radio sources and the associated magnetic field and polarization. As the statistics of detections improve along with our theoretical understanding, we update the source classification schemes based on their intrinsic properties. We conclude by summarising the role of the upgraded GMRT and our expectations from the upcoming Square Kilometre Array (SKA) observatories.
△ Less
Submitted 2 November, 2022;
originally announced November 2022.
-
Developer Discussion Topics on the Adoption and Barriers of Low Code Software Development Platforms
Authors:
Md Abdullah Al Alamin,
Gias Uddin,
Sanjay Malakar,
Sadia Afroz,
Tameem Bin Haider,
Anindya Iqbal
Abstract:
Low-code software development (LCSD) is an emerging approach to democratize application development for software practitioners from diverse backgrounds. LCSD platforms promote rapid application development with a drag-and-drop interface and minimal programming by hand. As it is a relatively new paradigm, it is vital to study developers' difficulties when adopting LCSD platforms. Software engineers…
▽ More
Low-code software development (LCSD) is an emerging approach to democratize application development for software practitioners from diverse backgrounds. LCSD platforms promote rapid application development with a drag-and-drop interface and minimal programming by hand. As it is a relatively new paradigm, it is vital to study developers' difficulties when adopting LCSD platforms. Software engineers frequently use the online developer forum Stack Overflow (SO) to seek assistance with technical issues. We observe a growing body of LCSD-related posts in SO. This paper presents an empirical study of around 33K SO posts containing discussions of 38 popular LCSD platforms. We use Topic Modeling to determine the topics discussed in those posts. Additionally, we examine how these topics are spread across the various phases of the agile software development life cycle (SDLC) and which part of LCSD is the most popular and challenging. Our study offers several interesting findings. First, we find 40 LCSD topics that we group into five categories: Application Customization, Database, and File Management, Platform Adoption, Platform Maintenance, and Third-party API Integration. Second, while the Application Customization (30\%) and Data Storage (25\%) \rev{topic} categories are the most common, inquiries relating to several other categories (e.g., the Platform Adoption \rev{topic} category) have gained considerable attention in recent years. Third, all topic categories are evolving rapidly, especially during the Covid-19 pandemic. The findings of this study have implications for all three LCSD stakeholders: LCSD platform vendors, LCSD developers/practitioners, Researchers, and Educators. Researchers and LCSD platform vendors can collaborate to improve different aspects of LCSD, such as better tutorial-based documentation, testing, and DevOps support.
△ Less
Submitted 2 September, 2022;
originally announced September 2022.
-
A Vision Transformer-Based Approach to Bearing Fault Classification via Vibration Signals
Authors:
Abid Hasan Zim,
Aeyan Ashraf,
Aquib Iqbal,
Asad Malik,
Minoru Kuribayashi
Abstract:
Rolling bearings are the most crucial components of rotating machinery. Identifying defective bearings in a timely manner may prevent the malfunction of an entire machinery system. The mechanical condition monitoring field has entered the big data phase as a result of the fast advancement of machine parts. When working with large amounts of data, the manual feature extraction approach has the draw…
▽ More
Rolling bearings are the most crucial components of rotating machinery. Identifying defective bearings in a timely manner may prevent the malfunction of an entire machinery system. The mechanical condition monitoring field has entered the big data phase as a result of the fast advancement of machine parts. When working with large amounts of data, the manual feature extraction approach has the drawback of being inefficient and inaccurate. Data-driven methods like the Deep Learning method have been successfully used in recent years for mechanical intelligent fault detection. Convolutional neural networks (CNNs) were mostly used in earlier research to detect and identify bearing faults. The CNN model, however, suffers from the drawback of having trouble managing fault-time information, which results in a lack of classification results. In this study, bearing defects have been classified using a state-of-the-art Vision Transformer (ViT). Bearing defects were classified using Case Western Reserve University (CWRU) bearing failure laboratory experimental data. The research took into account 13 distinct kinds of defects under 0-load situations in addition to normal bearing conditions. Using the short-time Fourier transform (STFT), the vibration signals were converted into 2D time-frequency images. The 2D time-frequency images are used as input parameters for the ViT. The model achieved an overall accuracy of 98.8%.
△ Less
Submitted 20 September, 2022; v1 submitted 15 August, 2022;
originally announced August 2022.
-
Heating of the intracluster medium by buoyant bubbles and sound waves
Authors:
Asif Iqbal,
Subhabrata Majumdar,
Biman B. Nath,
Suparna Roychowdhury
Abstract:
Active galactic nuclei (AGN) powered by the central Super-Massive Black Holes (SMBHs) play a major role in modifying the thermal properties of the intracluster medium (ICM). In this work, we implement two AGN heating models: (i) by buoyant cavities rising through stratified ICM (effervescent model) and, (ii) by viscous and conductive dissipation of sound waves (acoustic model). Our aim is to deter…
▽ More
Active galactic nuclei (AGN) powered by the central Super-Massive Black Holes (SMBHs) play a major role in modifying the thermal properties of the intracluster medium (ICM). In this work, we implement two AGN heating models: (i) by buoyant cavities rising through stratified ICM (effervescent model) and, (ii) by viscous and conductive dissipation of sound waves (acoustic model). Our aim is to determine whether these heating models are consistent with ICM observables and if one is preferred over the other. We assume an initial entropy profile of ICM that is expected from the purely gravitational infall of the gas in the potential of the dark matter halo. We then incorporate heating, radiative cooling, and thermal conduction to study the evolution of ICM over the age of the clusters. Our results are: (i) Both the heating processes can produce comparable thermal profiles of the ICM with some tuning of relevant parameters. (ii) Thermal conduction is crucially important, even at the level of 10\% of the Spitzer values, in transferring the injected energy beyond the central regions, and without which the temperature/entropy profiles are unrealistically high. (iii) The required injected AGN power scales with cluster mass as $M_{\rm vir}^{1.5}$ for both models. (iv) The required AGN luminosity is comparable with the observed radio jet power, reinforcing the idea that AGNs are the dominant heating source in clusters. (v) Finally, we estimate that the fraction of the total AGN luminosity available as the AGN mechanical luminosity at $0.02r_{500}$ is less than 0.05\%.
△ Less
Submitted 3 November, 2022; v1 submitted 24 March, 2022;
originally announced March 2022.
-
Are You Misinformed? A Study of Covid-Related Fake News in Bengali on Facebook
Authors:
Protik Bose Pranto,
Syed Zami-Ul-Haque Navid,
Protik Dey,
Gias Uddin,
Anindya Iqbal
Abstract:
Our opinions and views of life can be shaped by how we perceive the opinions of others on social media like Facebook. This dependence has increased during COVID-19 periods when we have fewer means to connect with others. However, fake news related to COVID-19 has become a significant problem on Facebook. Bengali is the seventh most spoken language worldwide, yet we are aware of no previous researc…
▽ More
Our opinions and views of life can be shaped by how we perceive the opinions of others on social media like Facebook. This dependence has increased during COVID-19 periods when we have fewer means to connect with others. However, fake news related to COVID-19 has become a significant problem on Facebook. Bengali is the seventh most spoken language worldwide, yet we are aware of no previous research that studied the prevalence of COVID-19 related fake news in Bengali on Facebook. In this paper, we develop machine learning models to detect fake news in Bengali automatically. The best performing model is BERT, with an F1-score of 0.97. We apply BERT on all Facebook Bengali posts related to COVID-19. We find 10 topics in the COVID-19 Bengali fake news grouped into three categories: System (e.g., medical system), belief (e.g., religious rituals), and social (e.g., scientific awareness).
△ Less
Submitted 22 March, 2022;
originally announced March 2022.
-
DRS-LIP: Linear Inverted Pendulum Model for Legged Locomotion on Dynamic Rigid Surfaces
Authors:
Amir Iqbal,
Sushant Veer,
Yan Gu
Abstract:
Legged robot locomotion on a dynamic rigid surface (i.e., a rigid surface moving in the inertial frame) involves complex full-order dynamics that is high-dimensional, nonlinear, and time-varying. Towards deriving an analytically tractable dynamic model, this study theoretically extends the reduced-order linear inverted pendulum (LIP) model from legged locomotion on a stationary surface to locomoti…
▽ More
Legged robot locomotion on a dynamic rigid surface (i.e., a rigid surface moving in the inertial frame) involves complex full-order dynamics that is high-dimensional, nonlinear, and time-varying. Towards deriving an analytically tractable dynamic model, this study theoretically extends the reduced-order linear inverted pendulum (LIP) model from legged locomotion on a stationary surface to locomotion on a dynamic rigid surface (DRS). The resulting model is herein termed as DRS-LIP. Furthermore, this study introduces an approximate analytical solution of the proposed DRS-LIP that is computationally efficient with high accuracy. To illustrate the practical uses of the analytical results, they are used to develop a hierarchical planning framework that efficiently generates physically feasible trajectories for DRS locomotion. The effectiveness of the proposed theoretical results and motion planner is demonstrated both through simulations and experimentally on a Laikago quadrupedal robot that walks on a rocking treadmill.
△ Less
Submitted 31 January, 2022;
originally announced February 2022.
-
Equating the area sums of alternative sectors in a circle
Authors:
Azhar Iqbal,
Derek Abbott
Abstract:
We determine the conditions resulting from equating the area sums of alternative sectors in a circle generated by four, two, and three straight lines, respectively, that connect opposite points on its circumference while passing through a point that is arbitrarily placed within the circle.
We determine the conditions resulting from equating the area sums of alternative sectors in a circle generated by four, two, and three straight lines, respectively, that connect opposite points on its circumference while passing through a point that is arbitrarily placed within the circle.
△ Less
Submitted 24 October, 2021;
originally announced October 2021.
-
Extended Capture Point and Optimization-based Control for Quadrupedal Robot Walking on Dynamic Rigid Surfaces
Authors:
Amir Iqbal,
Yan Gu
Abstract:
Stabilizing legged robot locomotion on a dynamic rigid surface (DRS) (i.e., rigid surface that moves in the inertial frame) is a complex planning and control problem. The complexity arises due to the hybrid nonlinear walking dynamics subject to explicitly time-varying holonomic constraints caused by the surface movement. The first main contribution of this study is the extension of the capture poi…
▽ More
Stabilizing legged robot locomotion on a dynamic rigid surface (DRS) (i.e., rigid surface that moves in the inertial frame) is a complex planning and control problem. The complexity arises due to the hybrid nonlinear walking dynamics subject to explicitly time-varying holonomic constraints caused by the surface movement. The first main contribution of this study is the extension of the capture point from walking on a static surface to locomotion on a DRS as well as the use of the resulting capture point for online motion planning. The second main contribution is a quadratic-programming (QP) based feedback controller design that explicitly considers the DRS movement. The stability and robustness of the proposed control approach are validated through simulations of a quadrupedal robot walking on a DRS with a rocking motion. The simulation results also demonstrate the improved walking performance compared with our previous approach based on offline planning and input-output linearizing control that does not explicitly guarantee the feasibility of ground contact constraints.
△ Less
Submitted 10 September, 2021;
originally announced September 2021.
-
A Survey-Based Qualitative Study to Characterize Expectations of Software Developers from Five Stakeholders
Authors:
Khalid Hasan,
Partho Chakraborty,
Rifat Shahriyar,
Anindya Iqbal,
Gias Uddin
Abstract:
Background: Studies on developer productivity and well-being find that the perceptions of productivity in a software team can be a socio-technical problem. Intuitively, problems and challenges can be better handled by managing expectations in software teams. Aim: Our goal is to understand whether the expectations of software developers vary towards diverse stakeholders in software teams. Method: W…
▽ More
Background: Studies on developer productivity and well-being find that the perceptions of productivity in a software team can be a socio-technical problem. Intuitively, problems and challenges can be better handled by managing expectations in software teams. Aim: Our goal is to understand whether the expectations of software developers vary towards diverse stakeholders in software teams. Method: We surveyed 181 professional software developers to understand their expectations from five different stakeholders: (1) organizations, (2) managers, (3) peers, (4) new hires, and (5) government and educational institutions. The five stakeholders are determined by conducting semi-formal interviews of software developers. We ask open-ended survey questions and analyze the responses using open coding. Results: We observed 18 multi-faceted expectations types. While some expectations are more specific to a stakeholder, other expectations are cross-cutting. For example, developers expect work-benefits from their organizations, but expect the adoption of standard software engineering (SE) practices from their organizations, peers, and new hires. Conclusion: Out of the 18 categories, three categories are related to career growth. This observation supports previous research that happiness cannot be assured by simply offering more money or a promotion. Among the most number of responses, we find expectations from educational institutions to offer relevant teaching and from governments to improve job stability, which indicate the increasingly important roles of these organizations to help software developers. This observation can be especially true during the COVID-19 pandemic.
△ Less
Submitted 20 July, 2021;
originally announced July 2021.
-
Local Nanoscale Defective Phase Impurities Are the Sites of Degradation in Halide Perovskite Devices
Authors:
Stuart Macpherson,
Tiarnan A. S. Doherty,
Andrew J. Winchester,
Sofiia Kosar,
Duncan N. Johnstone,
Yu-Hsien Chiang,
Krzystof Galkowski,
Miguel Anaya,
Kyle Frohna,
Affan N. Iqbal,
Bart Roose,
Zahra Andaji-Garmaroudi,
Paul A. Midgley,
Keshav M. Dani,
Samuel D. Stranks
Abstract:
Halide perovskites excel in the pursuit of highly efficient thin film photovoltaics, with power conversion efficiencies reaching 25.5% in single junction and 29.5% in tandem halide perovskite/silicon solar cell configurations. Operational stability of perovskite solar cells remains a barrier to their commercialisation, yet a fundamental understanding of degradation processes, including the specifi…
▽ More
Halide perovskites excel in the pursuit of highly efficient thin film photovoltaics, with power conversion efficiencies reaching 25.5% in single junction and 29.5% in tandem halide perovskite/silicon solar cell configurations. Operational stability of perovskite solar cells remains a barrier to their commercialisation, yet a fundamental understanding of degradation processes, including the specific sites at which failure mechanisms occur, is lacking. Recently, we reported that performance-limiting deep sub-bandgap states appear in nanoscale clusters at particular grain boundaries in state-of-the-art $Cs_{0.05}FA_{0.78}MA_{0.17}Pb(I_{0.83}Br_{0.17})_{3}$ (MA=methylammonium, FA=formamidinium) perovskite films. Here, we combine multimodal microscopy to show that these very nanoscale defect clusters, which go otherwise undetected with bulk measurements, are sites at which degradation seeds. We use photoemission electron microscopy to visualise trap clusters and observe that these specific sites grow in defect density over time under illumination, leading to local reductions in performance parameters. Scanning electron diffraction measurements reveal concomitant structural changes at phase impurities associated with trap clusters, with rapid conversion to metallic lead through iodine depletion, eventually resulting in pinhole formation. By contrast, illumination in the presence of oxygen reduces defect densities and reverses performance degradation at these local clusters, where phase impurities instead convert to amorphous and electronically benign lead oxide. Our work shows that the trapping of charge carriers at sites associated with phase impurities, itself reducing performance, catalyses redox reactions that compromise device longevity. Importantly, we reveal that both performance losses and intrinsic degradation can be mitigated by eliminating these defective clusters.
△ Less
Submitted 20 July, 2021;
originally announced July 2021.
-
Two-player quantum games: When player strategies are via directional choices
Authors:
Azhar Iqbal,
Derek Abbott
Abstract:
We propose a scheme for a quantum game based on performing an EPR type experiment and in which each player's spatial directional choices are considered as their strategies. A classical mixed-strategy game is recovered by restricting the players' choices to specific spatial trajectories. We show that for players' directional choices for which the Bell-CHSH inequality is violated, the players' payof…
▽ More
We propose a scheme for a quantum game based on performing an EPR type experiment and in which each player's spatial directional choices are considered as their strategies. A classical mixed-strategy game is recovered by restricting the players' choices to specific spatial trajectories. We show that for players' directional choices for which the Bell-CHSH inequality is violated, the players' payoffs in the quantum game have no mapping within the classical mixed-strategy game. The scheme provides a more direct link between classical and quantum games.
△ Less
Submitted 24 April, 2022; v1 submitted 2 July, 2021;
originally announced July 2021.
-
CoDesc: A Large Code-Description Parallel Dataset
Authors:
Masum Hasan,
Tanveer Muttaqueen,
Abdullah Al Ishtiaq,
Kazi Sajeed Mehrab,
Md. Mahim Anjum Haque,
Tahmid Hasan,
Wasi Uddin Ahmad,
Anindya Iqbal,
Rifat Shahriyar
Abstract:
Translation between natural language and source code can help software development by enabling developers to comprehend, ideate, search, and write computer programs in natural language. Despite growing interest from the industry and the research community, this task is often difficult due to the lack of large standard datasets suitable for training deep neural models, standard noise removal method…
▽ More
Translation between natural language and source code can help software development by enabling developers to comprehend, ideate, search, and write computer programs in natural language. Despite growing interest from the industry and the research community, this task is often difficult due to the lack of large standard datasets suitable for training deep neural models, standard noise removal methods, and evaluation benchmarks. This leaves researchers to collect new small-scale datasets, resulting in inconsistencies across published works. In this study, we present CoDesc -- a large parallel dataset composed of 4.2 million Java methods and natural language descriptions. With extensive analysis, we identify and remove prevailing noise patterns from the dataset. We demonstrate the proficiency of CoDesc in two complementary tasks for code-description pairs: code summarization and code search. We show that the dataset helps improve code search by up to 22\% and achieves the new state-of-the-art in code summarization. Furthermore, we show CoDesc's effectiveness in pre-training--fine-tuning setup, opening possibilities in building pretrained language models for Java. To facilitate future research, we release the dataset, a data processing tool, and a benchmark at \url{https://github.com/csebuetnlp/CoDesc}.
△ Less
Submitted 29 May, 2021;
originally announced May 2021.
-
Toplayer-Dependent Crystallographic Orientation Imaging in the Bilayer Two-Dimensional Materials with Transverse Shear Microscopy
Authors:
Sabir Hussain,
Rui Xu,
Kunqi Xu,
Le Lei,
Shuya Xing,
Jianfeng Guo,
Haoyu Dong,
Adeel Liaqat,
Rashid Iqbal,
Muhammad Ahsan Iqbal,
Shangzhi Gu,
Feiyue Cao,
Yan Jun Li,
Yasuhiro Sugawara,
Fei Pang,
Wei Ji,
Liming Xie,
Shanshan Chen,
Zhihai Cheng
Abstract:
Nanocontact properties of two-dimensional (2D) materials are closely dependent on their unique nanomechanical systems, such as the number of atomic layers and the supporting substrate. Here, we report a direct observation of toplayer-dependent crystallographic orientation imaging of 2D materials with the transverse shear microscopy (TSM). Three typical nanomechanical systems, MoS2 on the amorphous…
▽ More
Nanocontact properties of two-dimensional (2D) materials are closely dependent on their unique nanomechanical systems, such as the number of atomic layers and the supporting substrate. Here, we report a direct observation of toplayer-dependent crystallographic orientation imaging of 2D materials with the transverse shear microscopy (TSM). Three typical nanomechanical systems, MoS2 on the amorphous SiO2/Si, graphene on the amorphous SiO2/Si, and MoS2 on the crystallized Al2O3, have been investigated in detail. This experimental observation reveals that puckering behaviour mainly occurs on the top layer of 2D materials, which is attributed to its direct contact adhesion with the AFM tip. Furthermore, the result of crystallographic orientation imaging of MoS2/SiO2/Si and MoS2/Al2O3 indicated that the underlying crystalline substrates almost do not contribute to the puckering effect of 2D materials. Our work directly revealed the top layer dependent puckering properties of 2D material, and demonstrate the general applications of TSM in the bilayer 2D systems.
△ Less
Submitted 6 May, 2021;
originally announced May 2021.
-
How do developers discuss and support new programming languages in technical Q&A site? An empirical study of Go, Swift, and Rust in Stack Overflow
Authors:
Partha Chakraborty,
Rifat Shahriyar,
Anindya Iqbal,
Gias Uddin
Abstract:
New programming languages (e.g., Swift, Go, Rust, etc.) are being introduced to provide a better opportunity for the developers to make software development robust and easy. At the early stage, a programming language is likely to have resource constraints that encourage the developers to seek help frequently from experienced peers active in QA sites such as Stack Overflow (SO). In this study, we h…
▽ More
New programming languages (e.g., Swift, Go, Rust, etc.) are being introduced to provide a better opportunity for the developers to make software development robust and easy. At the early stage, a programming language is likely to have resource constraints that encourage the developers to seek help frequently from experienced peers active in QA sites such as Stack Overflow (SO). In this study, we have formally studied the discussions on three popular new languages introduced after the inception of SO (2008) and match those with the relevant activities in GitHub whenever appropriate. For that purpose, we have mined 4,17,82,536 questions and answers from SO and 7,846 issue information along with 6,60,965 repository information from GitHub. Initially, the development of new languages is relatively slow compared to mature languages (e.g., C, C++, Java). The expected outcome of this study is to reveal the difficulties and challenges faced by the developers working with these languages so that appropriate measures can be taken to expedite the generation of relevant resources. We have used the LDA method on SO's questions and answers to identify different topics of new languages. We have extracted several features of the answer pattern of the new languages from SO to study their characteristics. These attributes were used to identify difficult topics. We explored the background of developers who are contributing to these languages. We have created a model by combining Stack Overflow data and issues, repository, user data of GitHub. Finally, we have used that model to identify factors that affect language evolution. We believe that the outcome of our study is likely to help the owner/sponsor of these languages to design better features and documentation. It will also help the software developers or students to prepare themselves to work on these languages in an informed way.
△ Less
Submitted 4 May, 2021;
originally announced May 2021.
-
BERT2Code: Can Pretrained Language Models be Leveraged for Code Search?
Authors:
Abdullah Al Ishtiaq,
Masum Hasan,
Md. Mahim Anjum Haque,
Kazi Sajeed Mehrab,
Tanveer Muttaqueen,
Tahmid Hasan,
Anindya Iqbal,
Rifat Shahriyar
Abstract:
Millions of repetitive code snippets are submitted to code repositories every day. To search from these large codebases using simple natural language queries would allow programmers to ideate, prototype, and develop easier and faster. Although the existing methods have shown good performance in searching codes when the natural language description contains keywords from the code, they are still fa…
▽ More
Millions of repetitive code snippets are submitted to code repositories every day. To search from these large codebases using simple natural language queries would allow programmers to ideate, prototype, and develop easier and faster. Although the existing methods have shown good performance in searching codes when the natural language description contains keywords from the code, they are still far behind in searching codes based on the semantic meaning of the natural language query and semantic structure of the code. In recent years, both natural language and programming language research communities have created techniques to embed them in vector spaces. In this work, we leverage the efficacy of these embedding models using a simple, lightweight 2-layer neural network in the task of semantic code search. We show that our model learns the inherent relationship between the embedding spaces and further probes into the scope of improvement by empirically analyzing the embedding methods. In this analysis, we show that the quality of the code embedding model is the bottleneck for our model's performance, and discuss future directions of study in this area.
△ Less
Submitted 16 April, 2021;
originally announced April 2021.