-
Stability and fate of hierarchical triples comprising a central massive body and a tight binary in eccentric orbits
Authors:
Toshinori Hayashi,
Alessandro A. Trani,
Yasushi Suto
Abstract:
We explore the stability and fate of gravitational triple systems comprising a central massive body and a tight binary of less massive pairs. Our present purpose is two fold; (1) to improve the Hill-type stability criterion for the binary in those systems, and (2) to examine the fate of the triple systems after the binary break-up, with particular attention to the effects of the eccentricities of…
▽ More
We explore the stability and fate of gravitational triple systems comprising a central massive body and a tight binary of less massive pairs. Our present purpose is two fold; (1) to improve the Hill-type stability criterion for the binary in those systems, and (2) to examine the fate of the triple systems after the binary break-up, with particular attention to the effects of the eccentricities of the inner and outer orbits. We perform direct Newtonian N-body simulations over much longer integration times than previous studies, which is essential to determine the eventual fate of those systems statistically in a reliable fashion. We obtain an empirical fitting formula of the binary stability boundary that incorporates effects of the inner and outer eccentricities, the mutual inclination of the inner and outer orbits, and the mass ratios of the three bodies. We also find that those triple systems are stable for a much longer timescale after the binary break-up, and that their final fates (ejection of the outer body, merger to the central massive body, and collision of two less massive bodies) are very sensitive to the initial outer eccentricity.
△ Less
Submitted 10 September, 2024;
originally announced September 2024.
-
Féeton dark matter above the $e^-e^+$ threshold
Authors:
Tatsuya Hayashi,
Shigeki Matsumoto,
Yuki Watanabe,
Tsutomu T. Yanagida
Abstract:
The new gauge boson introduced in the minimal extension of the standard model (SM) by gauging the U(1)$_{\rm B-L}$ symmetry plays the role of dark matter when the U(1)$_{\rm B-L}$ gauge coupling is highly suppressed. This dark matter, named the Féeton dark matter is known to be efficiently created in the early universe by inflationary fluctuations with minimal gravity coupling, hence the framework…
▽ More
The new gauge boson introduced in the minimal extension of the standard model (SM) by gauging the U(1)$_{\rm B-L}$ symmetry plays the role of dark matter when the U(1)$_{\rm B-L}$ gauge coupling is highly suppressed. This dark matter, named the Féeton dark matter is known to be efficiently created in the early universe by inflationary fluctuations with minimal gravity coupling, hence the framework, the gauged U(1)$_{\rm B-L}$ extended SM + inflation, solves the four major problems of the SM; neutrino masses/mixings, dark matter, baryon asymmetry of the universe, and the initial condition of the universe (inflation). We comprehensively study the phenomenology of the dark matter when it is heavier than the $e^- e^+$ threshold, namely twice the electron mass, considering the threshold effect on the dark matter decay into $e^- e^+$. The viable parameter region is found only in the threshold region, while the branching fraction of the decay into $e^- e^+$ (i.e., the $e^- e^+$ signal) never vanishes even at the threshold due to the effect. As a result, the pure U(1)$_{\rm B-L}$ extension without the kinetic mixing between the U(1)$_{\rm B-L}$ and hyper-charge gauge bosons have already been excluded by the present observation of the 511\,keV photon from the galactic center. So, the Féeton dark matter requires a non-zero kinetic mixing to be a viable dark matter candidate and will be efficiently explored by future MeV-gamma ray telescopes thanks to the non-vanishing decay process into $e^- e^+$.
△ Less
Submitted 22 August, 2024;
originally announced August 2024.
-
Extraction of Research Objectives, Machine Learning Model Names, and Dataset Names from Academic Papers and Analysis of Their Interrelationships Using LLM and Network Analysis
Authors:
S. Nishio,
H. Nonaka,
N. Tsuchiya,
A. Migita,
Y. Banno,
T. Hayashi,
H. Sakaji,
T. Sakumoto,
K. Watabe
Abstract:
Machine learning is widely utilized across various industries. Identifying the appropriate machine learning models and datasets for specific tasks is crucial for the effective industrial application of machine learning. However, this requires expertise in both machine learning and the relevant domain, leading to a high learning cost. Therefore, research focused on extracting combinations of tasks,…
▽ More
Machine learning is widely utilized across various industries. Identifying the appropriate machine learning models and datasets for specific tasks is crucial for the effective industrial application of machine learning. However, this requires expertise in both machine learning and the relevant domain, leading to a high learning cost. Therefore, research focused on extracting combinations of tasks, machine learning models, and datasets from academic papers is critically important, as it can facilitate the automatic recommendation of suitable methods. Conventional information extraction methods from academic papers have been limited to identifying machine learning models and other entities as named entities. To address this issue, this study proposes a methodology extracting tasks, machine learning methods, and dataset names from scientific papers and analyzing the relationships between these information by using LLM, embedding model, and network clustering. The proposed method's expression extraction performance, when using Llama3, achieves an F-score exceeding 0.8 across various categories, confirming its practical utility. Benchmarking results on financial domain papers have demonstrated the effectiveness of this method, providing insights into the use of the latest datasets, including those related to ESG (Environmental, Social, and Governance) data.
△ Less
Submitted 21 August, 2024;
originally announced August 2024.
-
Status of Xtend telescope onboard X-Ray Imaging and Spectroscopy Mission (XRISM)
Authors:
Koji Mori,
Hiroshi Tomida,
Hiroshi Nakajima,
Takashi Okajima,
Hirofumi Noda,
Hiroyuki Uchida,
Hiromasa Suzuki,
Shogo Benjamin Kobayashi,
Tomokage Yoneyama,
Kouichi Hagino,
Kumiko Nobukawa,
Takaaki Tanaka,
Hiroshi Murakami,
Hideki Uchiyama,
Masayoshi Nobukawa,
Hironori Matsumoto,
Takeshi Tsuru,
Makoto Yamauchi,
Isamu Hatsukade,
Hirokazu Odaka,
Takayoshi Kohmura,
Kazutaka Yamaoka,
Manabu Ishida,
Yoshitomo Maeda,
Takayuki Hayashi
, et al. (38 additional authors not shown)
Abstract:
Xtend is one of the two telescopes onboard the X-ray imaging and spectroscopy mission (XRISM), which was launched on September 7th, 2023. Xtend comprises the Soft X-ray Imager (SXI), an X-ray CCD camera, and the X-ray Mirror Assembly (XMA), a thin-foil-nested conically approximated Wolter-I optics. A large field of view of $38^{\prime}\times38^{\prime}$ over the energy range from 0.4 to 13 keV is…
▽ More
Xtend is one of the two telescopes onboard the X-ray imaging and spectroscopy mission (XRISM), which was launched on September 7th, 2023. Xtend comprises the Soft X-ray Imager (SXI), an X-ray CCD camera, and the X-ray Mirror Assembly (XMA), a thin-foil-nested conically approximated Wolter-I optics. A large field of view of $38^{\prime}\times38^{\prime}$ over the energy range from 0.4 to 13 keV is realized by the combination of the SXI and XMA with a focal length of 5.6 m. The SXI employs four P-channel, back-illuminated type CCDs with a thick depletion layer of 200 $μ$m. The four CCD chips are arranged in a 2$\times$2 grid and cooled down to $-110$ $^{\circ}$C with a single-stage Stirling cooler. Before the launch of XRISM, we conducted a month-long spacecraft thermal vacuum test. The performance verification of the SXI was successfully carried out in a course of multiple thermal cycles of the spacecraft. About a month after the launch of XRISM, the SXI was carefully activated and the soundness of its functionality was checked by a step-by-step process. Commissioning observations followed the initial operation. We here present pre- and post-launch results verifying the Xtend performance. All the in-orbit performances are consistent with those measured on ground and satisfy the mission requirement. Extensive calibration studies are ongoing.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Initial operations of the Soft X-ray Imager onboard XRISM
Authors:
Hiromasa Suzuki,
Tomokage Yoneyama,
Shogo B. Kobayashi,
Hirofumi Noda,
Hiroyuki Uchida,
Kumiko K. Nobukawa,
Kouichi Hagino,
Koji Mori,
Hiroshi Tomida,
Hiroshi Nakajima,
Takaaki Tanaka,
Hiroshi Murakami,
Hideki Uchiyama,
Masayoshi Nobukawa,
Yoshiaki Kanemaru,
Yoshinori Otsuka,
Haruhiko Yokosu,
Wakana Yonemaru,
Hanako Nakano,
Kazuhiro Ichikawa,
Reo Takemoto,
Tsukasa Matsushima,
Marina Yoshimoto,
Mio Aoyagi,
Kohei Shima
, et al. (30 additional authors not shown)
Abstract:
XRISM (X-Ray Imaging and Spectroscopy Mission) is an astronomical satellite with the capability of high-resolution spectroscopy with the X-ray microcalorimeter, Resolve, and wide field-of-view imaging with the CCD camera, Xtend. The Xtend consists of the mirror assembly (XMA: X-ray Mirror Assembly) and detector (SXI: Soft X-ray Imager). The components of SXI include CCDs, analog and digital electr…
▽ More
XRISM (X-Ray Imaging and Spectroscopy Mission) is an astronomical satellite with the capability of high-resolution spectroscopy with the X-ray microcalorimeter, Resolve, and wide field-of-view imaging with the CCD camera, Xtend. The Xtend consists of the mirror assembly (XMA: X-ray Mirror Assembly) and detector (SXI: Soft X-ray Imager). The components of SXI include CCDs, analog and digital electronics, and a mechanical cooler. After the successful launch on September 6th, 2023 (UT) and subsequent critical operations, the mission instruments were turned on and set up. The CCDs have been kept at the designed operating temperature of $-110^\circ$C ~after the electronics and cooling system were successfully set up. During the initial operation phase, which continued for more than a month after the critical operations, we verified the observation procedure, stability of the cooling system, all the observation options with different imaging areas and/or timing resolutions, and operations for protection against South Atlantic Anomaly. We optimized the operation procedure and observation parameters including the cooler settings, imaging areas for the specific modes with higher timing resolutions, and event selection algorithm. We summarize our policy and procedure of the initial operations for SXI. We also report on a couple of issues we faced during the initial operations and lessons learned from them.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
The Japanese Vision for the Black Hole Explorer Mission
Authors:
Kazunori Akiyama,
Kotaro Niinuma,
Kazuhiro Hada,
Akihiro Doi,
Yoshiaki Hagiwara,
Aya E. Higuchi,
Mareki Honma,
Tomohisa Kawashima,
Dimitar Kolev,
Shoko Koyama,
Sho Masui,
Ken Ohsuga,
Hidetoshi Sano,
Hideki Takami,
Yuh Tsunetoe,
Yoshinori Uzawa,
Takuya Akahori,
Yuto Akiyama,
Peter Galison,
Takayuki J. Hayashi,
Tomoya Hirota,
Makoto Inoue,
Yuhei Iwata,
Michael D. Johnson,
Motoki Kino
, et al. (21 additional authors not shown)
Abstract:
The Black Hole Explorer (BHEX) is a next-generation space very long baseline interferometry (VLBI) mission concept that will extend the ground-based millimeter/submillimeter arrays into space. The mission, closely aligned with the science priorities of the Japanese VLBI community, involves an active engagement of this community in the development of the mission, resulting in the formation of the B…
▽ More
The Black Hole Explorer (BHEX) is a next-generation space very long baseline interferometry (VLBI) mission concept that will extend the ground-based millimeter/submillimeter arrays into space. The mission, closely aligned with the science priorities of the Japanese VLBI community, involves an active engagement of this community in the development of the mission, resulting in the formation of the Black Hole Explorer Japan Consortium. Here we present the current Japanese vision for the mission, ranging from scientific objectives to instrumentation. The Consortium anticipates a wide range of scientific investigations, from diverse black hole physics and astrophysics studied through the primary VLBI mode, to the molecular universe explored via a potential single-dish observation mode in the previously unexplored 50-70\,GHz band that would make BHEX the highest-sensitivity explorer ever of molecular oxygen. A potential major contribution for the onboard instrument involves supplying essential elements for its high-sensitivity dual-band receiving system, which includes a broadband 300\,GHz SIS mixer and a space-certified multi-stage 4.5K cryocooler akin to those used in the Hitomi and XRISM satellites by the Japan Aerospace Exploration Agency. Additionally, the Consortium explores enhancing and supporting BHEX operations through the use of millimeter/submillimeter facilities developed by the National Astronomical Observatory of Japan, coupled with a network of laser communication stations operated by the National Institute of Information and Communication Technology.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Cross-sectional shape analysis for risk assessment and prognosis of patients with true lumen narrowing after type-A aortic dissection surgery
Authors:
J V Ramana Reddy,
Toshitaka Watanabe,
Taro Hayashi,
Hiroshi Suito
Abstract:
Background: For acute type-A aortic dissection (ATAAD) surgery, early post-surgery assessment is crucially important for effective treatment plans, underscoring the need for a framework to identify the risk level of aortic dissection cases. We examined true-lumen narrowing during follow-up examinations, collected morphological data 14 days (early stages) after surgery, and assessed patient risk le…
▽ More
Background: For acute type-A aortic dissection (ATAAD) surgery, early post-surgery assessment is crucially important for effective treatment plans, underscoring the need for a framework to identify the risk level of aortic dissection cases. We examined true-lumen narrowing during follow-up examinations, collected morphological data 14 days (early stages) after surgery, and assessed patient risk levels over 2.8 years.
Purpose: To establish an implementable framework supported by mathematical techniques to predict the risk of aortic dissection patients experiencing true-lumen narrowing after ATAAD surgery.
Materials and Methods: This retrospective study analyzed CT data from 21 ATAAD patients. Forty uniformly distributed cross-sectional shapes (CSSs) are derived from each lumen to account for gradual changes in shape. We introduced the form factor (FF) to assess CSS morphology. Linear discriminant analysis (LDA) is used for the risk classification of aortic dissection patients. Leave-one-patient-out cross-validation (LOPO-CV) is used for risk prediction.
Results: For this investigation, we examined data of 21 ATAAD patients categorized into high-risk, medium-risk, and low-risk cases based on clinical observations of the range of true-lumen narrowing. Our risk classification machine-learning (ML) model preserving the model's generalizability. The model's predictions reliably identified low-risk patients, thereby potentially reducing hospital visits. It also demonstrated proficiency in accurately predicting the risk for all high-risk patients.
Conclusion: The suggested method anticipates the risk linked to aortic enlargement in patients with a narrowing true lumen in the early stage following ATAAD surgery, thereby aiding follow-up doctors in enhancing patient care.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Manipulation of Belief Aggregation Rules
Authors:
Christopher P. Chambers,
Federico Echenique,
Takashi Hayashi
Abstract:
This paper studies manipulation of belief aggregation rules in the setting where the society first collects individual's probabilistic opinions and then solves a public portfolio choice problem with common utility based on the aggregate belief.
First, we show that belief reporting in Nash equilibrium under the linear opinion pool and log utility is identified as the profile of state-contingent w…
▽ More
This paper studies manipulation of belief aggregation rules in the setting where the society first collects individual's probabilistic opinions and then solves a public portfolio choice problem with common utility based on the aggregate belief.
First, we show that belief reporting in Nash equilibrium under the linear opinion pool and log utility is identified as the profile of state-contingent wealth shares in parimutuel equilibrium with risk-neutral preference.
Then we characterize belief aggregation rules which are Nash-implementable. We provide a necessary and essentially sufficient condition for implementability, which is independent of the common risk attitude.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Critical Review for One-class Classification: recent advances and the reality behind them
Authors:
Toshitaka Hayashi,
Dalibor Cimr,
Hamido Fujita,
Richard Cimler
Abstract:
This paper offers a comprehensive review of one-class classification (OCC), examining the technologies and methodologies employed in its implementation. It delves into various approaches utilized for OCC across diverse data types, such as feature data, image, video, time series, and others. Through a systematic review, this paper synthesizes promi-nent strategies used in OCC from its inception to…
▽ More
This paper offers a comprehensive review of one-class classification (OCC), examining the technologies and methodologies employed in its implementation. It delves into various approaches utilized for OCC across diverse data types, such as feature data, image, video, time series, and others. Through a systematic review, this paper synthesizes promi-nent strategies used in OCC from its inception to its current advance-ments, with a particular emphasis on the promising application. Moreo-ver, the article criticizes the state-of-the-art (SOTA) image anomaly de-tection (AD) algorithms dominating one-class experiments. These algo-rithms include outlier exposure (binary classification) and pretrained model (multi-class classification), conflicting with the fundamental con-cept of learning from one class. Our investigation reveals that the top nine algorithms for one-class CIFAR10 benchmark are not OCC. We ar-gue that binary/multi-class classification algorithms should not be com-pared with OCC.
△ Less
Submitted 27 April, 2024;
originally announced April 2024.
-
A Giant Metrewave Radio Telescope Survey of Radio-loud Broad Absorption Line Quasars
Authors:
Takayuki J. Hayashi,
Akihiro Doi,
Hiroshi Nagai
Abstract:
A substantial fraction of quasars display broad absorption lines (BALs) in their rest-frame ultraviolet spectra. While the origin of BALs is thought to be related to the accretion disc wind, it remains unclear whether the observed ratio of BAL to non-BAL quasars is due to orientation. We conducted observations of 48 BAL quasars and the same number of non-BAL quasars at 322 MHz using the Giant Metr…
▽ More
A substantial fraction of quasars display broad absorption lines (BALs) in their rest-frame ultraviolet spectra. While the origin of BALs is thought to be related to the accretion disc wind, it remains unclear whether the observed ratio of BAL to non-BAL quasars is due to orientation. We conducted observations of 48 BAL quasars and the same number of non-BAL quasars at 322 MHz using the Giant Metrewave Radio Telescope. Combined with previous flux measurements ranging from MHz to GHz frequencies, we compared continuum radio spectra between the two quasar groups. These data offer insights into low-frequency radio properties that have been difficult to investigate with previous observations only at GHz frequencies. Our results present that $73\pm13$ per cent of the BAL quasars exhibit steep or peaked spectra, a higher proportion than $44 \pm 14$ per cent observed in the non-BAL quasars. In contrast, there are no discernible differences between the two quasar groups in the radio luminosity, peak frequency, and spectral index distributions of sources with steep or peaked spectra and sources with flat or inverted spectra. Generally, as the jet axis and line of sight become closer to parallel, quasars exhibit flat or inverted spectra rather than steep or peaked ones. Therefore, these results suggest that BAL quasars are more frequently observed farther from the jet axis than non-BAL quasars. However, given that a certain proportion of BAL quasars exhibit flat or inverted spectra, more than the simple orientation scenario is required to elucidate the radio properties of BAL quasars.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
VLBI Detection of an Active Radio Source Potentially Driving 100-kpc Scale Emission in the Ultraluminous Infrared Galaxy IRAS F01004$-$2237
Authors:
Takayuki J. Hayashi,
Yoshiaki Hagiwara,
Masatoshi Imanishi
Abstract:
The nearby ultraluminous infrared galaxy (ULIRG) IRAS F01004$-$2237 exhibits 100-kpc scale continuum emission at radio wavelengths. The absence of extended X-ray emission in IRAS F01004$-$2237 has suggested an active galactic nucleus (AGN) origin for the extended radio emission, whose properties and role in merging systems still need to be better understood. We present the results of multi-frequen…
▽ More
The nearby ultraluminous infrared galaxy (ULIRG) IRAS F01004$-$2237 exhibits 100-kpc scale continuum emission at radio wavelengths. The absence of extended X-ray emission in IRAS F01004$-$2237 has suggested an active galactic nucleus (AGN) origin for the extended radio emission, whose properties and role in merging systems still need to be better understood. We present the results of multi-frequency observations of IRAS F01004$-$2237 conducted by the Very Long Baseline Array at 2.3 and 8.4 GHz. Compact 8.4-GHz continuum emission was detected on a 1-pc scale in the nuclear region with an intrinsic brightness temperature of $10^{8.1}$ K suggesting that the radio source is originated from an AGN, potentially driving the extended emission. In contrast, no significant emission was observed at 2.3 GHz, indicating the presence of low-frequency absorption. This absorption cannot be attributed solely to synchrotron self-absorption; alternatively, free-free absorption due to thermal plasma is mainly at work in the spectrum. From combined perspectives, including mid-infrared and X-ray data, the AGN is obscured in a dense environment. The kinetic power of the nonthermal jet, as inferred from the extended emission, can play a more important role in dispersing the surrounding medium than the thermal outflow in IRAS F01004$-$2237. These findings hint that jet activities in ULIRGs may contribute to AGN feedback during galaxy evolution induced by merger events.
△ Less
Submitted 21 May, 2024; v1 submitted 15 January, 2024;
originally announced January 2024.
-
GAPS contributions to the 38th International Cosmic Ray Conference (Nagoya 2023)
Authors:
T. Aramaki,
M. Boezio,
S. E. Boggs,
V. Bonvicini,
G. Bridges,
D. Campana,
W. W. Craig,
P. von Doetinchem,
E. Everson,
L. Fabris,
S. Feldman,
H. Fuke,
F. Gahbauer,
C. Gerrity,
L. Ghislotti,
C. J. Hailey,
T. Hayashi,
A. Kawachi,
M. Kozai,
P. Lazzaroni,
M. Law,
A. Lenni,
A. Lowell,
M. Manghisoni,
N. Marcelli
, et al. (33 additional authors not shown)
Abstract:
Compilation of papers presented by the GAPS Collaboration at the 38th International Cosmic Ray Conference (ICRC), held July 26 through August 3, 2023 in Nagoya, Japan.
Compilation of papers presented by the GAPS Collaboration at the 38th International Cosmic Ray Conference (ICRC), held July 26 through August 3, 2023 in Nagoya, Japan.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
Classification of irreducible representations of affine group superschemes and the division superalgebras of their endomorphisms
Authors:
Takuma Hayashi
Abstract:
In this paper, we classify irreducible representations of affine group superschemes over fields $F$ of characteristic not two in terms of those over a separable closure $F^{\mathrm{sep}}$ and their Galois twists. We also compute the division superalgebras of their endomorphisms mainly when they are central. Finally, we give numerical conclusions for quasi-reductive algebraic supergroups under cert…
▽ More
In this paper, we classify irreducible representations of affine group superschemes over fields $F$ of characteristic not two in terms of those over a separable closure $F^{\mathrm{sep}}$ and their Galois twists. We also compute the division superalgebras of their endomorphisms mainly when they are central. Finally, we give numerical conclusions for quasi-reductive algebraic supergroups under certain conditions, based on Shibata's Borel--Weil theory for split quasi-reductive algebraic supergroups.
△ Less
Submitted 20 June, 2024; v1 submitted 27 September, 2023;
originally announced September 2023.
-
Valley splitting by extended zone effective mass approximation incorporating strain in silicon
Authors:
Jinichiro Noborisaka,
Toshiaki Hayashi,
Akira Fujiwara,
Katsuhiko Nishiguchi
Abstract:
Silicon metal-oxide-semiconductor field effect transistors (MOSFETs) fabricated on a SIMOX (001) substrate, which is a kind of silicon on insulator (SOI) substrate, that is annealed at high temperature for a long time are known to exhibit large valley splitting, but the origin of this splitting has long been unknown. Extended zone effective-mass approximation (EMA) predicts that strain significant…
▽ More
Silicon metal-oxide-semiconductor field effect transistors (MOSFETs) fabricated on a SIMOX (001) substrate, which is a kind of silicon on insulator (SOI) substrate, that is annealed at high temperature for a long time are known to exhibit large valley splitting, but the origin of this splitting has long been unknown. Extended zone effective-mass approximation (EMA) predicts that strain significantly affects valley splitting. In this study, we analyzed valley splitting based on this theory and found that the shear strain along <110> of approximately 5% near the buried oxide (BOX) interface is a promising source for large valley splitting.
△ Less
Submitted 10 September, 2023;
originally announced September 2023.
-
Constraining the binarity of black hole candidates: a proof-of-concept study of Gaia BH1 and Gaia BH2
Authors:
Toshinori Hayashi,
Yasushi Suto,
Alessandro A. Trani
Abstract:
Nearly a hundred of binary black holes (BBHs) have been discovered with gravitational-wave signals emitted at their merging events. Thus, it is quite natural to expect that significantly more abundant BBHs with wider separations remain undetected in the universe, or even in our Galaxy. We consider a possibility that star-BH binary candidates may indeed host an inner BBH, instead of a single BH. We…
▽ More
Nearly a hundred of binary black holes (BBHs) have been discovered with gravitational-wave signals emitted at their merging events. Thus, it is quite natural to expect that significantly more abundant BBHs with wider separations remain undetected in the universe, or even in our Galaxy. We consider a possibility that star-BH binary candidates may indeed host an inner BBH, instead of a single BH. We present a detailed feasibility study of constraining the binarity of the currently available two targets, Gaia BH1 and Gaia BH2. Specifically, we examine three types of radial velocity (RV) modulations of a tertiary star in star-BBH triple systems; short-term RV modulations induced by the inner BBH, long-term RV modulations induced by the nodal precession, and long-term RV modulations induced by the von Zeipel-Kozai-Lidov oscillations. Direct three-body simulations combined with approximate analytic models reveal that Gaia BH1 system may exhibit observable signatures of the hidden inner BBH if it exists at all. The methodology that we examine here is quite generic, and is expected to be readily applicable to future star-BH binary candidates in a straightforward manner.
△ Less
Submitted 29 August, 2023; v1 submitted 4 July, 2023;
originally announced July 2023.
-
Gravitational Redshift Detection from the Magnetic White Dwarf Harbored in RX J1712.6-2414
Authors:
Takayuki Hayashi,
Hideyuki Mori,
Koji Mukai,
Yukikatsu Terada,
Manabu Ishida
Abstract:
Gravitational redshift is a fundamental parameter that allows us to determine the mass-to-radius ratio of compact stellar objects, such as black holes, neutron stars, and white dwarfs (WDs). In the X-ray spectra of the close binary system, RX J1712.6$-$2414, obtained from the Chandra High-Energy Transmission Grating observation, we detected significant redshifts for characteristic X-rays emitted f…
▽ More
Gravitational redshift is a fundamental parameter that allows us to determine the mass-to-radius ratio of compact stellar objects, such as black holes, neutron stars, and white dwarfs (WDs). In the X-ray spectra of the close binary system, RX J1712.6$-$2414, obtained from the Chandra High-Energy Transmission Grating observation, we detected significant redshifts for characteristic X-rays emitted from hydrogen-like magnesium, silicon ($ΔE/E_{\rm rest} \sim 7 \times 10^{-4}$), and sulfur ($ΔE/E_{\rm rest} \sim 15 \times 10^{-4}$) ions, which are over the instrumental absolute energy accuracy (${ΔE/E_{\rm rest} \sim 3.3} \times 10^{-4}$). Considering some possible factors, such as Doppler shifts associated with the plasma flow, systemic velocity, and optical depth, we concluded that the major contributor to the observed redshift is the gravitational redshift of the WD harbored in the binary system, which is the first gravitational redshift detection from a magnetic WD. Moreover, the gravitational redshift provides us with a new method of the WD mass measurement by invoking the plasma-flow theory with strong magnetic fields in close binaries. Regardless of large uncertainty, our new method estimated the WD mass to be $M_{\rm WD}> 0.9\,M_{\odot}$.
△ Less
Submitted 28 April, 2023;
originally announced May 2023.
-
Fabrication of a 64-Pixel TES Microcalorimeter Array with Iron Absorbers Uniquely Designed for 14.4-keV Solar Axion Search
Authors:
Yuta Yagi,
Tasuku Hayashi,
Keita Tanaka,
Rikuta Miyagawa,
Ryo Ota,
Noriko Y. Yamasaki,
Kazuhisa Mitsuda,
Nao Yoshida,
Mikiko Saito,
Takayuki Homma
Abstract:
If a hypothetical elementary particle called an axion exists, to solve the strong CP problem, a 57Fe nucleus in the solar core could emit a 14.4-keV monochromatic axion through the M1 transition. If such axions are once more transformed into photons by a 57Fe absorber, a transition edge sensor (TES) X-ray microcalorimeter should be able to detect them efficiently. We have designed and fabricated a…
▽ More
If a hypothetical elementary particle called an axion exists, to solve the strong CP problem, a 57Fe nucleus in the solar core could emit a 14.4-keV monochromatic axion through the M1 transition. If such axions are once more transformed into photons by a 57Fe absorber, a transition edge sensor (TES) X-ray microcalorimeter should be able to detect them efficiently. We have designed and fabricated a dedicated 64-pixel TES array with iron absorbers for the solar axion search. In order to decrease the effect of iron magnetization on spectroscopic performance, the iron absorber is placed next to the TES while maintaining a certain distance. A gold thermal transfer strap connects them. We have accomplished the electroplating of gold straps with high thermal conductivity. The residual resistivity ratio (RRR) was over 23, more than eight times higher than a previous evaporated strap. In addition, we successfully electroplated pure-iron films of more than a few micrometers in thickness for absorbers and a fabricated 64-pixel TES calorimeter structure.
△ Less
Submitted 19 April, 2023;
originally announced April 2023.
-
Performance of TES X-Ray Microcalorimeters Designed for 14.4-keV Solar Axion Search
Authors:
Yuta Yagi,
Ryohei Konno,
Tasuku Hayashi,
Keita Tanaka,
Noriko Y. Yamasaki,
Kazuhisa Mitsuda,
Rumi Sato,
Mikiko Saito,
Takayuki Homma,
Yoshiki Nishida,
Shohei Mori,
Naoko Iyomoto,
Toru Hara
Abstract:
A 57Fe nucleus in the solar core could emit a 14.4-keV monochromatic axion through the M1 transition if a hypothetical elementary particle, axion, exists to solve the strong CP problem. Transition edge sensor (TES) X-ray microcalorimeters can detect such axions very efficiently if they are again converted into photons by a 57Fe absorber. We have designed and produced a dedicated TES array with 57F…
▽ More
A 57Fe nucleus in the solar core could emit a 14.4-keV monochromatic axion through the M1 transition if a hypothetical elementary particle, axion, exists to solve the strong CP problem. Transition edge sensor (TES) X-ray microcalorimeters can detect such axions very efficiently if they are again converted into photons by a 57Fe absorber. We have designed and produced a dedicated TES array with 57Fe absorbers for the solar axion search. The iron absorber is set next to the TES, keeping a certain distance to reduce the iron-magnetization effect on the spectroscopic performance. A gold thermal transfer strap connects them. A sample pixel irradiated from a 55Fe source detected 698 pulses. In contrast to thermal simulations, we consider that the pulses include either events produced in an iron absorber or gold strap at a fraction dependent on the absorption rate of each material. Furthermore, photons deposited on the iron absorber are detected through the strap as intended. The identification of all events still needs to be completed. However, we successfully operated the TES with the unique design under iron magnetization for the first time.
△ Less
Submitted 17 April, 2023; v1 submitted 14 April, 2023;
originally announced April 2023.
-
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit
Authors:
Brian Yan,
Jiatong Shi,
Yun Tang,
Hirofumi Inaguma,
Yifan Peng,
Siddharth Dalmia,
Peter Polák,
Patrick Fernandes,
Dan Berrebbi,
Tomoki Hayashi,
Xiaohui Zhang,
Zhaoheng Ni,
Moto Hira,
Soumi Maiti,
Juan Pino,
Shinji Watanabe
Abstract:
ESPnet-ST-v2 is a revamp of the open-source ESPnet-ST toolkit necessitated by the broadening interests of the spoken language translation community. ESPnet-ST-v2 supports 1) offline speech-to-text translation (ST), 2) simultaneous speech-to-text translation (SST), and 3) offline speech-to-speech translation (S2ST) -- each task is supported with a wide variety of approaches, differentiating ESPnet-…
▽ More
ESPnet-ST-v2 is a revamp of the open-source ESPnet-ST toolkit necessitated by the broadening interests of the spoken language translation community. ESPnet-ST-v2 supports 1) offline speech-to-text translation (ST), 2) simultaneous speech-to-text translation (SST), and 3) offline speech-to-speech translation (S2ST) -- each task is supported with a wide variety of approaches, differentiating ESPnet-ST-v2 from other open source spoken language translation toolkits. This toolkit offers state-of-the-art architectures such as transducers, hybrid CTC/attention, multi-decoders with searchable intermediates, time-synchronous blockwise CTC/attention, Translatotron models, and direct discrete unit models. In this paper, we describe the overall design, example models for each task, and performance benchmarking behind ESPnet-ST-v2, which is publicly available at https://github.com/espnet/espnet.
△ Less
Submitted 6 July, 2023; v1 submitted 10 April, 2023;
originally announced April 2023.
-
Xtend, the Soft X-ray Imaging Telescope for the X-ray Imaging and Spectroscopy Mission (XRISM)
Authors:
Koji Mori,
Hiroshi Tomida,
Hiroshi Nakajima,
Takashi Okajima,
Hirofumi Noda,
Takaaki Tanaka,
Hiroyuki Uchida,
Kouichi Hagino,
Shogo Benjamin Kobayashi,
Hiromasa Suzuki,
Tessei Yoshida,
Hiroshi Murakami,
Hideki Uchiyama,
Masayoshi Nobukawa,
Kumiko Nobukawa,
Tomokage Yoneyama,
Hironori Matsumoto,
Takeshi Tsuru,
Makoto Yamauchi,
Isamu Hatsukade,
Manabu Ishida,
Yoshitomo Maeda,
Takayuki Hayashi,
Keisuke Tamura,
Rozenn Boissay-Malaquin
, et al. (30 additional authors not shown)
Abstract:
Xtend is a soft X-ray imaging telescope developed for the X-Ray Imaging and Spectroscopy Mission (XRISM). XRISM is scheduled to be launched in the Japanese fiscal year 2022. Xtend consists of the Soft X-ray Imager (SXI), an X-ray CCD camera, and the X-ray Mirror Assembly (XMA), a thin-foil-nested conically approximated Wolter-I optics. The SXI uses the P-channel, back-illuminated type CCD with an…
▽ More
Xtend is a soft X-ray imaging telescope developed for the X-Ray Imaging and Spectroscopy Mission (XRISM). XRISM is scheduled to be launched in the Japanese fiscal year 2022. Xtend consists of the Soft X-ray Imager (SXI), an X-ray CCD camera, and the X-ray Mirror Assembly (XMA), a thin-foil-nested conically approximated Wolter-I optics. The SXI uses the P-channel, back-illuminated type CCD with an imaging area size of 31 mm on a side. The four CCD chips are arranged in a 2$\times$2 grid and can be cooled down to $-120$ $^{\circ}$C with a single-stage Stirling cooler. The XMA nests thin aluminum foils coated with gold in a confocal way with an outer diameter of 45~cm. A pre-collimator is installed in front of the X-ray mirror for the reduction of the stray light. Combining the SXI and XMA with a focal length of 5.6m, a field of view of $38^{\prime}\times38^{\prime}$ over the energy range from 0.4 to 13 keV is realized. We have completed the fabrication of the flight model of both SXI and XMA. The performance verification has been successfully conducted in a series of sub-system level tests. We also carried out on-ground calibration measurements and the data analysis is ongoing.
△ Less
Submitted 13 March, 2023;
originally announced March 2023.
-
Algebraic approach to contraction families
Authors:
Takuma Hayashi
Abstract:
In this paper, we give a purely algebraic approach to the contraction group scheme predicted by Bernstein--Higson--Subag and constructed by Barbasch--Higson--Subag. We also compare quotient schemes of contraction group schemes with other related schemes, equipped with actions of contraction group schemes in the cases of symmetric and $θ$-stable parabolic subgroups.
In this paper, we give a purely algebraic approach to the contraction group scheme predicted by Bernstein--Higson--Subag and constructed by Barbasch--Higson--Subag. We also compare quotient schemes of contraction group schemes with other related schemes, equipped with actions of contraction group schemes in the cases of symmetric and $θ$-stable parabolic subgroups.
△ Less
Submitted 22 April, 2024; v1 submitted 21 February, 2023;
originally announced February 2023.
-
Extraction of Constituent Factors of Digestion Efficiency in Information Transfer by Media Composed of Texts and Images
Authors:
Koike Hiroaki,
Teruaki Hayashi
Abstract:
The development and spread of information and communication technologies have increased and diversified information. However, the increase in the volume and the selection of information does not necessarily promote understanding. In addition, conventional evaluations of information transfer have focused only on the arrival of information to the receivers. They need to sufficiently take into accoun…
▽ More
The development and spread of information and communication technologies have increased and diversified information. However, the increase in the volume and the selection of information does not necessarily promote understanding. In addition, conventional evaluations of information transfer have focused only on the arrival of information to the receivers. They need to sufficiently take into account the receivers' understanding of the information after it has been acquired, which is the original purpose of the evaluation. In this study, we propose the concept of "information digestion," which refers to the receivers' correct understanding of the acquired information, its contents, and its purpose. In the experiment, we proposed an evaluation model of information digestibility using hierarchical factor analysis and extracted factors that constitute digestibility by four types of media.
△ Less
Submitted 17 February, 2023;
originally announced February 2023.
-
Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study
Authors:
Massa Baali,
Tomoki Hayashi,
Hamdy Mubarak,
Soumi Maiti,
Shinji Watanabe,
Wassim El-Hajj,
Ahmed Ali
Abstract:
Several high-resource Text to Speech (TTS) systems currently produce natural, well-established human-like speech. In contrast, low-resource languages, including Arabic, have very limited TTS systems due to the lack of resources. We propose a fully unsupervised method for building TTS, including automatic data selection and pre-training/fine-tuning strategies for TTS training, using broadcast news…
▽ More
Several high-resource Text to Speech (TTS) systems currently produce natural, well-established human-like speech. In contrast, low-resource languages, including Arabic, have very limited TTS systems due to the lack of resources. We propose a fully unsupervised method for building TTS, including automatic data selection and pre-training/fine-tuning strategies for TTS training, using broadcast news as a case study. We show how careful selection of data, yet smaller amounts, can improve the efficiency of TTS system in generating more natural speech than a system trained on a bigger dataset. We adopt to propose different approaches for the: 1) data: we applied automatic annotations using DNSMOS, automatic vowelization, and automatic speech recognition (ASR) for fixing transcriptions' errors; 2) model: we used transfer learning from high-resource language in TTS model and fine-tuned it with one hour broadcast recording then we used this model to guide a FastSpeech2-based Conformer model for duration. Our objective evaluation shows 3.9% character error rate (CER), while the groundtruth has 1.3% CER. As for the subjective evaluation, where 1 is bad and 5 is excellent, our FastSpeech2-based Conformer model achieved a mean opinion score (MOS) of 4.4 for intelligibility and 4.2 for naturalness, where many annotators recognized the voice of the broadcaster, which proves the effectiveness of our proposed unsupervised method.
△ Less
Submitted 26 January, 2023; v1 submitted 22 January, 2023;
originally announced January 2023.
-
A Modelling Framework for Regression with Collinearity
Authors:
Takeaki Kariya,
Hiroshi Kurata,
Takaki Hayashi
Abstract:
This study addresses a fundamental, yet overlooked, gap between standard theory and empirical modelling practices in the OLS regression model $\boldsymbol{y}=\boldsymbol{Xβ}+\boldsymbol{u}$ with collinearity. In fact, while an estimated model in practice is desired to have stability and efficiency in its "individual OLS estimates", $\boldsymbol{y}$ itself has no capacity to identify and control th…
▽ More
This study addresses a fundamental, yet overlooked, gap between standard theory and empirical modelling practices in the OLS regression model $\boldsymbol{y}=\boldsymbol{Xβ}+\boldsymbol{u}$ with collinearity. In fact, while an estimated model in practice is desired to have stability and efficiency in its "individual OLS estimates", $\boldsymbol{y}$ itself has no capacity to identify and control the collinearity in $\boldsymbol{X}$ and hence no theory including model selection process (MSP) would fill this gap unless $\boldsymbol{X}$ is controlled in view of sampling theory. In this paper, first introducing a new concept of "empirically effective modelling" (EEM), we propose our EEM methodology (EEM-M) as an integrated process of two MSPs with data $(\boldsymbol{y^o,X})$ given. The first MSP uses $\boldsymbol{X}$ only, called the XMSP, and pre-selects a class $\scr{D}$ of models with individually inefficiency-controlled and collinearity-controlled OLS estimates, where the corresponding two controlling variables are chosen from predictive standard error of each estimate. Next, defining an inefficiency-collinearity risk index for each model, a partial ordering is introduced onto the set of models to compare without using $\boldsymbol{y^o}$, where the better-ness and admissibility of models are discussed. The second MSP is a commonly used MSP that uses $(\boldsymbol{y^o,X})$, and evaluates total model performance as a whole by such AIC, BIC, etc. to select an optimal model from $\scr{D}$. Third, to materialize the XMSP, two algorithms are proposed.
△ Less
Submitted 25 June, 2023; v1 submitted 8 January, 2023;
originally announced January 2023.
-
ASCENT - A balloon-borne hard X-ray imaging spectroscopy telescope using transition edge sensor microcalorimeter detectors
Authors:
Fabian Kislat,
Daniel Becker,
Douglas Bennett,
Adrika Dasgupta,
Joseph Fowler,
Christopher L. Fryer,
Johnathon Gard,
Ephraim Gau,
Danielle Gurgew,
Keon Harmon,
Takayuki Hayashi,
Scott Heatwole,
Md Arman Hossen,
Henric Krawczynski,
R. James Lanzi,
Jason Legere,
John A. B. Mates,
Mark McConnell,
Johanna Nagy,
Takashi Okajima,
Toshiki Sato,
Daniel Schmidt,
Sean Spooner,
Daniel Swetz,
Keisuke Tamura
, et al. (4 additional authors not shown)
Abstract:
Core collapse supernovae are thought to be one of the main sources in the galaxy of elements heavier than iron. Understanding the origin of the elements is thus tightly linked to our understanding of the explosion mechanism of supernovae and supernova nucleosynthesis. X-ray and gamma-ray observations of young supernova remnants, combined with improved theoretical modeling, have resulted in enormou…
▽ More
Core collapse supernovae are thought to be one of the main sources in the galaxy of elements heavier than iron. Understanding the origin of the elements is thus tightly linked to our understanding of the explosion mechanism of supernovae and supernova nucleosynthesis. X-ray and gamma-ray observations of young supernova remnants, combined with improved theoretical modeling, have resulted in enormous improvements in our knowledge of these events. The isotope ${}^{44}$Ti is one of the most sensitive probes of the innermost regions of the core collapse engine, and its spatial and velocity distribution are key observables. Hard X-ray imaging spectroscopy with the Nuclear Spectroscopic Telescope Array (NuSTAR) has provided new insights into the structure of the supernova remnant Cassiopeia A (Cas A), establishing the convective nature of the supernova engine. However, many questions about the details of this engine remain. We present here the concept for a balloon-borne follow-up mission called ASCENT (A SuperConducting ENergetic x-ray Telescope). ASCENT uses transition edge sensor gamma-ray microcalorimeter detectors with a demonstrated 55 eV Full Width Half Maximum (FWHM) energy resolution at 97 keV. This 8--16-fold improvement in energy resolution over NuSTAR will allow high resolution imaging and spectroscopy of the ${}^{44}$Ti emission. This will allow a detailed reconstruction of gamma-ray line redshifts, widths, and shapes, allowing us to address questions such as: What is the source of the neutron star "kicks"? What is the dominant production pathway for ${}^{44}$Ti? Is the engine of Cas A unique?
△ Less
Submitted 4 January, 2023;
originally announced January 2023.
-
Finite abelian groups of K3 surfaces with smooth quotient
Authors:
Taro Hayashi
Abstract:
The quotient space of a $K3$ surface by a finite group is an Enriques surface or a rational surface if it is smooth. Finite groups where the quotient space are Enriques surfaces are known. In this paper, by analyzing effective divisors on smooth rational surfaces, we will study finite groups which act faithfully on $K3$ surfaces such that the quotient space are smooth. In particular, we will compl…
▽ More
The quotient space of a $K3$ surface by a finite group is an Enriques surface or a rational surface if it is smooth. Finite groups where the quotient space are Enriques surfaces are known. In this paper, by analyzing effective divisors on smooth rational surfaces, we will study finite groups which act faithfully on $K3$ surfaces such that the quotient space are smooth. In particular, we will completely determine effective divisors on Hirzebruch surfaces such that there is a finite Abelian cover from a $K3$ surface to a Hirzebrunch surface such that the branch divisor is that effective divisor. Furthermore, we will decide the Galois group and give the way to construct that Abelian cover from an effective divisor on a Hirzebruch surface. Subsequently, we study the same theme for Enriques surfaces.
△ Less
Submitted 30 December, 2022;
originally announced January 2023.
-
Gaussian Process Classification Bandits
Authors:
Tatsuya Hayashi,
Naoki Ito,
Koji Tabata,
Atsuyoshi Nakamura,
Katsumasa Fujita,
Yoshinori Harada,
Tamiki Komatsuzaki
Abstract:
Classification bandits are multi-armed bandit problems whose task is to classify a given set of arms into either positive or negative class depending on whether the rate of the arms with the expected reward of at least h is not less than w for given thresholds h and w. We study a special classification bandit problem in which arms correspond to points x in d-dimensional real space with expected re…
▽ More
Classification bandits are multi-armed bandit problems whose task is to classify a given set of arms into either positive or negative class depending on whether the rate of the arms with the expected reward of at least h is not less than w for given thresholds h and w. We study a special classification bandit problem in which arms correspond to points x in d-dimensional real space with expected rewards f(x) which are generated according to a Gaussian process prior. We develop a framework algorithm for the problem using various arm selection policies and propose policies called FCB and FTSV. We show a smaller sample complexity upper bound for FCB than that for the existing algorithm of the level set estimation, in which whether f(x) is at least h or not must be decided for every arm's x. Arm selection policies depending on an estimated rate of arms with rewards of at least h are also proposed and shown to improve empirical sample complexity. According to our experimental results, the rate-estimation versions of FCB and FTSV, together with that of the popular active learning policy that selects the point with the maximum variance, outperform other policies for synthetic functions, and the version of FTSV is also the best performer for our real-world dataset.
△ Less
Submitted 26 December, 2022;
originally announced December 2022.
-
X-ray and optical spectroscopic study of a gamma Cassiopeiae analog source pi Aquarii
Authors:
Masahiro Tsujimoto,
Takayuki Hayashi,
Kumiko Morihana,
Yuki Moritani
Abstract:
Gamma Cas analog sources are a subset of Be stars that emit intense and hard X-ray emission. Two competing ideas for their X-ray production mechanism are (a) the magnetic activities of the Be star and its disk and (b) the accretion from the Be star to an unidentified compact object. Among such sources, Pi Aqr plays a pivotal role as it is one of the only two spectroscopic binaries observed for man…
▽ More
Gamma Cas analog sources are a subset of Be stars that emit intense and hard X-ray emission. Two competing ideas for their X-ray production mechanism are (a) the magnetic activities of the Be star and its disk and (b) the accretion from the Be star to an unidentified compact object. Among such sources, Pi Aqr plays a pivotal role as it is one of the only two spectroscopic binaries observed for many orbital cycles and one of the three sources with X-ray brightness sufficient for detailed X-ray spectroscopy. Bjorkman et al. (2002) estimated the secondary mass > 2.0 Mo with optical spectroscopy, which would argue against the compact object being a white dwarf (WD). However, their dynamical mass solution is inconsistent with an evolutionary solution and their radial velocity measurement is inconsistent with later work by Naze et al. (2019). We revisit this issue by adding a new data set with the NuSTAR X-ray observatory and the HIDES echelle spectrograph. We found that the radial velocity amplitude is consistent with Naze et al. (2019), which is only a half of that claimed by Bjorkman et al. (2002). Fixing the radial velocity amplitude of the primary, the secondary mass is estimated as < 1.4 Mo over an assumed range of the primary mass and the inclination angle. We further constrained the inclination angle and the secondary mass independently by fitting the X-ray spectra with a non-magnetic or magnetic accreting WD model under the assumption that the secondary is indeed a WD. The two results match well. We thus argue that the possibility of the secondary being a WD should not be excluded for pi Aqr.
△ Less
Submitted 19 November, 2022;
originally announced November 2022.
-
ESPnet-ONNX: Bridging a Gap Between Research and Production
Authors:
Masao Someki,
Yosuke Higuchi,
Tomoki Hayashi,
Shinji Watanabe
Abstract:
In the field of deep learning, researchers often focus on inventing novel neural network models and improving benchmarks. In contrast, application developers are interested in making models suitable for actual products, which involves optimizing a model for faster inference and adapting a model to various platforms (e.g., C++ and Python). In this work, to fill the gap between the two, we establish…
▽ More
In the field of deep learning, researchers often focus on inventing novel neural network models and improving benchmarks. In contrast, application developers are interested in making models suitable for actual products, which involves optimizing a model for faster inference and adapting a model to various platforms (e.g., C++ and Python). In this work, to fill the gap between the two, we establish an effective procedure for optimizing a PyTorch-based research-oriented model for deployment, taking ESPnet, a widely used toolkit for speech processing, as an instance. We introduce different techniques to ESPnet, including converting a model into an ONNX format, fusing nodes in a graph, and quantizing parameters, which lead to approximately 1.3-2$\times$ speedup in various tasks (i.e., ASR, TTS, speech translation, and spoken language understanding) while keeping its performance without any additional training. Our ESPnet-ONNX will be publicly available at https://github.com/espnet/espnet_onnx
△ Less
Submitted 14 November, 2022; v1 submitted 20 September, 2022;
originally announced September 2022.
-
Lagrange vs. Lyapunov stability of hierarchical triple systems: dependence on the mutual inclination between inner and outer orbits
Authors:
Toshinori Hayashi,
Alessandro A. Trani,
Yasushi Suto
Abstract:
While there have been many studies examining the stability of hierarchical triple systems, the meaning of ``stability'' is somewhat vague and has been interpreted differently in previous literatures. The present paper focuses on ``Lagrange stability'', which roughly refers to the stability against the escape of a body from the system, or ``disruption'' of the triple system, in contrast to ``Lyapun…
▽ More
While there have been many studies examining the stability of hierarchical triple systems, the meaning of ``stability'' is somewhat vague and has been interpreted differently in previous literatures. The present paper focuses on ``Lagrange stability'', which roughly refers to the stability against the escape of a body from the system, or ``disruption'' of the triple system, in contrast to ``Lyapunov-like stability'' that is related to the chaotic nature of the system dynamics. We compute the evolution of triple systems using direct $N$-body simulations up to $10^7 P_\mathrm{out}$, which is significantly longer than previous studies (with $P_\mathrm{out}$ being the initial orbital period of the outer body). We obtain the resulting disruption timescale $T_\mathrm{d}$ as a function of the triple orbital parameters with particular attention to the dependence on the mutual inclination between the inner and outer orbits, $i_\mathrm{mut}$. By doing so, we have clarified explicitly the difference between Lagrange and Lyapunov stabilities in astronomical triples. Furthermore, we find that the von Zeipel-Kozai-Lidov oscillations significantly destabilize inclined triples (roughly with $60^\circ < i_\mathrm{mut} < 150^\circ$) relative to those with $i_\mathrm{mut}=0^\circ$. On the other hand, retrograde triples with $i_\mathrm{mut}>160^\circ$ become strongly stabilized with much longer disruption timescales. We show the sensitivity of the normalized disruption timescale $T_\mathrm{d}/P_\mathrm{out}$ to the orbital parameters of triple system. The resulting $T_\mathrm{d}/P_\mathrm{out}$ distribution is practically more useful in a broad range of astronomical applications than the stability criterion based on the Lyapunov divergence.
△ Less
Submitted 16 December, 2022; v1 submitted 18 September, 2022;
originally announced September 2022.
-
Dynamical disruption timescales and chaotic behavior of hierarchical triple systems
Authors:
Toshinori Hayashi,
Alessandro A. Trani,
Yasushi Suto
Abstract:
We examine the stability of hierarchical triple systems using direct $N$-body simulations without adopting a secular perturbation approximation. We estimate their disruption timescales in addition to the mere stable/unstable criterion, with particular attention to the mutual inclination between the inner and outer orbits. First, we improve the fit to the dynamical stability criterion by \citet{Mar…
▽ More
We examine the stability of hierarchical triple systems using direct $N$-body simulations without adopting a secular perturbation approximation. We estimate their disruption timescales in addition to the mere stable/unstable criterion, with particular attention to the mutual inclination between the inner and outer orbits. First, we improve the fit to the dynamical stability criterion by \citet{Mardling1999,Mardling2001} widely adopted in the previous literature. Especially, we find that that the stability boundary is very sensitive to the mutual inclination; coplanar retrograde triples and orthogonal triples are much more stable and unstable, respectively, than coplanar prograde triples. Next, we estimate the disruption timescales of triples satisfying the stability condition up to $10^9$ times the inner orbital period. The timescales follow the scaling predicted by \citet{Mushkin2020}, especially at high $e_\mathrm{out}$ where their random walk model is most valid. We obtain an improved empirical fit to the disruption timescales, which indicates that the coplanar retrograde triples are significantly more stable than the previous prediction. We furthermore find that the dependence on the mutual inclination can be explained by the energy transfer model based on a parabolic encounter approximation. We also show that the disruption timescales of triples are highly sensitive to the tiny change of the initial parameters, reflecting the genuine chaotic nature of the dynamics of those systems.
△ Less
Submitted 5 September, 2022; v1 submitted 26 July, 2022;
originally announced July 2022.
-
A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Authors:
Wen-Chin Huang,
Shu-Wen Yang,
Tomoki Hayashi,
Tomoki Toda
Abstract:
We present a large-scale comparative study of self-supervised speech representation (S3R)-based voice conversion (VC). In the context of recognition-synthesis VC, S3Rs are attractive owing to their potential to replace expensive supervised representations such as phonetic posteriorgrams (PPGs), which are commonly adopted by state-of-the-art VC systems. Using S3PRL-VC, an open-source VC software we…
▽ More
We present a large-scale comparative study of self-supervised speech representation (S3R)-based voice conversion (VC). In the context of recognition-synthesis VC, S3Rs are attractive owing to their potential to replace expensive supervised representations such as phonetic posteriorgrams (PPGs), which are commonly adopted by state-of-the-art VC systems. Using S3PRL-VC, an open-source VC software we previously developed, we provide a series of in-depth objective and subjective analyses under three VC settings: intra-/cross-lingual any-to-one (A2O) and any-to-any (A2A) VC, using the voice conversion challenge 2020 (VCC2020) dataset. We investigated S3R-based VC in various aspects, including model type, multilinguality, and supervision. We also studied the effect of a post-discretization process with k-means clustering and showed how it improves in the A2A setting. Finally, the comparison with state-of-the-art VC systems demonstrates the competitiveness of S3R-based VC and also sheds light on the possible improving directions.
△ Less
Submitted 9 July, 2022;
originally announced July 2022.
-
Sensitivity of the GAPS Experiment to Low-energy Cosmic-ray Antiprotons
Authors:
Field Rogers,
Tsuguo Aramaki,
Mirko Boezio,
Steven Boggs,
Valter Bonvicini,
Gabriel Bridges,
Donatella Campana,
William W. Craig,
Philip von Doetinchem,
Eric Everson,
Lorenzo Fabris,
Sydney Feldman,
Hideyuki Fuke,
Florian Gahbauer,
Cory Gerrity,
Charles J. Hailey,
Takeru Hayashi,
Akiko Kawachi,
Masayoshi Kozai,
Alex Lenni,
Alexander Lowell,
Massimo Manghisoni,
Nadir Marcelli,
Brent Mochizuki,
Isaac Mognet
, et al. (28 additional authors not shown)
Abstract:
The General Antiparticle Spectrometer (GAPS) is an upcoming balloon mission to measure low-energy cosmic-ray antinuclei during at least three ~35-day Antarctic flights. With its large geometric acceptance and novel exotic atom-based particle identification, GAPS will detect ~500 cosmic antiprotons per flight and produce a precision cosmic antiproton spectrum in the kinetic energy range of ~0.07-0.…
▽ More
The General Antiparticle Spectrometer (GAPS) is an upcoming balloon mission to measure low-energy cosmic-ray antinuclei during at least three ~35-day Antarctic flights. With its large geometric acceptance and novel exotic atom-based particle identification, GAPS will detect ~500 cosmic antiprotons per flight and produce a precision cosmic antiproton spectrum in the kinetic energy range of ~0.07-0.21 GeV/n at the top of the atmosphere. With these high statistics extending to lower energies than any previous experiment, and with complementary sources of experimental uncertainty compared to traditional magnetic spectrometers, the GAPS antiproton measurement will be sensitive to dark matter, primordial black holes, and cosmic ray propagation. The antiproton measurement will also validate the GAPS antinucleus identification technique for the antideuteron and antihelium rare-event searches. This analysis demonstrates the GAPS sensitivity to cosmic-ray antiprotons using a full instrument simulation and event reconstruction, and including solar and atmospheric effects.
△ Less
Submitted 5 November, 2022; v1 submitted 26 June, 2022;
originally announced June 2022.
-
Improvement of Serial Approach to Anomalous Sound Detection by Incorporating Two Binary Cross-Entropies for Outlier Exposure
Authors:
Ibuki Kuroyanagi,
Tomoki Hayashi,
Kazuya Takeda,
Tomoki Toda
Abstract:
Anomalous sound detection systems must detect unknown, atypical sounds using only normal audio data. Conventional methods use the serial method, a combination of outlier exposure (OE), which classifies normal and pseudo-anomalous data and obtains embedding, and inlier modeling (IM), which models the probability distribution of the embedding. Although the serial method shows high performance due to…
▽ More
Anomalous sound detection systems must detect unknown, atypical sounds using only normal audio data. Conventional methods use the serial method, a combination of outlier exposure (OE), which classifies normal and pseudo-anomalous data and obtains embedding, and inlier modeling (IM), which models the probability distribution of the embedding. Although the serial method shows high performance due to the powerful feature extraction of OE and the robustness of IM, OE still has a problem that doesn't work well when the normal and pseudo-anomalous data are too similar or too different. To explicitly distinguish these data, the proposed method uses multi-task learning of two binary cross-entropies when training OE. The first is a loss that classifies the sound of the target machine to which product it is emitted from, which deals with the case where the normal data and the pseudo-anomalous data are too similar. The second is a loss that identifies whether the sound is emitted from the target machine or not, which deals with the case where the normal data and the pseudo-anomalous data are too different. We perform our experiments with DCASE 2021 Task~2 dataset. Our proposed single-model method outperforms the top-ranked method, which combines multiple models, by 2.1% in AUC.
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
Filtrations on the globalization of twisted D-modules over Dedekind schemes
Authors:
Takuma Hayashi
Abstract:
Fabian Januszewski and the author established the theory of twisted D-modules over general base schemes. In this short note, we construct a $K$-invariant positive exhaustive filtration on the globalization of the twisted D-module on a smooth quasi-compact $K$-scheme over a Dedekind scheme $S$ obtained by the direct image of a $K$-equivariant twisted integrable connection along a $K$-equivariant cl…
▽ More
Fabian Januszewski and the author established the theory of twisted D-modules over general base schemes. In this short note, we construct a $K$-invariant positive exhaustive filtration on the globalization of the twisted D-module on a smooth quasi-compact $K$-scheme over a Dedekind scheme $S$ obtained by the direct image of a $K$-equivariant twisted integrable connection along a $K$-equivariant closed immersion from a smooth proper $K$-scheme $Y$ with $K$ a smooth $S$-affine group scheme, whose $p$th associated graded $\mathcal{O}_S$-module is locally free of finite rank for every integer $p$. In particular, the $k$-module of its global sections is projective if $S$ is affine with coordinate ring $k$.
△ Less
Submitted 22 April, 2024; v1 submitted 16 May, 2022;
originally announced May 2022.
-
Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis
Authors:
Jiatong Shi,
Shuai Guo,
Tao Qian,
Nan Huo,
Tomoki Hayashi,
Yuning Wu,
Frank Xu,
Xuankai Chang,
Huazhe Li,
Peter Wu,
Shinji Watanabe,
Qin Jin
Abstract:
This paper introduces a new open-source platform named Muskits for end-to-end music processing, which mainly focuses on end-to-end singing voice synthesis (E2E-SVS). Muskits supports state-of-the-art SVS models, including RNN SVS, transformer SVS, and XiaoiceSing. The design of Muskits follows the style of widely-used speech processing toolkits, ESPnet and Kaldi, for data prepossessing, training,…
▽ More
This paper introduces a new open-source platform named Muskits for end-to-end music processing, which mainly focuses on end-to-end singing voice synthesis (E2E-SVS). Muskits supports state-of-the-art SVS models, including RNN SVS, transformer SVS, and XiaoiceSing. The design of Muskits follows the style of widely-used speech processing toolkits, ESPnet and Kaldi, for data prepossessing, training, and recipe pipelines. To the best of our knowledge, this toolkit is the first platform that allows a fair and highly-reproducible comparison between several published works in SVS. In addition, we also demonstrate several advanced usages based on the toolkit functionalities, including multilingual training and transfer learning. This paper describes the major framework of Muskits, its functionalities, and experimental results in single-singer, multi-singer, multilingual, and transfer learning scenarios. The toolkit is publicly available at https://github.com/SJTMusicTeam/Muskits.
△ Less
Submitted 2 July, 2022; v1 submitted 9 May, 2022;
originally announced May 2022.
-
Learning General Inventory Management Policy for Large Supply Chain Network
Authors:
Soh Kumabe,
Shinya Shiroshita,
Takanori Hayashi,
Shirou Maruyama
Abstract:
Inventory management in warehouses directly affects profits made by manufacturers. Particularly, large manufacturers produce a very large variety of products that are handled by a significantly large number of retailers. In such a case, the computational complexity of classical inventory management algorithms is inordinately large. In recent years, learning-based approaches have become popular for…
▽ More
Inventory management in warehouses directly affects profits made by manufacturers. Particularly, large manufacturers produce a very large variety of products that are handled by a significantly large number of retailers. In such a case, the computational complexity of classical inventory management algorithms is inordinately large. In recent years, learning-based approaches have become popular for addressing such problems. However, previous studies have not been managed systems where both the number of products and retailers are large. This study proposes a reinforcement learning-based warehouse inventory management algorithm that can be used for supply chain systems where both the number of products and retailers are large. To solve the computational problem of handling large systems, we provide a means of approximate simulation of the system in the training phase. Our experiments on both real and artificial data demonstrate that our algorithm with approximated simulation can successfully handle large supply chain networks.
△ Less
Submitted 28 April, 2022;
originally announced April 2022.
-
Acoustic Event Detection with Classifier Chains
Authors:
Tatsuya Komatsu,
Shinji Watanabe,
Koichi Miyazaki,
Tomoki Hayashi
Abstract:
This paper proposes acoustic event detection (AED) with classifier chains, a new classifier based on the probabilistic chain rule. The proposed AED with classifier chains consists of a gated recurrent unit and performs iterative binary detection of each event one by one. In each iteration, the event's activity is estimated and used to condition the next output based on the probabilistic chain rule…
▽ More
This paper proposes acoustic event detection (AED) with classifier chains, a new classifier based on the probabilistic chain rule. The proposed AED with classifier chains consists of a gated recurrent unit and performs iterative binary detection of each event one by one. In each iteration, the event's activity is estimated and used to condition the next output based on the probabilistic chain rule to form classifier chains. Therefore, the proposed method can handle the interdependence among events upon classification, while the conventional AED methods with multiple binary classifiers with a linear layer and sigmoid function have placed an assumption of conditional independence. In the experiments with a real-recording dataset, the proposed method demonstrates its superior AED performance to a relative 14.80% improvement compared to a convolutional recurrent neural network baseline system with the multiple binary classifiers.
△ Less
Submitted 17 February, 2022;
originally announced February 2022.
-
On Conjugacy of Subalgebras in Graph $C^*$-Algebras. II
Authors:
Tomohiro Hayashi,
Jeong Hee Hong,
Wojciech Szymański
Abstract:
We apply a method inspired by Popa's intertwining-by-bimodules technique to investigate inner conjugacy of MASAs in graph $C^*$-algebras. First we give a new proof of non-inner conjugacy of the diagonal MASA ${\mathcal D}_n$ to its non-trivial image under a quasi-free automorphism, where $E$ is a finite transitive graph. Changing graphs representing the algebras, this result applies to some non qu…
▽ More
We apply a method inspired by Popa's intertwining-by-bimodules technique to investigate inner conjugacy of MASAs in graph $C^*$-algebras. First we give a new proof of non-inner conjugacy of the diagonal MASA ${\mathcal D}_n$ to its non-trivial image under a quasi-free automorphism, where $E$ is a finite transitive graph. Changing graphs representing the algebras, this result applies to some non quasi-free automorphisms as well. Then we exhibit a large class of MASAs in the Cuntz algebra ${\mathcal O}_n$ that are not inner conjugate to the diagonal ${\mathcal D}_n$.
△ Less
Submitted 15 November, 2022; v1 submitted 20 January, 2022;
originally announced January 2022.
-
Pattern formation of elliptic particles by two-body interactions: a model for dynamics of endothelial cells in angiogenesis
Authors:
Tatsuya Hayashi,
Fumitaka Yura,
Jun Mada,
Hiroki Kurihara,
Tetsuji Tokihiro
Abstract:
A two-dimensional mathematical model for dynamics of endothelial cells in angiogenesis is investigated. Angiogenesis is a morphogenic process in which new blood vessels emerge from an existing vascular network. Recently a one-dimensional discrete dynamical model has been proposed to reproduce elongation, bifurcation, and cell motility such as cell-mixing during angiogenesis on the assumption of a…
▽ More
A two-dimensional mathematical model for dynamics of endothelial cells in angiogenesis is investigated. Angiogenesis is a morphogenic process in which new blood vessels emerge from an existing vascular network. Recently a one-dimensional discrete dynamical model has been proposed to reproduce elongation, bifurcation, and cell motility such as cell-mixing during angiogenesis on the assumption of a simple two-body interaction between endothelial cells. The present model is its two-dimensional extension, where endothelial cells are represented as the ellipses with the two-body interactions: repulsive interaction due to excluded volume effect, attractive interaction through pseudopodia and rotation by contact. We show that the oblateness of ellipses and the magnitude of contact rotation significantly affect the shape of created vascular patterns and elongation of branches.
△ Less
Submitted 9 January, 2022;
originally announced January 2022.
-
Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem
Authors:
Jing Shi,
Xuankai Chang,
Tomoki Hayashi,
Yen-Ju Lu,
Shinji Watanabe,
Bo Xu
Abstract:
Deep learning based models have significantly improved the performance of speech separation with input mixtures like the cocktail party. Prominent methods (e.g., frequency-domain and time-domain speech separation) usually build regression models to predict the ground-truth speech from the mixture, using the masking-based design and the signal-level loss criterion (e.g., MSE or SI-SNR). This study…
▽ More
Deep learning based models have significantly improved the performance of speech separation with input mixtures like the cocktail party. Prominent methods (e.g., frequency-domain and time-domain speech separation) usually build regression models to predict the ground-truth speech from the mixture, using the masking-based design and the signal-level loss criterion (e.g., MSE or SI-SNR). This study demonstrates, for the first time, that the synthesis-based approach can also perform well on this problem, with great flexibility and strong potential. Specifically, we propose a novel speech separation/enhancement model based on the recognition of discrete symbols, and convert the paradigm of the speech separation/enhancement related tasks from regression to classification. By utilizing the synthesis model with the input of discrete symbols, after the prediction of discrete symbol sequence, each target speech could be re-synthesized. Evaluation results based on the WSJ0-2mix and VCTK-noisy corpora in various settings show that our proposed method can steadily synthesize the separated speech with high speech quality and without any interference, which is difficult to avoid in regression-based methods. In addition, with negligible loss of listening quality, the speaker conversion of enhanced/separated speech could be easily realized through our method.
△ Less
Submitted 9 January, 2022; v1 submitted 17 December, 2021;
originally announced December 2021.
-
Vacuum decay in the Lorentzian path integral
Authors:
Takumi Hayashi,
Kohei Kamada,
Naritaka Oshita,
Jun'ichi Yokoyama
Abstract:
We apply the Lorentzian path integral to the decay of a false vacuum and estimate the false-vacuum decay rate. To make the Lorentzian path integral convergent, the deformation of an integral contour is performed by following the Picard-Lefschetz theory. We show that the nucleation rate of a critical bubble, for which the corresponding bounce action is extremized, has the same exponent as the Eucli…
▽ More
We apply the Lorentzian path integral to the decay of a false vacuum and estimate the false-vacuum decay rate. To make the Lorentzian path integral convergent, the deformation of an integral contour is performed by following the Picard-Lefschetz theory. We show that the nucleation rate of a critical bubble, for which the corresponding bounce action is extremized, has the same exponent as the Euclidean approach. We also extend our computation to the nucleation of a bubble larger or smaller than the critical one to which the Euclidean formalism is not applicable.
△ Less
Submitted 14 January, 2022; v1 submitted 16 December, 2021;
originally announced December 2021.
-
ViCE: Improving Dense Representation Learning by Superpixelization and Contrasting Cluster Assignment
Authors:
Robin Karlsson,
Tomoki Hayashi,
Keisuke Fujii,
Alexander Carballo,
Kento Ohtani,
Kazuya Takeda
Abstract:
Recent self-supervised models have demonstrated equal or better performance than supervised methods, opening for AI systems to learn visual representations from practically unlimited data. However, these methods are typically classification-based and thus ineffective for learning high-resolution feature maps that preserve precise spatial information. This work introduces superpixels to improve sel…
▽ More
Recent self-supervised models have demonstrated equal or better performance than supervised methods, opening for AI systems to learn visual representations from practically unlimited data. However, these methods are typically classification-based and thus ineffective for learning high-resolution feature maps that preserve precise spatial information. This work introduces superpixels to improve self-supervised learning of dense semantically rich visual concept embeddings. Decomposing images into a small set of visually coherent regions reduces the computational complexity by $\mathcal{O}(1000)$ while preserving detail. We experimentally show that contrasting over regions improves the effectiveness of contrastive learning methods, extends their applicability to high-resolution images, improves overclustering performance, superpixels are better than grids, and regional masking improves performance. The expressiveness of our dense embeddings is demonstrated by improving the SOTA unsupervised semantic segmentation benchmark on Cityscapes, and for convolutional models on COCO.
△ Less
Submitted 7 October, 2022; v1 submitted 24 November, 2021;
originally announced November 2021.
-
$\mathrm{SO}(3)$-homogeneous decomposition of the flag scheme of $\mathrm{SL}_3$ over $\mathbb{Z}\left[1/2\right]$
Authors:
Takuma Hayashi
Abstract:
In this paper, we give $\mathbb{Z}\left[1/2\right]$-forms of $\mathrm{SO}(3,\mathbb{C})$-orbits in the flag variety of $\mathrm{SL}_3(\mathbb{C})$. We also prove that they give a $\mathbb{Z}\left[1/2\right]$-form of the $\mathrm{SO}(3,\mathbb{C})$-orbit decomposition of the flag variety of $\mathrm{SL}_3(\mathbb{C})$.
In this paper, we give $\mathbb{Z}\left[1/2\right]$-forms of $\mathrm{SO}(3,\mathbb{C})$-orbits in the flag variety of $\mathrm{SL}_3(\mathbb{C})$. We also prove that they give a $\mathbb{Z}\left[1/2\right]$-form of the $\mathrm{SO}(3,\mathbb{C})$-orbit decomposition of the flag variety of $\mathrm{SL}_3(\mathbb{C})$.
△ Less
Submitted 14 February, 2024; v1 submitted 15 November, 2021;
originally announced November 2021.
-
Feature Concepts for Data Federative Innovations
Authors:
Yukio Ohsawa,
Sae Kondo,
Teruaki Hayashi
Abstract:
A feature concept, the essence of the data-federative innovation process, is presented as a model of the concept to be acquired from data. A feature concept may be a simple feature, such as a single variable, but is more likely to be a conceptual illustration of the abstract information to be obtained from the data. For example, trees and clusters are feature concepts for decision tree learning an…
▽ More
A feature concept, the essence of the data-federative innovation process, is presented as a model of the concept to be acquired from data. A feature concept may be a simple feature, such as a single variable, but is more likely to be a conceptual illustration of the abstract information to be obtained from the data. For example, trees and clusters are feature concepts for decision tree learning and clustering, respectively. Useful feature concepts for satis-fying the requirements of users of data have been elicited so far via creative communication among stakeholders in the market of data. In this short paper, such a creative communication is reviewed, showing a couple of appli-cations, for example, change explanation in markets and earthquakes, and highlight the feature concepts elicited in these cases.
△ Less
Submitted 5 November, 2021;
originally announced November 2021.
-
ESPnet2-TTS: Extending the Edge of TTS Research
Authors:
Tomoki Hayashi,
Ryuichi Yamamoto,
Takenori Yoshimura,
Peter Wu,
Jiatong Shi,
Takaaki Saeki,
Yooncheol Ju,
Yusuke Yasuda,
Shinnosuke Takamichi,
Shinji Watanabe
Abstract:
This paper describes ESPnet2-TTS, an end-to-end text-to-speech (E2E-TTS) toolkit. ESPnet2-TTS extends our earlier version, ESPnet-TTS, by adding many new features, including: on-the-fly flexible pre-processing, joint training with neural vocoders, and state-of-the-art TTS models with extensions like full-band E2E text-to-waveform modeling, which simplify the training pipeline and further enhance T…
▽ More
This paper describes ESPnet2-TTS, an end-to-end text-to-speech (E2E-TTS) toolkit. ESPnet2-TTS extends our earlier version, ESPnet-TTS, by adding many new features, including: on-the-fly flexible pre-processing, joint training with neural vocoders, and state-of-the-art TTS models with extensions like full-band E2E text-to-waveform modeling, which simplify the training pipeline and further enhance TTS performance. The unified design of our recipes enables users to quickly reproduce state-of-the-art E2E-TTS results. We also provide many pre-trained models in a unified Python interface for inference, offering a quick means for users to generate baseline samples and build demos. Experimental evaluations with English and Japanese corpora demonstrate that our provided models synthesize utterances comparable to ground-truth ones, achieving state-of-the-art TTS performance. The toolkit is available online at https://github.com/espnet/espnet.
△ Less
Submitted 14 October, 2021;
originally announced October 2021.
-
S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Authors:
Wen-Chin Huang,
Shu-Wen Yang,
Tomoki Hayashi,
Hung-Yi Lee,
Shinji Watanabe,
Tomoki Toda
Abstract:
This paper introduces S3PRL-VC, an open-source voice conversion (VC) framework based on the S3PRL toolkit. In the context of recognition-synthesis VC, self-supervised speech representation (S3R) is valuable in its potential to replace the expensive supervised representation adopted by state-of-the-art VC systems. Moreover, we claim that VC is a good probing task for S3R analysis. In this work, we…
▽ More
This paper introduces S3PRL-VC, an open-source voice conversion (VC) framework based on the S3PRL toolkit. In the context of recognition-synthesis VC, self-supervised speech representation (S3R) is valuable in its potential to replace the expensive supervised representation adopted by state-of-the-art VC systems. Moreover, we claim that VC is a good probing task for S3R analysis. In this work, we provide a series of in-depth analyses by benchmarking on the two tasks in VCC2020, namely intra-/cross-lingual any-to-one (A2O) VC, as well as an any-to-any (A2A) setting. We also provide comparisons between not only different S3Rs but also top systems in VCC2020 with supervised representations. Systematic objective and subjective evaluation were conducted, and we show that S3R is comparable with VCC2020 top systems in the A2O setting in terms of similarity, and achieves state-of-the-art in S3R-based A2A VC. We believe the extensive analysis, as well as the toolkit itself, contribute to not only the S3R community but also the VC community. The codebase is now open-sourced.
△ Less
Submitted 12 October, 2021;
originally announced October 2021.
-
Collaborative Problem Solving on a Data Platform Kaggle
Authors:
Teruaki Hayashi,
Takumi Shimizu,
Yoshiaki Fukami
Abstract:
Data exchange across different domains has gained much attention as a way of creating new businesses and improving the value of existing services. Data exchange ecosystem is developed by platform services that facilitate data and knowledge exchange and offer co-creation environments for organizations to promote their problem-solving. In this study, we investigate Kaggle, a data analysis competitio…
▽ More
Data exchange across different domains has gained much attention as a way of creating new businesses and improving the value of existing services. Data exchange ecosystem is developed by platform services that facilitate data and knowledge exchange and offer co-creation environments for organizations to promote their problem-solving. In this study, we investigate Kaggle, a data analysis competition platform, and discuss the characteristics of data and the ecosystem that contributes to collaborative problem-solving by analyzing the datasets, users, and their relationships.
△ Less
Submitted 25 July, 2021;
originally announced July 2021.
-
Breaking the degeneracy in magnetic cataclysmic variable X-ray spectral modeling using X-ray light curves
Authors:
Diogo Belloni,
Claudia V. Rodrigues,
Matthias R. Schreiber,
Manuel Castro,
Joaquim E. R. Costa,
Takayuki Hayashi,
Isabel J. Lima,
Gerardo J. M. Luna,
Murilo Martins,
Alexandre S. Oliveira,
Steven G. Parsons,
Karleyne M. G. Silva,
Paulo E. Stecchini,
Teresa J. Stuchi,
Monica Zorotovic
Abstract:
We present an analysis of mock X-ray spectra and light curves of magnetic cataclysmic variables using an upgraded version of the 3D CYCLOPS code. This 3D representation of the accretion flow allows us to properly model total and partial occultation of the post-shock region by the white dwarf as well as the modulation of the X-ray light curves due to the phase-dependent extinction of the pre-shock…
▽ More
We present an analysis of mock X-ray spectra and light curves of magnetic cataclysmic variables using an upgraded version of the 3D CYCLOPS code. This 3D representation of the accretion flow allows us to properly model total and partial occultation of the post-shock region by the white dwarf as well as the modulation of the X-ray light curves due to the phase-dependent extinction of the pre-shock region. We carried out detailed post-shock region modeling in a four-dimensional parameter space by varying the white dwarf mass and magnetic field strength as well as the magnetosphere radius and the specific accretion rate. To calculate the post-shock region temperature and density profiles, we assumed equipartition between ions and electrons, took into account the white dwarf gravitational potential, the finite size of the magnetosphere and a dipole-like magnetic field geometry, and considered cooling by both bremsstrahlung and cyclotron radiative processes. By investigating the impact of the parameters on the resulting X-ray continuum spectra, we show that there is an inevitable degeneracy in the four-dimensional parameter space investigated here, which compromises X-ray continuum spectral fitting strategies and can lead to incorrect parameter estimates. However, the inclusion of X-ray light curves in different energy ranges can break this degeneracy, and it therefore remains, in principle, possible to use X-ray data to derive fundamental parameters of magnetic cataclysmic variables, which represents an essential step toward understanding their formation and evolution.
△ Less
Submitted 22 July, 2021;
originally announced July 2021.
-
On Prosody Modeling for ASR+TTS based Voice Conversion
Authors:
Wen-Chin Huang,
Tomoki Hayashi,
Xinjian Li,
Shinji Watanabe,
Tomoki Toda
Abstract:
In voice conversion (VC), an approach showing promising results in the latest voice conversion challenge (VCC) 2020 is to first use an automatic speech recognition (ASR) model to transcribe the source speech into the underlying linguistic contents; these are then used as input by a text-to-speech (TTS) system to generate the converted speech. Such a paradigm, referred to as ASR+TTS, overlooks the…
▽ More
In voice conversion (VC), an approach showing promising results in the latest voice conversion challenge (VCC) 2020 is to first use an automatic speech recognition (ASR) model to transcribe the source speech into the underlying linguistic contents; these are then used as input by a text-to-speech (TTS) system to generate the converted speech. Such a paradigm, referred to as ASR+TTS, overlooks the modeling of prosody, which plays an important role in speech naturalness and conversion similarity. Although some researchers have considered transferring prosodic clues from the source speech, there arises a speaker mismatch during training and conversion. To address this issue, in this work, we propose to directly predict prosody from the linguistic representation in a target-speaker-dependent manner, referred to as target text prediction (TTP). We evaluate both methods on the VCC2020 benchmark and consider different linguistic representations. The results demonstrate the effectiveness of TTP in both objective and subjective evaluations.
△ Less
Submitted 20 July, 2021;
originally announced July 2021.