scispace - formally typeset
Search or ask a question
Author

Reitaro Tokumasu

Bio: Reitaro Tokumasu is an academic researcher from IBM. The author has contributed to research in topics: Muscle disorder & Genome. The author has an hindex of 2, co-authored 8 publications receiving 19 citations.

Papers
More filters
Journal ArticleDOI
TL;DR: WfG was able to scour large volumes of data from scientific studies and databases to analyze in-house clinical genome sequencing results and demonstrated the potential for application to clinical practice; however, it is necessary to train WfG in clinical trial settings.
Abstract: Background: Oncologists increasingly rely on clinical genome sequencing to pursue effective, molecularly targeted therapies. This study assesses the validity and utility of the artificial intelligence Watson for Genomics (WfG) for analyzing clinical sequencing results. Methods: This study identified patients with solid tumors who participated in in-house genome sequencing projects at a single cancer specialty hospital between April 2013 and October 2016. Targeted genome sequencing results of these patients’ tumors, previously analyzed by multidisciplinary specialists at the hospital, were reanalyzed by WfG. This study measures the concordance between the two evaluations. Results: In 198 patients, in-house genome sequencing detected 785 gene mutations, 40 amplifications, and 22 fusions after eliminating single nucleotide polymorphisms. Breast cancer (n = 40) was the most frequent diagnosis in this analysis, followed by gastric cancer (n=31), and lung cancer (n =30). Frequently detected single nucleotide variants were found in TP53 (n = 107), BRCA2 (n =24), and NOTCH2 (n = 23). MYC (n = 10) was the most frequently detected gene amplification, followed by ERBB2 (n = 9) and CCND1 (n = 6). Concordant pathogenic classifications (i.e., pathogenic, benign, or variant of unknown significance) between in-house specialists and WfG included 705 mutations (89.8%; 95% CI, 87.5%–91.8%), 39 amplifications (97.5%; 95% CI, 86.8%–99.9%), and 17 fusions (77.3%; 95% CI, 54.6%–92.2%). After about 12 months, reanalysis using a more recent version of WfG demonstrated a better concordance rate of 94.5% (95% CI, 92.7%–96.0%) for gene mutations. Across the 249 gene alterations determined to be pathogenic by both methods, including mutations, amplifications, and fusions, WfG covered 84.6% (88 of 104) of all targeted therapies that experts proposed and offered an additional 225 therapeutic options. Conclusions: WfG was able to scour large volumes of data from scientific studies and databases to analyze in-house clinical genome sequencing results and demonstrated the potential for application to clinical practice; however, we must train WfG in clinical trial settings.

14 citations

Posted ContentDOI
02 Mar 2021-medRxiv
TL;DR: In this paper, the authors analyzed 30493 genomes sampled in Japan were analyzed to understand the strains, heterogeneity and temporal evolution of different SARS-CoV-2 strains and identified 12 discrete strains with a substantial number of cases with most strains possessing the spike (S) D614G and nucleocapsid (N) 203_204delinsKR mutations.
Abstract: COVID-19 caused by SARS-CoV-2 was first identified in Japan on January 15th, 2020, soon after the pandemic originated in Wuhan, China. Subsequently, Japan experienced three distinct waves of the outbreak in the span of a year and has been attributed to new exogenous strains and evolving existing strains. Japan engaged very early on in tracking different COVID-19 strains and have sequenced approximately 5% of all confirmed cases. While Japan has enforced stringent airport surveillance on cross-border travelers and returnees, some carriers appear to have advanced through the quarantine stations undetected. In this study 30493 genomes sampled in Japan were analyzed to understand the strains, heterogeneity and temporal evolution of different SARS-CoV-2 strains. We identified 12 discrete strains with a substantial number of cases with most strains possessing the spike (S) D614G and nucleocapsid (N) 203_204delinsKR mutations. 155 distinct strains have been introduced into Japan and 39 of them were introduced after strict quarantine policy was implemented. In particular, the B.1.1.7 strain, that emerged in the United Kingdom (UK) in September 2020, has been circulating in Japan since late 2020 after eluding cross-border quarantine stations. Similarly, the B.1.351 strain dubbed the South African variant, P.1 Brazilian strain and R.1 strain with the spike E484K mutation have been detected in Japan. At least 14 exogenous B.1.1.7 sub-strains have been independently introduced in Japan as of late March 2021, and these strains carry mutations that give selective advantage including N501Y, H69_V70del, and E484K that confer increased transmissibility, reduced efficacy to vaccines and possible increased virulence. Furthermore, various strains, which harbor multiple variants in the PCR primers and the probe developed by National Institute of Infectious Disease (NIID), are emerging. It is imperative that the quarantine policy be revised, cross-border surveillance reinforced, and new public health measures implemented to mitigate further transmission of this deadly disease and to identify strains that may engender resistance to vaccines.

8 citations

Proceedings ArticleDOI
01 Apr 2019
TL;DR: A new classifier is developed, which is called the aggregated classifier, that improves the classification accuracy of existing classifiers to deal with histopathological images and suggests that causative genes may possibly be precisely predicted only by hematoxylin-eosin stained images without genetic testing.
Abstract: Muscle histopathology is the one of the most important diagnostic methods in the diagnosis of muscle diseases [1]. Nevertheless, it is a highly specialized field and only a limited number of experts are available in the world. Not surprisingly, significant number of cases are undiagnosed in underserved areas. We therefore intended to establish a computer-aided diagnostic system which should be helpful in this domain. This study aims to develop a multi-class classifier with deep learning for computer-aided muscle histopathological diagnosis. We chose five genetic muscle disease categories (dystrophinopathy, limb-girdle muscular dystrophy 2A (LGMD2A), limb-girdle muscular dystrophy 2B (LGMD2B), Ullrich congenital muscular dystrophy (UCMD), and Fukuyama-type congenital muscular dystrophy (FCMD)) as targets and aimed to distinguish between these diseases. We developed a new classifier, which we call the aggregated classifier, that improves the classification accuracy of existing classifiers to deal with histopathological images. The classifier achieved better classification accuracy than not only the existing classifier but also eight physicians who have been trained for muscle histopathology for variable duration. This result suggests that causative genes may possibly be precisely predicted only by hematoxylin-eosin (H&E) stained images without genetic testing.

3 citations

Journal ArticleDOI
TL;DR: In this article, a deep convolutional neural network (CNN) was used to classify muscle biopsy samples for classifying and diagnosing muscle diseases, including idiopathic inflammatory myopathies and hereditary muscle diseases.

3 citations

Proceedings ArticleDOI
16 Jun 2020
TL;DR: The proposed feature extraction method for prediction model for at the early stage of diabetic kidney disease (DKD) progression uses with hierarchical clustering that can estimate a suitable interval for grouping inputted sequences.
Abstract: In this paper, we propose feature extraction method for prediction model for at the early stage of diabetic kidney disease (DKD) progression DKD needs continuous treatment; however, a hospital visit interval of a patient at the early stage of DKD is normally from one month to three months, and this is not a short time period Therefore it makes difficult to apply sophisticated approaches such as using convolutional neural networks because of the data limitation The propose method uses with hierarchical clustering that can estimate a suitable interval for grouping inputted sequences We evaluate the proposed method with a real-EMR dataset that consists of 30,810 patient records and conclude that the proposed method outperforms the baseline methods derived from related work

2 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: In this article, the trajectory of SARS-CoV2 variants circulating in a major metropolitan area, documents B.1.7 as the major cause of new cases in Houston, TX, and heralds the arrival of B.617 variants in the metroplex.
Abstract: Certain genetic variants of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) are of substantial concern because they may be more transmissible or detrimentally alter the pandemic course and disease features in individual patients. SARS-CoV-2 genome sequences from 12,476 patients in the Houston Methodist health care system diagnosed from January 1 through May 31, 2021 are reported here. Prevalence of the B.1.1.7 (Alpha) variant increased rapidly and caused 63% to 90% of new cases in the latter half of May. Eleven B.1.1.7 genomes had an E484K replacement in spike protein, a change also identified in other SARS-CoV-2 lineages. Compared with non-B.1.1.7-infected patients, individuals with B.1.1.7 had a significantly lower cycle threshold (a proxy for higher virus load) and significantly higher hospitalization rate. Other variants [eg, B.1.429 and B.1.427 (Epsilon), P.1 (Gamma), P.2 (Zeta), and R.1] also increased rapidly, although the magnitude was less than that in B.1.1.7. Twenty-two patients infected with B.1.617.1 (Kappa) or B.1.617.2 (Delta) variants had a high rate of hospitalization. Breakthrough cases (n = 207) in fully vaccinated patients were caused by a heterogeneous array of virus genotypes, including many not currently designated variants of interest or concern. In the aggregate, this study delineates the trajectory of SARS-CoV-2 variants circulating in a major metropolitan area, documents B.1.1.7 as the major cause of new cases in Houston, TX, and heralds the arrival of B.1.617 variants in the metroplex.

22 citations

Journal ArticleDOI
TL;DR: In this article, the authors propose six recommendations to improve AI projects in the biomedical space, especially clinical health care, and to facilitate communication between AI scientists and medical doctors: (1) Relevant and well-defined clinical question first; (2) Right data (i.e., representative and of good quality); (3) Ratio between number of patients and their variables should fit the AI method; (4) Relationship between data and ground truth should be as direct and causal as possible; (5) Regulatory ready; enabling validation; and (6) Right AI method.
Abstract: The idea of artificial intelligence (AI) has a long history. It turned out, however, that reaching intelligence at human levels is more complicated than originally anticipated. Currently, we are experiencing a renewed interest in AI, fueled by an enormous increase in computing power and an even larger increase in data, in combination with improved AI technologies like deep learning. Healthcare is considered the next domain to be revolutionized by artificial intelligence. While AI approaches are excellently suited to develop certain algorithms, for biomedical applications there are specific challenges. We propose six recommendations-the 6Rs-to improve AI projects in the biomedical space, especially clinical health care, and to facilitate communication between AI scientists and medical doctors: (1) Relevant and well-defined clinical question first; (2) Right data (ie, representative and of good quality); (3) Ratio between number of patients and their variables should fit the AI method; (4) Relationship between data and ground truth should be as direct and causal as possible; (5) Regulatory ready; enabling validation; and (6) Right AI method.

21 citations

Journal ArticleDOI
TL;DR: In this article, the transition of viral lineage in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) was investigated by stationary genome analysis in Yamanashi, Japan.

16 citations

Journal ArticleDOI
TL;DR: It is expected that AI will soon be introduced in the field of endoscopic diagnosis and management of gastric cancer as a supportive tool for physicians, thus improving the quality of medical care.
Abstract: Image recognition using artificial intelligence (AI) has progressed significantly due to innovative technologies such as machine learning and deep learning. In the field of gastric cancer (GC) management, research on AI-based diagnosis such as anatomical classification of endoscopic images, diagnosis of Helicobacter pylori infection, and detection and qualitative diagnosis of GC is being conducted, and an accuracy equivalent to that of physicians has been reported. It is expected that AI will soon be introduced in the field of endoscopic diagnosis and management of gastric cancer as a supportive tool for physicians, thus improving the quality of medical care.

15 citations

Journal ArticleDOI
23 Jun 2021
TL;DR: In this article, an artificial intelligence-assisted computational method, the digital drug-assignment (DDA) system, was developed to prioritize potential molecularly targeted agents (MTA) for each cancer patient based on the complex individual molecular profile of their tumor.
Abstract: Precision oncology is currently based on pairing molecularly targeted agents (MTA) to predefined single driver genes or biomarkers. Each tumor harbors a combination of a large number of potential genetic alterations of multiple driver genes in a complex system that limits the potential of this approach. We have developed an artificial intelligence (AI)-assisted computational method, the digital drug-assignment (DDA) system, to prioritize potential MTAs for each cancer patient based on the complex individual molecular profile of their tumor. We analyzed the clinical benefit of the DDA system on the molecular and clinical outcome data of patients treated in the SHIVA01 precision oncology clinical trial with MTAs matched to individual genetic alterations or biomarkers of their tumor. We found that the DDA score assigned to MTAs was significantly higher in patients experiencing disease control than in patients with progressive disease (1523 versus 580, P = 0.037). The median PFS was also significantly longer in patients receiving MTAs with high (1000+ <) than with low (<0) DDA scores (3.95 versus 1.95 months, P = 0.044). Our results indicate that AI-based systems, like DDA, are promising new tools for oncologists to improve the clinical benefit of precision oncology.

13 citations