Home
/
Authors
/
Kohji Okamura

Author

Kohji Okamura

Other affiliations: Ochanomizu University, The Centre for Applied Genomics, University of Toronto

Bio: Kohji Okamura is an academic researcher from University of Tokyo. The author has contributed to research in topics: DNA methylation & CpG site. The author has an hindex of 20, co-authored 67 publications receiving 5836 citations. Previous affiliations of Kohji Okamura include Ochanomizu University & The Centre for Applied Genomics.

Topics: DNA methylation, CpG site, Gene, Genomic imprinting, Promoter ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2011
2010
2008
2007
2006
2005
2004
2000
1997
1995

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Global variation in copy number in the human genome

[...]

Richard Redon¹, Shumpei Ishikawa², Karen R. Fitch³, Lars Feuk⁴, George H. Perry⁵, T. Daniel Andrews¹, Heike Fiegler¹, Michael H. Shapero³, Andrew R. Carson⁴, Wenwei Chen³, Eun Kyung Cho⁶, Stephanie Dallaire⁶, Jennifer L. Freeman⁶, Juan R. González⁷, Mònica Gratacòs⁷, Jing Huang³, Dimitrios Kalaitzopoulos¹, Daisuke Komura², Jeffrey R. MacDonald⁴, Christian R. Marshall⁴, Rui Mei³, Lyndal Montgomery¹, Keunihiro Nishimura², Kohji Okamura⁴, Fan Shen³, Martin J. Somerville⁸, Joelle Tchinda⁶, Armand Valsesia¹, Cara Woodwark¹, Fengtang Yang¹, Junjun Zhang⁴, Tatiana Zerjal¹, Jane Zhang³, Lluís Armengol⁷, Donald F. Conrad⁹, Xavier Estivill⁷, Chris Tyler-Smith¹, Nigel P. Carter¹, Hiroyuki Aburatani², Charles Lee⁶, Keith W. Jones³, Stephen W. Scherer⁴, Matthew E. Hurles¹ - Show less +39 more•Institutions (9)

Wellcome Trust Sanger Institute¹, University of Tokyo², Thermo Fisher Scientific³, University of Toronto⁴, Brigham and Women's Hospital⁵, Harvard University⁶, Pompeu Fabra University⁷, University of Alberta⁸, University of Chicago⁹

23 Nov 2006-Nature

TL;DR: A first-generation CNV map of the human genome is constructed through the study of 270 individuals from four populations with ancestry in Europe, Africa or Asia, underscoring the importance of CNV in genetic diversity and evolution and the utility of this resource for genetic disease studies.

...read moreread less

Abstract: Copy number variation (CNV) of DNA sequences is functionally significant but has yet to be fully ascertained. We have constructed a first-generation CNV map of the human genome through the study of 270 individuals from four populations with ancestry in Europe, Africa or Asia (the HapMap collection). DNA from these individuals was screened for CNV using two complementary technologies: single-nucleotide polymorphism (SNP) genotyping arrays, and clone-based comparative genomic hybridization. A total of 1,447 copy number variable regions (CNVRs), which can encompass overlapping or adjacent gains or losses, covering 360 megabases (12% of the genome) were identified in these populations. These CNVRs contained hundreds of genes, disease loci, functional elements and segmental duplications. Notably, the CNVRs encompassed more nucleotide content per genome than SNPs, underscoring the importance of CNV in genetic diversity and evolution. The data obtained delineate linkage disequilibrium patterns for many CNVs, and reveal marked variation in copy number among populations. We also demonstrate the utility of this resource for genetic disease studies.

...read moreread less

4,275 citations

Journal Article•DOI•

Targeted DNA demethylation in vivo using dCas9-peptide repeat and scFv-TET1 catalytic domain fusions

[...]

Sumiyo Morita¹, Hirofumi Noguchi², Takuro Horii¹, Kazuhiko Nakabayashi, Mika Kimura¹, Kohji Okamura, Atsuhiko Sakai², Hideyuki Nakashima², Kenichiro Hata, Kinichi Nakashima², Izuho Hatada¹ - Show less +7 more•Institutions (2)

Gunma University¹, Kyushu University²

01 Oct 2016-Nature Biotechnology

TL;DR: Targeted demethylation of CpGs in regulatory regions and dem methylation-dependent 1.7- to 50-fold upregulation of associated genes both in cell culture (embryonic stem cells, cancer cell lines, primary neural precursor cells) and in vivo in mouse fetuses are demonstrated.

...read moreread less

Abstract: Despite the importance of DNA methylation in health and disease, technologies to readily manipulate methylation of specific sequences for functional analysis and therapeutic purposes are lacking. Here we adapt the previously described dCas9-SunTag for efficient, targeted demethylation of specific DNA loci. The original SunTag consists of ten copies of the GCN4 peptide separated by 5-amino-acid linkers. To achieve efficient recruitment of an anti-GCN4 scFv fused to the ten-eleven (TET) 1 hydroxylase, which induces demethylation, we changed the linker length to 22 amino acids. The system attains demethylation efficiencies >50% in seven out of nine loci tested. Four of these seven loci showed demethylation of >90%. We demonstrate targeted demethylation of CpGs in regulatory regions and demethylation-dependent 1.7- to 50-fold upregulation of associated genes both in cell culture (embryonic stem cells, cancer cell lines, primary neural precursor cells) and in vivo in mouse fetuses.

...read moreread less

374 citations

Journal Article•DOI•

Genome-wide parent-of-origin DNA methylation analysis reveals the intricacies of human imprinting and suggests a germline methylation-independent mechanism of establishment

[...]

Franck Court, Chiharu Tayama, Valeria Romanelli, Alex Martin-Trujillo, Isabel Iglesias-Platas, Kohji Okamura, Naoko Sugahara, Carlos Simón¹, Harry Moore², Julie V. Harness³, Hans S. Keirstead³, Jose V. Sanchez-Mut, Eisuke Kaneki⁴, Pablo Lapunzina⁵, Hidenobu Soejima⁶, Norio Wake⁴, Manel Esteller⁷, Manel Esteller⁸, Tsutomu Ogata⁹, Kenichiro Hata, Kazuhiko Nakabayashi, David Monk - Show less +18 more•Institutions (9)

University of Valencia¹, University of Sheffield², University of California, Irvine³, Kyushu University⁴, Autonomous University of Madrid⁵, Saga University⁶, Catalan Institution for Research and Advanced Studies⁷, University of Barcelona⁸, Hamamatsu University School of Medicine⁹

01 Apr 2014-Genome Research

TL;DR: Pl placental-specific imprinting provides evidence for an inheritable epigenetic state that is independent of DNA methylation and the existence of a novel imprinting mechanism at these loci.

...read moreread less

Abstract: Genomic imprinting is a form of epigenetic regulation that results in the expression of either the maternally or paternally inherited allele of a subset of genes (Ramowitz and Bartolomei 2011). This imprinted expression of transcripts is crucial for normal mammalian development. In humans, loss-of-imprinting of specific loci results in a number of diseases exemplified by the reciprocal growth phenotypes of the Beckwith-Wiedemann and Silver-Russell syndromes, and the behavioral disorders Angelman and Prader-Willi syndromes (Kagami et al. 2008; Buiting 2010; Choufani et al. 2010; Eggermann 2010; Kelsey 2010; Mackay and Temple 2010). In addition, aberrant imprinting also contributes to multigenic disorders associated with various complex traits and cancer (Kong et al. 2009; Monk 2010). Imprinted loci contain differentially methylated regions (DMRs) where cytosine methylation marks one of the parental alleles, providing cis-acting regulatory elements that influence the allelic expression of surrounding genes. Some DMRs acquire their allelic methylation during gametogenesis, when the two parental genomes are separated, resulting from the cooperation of the de novo methyltransferase DNMT3A and its cofactor DNMT3L (Bourc'his et al. 2001; Hata et al. 2002). These primary, or germline imprinted DMRs are stably maintained throughout somatic development, surviving the epigenetic reprogramming at the oocyte-to-embryo transition (Smallwood et al. 2011; Smith et al. 2012). To confirm that an imprinted DMR functions as an imprinting control region (ICR), disruption of the imprinted expression upon genetic deletion of that DMR, either through experimental targeting in mouse or that which occurs spontaneously in humans, is required. A subset of DMRs, known as secondary DMRs, acquire methylation during development and are regulated by nearby germline DMRs in a hierarchical fashion (Coombes et al. 2003; Lopes et al. 2003; Kagami et al. 2010). With the advent of large-scale, base-resolution methylation technologies, it is now possible to discriminate allelic methylation dictated by sequence variants from imprinted methylation. Yet our knowledge of the total number of imprinted DMRs in humans, and their developmental dynamics, remains incomplete, hampered by genetic heterogeneity of human samples. Here we present high-resolution mapping of human imprinted methylation. We performed whole-genome-wide bisulfite sequencing (WGBS) on leukocyte-, brain-, liver-, and placenta-derived DNA samples to identify partially methylated regions common to all tissues consistent with imprinted DMRs. We subsequently confirmed the partial methylated states in tissues using high-density methylation microarrays. The parental origin of methylation was determined by comparing microarray data for DNA samples from reciprocal genome-wide uniparental disomy (UPD) samples, in which all chromosomes are inherited from one parent (Lapunzina and Monk 2011), and androgenetic hydatidiform moles, which are created by the fertilization of an oocyte lacking a nucleus by a sperm that endoreduplicates. The use of uniparental disomies and hydatidiform moles meant that our analyses were not subjected to genotype influences, enabling us to characterize all known imprinted DMRs at base-pair resolution and to identify 21 imprinted domains, which we show are absent in mice. Lastly, we extended our analyses to determine the methylation profiles of all imprinted DMRs in sperm, stem cells derived from parthenogenetically activated metaphase-2 oocyte blastocytes (phES) (Mai et al. 2007; Harness et al. 2011), and stem cells (hES) generated from both six-cell blastomeres and the inner cell mass of blastocysts, delineating the extent of embryonic reprogramming that occurs at these loci during human development.

...read moreread less

285 citations

Journal Article•DOI•

Human genetic variation database, a reference database of genetic variations in the Japanese population.

[...]

Koichiro Higasa¹, Noriko Miyake², Jun Yoshimura³, Kohji Okamura, Tetsuya Niihori⁴, Hirotomo Saitsu², Koichiro Doi³, Masakazu Shimizu¹, Kazuhiko Nakabayashi, Yoko Aoki⁴, Yoshinori Tsurusaki², Shinichi Morishita³, Takahisa Kawaguchi¹, Osuke Migita⁵, Keiko Nakayama⁴, Mitsuko Nakashima², Jun Mitsui³, Maiko Narahara¹, Keiko Hayashi, Ryo Funayama⁴, Daisuke Yamaguchi, Hiroyuki Ishiura³, Wen Ya Ko¹, Wen Ya Ko⁶, Kenichiro Hata, Takeshi Nagashima⁴, Ryo Yamada¹, Yoichi Matsubara⁴, Akihiro Umezawa, Shoji Tsuji³, Naomichi Matsumoto², Fumihiko Matsuda¹ - Show less +28 more•Institutions (6)

Kyoto University¹, Yokohama City University², University of Tokyo³, Tohoku University⁴, St. Marianna University School of Medicine⁵, National Yang-Ming University⁶

01 Jun 2016-Journal of Human Genetics

TL;DR: The results illustrate the importance of constructing an ethnicity-specific reference genome for identifying rare variants and constructed a Japanese-specific major allele reference genome, by which the number of unique mapping of the short reads in the data has increased 0.045% on average.

...read moreread less

Abstract: Whole-genome and -exome resequencing using next-generation sequencers is a powerful approach for identifying genomic variations that are associated with diseases. However, systematic strategies for prioritizing causative variants from many candidates to explain the disease phenotype are still far from being established, because the population-specific frequency spectrum of genetic variation has not been characterized. Here, we have collected exomic genetic variation from 1208 Japanese individuals through a collaborative effort, and aggregated the data into a prevailing catalog. In total, we identified 156 622 previously unreported variants. The allele frequencies for the majority (88.8%) were lower than 0.5% in allele frequency and predicted to be functionally deleterious. In addition, we have constructed a Japanese-specific major allele reference genome by which the number of unique mapping of the short reads in our data has increased 0.045% on average. Our results illustrate the importance of constructing an ethnicity-specific reference genome for identifying rare variants. All the collected data were centralized to a newly developed database to serve as useful resources for exploring pathogenic variations. Public access to the database is available at http://www.genome.med.kyoto-u.ac.jp/SnpDB/.

...read moreread less

261 citations

Journal Article•DOI•

[...]

Shinsuke Hirabayashi, Kentaro Ohki, Kazuhiko Nakabayashi, Hitoshi Ichikawa, Yukihide Momozawa, Kohji Okamura, Akinori Yaguchi¹, Kazuki Terada, Yuya Saito, Ai Yoshimi², Hiroko Ogata-Kawata, Hiromi Sakamoto, Motohiro Kato, Junya Fujimura¹, Moeko Hino³, Akitoshi Kinoshita⁴, Harumi Kakuda², Hidemitsu Kurosawa⁵, Keisuke Kato², Ryosuke Kajiwara⁶, Koichi Moriwaki⁷, Tsuyoshi Morimoto⁸, Kozue Nakamura⁹, Yasushi Noguchi, Tomoo Osumi, Kazuo Sakashita², Junko Takita¹⁰, Yuki Yuza, Koich Matsuda¹⁰, Teruhiko Yoshida, Kenji Matsumoto, Kenichiro Hata, Michiaki Kubo, Yoichi Matsubara, Takashi Fukushima¹¹, Katsuyoshi Koh, Atsushi Manabe, Akira Ohara¹², Nobutaka Kiyokawa - Show less +35 more•Institutions (12)

Juntendo University¹, Boston Children's Hospital², Chiba University³, St. Marianna University School of Medicine⁴, Dokkyo Medical University⁵, Yokohama City University⁶, Saitama Medical University⁷, Tokai University⁸, Teikyo University⁹, University of Tokyo¹⁰, University of Tsukuba¹¹, Toho University¹²

01 Jan 2017-Haematologica

TL;DR: Observations indicate that ZNF384-related fusion genes consist of a distinct subgroup of B-cell precursor acute lymphoblastic leukemia with a characteristic immunophenotype, while the clinical features depend on the functional properties of individual fusion partners.

...read moreread less

Abstract: Fusion genes involving ZNF384 have recently been identified in B-cell precursor acute lymphoblastic leukemia, and 7 fusion partners have been reported. We further characterized this type of fusion gene by whole transcriptome sequencing and/or polymerase chain reaction. In addition to previously reported genes, we identified BMP2K as a novel fusion partner for ZNF384. Including the EP300-ZNF384 that we reported recently, the total frequency of ZNF384-related fusion genes was 4.1% in 291 B-cell precursor acute lymphoblastic leukemia patients enrolled in a single clinical trial, and TCF3-ZNF384 was the most recurrent, with a frequency of 2.4%. The characteristic immunophenotype of weak CD10 and aberrant CD13 and/or CD33 expression was revealed to be a common feature of the leukemic cells harboring ZNF384-related fusion genes. The signature gene expression profile in TCF3-ZNF384-positive patients was enriched in hematopoietic stem cell features and related to that of EP300-ZNF384-positive patients, but was significantly distinct from that of TCF3-PBX1-positive and ZNF384-fusion-negative patients. However, clinical features of TCF3-ZNF384-positive patients are markedly different from those of EP300-ZNF384-positive patients, exhibiting higher cell counts and a younger age at presentation. TCF3-ZNF384-positive patients revealed a significantly poorer steroid response and a higher frequency of relapse, and the additional activating mutations in RAS signaling pathway genes were detected by whole exome analysis in some of the cases. Our observations indicate that ZNF384-related fusion genes consist of a distinct subgroup of B-cell precursor acute lymphoblastic leukemia with a characteristic immunophenotype, while the clinical features depend on the functional properties of individual fusion partners.

...read moreread less

149 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Machine learning

[...]

Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

01 Dec 1996-ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

...read moreread less

13,246 citations

Journal Article•DOI•

Model-based Analysis of ChIP-Seq (MACS)

[...]

Yong Zhang¹, Tao Liu¹, Clifford A. Meyer¹, Jérôme Eeckhoute², David S. Johnson, Bradley E. Bernstein³, Bradley E. Bernstein¹, Chad Nusbaum³, Richard M. Myers⁴, Myles Brown², Wei Li⁵, X. Shirley Liu¹ - Show less +8 more•Institutions (5)

Harvard University¹, Brigham and Women's Hospital², Broad Institute³, Stanford University⁴, Baylor College of Medicine⁵

17 Sep 2008-Genome Biology

TL;DR: This work presents Model-based Analysis of ChIP-Seq data, MACS, which analyzes data generated by short read sequencers such as Solexa's Genome Analyzer, and uses a dynamic Poisson distribution to effectively capture local biases in the genome, allowing for more robust predictions.

...read moreread less

Abstract: We present Model-based Analysis of ChIP-Seq data, MACS, which analyzes data generated by short read sequencers such as Solexa's Genome Analyzer. MACS empirically models the shift size of ChIP-Seq tags, and uses it to improve the spatial resolution of predicted binding sites. MACS also uses a dynamic Poisson distribution to effectively capture local biases in the genome, allowing for more robust predictions. MACS compares favorably to existing ChIP-Seq peak-finding algorithms, and is freely available.

...read moreread less

13,008 citations

Journal Article•

Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

[...]

Fumio Tajima¹•Institutions (1)

Kyushu University¹

30 Oct 1989-Genomics

TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.

...read moreread less

11,521 citations

Pattern Recognition and Machine Learning

[...]

Christopher M. Bishop¹•Institutions (1)

Microsoft¹

01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

...read moreread less

10,141 citations

Journal Article•DOI•

A second generation human haplotype map of over 3.1 million SNPs

[...]

Kelly A. Frazer¹, Dennis G. Ballinger, David R. Cox, David A. Hinds +234 more•Institutions (48)

18 Oct 2007-Nature

TL;DR: The Phase II HapMap is described, which characterizes over 3.1 million human single nucleotide polymorphisms genotyped in 270 individuals from four geographically diverse populations and includes 25–35% of common SNP variation in the populations surveyed, and increased differentiation at non-synonymous, compared to synonymous, SNPs is demonstrated.

...read moreread less

Abstract: We describe the Phase II HapMap, which characterizes over 3.1 million human single nucleotide polymorphisms (SNPs) genotyped in 270 individuals from four geographically diverse populations and includes 25-35% of common SNP variation in the populations surveyed. The map is estimated to capture untyped common variation with an average maximum r2 of between 0.9 and 0.96 depending on population. We demonstrate that the current generation of commercial genome-wide genotyping products captures common Phase II SNPs with an average maximum r2 of up to 0.8 in African and up to 0.95 in non-African populations, and that potential gains in power in association studies can be obtained through imputation. These data also reveal novel aspects of the structure of linkage disequilibrium. We show that 10-30% of pairs of individuals within a population share at least one region of extended genetic identity arising from recent ancestry and that up to 1% of all common variants are untaggable, primarily because they lie within recombination hotspots. We show that recombination rates vary systematically around genes and between genes of different function. Finally, we demonstrate increased differentiation at non-synonymous, compared to synonymous, SNPs, resulting from systematic differences in the strength or efficacy of natural selection between populations.

...read moreread less

4,565 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse