Home
/
Authors
/
Shaoliang Peng

Author

Shaoliang Peng

Other affiliations: National University of Defense Technology, Xiamen University, University of Defence ...read more

Bio: Shaoliang Peng is an academic researcher from Hunan University. The author has contributed to research in topics: Computer science & Xeon Phi. The author has an hindex of 20, co-authored 140 publications receiving 6339 citations. Previous affiliations of Shaoliang Peng include National University of Defense Technology & Xiamen University.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006

Papers

PDF

Open Access

More filters

Journal Article•DOI•

SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler

[...]

Ruibang Luo¹, Binghang Liu¹, Yinlong Xie², Yinlong Xie¹, Zhenyu Li¹, Weihua Huang, Jianying Yuan, Guangzhu He, Yanxiang Chen, Qi Pan, Yunjie Liu, Jingbo Tang, Gengxiong Wu, Hao Zhang, Yujian Shi, Yong Liu, Chang Yu, Bo Wang, Yao Lu, Changlei Han, David W. Cheung¹, Siu-Ming Yiu¹, Shaoliang Peng³, Zhu Xiao-qian³, Guangming Liu³, Xiangke Liao³, Yingrui Li¹, Huanming Yang, Jian Wang, Tak-Wah Lam¹, Jun Wang - Show less +27 more•Institutions (3)

University of Hong Kong¹, South China University of Technology², National University of Defense Technology³

27 Dec 2012-GigaScience

TL;DR: This work provides an updated assembly version of the 2008 Asian genome using SOAPdenovo2, a new algorithm design that reduces memory consumption in graph construction, resolves more repeat regions in contig assembly, increases coverage and length in scaffold construction, improves gap closing, and optimizes for large genome.

...read moreread less

Abstract: There is a rapidly increasing amount of de novo genome assembly using next-generation sequencing (NGS) short reads; however, several big challenges remain to be overcome in order for this to be efficient and accurate. SOAPdenovo has been successfully applied to assemble many published genomes, but it still needs improvement in continuity, accuracy and coverage, especially in repeat regions. To overcome these challenges, we have developed its successor, SOAPdenovo2, which has the advantage of a new algorithm design that reduces memory consumption in graph construction, resolves more repeat regions in contig assembly, increases coverage and length in scaffold construction, improves gap closing, and optimizes for large genome. Benchmark using the Assemblathon1 and GAGE datasets showed that SOAPdenovo2 greatly surpasses its predecessor SOAPdenovo and is competitive to other assemblers on both assembly length and accuracy. We also provide an updated assembly version of the 2008 Asian (YH) genome using SOAPdenovo2. Here, the contig and scaffold N50 of the YH genome were ~20.9 kbp and ~22 Mbp, respectively, which is 3-fold and 50-fold longer than the first published version. The genome coverage increased from 81.16% to 93.91%, and memory consumption was ~2/3 lower during the point of largest memory consumption.

...read moreread less

4,284 citations

Journal Article•DOI•

Corrigendum: Genome-wide adaptive complexes to underground stresses in blind mole rats Spalax.

[...]

Xiaodong Fang, Eviatar Nevo, Lijuan Han, Erez Y. Levanon, Jing Zhao, Aaron Avivi, Denis M. Larkin, Xuanting Jiang, Sergey Feranchuk, Yabing Zhu, Alla Fishman, Yue Feng, Noa Sher, Zhiqiang Xiong, Thomas Hankeln, Zhiyong Huang, Vera Gorbunova, Lu Zhang, Wei Zhao, Derek E. Wildman¹, Derek E. Wildman², Yingqi Xiong, Andrei V. Gudkov, Qiumei Zheng, Gideon Rechavi, Sanyang Liu, Lily Bazak, Jie Chen¹, Jie Chen², Binyamin A. Knisbacher, Yao Lu, Imad Shams, Krzysztof Gajda, Marta Farré, Jaebum Kim, Harris A. Lewin, Jian Ma, Mark Band, Anne Bicker, Angela Kranz, Tobias Mattheus, Hanno Schmidt, Andrei Seluanov, Jorge Azpurua, Michael R. McGowen, Eshel Ben Jacob, Kexin Li, Shaoliang Peng, Xiaoqian Zhu, Xiangke Liao, Shuai Cheng Li, Anders Krogh, Xin Zhou, Leonid Brodsky, Jun Wang - Show less +51 more•Institutions (2)

Illinois College¹, University of Illinois at Urbana–Champaign²

12 Aug 2015-Nature Communications

TL;DR: In this paper, a Genome-Wide Adaptive Complex to underground stresses in blind mole rats Spalax is described, where the adaptive complexes are based on adaptive complexes to underground stress.

...read moreread less

Abstract: Corrigendum: Genome-wide adaptive complexes to underground stresses in blind mole rats Spalax

...read moreread less

527 citations

Journal Article•DOI•

SOAP3-dp: Fast, Accurate and Sensitive GPU-based Short Read Aligner

[...]

Ruibang Luo¹, Thomas K. F. Wong¹, Jianqiao Zhu¹, Jianqiao Zhu², Chi-Man Liu¹, Xiaoqian Zhu³, Ed X. Wu¹, Lap-Kei Lee¹, Haoxiang Lin, Wenjuan Zhu, David W. Cheung¹, Hing-Fung Ting¹, Siu-Ming Yiu¹, Shaoliang Peng³, Chang Yu, Yingrui Li, Ruiqiang Li⁴, Tak-Wah Lam¹ - Show less +14 more•Institutions (4)

University of Hong Kong¹, University of Wisconsin-Madison², National University of Defense Technology³, Peking University⁴

31 May 2013-PLOS ONE

TL;DR: Compared with widely adopted aligners including BWA, Bowtie2, SeqAlto, CUSHAW2, GEM and GPU-based aligners, SOAP3-dp was found to be two to tens of times faster, while maintaining the highest sensitivity and lowest false discovery rate (FDR) on Illumina reads with different lengths.

...read moreread less

Abstract: To tackle the exponentially increasing throughput of Next-Generation Sequencing (NGS), most of the existing short-read aligners can be configured to favor speed in trade of accuracy and sensitivity. SOAP3-dp, through leveraging the computational power of both CPU and GPU with optimized algorithms, delivers high speed and sensitivity simultaneously. Compared with widely adopted aligners including BWA, Bowtie2, SeqAlto, CUSHAW2, GEM and GPU-based aligners BarraCUDA and CUSHAW, SOAP3-dp was found to be two to tens of times faster, while maintaining the highest sensitivity and lowest false discovery rate (FDR) on Illumina reads with different lengths. Transcending its predecessor SOAP3, which does not allow gapped alignment, SOAP3-dp by default tolerates alignment similarity as low as 60%. Real data evaluation using human genome demonstrates SOAP3-dp's power to enable more authentic variants and longer Indels to be discovered. Fosmid sequencing shows a 9.1% FDR on newly discovered deletions. SOAP3-dp natively supports BAM file format and provides the same scoring scheme as BWA, which enables it to be integrated into existing analysis pipelines. SOAP3-dp has been deployed on Amazon-EC2, NIH-Biowulf and Tianhe-1A.

...read moreread less

407 citations

Journal Article•DOI•

MicroRNAs Activate Gene Transcription Epigenetically as an Enhancer Trigger

[...]

Min Xiao¹, Jin Li¹, Wei Li¹, Yu Wang¹, Feizhen Wu¹, Yanping Xi¹, Lan Zhang¹, Chao Ding², Huaibing Luo¹, Yan Li¹, Lina Peng¹, Liping Zhao¹, Shaoliang Peng³, Yao Xiao¹, Shihua Dong¹, Jie Cao², Wenqiang Yu¹ - Show less +13 more•Institutions (3)

Fudan University¹, Second Military Medical University², National University of Defense Technology³

30 May 2017-RNA Biology

TL;DR: This work focused on miR-24-1 and found that this miRNA unconventionally activates gene transcription by targeting enhancers, and demonstrates a novel mechanism of miRNA as an enhancer trigger.

...read moreread less

Abstract: MicroRNAs (miRNAs) are small non-coding RNAs that function as negative gene expression regulators. Emerging evidence shows that, except for function in the cytoplasm, miRNAs are also present in the nucleus. However, the functional significance of nuclear miRNAs remains largely undetermined. By screening miRNA database, we have identified a subset of miRNA that functions as enhancer regulators. Here, we found a set of miRNAs show gene-activation function. We focused on miR-24-1 and found that this miRNA unconventionally activates gene transcription by targeting enhancers. Consistently, the activation was completely abolished when the enhancer sequence was deleted by TALEN. Furthermore, we found that miR-24-1 activates enhancer RNA (eRNA) expression, alters histone modification, and increases the enrichment of p300 and RNA Pol II at the enhancer locus. Our results demonstrate a novel mechanism of miRNA as an enhancer trigger.

...read moreread less

224 citations

Journal Article•DOI•

SOAPfuse: an algorithm for identifying fusion transcripts from paired-end RNA-Seq data

[...]

Wenlong Jia, Kunlong Qiu, Minghui He, Pengfei Song, Quan Zhou¹, Feng Zhou², Yuan Yu, Dandan Zhu, Michael L. Nickerson³, Shengqing Wan, Xiangke Liao⁴, Xiaoqian Zhu⁴, Shaoliang Peng⁴, Yingrui Li, Jun Wang, Guangwu Guo - Show less +12 more•Institutions (4)

University of Electronic Science and Technology of China¹, South China University of Technology², National Institutes of Health³, National University of Defense Technology⁴

14 Feb 2013-Genome Biology

TL;DR: A new method to identify fusion transcripts from paired-end RNA-Seq data by applying an improved partial exhaustion algorithm to construct a library of fusion junction sequences, and employs a series of filters to nominate high-confidence fusion transcripts.

...read moreread less

Abstract: We have developed a new method, SOAPfuse, to identify fusion transcripts from paired-end RNA-Seq data. SOAPfuse applies an improved partial exhaustion algorithm to construct a library of fusion junction sequences, which can be used to efficiently identify fusion events, and employs a series of filters to nominate high-confidence fusion transcripts. Compared with other released tools, SOAPfuse achieves higher detection efficiency and consumed less computing resources. We applied SOAPfuse to RNA-Seq data from two bladder cancer cell lines, and confirmed 15 fusion transcripts, including several novel events common to both cell lines. SOAPfuse is available at http://soap.genomics.org.cn/soapfuse.html.

...read moreread less

201 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

I and i

[...]

Kevin Barraclough

08 Dec 2001-BMJ

TL;DR: There is, I think, something ethereal about i —the square root of minus one, which seems an odd beast at that time—an intruder hovering on the edge of reality.

...read moreread less

Abstract: There is, I think, something ethereal about i —the square root of minus one. I remember first hearing about it at school. It seemed an odd beast at that time—an intruder hovering on the edge of reality. Usually familiarity dulls this sense of the bizarre, but in the case of i it was the reverse: over the years the sense of its surreal nature intensified. It seemed that it was impossible to write mathematics that described the real world in …

...read moreread less

33,785 citations

Journal Article•DOI•

Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype

[...]

Daehwan Kim¹, Joseph M. Paggi², Chanhee Park¹, Christopher Bennett¹, Steven L. Salzberg³ - Show less +1 more•Institutions (3)

University of Texas Southwestern Medical Center¹, Stanford University², Johns Hopkins University³

01 Aug 2019-Nature Biotechnology

TL;DR: This work presents a method named HISAT2 (hierarchical indexing for spliced alignment of transcripts 2) that can align both DNA and RNA sequences using a graph Ferragina Manzini index, and uses it to represent and search an expanded model of the human reference genome.

...read moreread less

Abstract: The human reference genome represents only a small number of individuals, which limits its usefulness for genotyping. We present a method named HISAT2 (hierarchical indexing for spliced alignment of transcripts 2) that can align both DNA and RNA sequences using a graph Ferragina Manzini index. We use HISAT2 to represent and search an expanded model of the human reference genome in which over 14.5 million genomic variants in combination with haplotypes are incorporated into the data structure used for searching and alignment. We benchmark HISAT2 using simulated and real datasets to demonstrate that our strategy of representing a population of genomes, together with a fast, memory-efficient search algorithm, provides more detailed and accurate variant analyses than other methods. We apply HISAT2 for HLA typing and DNA fingerprinting; both applications form part of the HISAT-genotype software that enables analysis of haplotype-resolved genes or genomic regions. HISAT-genotype outperforms other computational methods and matches or exceeds the performance of laboratory-based assays. A graph-based genome indexing scheme enables variant-aware alignment of sequences with very low memory requirements.

...read moreread less

4,855 citations

“Bioinformatics” 특집을 내면서

[...]

장병탁, 김삼묘, 허철구

01 Aug 2000

TL;DR: Assessment of medical technology in the context of commercialization with Bioentrepreneur course, which addresses many issues unique to biomedical products.

...read moreread less

Abstract: BIOE 402. Medical Technology Assessment. 2 or 3 hours. Bioentrepreneur course. Assessment of medical technology in the context of commercialization. Objectives, competition, market share, funding, pricing, manufacturing, growth, and intellectual property; many issues unique to biomedical products. Course Information: 2 undergraduate hours. 3 graduate hours. Prerequisite(s): Junior standing or above and consent of the instructor.

...read moreread less

4,833 citations

Integrative analysis of 111 reference human epigenomes

[...]

Anshul Kundaje, Wouter Meuleman, Jason Ernst, Angela Yen, Pouya Kheradpour, Zhizhuo Zhang, Jianrong Wang, Lucas D. Ward, Abhishek Sarkar, Gerald Quon, Matthew L. Eaton, Yi-Chieh Wu, Andreas R. Pfenning, Xinchen Wang, Melina Claussnitzer, Yaping Liu, Mukul S. Bansal, Soheil Feizi-Khankandi, Ah Ram Kim, Richard C Sallari, Nicholas A Sinnott-Armstrong, Laurie A. Boyer, Elizabeta Gjoneska, Li-Huei Tsai, Manolis Kellis - Show less +21 more

01 Feb 2015

TL;DR: In this article, the authors describe the integrative analysis of 111 reference human epigenomes generated as part of the NIH Roadmap Epigenomics Consortium, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression.

...read moreread less

Abstract: The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but epigenomic studies lack a similar reference. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection so far of human epigenomes for primary cells and tissues. Here we describe the integrative analysis of 111 reference human epigenomes generated as part of the programme, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation and human disease.

...read moreread less

4,409 citations

Journal Article•DOI•

MEGAHIT: An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph

[...]

Dinghua Li¹, Chi-Man Liu¹, Ruibang Luo¹, Kunihiko Sadakane¹, Tak-Wah Lam¹ - Show less +1 more•Institutions (1)

National Institute of Informatics¹

15 May 2015-Bioinformatics

TL;DR: MEGAHIT is a NGS de novo assembler for assembling large and complex metagenomics data in a time- and cost-efficient manner and generated a three-time larger assembly, with longer contig N50 and average contig length.

...read moreread less

Abstract: Summary: MEGAHIT is a NGS de novo assembler for assembling large and complex metagenomics data in a time- and cost-efficient manner. It finished assembling a soil metagenomics dataset with 252Gbps in 44.1 hours and 99.6 hours on a single computing node with and without a GPU, respectively. MEGAHIT assembles the data as a whole, i.e., no pre-processing like partitioning and normalization was needed. When compared with previous methods (Chikhi and Rizk, 2012; Howe, et al., 2014) on assembling the soil data, MEGAHIT generated a 3-time larger assembly, with longer contig N50 and average contig length; furthermore, 55.8% of the reads were aligned to the assembly, giving a 4-fold improvement . Availability: The source code of MEGAHIT is freely available at https://github.com/voutcn/megahit under GPLv3 license. Contact: rb@l3-bioinfo.com, twlam@cs.hku.hk

...read moreread less

3,634 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse