scispace - formally typeset
I

Idoia Ochoa

Researcher at University of Illinois at Urbana–Champaign

Publications -  59
Citations -  660

Idoia Ochoa is an academic researcher from University of Illinois at Urbana–Champaign. The author has contributed to research in topics: Lossless compression & Lossy compression. The author has an hindex of 14, co-authored 53 publications receiving 515 citations. Previous affiliations of Idoia Ochoa include Stanford University & University of Navarra.

Papers
More filters
Journal ArticleDOI

Effect of lossy compression of quality scores on variant calling

TL;DR: It is shown that lossy compression can significantly alleviate the storage while maintaining variant calling performance comparable to that with the original data, and in some cases lossy compressed can lead to variantCalling performance that is superior to that using the original file.
Journal ArticleDOI

QualComp: a new lossy compressor for quality scores based on rate distortion theory

TL;DR: This paper presents a new scheme for the lossy compression of the quality scores, to address the problem of storage and shows that it is possible to achieve a significant reduction in size with little compromise in performance on downstream applications (e.g., alignment).
Journal ArticleDOI

SPRING: a next-generation compressor for FASTQ data.

TL;DR: SPRING achieves substantially better compression than existing tools, for example, SPRING compresses 195 GB of 25x whole genome human FASTQ from Illumina's NovaSeq sequencer to less than 7 GB, around 1.6x smaller than previous state-of-the-art FastQ compressors.
Journal ArticleDOI

QVZ: lossy compression of quality values

TL;DR: In this paper, the authors proposed a new lossy compressor for the quality values presented in genomic data files (e.g. FASTQ and SAM files), which comprise roughly half of the storage space (in the uncompressed domain).
Journal ArticleDOI

iDoComp: A Compression Scheme for Assembled Genomes

TL;DR: iDoComp, a compressor of assembled genomes presented in FASTA format that compresses an individual genome using a reference genome for both the compression and the decompression outperforms previously proposed algorithms in most of the studied cases, with comparable or better running time.