S
Seung-Hee Bae
Researcher at Indiana University
Publications - 29
Citations - 1439
Seung-Hee Bae is an academic researcher from Indiana University. The author has contributed to research in topics: Cluster analysis & Multi-core processor. The author has an hindex of 14, co-authored 29 publications receiving 1403 citations. Previous affiliations of Seung-Hee Bae include Purdue University & Seoul National University.
Papers
More filters
Proceedings ArticleDOI
Twister: a runtime for iterative MapReduce
Jaliya Ekanayake,Hui Li,Bingjing Zhang,Thilina Gunarathne,Seung-Hee Bae,Judy Qiu,Geoffrey C. Fox +6 more
TL;DR: This paper presents the programming model and the architecture of Twister an enhanced MapReduce runtime that supports iterative Map Reduce computations efficiently and shows performance comparisons of Twisters with other similar runtimes such as Hadoop and DryadLINQ for large scale data parallel applications.
Journal ArticleDOI
Hybrid cloud and cluster computing paradigms for life science applications
Judy Qiu,Jaliya Ekanayake,Thilina Gunarathne,Jong Youl Choi,Seung-Hee Bae,Hui Li,Bingjing Zhang,Tak-Lon Wu,Yang Ruan,Saliya Ekanayake,Adam Hughes,Geoffrey C. Fox +11 more
TL;DR: The hybrid cloud (MapReduce) and cluster (MPI) approach offers an attractive production environment while Twister promises a uniform programming environment for many Life Sciences applications.
Proceedings ArticleDOI
Dimension reduction and visualization of large high-dimensional data via interpolation
TL;DR: The authors propose an interpolated approach to utilizing the mapping of only a subset of the given data, which effectively reduces computational complexity and demonstrates that the quality of interpolated mapping results are comparable to the mapping results of original algorithm only.
Journal ArticleDOI
Cloud computing paradigms for pleasingly parallel biomedical applications
TL;DR: This paper presents three pleasingly parallel biomedical applications, and discusses variations in cost among the different platform choices (e.g., Elastic Compute Cloud instance types), highlighting the importance of selecting an appropriate platform based on the nature of the computation.
Proceedings ArticleDOI
Parallel data mining from multicore to cloudy grids
TL;DR: A suite of data mining tools that cover clustering, information retrieval and the mapping of high dimensional data to low dimensions for visualization are described, stressing that data analysis/mining of large datasets can be a supercomputer application.