Showing papers by "J. Stephen Downie published in 2015"

PDF

Open Access

Proceedings Article•DOI•

Improving Access to Large-scale Digital Libraries ThroughSemantic-enhanced Search and Disambiguation

[...]

Annika Hinze¹, Craig Taube-Schock¹, David Bainbridge¹, Rangi Matamua¹, J. Stephen Downie² - Show less +1 more•Institutions (2)

University of Waikato¹, University of Illinois at Urbana–Champaign²

21 Jun 2015

TL;DR: This paper presents a conceptual design and report on the initial implementation of a new framework that affords the benefits of semantic search while minimizing the problems associated with applying existing semantic analysis at scale.

...read moreread less

Abstract: With 13,000,000 volumes comprising 4.5 billion pages of text, it is currently very difficult for scholars to locate relevant sets of documents that are useful in their research from the HathiTrust Digital Libary (HTDL) using traditional lexically-based retrieval techniques. Existing document search tools and document clustering approaches use purely lexical analysis, which cannot address the inherent ambiguity of natural language. A semantic search approach offers the potential to overcome the shortcoming of lexical search, but even if an appropriate network of ontologies could be decided upon it would require a full semantic markup of each document. In this paper, we present a conceptual design and report on the initial implementation of a new framework that affords the benefits of semantic search while minimizing the problems associated with applying existing semantic analysis at scale. Our approach avoids the need for complete semantic document markup using pre-existing ontologies by developing an automatically generated Concept-in-Context (CiC) network seeded by a priori analysis of Wikipedia texts and identification of semantic metadata. Our Capisco system analyzes documents by the semantics and context of their content. The disambiguation of search queries is done interactively, to fully utilize the domain knowledge of the scholar. Our method achieves a form of semantic-enhanced search that simultaneously exploits the proven scale benefits provided by lexical indexing.

...read moreread less

24 citations

Journal Article•DOI•

Cochlear pitch class profile for cover song identification

[...]

Ning Chen¹, J. Stephen Downie², Hai Dong Xiao³, Yu Zhu¹•Institutions (3)

East China University of Science and Technology¹, University of Illinois at Urbana–Champaign², Shanghai Jiao Tong University³

01 Dec 2015-Applied Acoustics

TL;DR: In this paper, a cochlear pitch class profile (CPCP) was proposed to enhance the degree of instrumental accompaniment invariance without degrading the feature's discriminative power.

...read moreread less

13 citations

Proceedings Article•DOI•

Building Complex Research Collections in Digital Libraries: A Survey of Ontology Implications

[...]

Terhi Nurmikko-Fuller¹, Kevin R. Page¹, Pip Willcox¹, Jacob Jett², Christopher R. Maden², Timothy W. Cole², Colleen Fallaw², Megan Senseney², J. Stephen Downie² - Show less +5 more•Institutions (2)

University of Oxford¹, University of Illinois at Urbana–Champaign²

21 Jun 2015

TL;DR: This paper surveys the coverage of existing bibliographic ontologies in the context of meeting these scholarly needs, and provides an illustrated discussion of potential extensions that might fully realize a solution.

...read moreread less

Abstract: Bibliographic metadata standards are a longstanding mechanism for Digital Libraries to manage records and express relationships between them. As digital scholarship, particularly in the humanities, incorporates and manipulates these records in an increasingly direct manner, existing systems are proving insufficient for providing the underlying addressability and relational expressivity required to construct and interact with complex research collections. In this paper we describe motivations for these "worksets" and the technical requirements they raise. We survey the coverage of existing bibliographic ontologies in the context of meeting these scholarly needs, and finally provide an illustrated discussion of potential extensions that might fully realize a solution.

...read moreread less

12 citations

Proceedings Article•DOI•

Topic Modeling Users' Interpretations of Songs to Inform Subject Access in Music Digital Libraries

[...]

Kahyun Choi¹, Jin Ha Lee², Craig Willis¹, J. Stephen Downie¹•Institutions (2)

University of Illinois at Urbana–Champaign¹, University of Washington²

21 Jun 2015

TL;DR: An automatic topic discovery system from web-mined user-generated interpretations of songs to provide subject access to a music digital library is proposed and filtering techniques to identify high-quality topics are proposed.

...read moreread less

Abstract: The assignment of subject metadata to music is useful for organizing and accessing digital music collections. Since manual subject annotation of large-scale music collections is labor-intensive, automatic methods are preferred. Topic modeling algorithms can be used to automatically identify latent topics from appropriate text sources. Candidate text sources such as song lyrics are often too poetic, resulting in lower-quality topics. Users' interpretations of song lyrics provide an alternative source. In this paper, we propose an automatic topic discovery system from web-mined user-generated interpretations of songs to provide subject access to a music digital library. We also propose and evaluate filtering techniques to identify high-quality topics. In our experiments, we use 24,436 popular songs that exist in both the Million Song Dataset and songmeanings.com. Topic models are generated using Latent Dirichlet Allocation (LDA). To evaluate the coherence of learned topics, we calculate the Normalized Pointwise Mutual Information (NPMI) of the top ten words in each topic based on occurrences in Wikipedia. Finally, we evaluate the resulting topics using a subset of 422 songs that have been manually assigned to six subjects. Using this system, 71% of the manually assigned subjects were correctly identified. These results demonstrate that topic modeling of song interpretations is a promising method for subject metadata enrichment in music digital libraries. It also has implications for affording similar access to collections of poetry and fiction.

...read moreread less

10 citations

Proceedings Article•

Modified Perceptual Linear Prediction Liftered Cepstrum (MPLPLC) Model for Pop Cover Song Recognition.

[...]

Ning Chen¹, J. Stephen Downie, Hai Dong Xiao², Yu Zhu¹, Jie Zhu³ - Show less +1 more•Institutions (3)

East China University of Science and Technology¹, Chinese Academy of Sciences², Shanghai Jiao Tong University³

01 Jan 2015

TL;DR: This study verifies that the mature techniques in the ASR or Computational Auditory Scene Analysis (CASA) fields may be modified and included to enhance the performance of the Music Information Retrieval (MIR) scheme.

...read moreread less

Abstract: Most of the features of Cover Song Identification (CSI), for example, Pitch Class Profile (PCP) related features, are based on the musical facets shared among cover versions: melody evolution and harmonic progression. In this work, the perceptual feature was studied for CSI. Our idea was to modify the Perceptual Linear Prediction (PLP) model in the field of Automatic Speech Recognition (ASR) by (a) introducing new research achievements in psychophysics, and (b) considering the difference between speech and music signals to make it consistent with human hearing and more suitable for music signal analysis. Furthermore, the obtained Linear Prediction Coefficients (LPCs) were mapped to LPC cepstrum coefficients, on which liftering was applied, to boost the timbre invariance of the resultant feature: Modified Perceptual Linear Prediction Liftered Cepstrum (MPLPLC). Experimental results showed that both LPC cepstrum coefficients mapping and cepstrum liftering were crucial in ensuring the identification power of the MPLPLC feature. The MPLPLC feature outperformed state-of-the-art features in the context of CSI and in resisting instrumental accompaniment variation. This study verifies that the mature techniques in the ASR or Computational Auditory Scene Analysis (CASA) fields may be modified and included to enhance the performance of the Music Information Retrieval (MIR) scheme.

...read moreread less

6 citations

Journal Article•DOI•

A Fragmentizing Interface to a Large Corpus of Digitized Text: (Post)humanism and Non-consumptive Reading via Features

[...]

Sayan Bhattacharyya¹, Peter Organisciak¹, J. Stephen Downie¹•Institutions (1)

University of Illinois at Urbana–Champaign¹

27 Mar 2015-Interdisciplinary Science Reviews

TL;DR: It is argued that the fragmentary quality of the features generated by the reading interface does not necessarily imply that the mode of reading enabled and mediated by these features points in an anti-humanist direction, and that such a practice of reading may be considered posthumanist but not necessarily antihumanist.

...read moreread less

Abstract: While the idea of distant reading does not rule out the possibility of close reading of the individual components of the corpus of digitized text that is being distant-read, this ceases to be the case when parts of the corpus are, for reasons relating to intellectual property, not accessible for consumption through downloading followed by close reading. Copyright restrictions on material in collections of digitized text such as the HathiTrust Digital Library (HTDL) necessitate providing facilities for non-consumptive reading, one of the approaches to which consists of providing users with features from the text in the form of small fragments of text, instead of the text itself. We argue that, contrary to expectation, the fragmentary quality of the features generated by the reading interface does not necessarily imply that the mode of reading enabled and mediated by these features points in an anti-humanist direction. We pose the fragmentariness of the features as paradigmatic of the fragmentation ...

...read moreread less

5 citations

Proceedings Article•DOI•

Improving Consistency of Crowdsourced Multimedia Similarity for Evaluation

[...]

Peter Organisciak¹, J. Stephen Downie¹•Institutions (1)

University of Illinois at Urbana–Champaign¹

21 Jun 2015

TL;DR: A case study, music similarity judgments in a music digital library evaluation, finds that even with trusted graders song pairs are not consistently rated the same, and concludes with recommendations for achieving reliable evaluation judgments for music similarity and other normative judgment tasks.

...read moreread less

Abstract: Building evaluation datasets for information retrieval is a time-consuming and exhausting activity. To evaluate research over novel corpora, researchers are increasingly turning to crowdsourcing to efficiently distribute the evaluation dataset creation among many workers. However, there has been little investigation into the effect of instrument design on data quality in crowdsourced evaluation datasets. We pursue this question through a case study, music similarity judgments in a music digital library evaluation, where we find that even with trusted graders song pairs are not consistently rated the same. We find that much of this low intra-coder consistency can be attributed to the task design and judge effects, concluding with recommendations for achieving reliable evaluation judgments for music similarity and other normative judgment tasks.

...read moreread less

4 citations

Proceedings Article•

Mirex grand challenge 2014 user experience: Qualitative analysis of user feedback

[...]

Jin Ha Lee¹, Xiao Hu², Kahyun Choi³, J. Stephen Downie⁴•Institutions (4)

University of Washington¹, University of Hong Kong², University of Illinois at Urbana–Champaign³, National Center for Supercomputing Applications⁴

01 Jan 2015

TL;DR: The analysis of 99 free text responses collected from evaluators revealed additional user opinions, not fully captured by score ratings on the given criteria, and demonstrated the challenge of evaluating a variety of systems with different user goals.

...read moreread less

Abstract: Evaluation has always been fundamental to the Music Information Retrieval (MIR) community, as evidenced by the popularity of the Music Information Retrieval Evaluation eXchange (MIREX). However, prior MIREX tasks have primarily focused on testing specialized MIR algorithms that sit on the back end of systems. Not until the Grand Challenge 2014 User Experience (GC14UX) task had the users’ overall interaction and experience with complete systems been formally evaluated. Three systems were evaluated based on five criteria. This paper reports the results of GC14UX, with a special focus on the qualitative analysis of 99 free text responses collected from evaluators. The analysis revealed additional user opinions, not fully captured by score ratings on the given criteria, and demonstrated the challenge of evaluating a variety of systems with different user goals. We conclude with a discussion on the implications of findings and recommendations for future UX evaluation tasks, including adding new criteria: Aesthetics, Performance, and Utility.

...read moreread less

3 citations

Journal Article•DOI•

“Introducing Capisco: a semantically-enhanced search and discovery system for large-scale text corpora”

[...]

Annika Hinze¹, Craig Taube-Schock¹, David Bainbridge¹, Sally Jo Cunningham¹, J. Stephen Downie² - Show less +1 more•Institutions (2)

University of Waikato¹, University of Illinois at Urbana–Champaign²

03 Nov 2015-ACM Sigweb Newsletter

TL;DR: In this article, a new approach to scholarly search and discovery in large-scale text corpora is presented. But it cannot directly address the inherent ambiguity of natural language and cannot resolve on a case-by-case basis issues caused by synonyms, homonyms and OCR errors.

...read moreread less

Abstract: This article discusses a new approach to scholarly search and discovery in large-scale text corpora. While lexicographic search is at present the predominant means to access large document corpora, it cannot directly address the inherent ambiguity of natural language. As a pragmatic solution, many scholars manually build their own list of suitable search terms to be used in repeated searches in digital libraries and other online resources; however, scholars then have to resolve on a case-by-case basis issues caused by synonyms, homonyms and OCR errors. Our approach differs from this by supporting scholars in developing and refining a set of relevant concepts, searches a large document collection using semantic concepts, and categorizes the potentially relevant documents from search results into worksets. The developed technique revisits the notion of semantic search and redesigns both the underlying data representation and interface support. This is achieved through an end-to-end design that relies centrally on a Concept-in-Context network sourced through the link structure of Wikipedia. We discuss here the principles of our approach, its implementation in the Capisco prototype, and the relationship between established search techniques and our approach.

...read moreread less

2 citations

Conceptualizing worksets for non-consumptive research

[...]

Jacob Jett, Christopher R. Maden, Colleen Fallaw, Megan Senseney, J. Stephen Downie - Show less +1 more

15 Mar 2015

TL;DR: A workset in the HTRC context is a container for a scholar's aggregated units of analysis – analogous to a scholar’s research collection and derives the values for a number of its properties from the values of certain properties of its constituent members that propagate allowing it to support of various types of filtration.

...read moreread less

Abstract: A workset in the HTRC context: • is a container for a scholar’s aggregated units of analysis – analogous to a scholar’s research collection; • is a persistent globally unique entity that can be directly cited; • possesses provenance properties supporting change awareness within the HTRC’s architecture so that a description of its nature at the time of analysis persists over time; • is flexible enough to allow for the aggregation of heterogeneous resources, with regard to: • granularity and • source repository; and • derives the values for a number of its properties (i.e., its metadata) from the values of certain properties of its constituent members that propagate allowing it to support of various types of filtration (cf. Wickett et al., 2010; Wickett, 2012).

...read moreread less

1 citations

Bibliographic Ontologies Comparative Features Dataset

[...]

Terhi Nurmikko-Fuller, Colleen Fallaw, Jacob Jett, Kevin R. Page, Timothy W. Cole, Christopher R. Maden, Pip Willcox, Megan Senseney, J. Stephen Downie - Show less +5 more

29 Oct 2015

Workset Creation for Scholarly Analysis: Recommendations and Prototyping Project Reports

[...]

J. Stephen Downie, Timothy W. Cole, Megan Senseney

22 Dec 2015

Proceedings Article•DOI•

The HathiTrust Research Center: Providing analytic access to the HathiTrust Digital Library's 4.7 billion pages

[...]

J. Stephen Downie¹•Institutions (1)

University of Illinois at Urbana–Champaign¹

21 Jun 2015

TL;DR: The HTRC's involvement with the NOVEL(TM) text mining project and the Single Interface for Music Score Searching and Analysis project, both funded by the SSHRC Partnership Grant programme, will be introduced.

...read moreread less

Abstract: This lecture provides an update on the recent developments and activities of the HathiTrust Research Center (HTRC) The HTRC is the research arm of the HathiTrust, an online repository dedicated to the provision of access to a comprehensive body of published works for scholarship and education The HathiTrust is a partnership of over 100 major research institutions and libraries working to ensure that the cultural record is preserved and accessible long into the future Membership is open to institutions worldwide Over 131 million volumes (47 billion pages) have been ingested into the HathiTrust digital archive from sources including Google Books, member university libraries, the Internet Archive, and numerous private collections The HTRC is dedicated to facilitating scholarship by enabling analytic access to the corpus, developing research tools, fostering research projects and communities, and providing additional resources such as enhanced metadata and indices that will assist scholars to more easily exploit the HathiTrust materials This talk will outline the mission, goals and structure of the HTRC It will also provide an overview of recent work being conducted on a range of projects, partnerships and initiatives Projects include Workset Creation for Scholarly Analysis project (WCSA, funded by the Andrew W Mellon Foundation) and the HathiTrust + Bookworm project (HT+BW, funded by the National Endowment for the Humanities) HTRC's involvement with the NOVEL(TM) text mining project and the Single Interface for Music Score Searching and Analysis (SIMSSA) project, both funded by the SSHRC Partnership Grant programme, will be introduced The HTRC's new feature extraction and Data Capsule initiatives, part of its ongoing work its ongoing efforts to enable the non-consumptive analyses of the approximately 8 million volumes under copyright restrictions will also be discussed The talk will conclude with some suggestions on how the non-consumptive research model might be improved upon and possibly extended beyond the HathiTrust context

...read moreread less