Home
/
Authors
/
Natalya F. Noy

Author

Natalya F. Noy

Other affiliations: Pennsylvania State University, Google, Northeastern University

Bio: Natalya F. Noy is an academic researcher from Stanford University. The author has contributed to research in topics: Ontology (information science) & Open Biomedical Ontologies. The author has an hindex of 56, co-authored 166 publications receiving 23427 citations. Previous affiliations of Natalya F. Noy include Pennsylvania State University & Google.

Papers published on a yearly basis

2022
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1998
1997
1996
1995
1993

Papers

PDF

Open Access

More filters

Ontology Development 101: A Guide to Creating Your First Ontology

[...]

Natalya F. Noy¹, Deborah L. McGuinness¹•Institutions (1)

Stanford University¹

01 Jan 2002

TL;DR: An ontology defines a common vocabulary for researchers who need to share information in a domain that includes machine-interpretable definitions of basic concepts in the domain and relations among them.

...read moreread less

Abstract: 1 Why develop an ontology? In recent years the development of ontologies—explicit formal specifications of the terms in the domain and relations among them (Gruber 1993)—has been moving from the realm of ArtificialIntelligence laboratories to the desktops of domain experts. Ontologies have become common on the World-Wide Web. The ontologies on the Web range from large taxonomies categorizing Web sites (such as on Yahoo!) to categorizations of products for sale and their features (such as on Amazon.com). The WWW Consortium (W3C) is developing the Resource Description Framework (Brickley and Guha 1999), a language for encoding knowledge on Web pages to make it understandable to electronic agents searching for information. The Defense Advanced Research Projects Agency (DARPA), in conjunction with the W3C, is developing DARPA Agent Markup Language (DAML) by extending RDF with more expressive constructs aimed at facilitating agent interaction on the Web (Hendler and McGuinness 2000). Many disciplines now develop standardized ontologies that domain experts can use to share and annotate information in their fields. Medicine, for example, has produced large, standardized, structured vocabularies such as SNOMED (Price and Spackman 2000) and the semantic network of the Unified Medical Language System (Humphreys and Lindberg 1993). Broad general-purpose ontologies are emerging as well. For example, the United Nations Development Program and Dun & Bradstreet combined their efforts to develop the UNSPSC ontology which provides terminology for products and services (www.unspsc.org). An ontology defines a common vocabulary for researchers who need to share information in a domain. It includes machine-interpretable definitions of basic concepts in the domain and relations among them. Why would someone want to develop an ontology? Some of the reasons are:

...read moreread less

4,838 citations

Journal Article•DOI•

The evolution of Protégé: an environment for knowledge-based systems development

[...]

John H. Gennari¹, Mark A. Musen², Ray W. Fergerson², William Grosso, Monica Crubézy², Henrik Eriksson³, Natalya F. Noy², Samson W. Tu² - Show less +4 more•Institutions (3)

Washington University in St. Louis¹, Stanford University², Linköping University³

01 Jan 2003-International Journal of Human-computer Studies \/ International Journal of Man-machine Studies

TL;DR: This paper follows the evolution of the Protege project through three distinct re-implementations, and describes the overall methodology, the design decisions, and the lessons learned over the duration of the project.

...read moreread less

Abstract: The Protege project has come a long way since Mark Musen first built the Protege meta-tool for knowledge-based systems in 1987. The original tool was a small application, aimed at building knowledge-acquisition tools for a few specialized programs in medical planning. From this initial tool, the Protege system has evolved into a durable, extensible platform for knowledge-based systems development and research. The current version, Protege-2000, can be run on a variety of platforms, supports customized user-interface extensions, incorporates the Open Knowledge-Base Connectivity (OKBC) knowledge model, interacts with standard storage formats such as relational databases, XML, and RDF, and has been used by hundreds of individuals and research groups. In this paper, we follow the evolution of the Protege project through three distinct re-implementations. We describe our overall methodology, our design decisions, and the lessons we have learned over the duration of the project. We believe that our success is one of infrastructure: Protege is a flexible, well-supported, and robust development environment. Using Protege, developers and domain experts can easily build effective knowledge-based systems, and researchers can explore ideas in a variety of knowledge-based domains.

...read moreread less

1,244 citations

Journal Article•DOI•

Semantic integration: a survey of ontology-based approaches

[...]

Natalya F. Noy¹•Institutions (1)

Stanford University¹

01 Dec 2004

TL;DR: The goal of the paper is to provide a reader who may not be very familiar with ontology research with introduction to major themes in this research and with pointers to different research projects.

...read moreread less

Abstract: Semantic integration is an active area of research in several disciplines, such as databases, information-integration, and ontologies. This paper provides a brief survey of the approaches to semantic integration developed by researchers in the ontology community. We focus on the approaches that differentiate the ontology research from other related areas. The goal of the paper is to provide a reader who may not be very familiar with ontology research with introduction to major themes in this research and with pointers to different research projects. We discuss techniques for finding correspondences between ontologies, declarative ways of representing these correspondences, and use of these correspondences in various semantic-integration tasks

...read moreread less

1,142 citations

Proceedings Article•

Algorithm and Tool for Automated Ontology Merging and Alignment

[...]

Natalya F. Noy¹, Mark A. Musen¹•Institutions (1)

Stanford University¹

01 Jan 2000

TL;DR: In this paper, a semi-automated approach to ontology merging and alignment is presented. But the approach is not suitable for the problem of ontology alignment and merging, as it requires a large and tedious portion of the sharing process.

...read moreread less

Abstract: Researchers in the ontology-design field have developed the content for ontologies in many domain areas. Recently, ontologies have become increasingly common on the WorldWide Web where they provide semantics for annotations in Web pages. This distributed nature of ontology development has led to a large number of ontologies covering overlapping domains. In order for these ontologies to be reused, they first need to be merged or aligned to one another. The processes of ontology alignment and merging are usually handled manually and often constitute a large and tedious portion of the sharing process. We have developed and implemented PROMPT, an algorithm that provides a semi-automatic approach to ontology merging and alignment. PROMPT performs some tasks automatically and guides the user in performing other tasks for which his intervention is required. PROMPT also determines possible inconsistencies in the state of the ontology, which result from the user’s actions, and suggests ways to remedy these inconsistencies. PROMPT is based on an extremely general knowledge model and therefore can be applied across various platforms. Our formative evaluation showed that a human expert followed 90% of the suggestions that PROMPT generated and that 74% of the total knowledge-base operations invoked by the user were suggested by PROMPT.

...read moreread less

1,119 citations

Journal Article•DOI•

Creating Semantic Web contents with Protege-2000

[...]

Natalya F. Noy¹, Michael Sintek, Stefan Decker, Monica Crubézy, Raymond W. Fergerson, Mark A. Musen - Show less +2 more•Institutions (1)

Stanford University¹

01 Mar 2001-IEEE Intelligent Systems

TL;DR: The authors describe how Protege-2000, a tool for ontology development and knowledge acquisition, can be adapted for editing models in different Semantic Web languages.

...read moreread less

Abstract: As researchers continue to create new languages in the hope of developing a Semantic Web, they still lack consensus on a standard. The authors describe how Protege-2000, a tool for ontology development and knowledge acquisition, can be adapted for editing models in different Semantic Web languages.

...read moreread less

1,092 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The REDCap consortium: Building an international community of software platform partners.

[...]

Paul A. Harris¹, Paul A. Harris², Robert Taylor¹, Brenda L. Minor¹, Veida Elliott¹, Michelle Fernandez¹, Lindsay O'Neal¹, Laura McLeod¹, Giovanni Delacqua¹, Francesco Delacqua¹, Jacqueline Kirby¹, Stephany N. Duda¹ - Show less +8 more•Institutions (2)

Vanderbilt University Medical Center¹, Vanderbilt University²

09 May 2019-Journal of Biomedical Informatics

TL;DR: The Research Electronic Data Capture (REDCap) data management platform was developed in 2004 to address an institutional need at Vanderbilt University, then shared with a limited number of adopting sites beginning in 2006, and a broader consortium sharing and support model was created.

...read moreread less

8,712 citations

Journal Article•DOI•

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

[...]

Ranjay Krishna¹, Yuke Zhu¹, Oliver Groth², Justin Johnson¹, Kenji Hata¹, Joshua Kravitz¹, Stephanie Chen¹, Yannis Kalantidis³, Li-Jia Li, David A. Shamma⁴, Michael S. Bernstein¹, Li Fei-Fei¹ - Show less +8 more•Institutions (4)

Stanford University¹, Dresden University of Technology², Yahoo!³, Centrum Wiskunde & Informatica⁴

01 May 2017-International Journal of Computer Vision

TL;DR: The Visual Genome dataset as mentioned in this paper contains over 108k images where each image has an average of $35$35 objects, $26$26 attributes, and $21$21 pairwise relationships between objects.

...read moreread less

Abstract: Despite progress in perceptual tasks such as image classification, computers still perform poorly on cognitive tasks such as image description and question answering. Cognition is core to tasks that involve not just recognizing, but reasoning about our visual world. However, models used to tackle the rich content in images for cognitive tasks are still being trained using the same datasets designed for perceptual tasks. To achieve success at cognitive tasks, models need to understand the interactions and relationships between objects in an image. When asked "What vehicle is the person riding?", computers will need to identify the objects in an image as well as the relationships riding(man, carriage) and pulling(horse, carriage) to answer correctly that "the person is riding a horse-drawn carriage." In this paper, we present the Visual Genome dataset to enable the modeling of such relationships. We collect dense annotations of objects, attributes, and relationships within each image to learn these models. Specifically, our dataset contains over 108K images where each image has an average of $$35$$35 objects, $$26$$26 attributes, and $$21$$21 pairwise relationships between objects. We canonicalize the objects, attributes, relationships, and noun phrases in region descriptions and questions answer pairs to WordNet synsets. Together, these annotations represent the densest and largest dataset of image descriptions, objects, attributes, relationships, and question answer pairs.

...read moreread less

3,842 citations

Proceedings Article•DOI•

Yago: a core of semantic knowledge

[...]

Fabian M. Suchanek¹, Gjergji Kasneci¹, Gerhard Weikum¹•Institutions (1)

Max Planck Society¹

08 May 2007

TL;DR: YAGO as discussed by the authors is a light-weight and extensible ontology with high coverage and quality, which includes the Is-A hierarchy as well as non-taxonomic relations between entities (such as HASONEPRIZE).

...read moreread less

Abstract: We present YAGO, a light-weight and extensible ontology with high coverage and quality. YAGO builds on entities and relations and currently contains more than 1 million entities and 5 million facts. This includes the Is-A hierarchy as well as non-taxonomic relations between entities (such as HASONEPRIZE). The facts have been automatically extracted from Wikipedia and unified with WordNet, using a carefully designed combination of rule-based and heuristic methods described in this paper. The resulting knowledge base is a major step beyond WordNet: in quality by adding knowledge about individuals like persons, organizations, products, etc. with their semantic relationships - and in quantity by increasing the number of facts by more than an order of magnitude. Our empirical evaluation of fact correctness shows an accuracy of about 95%. YAGO is based on a logically clean model, which is decidable, extensible, and compatible with RDFS. Finally, we show how YAGO can be further extended by state-of-the-art information extraction techniques.

...read moreread less

3,710 citations

Journal Article•DOI•

Knowledge engineering: principles and methods

[...]

Rudi Studer¹, V. Richard Benjamins², V. Richard Benjamins³, Dieter Fensel¹•Institutions (3)

Karlsruhe Institute of Technology¹, Spanish National Research Council², University of Amsterdam³

01 Mar 1998

TL;DR: The paradigm shift from a transfer view to a modeling view is discussed and two approaches which considerably shaped research in Knowledge Engineering are described: Role-limiting Methods and Generic Tasks.

...read moreread less

Abstract: This paper gives an overview of the development of the field of Knowledge Engineering over the last 15 years. We discuss the paradigm shift from a transfer view to a modeling view and describe two approaches which considerably shaped research in Knowledge Engineering: Role-limiting Methods and Generic Tasks. To illustrate various concepts and methods which evolved in recent years we describe three modeling frameworks: CommonKADS, MIKE and PROTEGE-II. This description is supplemented by discussing some important methodological developments in more detail: specification languages for knowledge-based systems, problem-solving methods and ontologies. We conclude by outlining the relationship of Knowledge Engineering to Software Engineering, Information Integration and Knowledge Management.

...read moreread less

3,406 citations

Journal Article•DOI•

The NHGRI GWAS Catalog, a curated resource of SNP-trait associations

[...]

Danielle Welter¹, Jacqueline A. L. MacArthur¹, Joannella Morales¹, Tony Burdett¹, Peggy Hall¹, Heather Junkins¹, Alan Klemm¹, Paul Flicek¹, Teri A. Manolio¹, Lucia A. Hindorff¹, Helen Parkinson¹ - Show less +7 more•Institutions (1)

National Institutes of Health¹

01 Jan 2014-Nucleic Acids Research

TL;DR: A number of recent improvements to theNHGRI Catalog of Published Genome-Wide Association Studies are presented, including novel ways for users to interact with the Catalog and changes to the curation infrastructure.

...read moreread less

Abstract: The National Human Genome Research Institute (NHGRI) Catalog of Published Genome-Wide Association Studies (GWAS) Catalog provides a publicly available manually curated collection of published GWAS assaying at least 100000 singlenucleotide polymorphisms (SNPs) and all SNP-trait associations with P <110 5 . The Catalog includes 1751 curated publications of 11912 SNPs. In addition to the SNP-trait association data, the Catalog also publishes a quarterly diagram of all SNP-trait associations mapped to the SNPs’ chromosomal locations. The Catalog can be accessed via a tabular web interface, via a dynamic visualization on the human karyotype, as a downloadable tab-delimited file and as an OWL knowledge base. This article presents a number of recent improvements to the Catalog, including novel ways for users to interact with the Catalog and changes to the curation infrastructure.

...read moreread less

2,755 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse