B
Benjamin Wellner
Researcher at Mitre Corporation
Publications - 6
Citations - 228
Benjamin Wellner is an academic researcher from Mitre Corporation. The author has contributed to research in topics: Set (abstract data type) & Information extraction. The author has an hindex of 4, co-authored 5 publications receiving 206 citations. Previous affiliations of Benjamin Wellner include Brandeis University.
Papers
More filters
Journal ArticleDOI
The MITRE Identification Scrubber Toolkit: Design, training, and assessment
John S. Aberdeen,Samuel Bayer,Reyyan Yeniterzi,Benjamin Wellner,Cheryl Clark,David A. Hanauer,Bradley A. Malin,Lynette Hirschman +7 more
TL;DR: The open source MITRE Identification Scrubber Toolkit (MIST) provides an environment to support rapid tailoring of automated de-identification to different document types, using automatically learned classifiers to de-identified and protect sensitive information.
Journal ArticleDOI
Effects of personal identifier resynthesis on clinical text de-identification.
Reyyan Yeniterzi,John S. Aberdeen,Samuel Bayer,Benjamin Wellner,Benjamin Wellner,Lynette Hirschman,Bradley A. Malin +6 more
TL;DR: The de-identification tool achieves high accuracy when training and test sets are homogeneous (ie, both real or resynthesized records), but the resynthesis component regularizes the data to make them less "realistic," resulting in loss of performance particularly when training on resynthesesized data and testing on real data.
Journal ArticleDOI
Bootstrapping a de-identification system for narrative patient records: Cost-performance tradeoffs
David A. Hanauer,John S. Aberdeen,Samuel Bayer,Benjamin Wellner,Cheryl Clark,Kai Zheng,Lynette Hirschman +6 more
TL;DR: The human annotation effort needed to produce a system that de-identifies at high accuracy using the MIST framework is quantified, suggesting that the wider variety and contexts for protected health information in social work notes is more difficult to model.
Proceedings ArticleDOI
Evaluating the automatic mapping of human gene and protein mentions to unique identifiers.
Alexander A. Morgan,Benjamin Wellner,Jeffrey B. Colombe,Robert Arens,Marc E. Colosimo,Lynette Hirschman +5 more
TL;DR: A challenge task for the second BioCreAtIvE (Critical Assessment of Information Extraction in Biology) that requires participating systems to provide lists of the EntrezGene (formerly LocusLink) identifiers for all human genes and proteins mentioned in a MEDLINE abstract is developed.
Journal ArticleDOI
The “Coherent Data Set”: Combining Patient Data and Imaging in a Comprehensive, Synthetic Health Record
Jason A. Walonoski,Dylan Hall,Karen M. Bates,M. Heath Farris,Joseph Dagher,Matthew E. Downs,Ryan T. Sivek,Benjamin Wellner,Andrew Gregorowicz,Marc Hadley,Francis X. Campion,Lauren Levine,Kevin Wacome,Geoff Emmer,Aaron Kemmer,Maha Malik,Jonah Hughes,Eldesia Granger,Sybil Russell +18 more
TL;DR: The Coherent Data Set is a novel synthetic data set that leverages structured data from Synthea™ to create a longitudinal, “coherent” patient-level electronic health record (EHR).