Topic

Human visual system model

About: Human visual system model is a research topic. Over the lifetime, 8697 publications have been published within this topic receiving 259440 citations.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A Computational Model of “Active Vision” for Visual Search in Human–Computer Interaction

[...]

Tim Halverson¹, AnthonyJ. Hornof²•Institutions (2)

Air Force Research Laboratory¹, University of Oregon²

01 Dec 2013-Human-Computer Interaction

TL;DR: A detailed instantiation, in the form of a computational cognitive model, of a comprehensive theory of human visual processing known as “active vision” is described, built using the Executive Process-Interactive Control cognitive architecture.

...read moreread less

Abstract: Human visual search plays an important role in many human–computer interaction HCI tasks. Better models of visual search are needed not just to predict overall performance outcomes, such as whether people will be able to find the information needed to complete an HCI task, but to understand the many human processes that interact in visual search, which will in turn inform the detailed design of better user interfaces. This article describes a detailed instantiation, in the form of a computational cognitive model, of a comprehensive theory of human visual processing known as “active vision” Findlay & Gilchrist, 2003. The computational model is built using the Executive Process-Interactive Control cognitive architecture. Eye-tracking data from three experiments inform the development and validation of the model. The modeling asks—and at least partially answers—the four questions of active vision: a What can be perceived in a fixation? b When do the eyes move? c Where do the eyes move? d What information is integrated between eye movements? Answers include: a Items nearer the point of gaze are more likely to be perceived, and the visual features of objects are sometimes misidentified. b The eyes move after the fixated visual stimulus has been processed i.e., has entered working memory. c The eyes tend to go to nearby objects. d Only the coarse spatial information of what has been fixated is likely maintained between fixations. The model developed to answer these questions has both scientific and practical value in that the model gives HCI researchers and practitioners a better understanding of how people visually interact with computers, and provides a theoretical foundation for predictive analysis tools that can predict aspects of that interaction.

...read moreread less

48 citations

Journal Article•DOI•

Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex.

[...]

Astrid Zeman¹, J. Brendan Ritchie¹, Stefania Bracci¹, Stefania Bracci², Hans Op de Beeck¹ - Show less +1 more•Institutions (2)

Allen Institute for Brain Science¹, University of Trento²

12 Feb 2020-Scientific Reports

TL;DR: It is found that CNNs encode category information independently from shape, peaking at the final fully connected layer in all tested CNN architectures, much like the human visual system.

...read moreread less

Abstract: Deep Convolutional Neural Networks (CNNs) are gaining traction as the benchmark model of visual object recognition, with performance now surpassing humans. While CNNs can accurately assign one image to potentially thousands of categories, network performance could be the result of layers that are tuned to represent the visual shape of objects, rather than object category, since both are often confounded in natural images. Using two stimulus sets that explicitly dissociate shape from category, we correlate these two types of information with each layer of multiple CNNs. We also compare CNN output with fMRI activation along the human visual ventral stream by correlating artificial with neural representations. We find that CNNs encode category information independently from shape, peaking at the final fully connected layer in all tested CNN architectures. Comparing CNNs with fMRI brain data, early visual cortex (V1) and early layers of CNNs encode shape information. Anterior ventral temporal cortex encodes category information, which correlates best with the final layer of CNNs. The interaction between shape and category that is found along the human visual ventral pathway is echoed in multiple deep networks. Our results suggest CNNs represent category information independently from shape, much like the human visual system.

...read moreread less

48 citations

Journal Article•DOI•

Perceptual Reduced-Reference Visual Quality Assessment for Contrast Alteration

[...]

Min Liu¹, Ke Gu², Guangtao Zhai¹, Patrick Le Callet³, Wenjun Zhang¹ - Show less +1 more•Institutions (3)

Shanghai Jiao Tong University¹, Nanyang Technological University², University of Nantes³

01 Mar 2017-IEEE Transactions on Broadcasting

TL;DR: A novel reduced-reference (RR) quality metric with the integration of bottom-up and top-down strategies is proposed, which stems from the recently revealed free energy principle that tells that the human visual system seeks to comprehend an input image via uncertainty removal.

...read moreread less

Abstract: In image/video systems, contrast adjustment which manages to enhance visual quality is nowadays an important research topic. Yet very limited struggles have been made to the exploration of visual quality assessment for contrast adjustment. To tackle the issue, this paper proposes a novel reduced-reference (RR) quality metric with the integration of bottom-up and top-down strategies. The former one stems from the recently revealed free energy principle that tells that the human visual system seeks to comprehend an input image via uncertainty removal, while the latter one is toward using the symmetric Kullback–Leibler divergence to compare the histogram of the contrast-altered image with that of the pristine image. The bottom-up and top-down strategies are lastly incorporated to derive the RR contrast-altered image quality measure. A comparison using numerous existing IQA models is carried out on five contrast related databases/subsets in CID2013, CCID2014, CSIQ, TID2008, and TID2013, and experimental results validate the superiority of the proposed technique.

...read moreread less

48 citations

Journal Article•DOI•

A Perception-Based Hybrid Model for Video Quality Assessment

[...]

Fan Zhang¹, David Bull¹•Institutions (1)

University of Bristol¹

02 Jun 2016-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: A novel perception-based hybrid model for video quality assessment that simulates the HVS perception process by adaptively combining noticeable distortion and blurring artifacts using an enhanced nonlinear model and exploits the orientation selectivity and shift invariance properties of the dual-tree complex wavelet transform.

...read moreread less

Abstract: It is known that the human visual system (HVS) employs independent processes (distortion detection and artifact perception—also often referred to as near-threshold and suprathreshold distortion perception) to assess video quality for various distortion levels. Visual masking effects also play an important role in video distortion perception, especially within spatial and temporal textures. In this paper, a novel perception-based hybrid model for video quality assessment is presented. This simulates the HVS perception process by adaptively combining noticeable distortion and blurring artifacts using an enhanced nonlinear model. Noticeable distortion is defined by thresholding absolute differences using spatial and temporal tolerance maps that characterize texture masking effects, and this makes a significant contribution to quality assessment when the quality of the distorted video is similar to that of the original video. Characterization of blurring artifacts, estimated by computing high frequency energy variations and weighted with motion speed, is found to further improve metric performance. This is especially true for low quality cases. All stages of our model exploit the orientation selectivity and shift invariance properties of the dual-tree complex wavelet transform. This not only helps to improve the performance but also offers the potential for new low complexity in-loop application. Our approach is evaluated on both the Video Quality Experts Group (VQEG) full reference television Phase I and the Laboratory for Image and Video Engineering (LIVE) video databases. The resulting overall performance is superior to the existing metrics, exhibiting statistically better or equivalent performance with significantly lower complexity.

...read moreread less

48 citations

Report•DOI•

Subjective surfaces: a geometric model for boundary completion

[...]

Alessandro Sarti¹, R. Malladi², James A. Sethian²•Institutions (2)

University of Bologna¹, Lawrence Berkeley National Laboratory²

01 Jun 2000

TL;DR: A geometric model and a computational method for segmentation of images with missing boundaries, and an algorithm which tries to build missing information on the basis of the given point of view and the available information as boundary data to the algorithm are presented.

...read moreread less

Abstract: We present a geometric model and a computational method for segmentation of images with missing boundaries. In many situations, the human visual system fills in missing gaps in edges and boundaries, building and completing information that is not present. Boundary completion presents a considerable challenge in computer vision, since most algorithms attempt to exploit existing data. A large body of work concerns completion models, which postulate how to construct missing data; these models are often trained and specific to particular images. In this paper, we take the following, alternative perspective: we consider a reference point within an image as given, and then develop an algorithm which tries to build missing information on the basis of the given point of view and the available information as boundary data to the algorithm. Starting from this point of view, a surface is constructed. It is then evolved with the mean curvature flow in the metric induced by the image until a piecewise constant solution is reached. We test the computational model on modal completion, amodal completion, texture, photo and medical images. We extend the geometric model and the algorithm to 3D in order to extract shapes from low signal/noise ratio medical volumes. Results in 3D echocardiography and 3D fetal echography are presented.

...read moreread less

48 citations

Collapse

Network Information

Performance

Metrics

8,840

Papers

298,215

Citations

No. of papers in the topic in previous years
Year	Papers
2023	49
2022	94
2021	279
2020	311
2019	351
2018	348

Human visual system model

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics