scispace - formally typeset
Search or ask a question

Showing papers on "Image file formats published in 2007"


Proceedings ArticleDOI
27 Feb 2007
TL;DR: A novel statistical model based on Benford's law for the probability distributions of the first digits of the block-DCT and quantized JPEG coefficients is presented and a parametric logarithmic law, i.e., the generalized Benford't law, is formulated.
Abstract: In this paper, a novel statistical model based on Benford's law for the probability distributions of the first digits of the block-DCT and quantized JPEG coefficients is presented. A parametric logarithmic law, i.e., the generalized Benford's law, is formulated. Furthermore, some potential applications of this model in image forensics are discussed in this paper, which include the detection of JPEG compression for images in bitmap format, the estimation of JPEG compression Qfactor for JPEG compressed bitmap image, and the detection of double compressed JPEG image. The results of our extensive experiments demonstrate the effectiveness of the proposed statistical model.

287 citations


Journal ArticleDOI
Peter Amon1, T. Rathgen, D. Singer
TL;DR: This paper describes the file format defined for scalable video coding, which enables rapid extraction of scalable data, corresponding to the desired operating point, in a variety of usages and application scenarios.
Abstract: This paper describes the file format defined for scalable video coding. Techniques in the file format enable rapid extraction of scalable data, corresponding to the desired operating point. Significant assistance to file readers can be provided, and there is also great flexibility in the ways that the techniques can be used and combined, corresponding to different usages and application scenarios.

173 citations


Patent
07 Nov 2007
TL;DR: In this paper, a system for managing media files having different format characteristics includes a transcoder, a content store, and a plurality of clients, each associated with one or more media file formats and capable of playing media files to users.
Abstract: A system for managing media files having different format characteristics includes a transcoder, a content store, and a plurality of clients. The content store is capable of storing a media file in a first format. The clients are each associated with one or more media file formats and capable of playing media files to users. The transcoder is capable of receiving a request identifying a media file from a first client and, in response to receiving the request, retrieving the media file from the content store in a first format. The transcoder is also operable of modifying the media file from the first format to a second format associated with the first client and, while modifying the media file from the first format to the second format, transmitting a modified portion of the media file to the first client.

165 citations


Patent
09 Aug 2007
TL;DR: In this article, a system, media, and method for transforming a main image of a digital image in accordance with a parameter are provided, where the main image that is transformed based on the parameter may be one stored in the Exchangeable Image File (EXIF) format.
Abstract: A system, media, and method for transforming a main image of a digital image in accordance with a parameter are provided. The main image that is transformed based on the parameter may be one stored in the Exchangeable Image File (“EXIF”) format. In turn, the metadata, including the thumbnail image, is updated to correspond to the transformed main image. The transformed main image and updated metadata are stored together in a file using the EXIF format. Thus, the transformed main image may be viewed using a digital camera or viewer software compatible with a properly formatted EXIF file.

109 citations


Patent
04 May 2007
TL;DR: In this paper, a corrected gradation derivation method was proposed to enhance the feeling of depth of a 2D image through the addition of shadow component on the input image based on brightness information and the estimated normal direction and edge information.
Abstract: It is an object to easily, and using existing devices, perform shadow enhancement that achieves an increase in the feeling of depth of 2D video. The input image data are first converted into brightness information by a brightness information calculation portion. Then, based on that brightness information, the normal direction and the edge information in the pixel targeted for processing are estimated by a normal direction estimation portion. A corrected gradation derivation portion then performs correction processing such as the addition of shadow component on the input image based on the brightness information and the estimated normal direction and edge information to create a processed image that has a feeling of depth, and then an output portion converts this to a predetermined image format and outputs it. In this way, it is possible to easily increase the feeling of depth of a 2D image through the addition of shadow, for example, in accordance with the characteristics of the input image.

106 citations


Patent
04 Sep 2007
TL;DR: In this paper, the file format of the data and file format which can be processed by the mobile terminal is determined and the conversion of the file formats is decided. But the conversion is not performed by the user.
Abstract: In a data transfer system, when transferring data which has been transferred to a storage in a PC, to a mobile terminal connected to the PC, the PC acquires the file format of the data and the file format which can be processed by the mobile terminal and determines whether the conversion of the file format is necessary. When the conversion of the file format is necessary, the PC issues a request for a conversion to a format converter. The format converter acquires data from the PC and transfers the data back to the PC while performing conversion. The PC transfers the converted data to the mobile terminal.

95 citations


Patent
18 Sep 2007
TL;DR: In this article, the received image files are stored such that an identifier value associated with the media content is stored in the filenames of the image files, such that the file is associated with specific media content.
Abstract: Requesting and receiving image files associated with media content. The received image files are stored such that an identifier value associated with the media content is stored in the filenames of the received image files. The invention determines which of the image files is associated with specific media content by searching for the identifier value of the specific media content in the filenames of the image files.

82 citations


Patent
15 Mar 2007
TL;DR: In this paper, the authors described techniques for automatic generation of one or more tags associated with an image file using hand-written annotations for a displayed image and handwriting recognition processing of the ink annotations.
Abstract: Techniques are described for performing automatic generation of one or more tags associated with an image file. One or more ink annotations for a displayed image are received. Handwriting recognition processing of the one or more ink annotations is performed. A string is generated and the string includes one or more recognized words used to form the one or more tags associated with the image file. The handwriting recognition processing and generating the string are performed in response to receiving the ink annotations.

75 citations


Patent
22 Mar 2007
TL;DR: A device may process images (e.g. sort, group, file, e-mail, etc.) using various filters as discussed by the authors, which relate to non-image data in the image files to be processed.
Abstract: A device may process images (e.g. sort, group, file, e-mail, etc.) using various filters. The filters may relate to non-image data in the image files to be processed. The filters may include time and location filters.

65 citations


Patent
19 Dec 2007
TL;DR: In this paper, the authors propose an access control system that uses an existing file format standard, e.g., PDI or image file interchange, for novel access control purposes to provide temporary access for a wireless key device to a lock device and its protected environment.
Abstract: An access control system uses an existing file format standard, e.g. for personal data interchange (PDI) or image file interchange, for novel access control purposes to provide temporary access for a wireless key device to a lock device and its protected environment by creat ing appropriate temporary access defining data in a data object compliant with the file format standard and communicating the data object to the lock device via the wireless key device.

58 citations


Patent
22 Mar 2007
TL;DR: In this article, the authors propose a method to simplify the uploading of images from a device to a network site (e.g., a website) by changing a format in which nonstandard data is stored in an image file to be uploaded, such as location information.
Abstract: A device simplifies uploading of images from a device to a network site (e.g. a website). The device or a remote server stores public upload information for one or more websites. The images are then formatted in accordance with the public upload data and/or personal configurations. The formatting may be directed, at least in part, to changing a format in which non-standard data is stored in an image file to be uploaded, such as location information. Information from an uploaded image file may be displayed directly, may be used in a mash-up, or may have some other use.

Proceedings ArticleDOI
20 Jun 2007
TL;DR: A feature extraction and classification framework that operates on features that can be extracted from image files in a very fast fashion that is able to detect a large amount of malicious images while being computationally inexpensive.
Abstract: Image spam poses a great threat to email communications due to high volumes, bigger bandwidth requirements, and higher processing requirements for filtering. We present a feature extraction and classification framework that operates on features that can be extracted from image files in a very fast fashion. The features considered are thoroughly analyzed regarding their information gain. We present classification performance results for C4.5 decision tree and support vector machine classifiers. Lastly, we compare the performance that can be achieved using these fast features to a more complex image classifier operating on morphological features extracted from fully decoded images. The proposed classifier is able to detect a large amount of malicious images while being computationally inexpensive.

Journal ArticleDOI
TL;DR: The method utilizes the approximation of an inverse tone mapping function that reduces the high dynamic range to a displayable range and significantly improves a compression performance, compared to conventional methods.

Patent
Jeong-hwan Jeon1
04 Apr 2007
TL;DR: In this article, a system and method for connecting a global positioning system (GPS) device with a digital image processing device and inserting position information stored in the GPS device into an image file taken by the digital image processor is presented.
Abstract: A system and method for connecting a global positioning system (GPS) device with a digital image processing device and inserting position information stored in the GPS device into an image file taken by the digital image processing device are provided. The system includes: a digital image processing device that photographs an image, generates an image file and stores the same; and a GPS device that stores position information according to signals transmitted from a GPS satellite at a predetermined time interval. When the digital image processing device and the GPS device are interconnected, the digital image processing device receives position information, which corresponds to time information on an image file stored in the digital image processing device, from the GPS device and inserts the position information into the image file.

12 Nov 2007
TL;DR: Two techniques are proposed for enhancing the message secrecy using image based steganography based on the use of punctuation marks and modified scytale cipher to hide a secret message in an image file.
Abstract: Image based steganography is the most popular method for message concealment. In this paper, two techniques are proposed for enhancing the message secrecy using image based steganography. The first technique is based on the use of punctuation marks to encode a secret message before embedding it into the image file. The second technique is based on the use of modified scytale cipher to hide a secret message in an image file. Both of these techniques have been implemented and tested using the S-Tools software package. The original and stego-images both are shown for the purpose of comparison

Patent
11 Apr 2007
TL;DR: An image processing apparatus includes a moving image file storage unit, an area selection receiving unit, and a scene change detecting unit that detects a start and an end of a scene containing the matching frame.
Abstract: An image processing apparatus includes a moving image file storage unit operable to store a moving image file; an area selection receiving unit operable to receive a selection of a predetermined area corresponding to one of a plurality of frames forming the moving image file; a template image generating unit operable to generate as a template image an image of the selected area; an image matching unit operable to obtain the frames from the moving image file storage unit, and to match each of the frames against the template image to search for a matching frame containing an image similar to the template image; and a scene change detecting unit operable to detect a start and an end of a scene containing the matching frame

Patent
Pujan K. Roka1
11 Oct 2007
TL;DR: A wireless telephone includes one or more keys incorporating a programmable display as mentioned in this paper, which can display an image file (e.g., a photo of a parent or friend), such as a photo captured by a camera incorporated into the telephone.
Abstract: A wireless telephone includes one or more keys incorporating a programmable display. The display may display an image file (e.g., a photo of a parent or friend), such as a photo captured by a camera incorporated into the telephone. The image file may be displayed directly on the key or it may be cropped or downsampled as appropriate. The user interface allows the user to program the key to dial a particular telephone number. In one embodiment, during a multi-party conference call the keys display images of the conference call participants. The participant currently speaking can be displayed on a main display of the phone. When the conference call is over, the key displays revert back to their previous display state.

Patent
06 Sep 2007
TL;DR: In this article, techniques and tools for representing pixel data in a video processing or capture system are described, which provide efficient color representation for video processing and capture, and provide flexibility for representing colors using different bit precisions and memory layouts.
Abstract: Techniques and tools for representing pixel data in a video processing or capture system are described. Described techniques and tools provide efficient color representation for video processing and capture, and provide flexibility for representing colors using different bit precisions and memory layouts. Described techniques and tools include video formats that can be used, for example, in hardware or software for capture, processing, and display purposes. In one aspect, chroma and luma information for a pixel in a video image is represented in a 16-bit fixed-point block of data having an integer and fractional components. Data can be easily converted from one representation to another (e.g., between 16-bit and 10-bit representations). In other aspects, formats for representing 8-, 10- and 16-bit video image data (e.g., packed and hybrid planar formats), and codes for indicating the formats, are described.

Patent
25 May 2007
TL;DR: In this article, a plurality of areas and pieces of distance information to subjects included in the respective areas are acquired and the image is made into the one with a blurring taste in which an area with low blurring degree in the image was floated.
Abstract: PROBLEM TO BE SOLVED: To make an image focused on the whole of a screen to the three-dimensional one with a feeling of depth. SOLUTION: When the image is photographed, the image is divided into a plurality of areas and pieces of distance information to subjects included in the respective areas are acquired. Blurring degrees are set by every area based on the pieces of distance information and blurring processing is performed by every area according to the blurring degrees. Thus, the image is made into the one with a blurring taste in which an area with low blurring degree in the image is floated. COPYRIGHT: (C)2009,JPO&INPIT

Patent
28 Dec 2007
TL;DR: In this paper, a method of capturing an image from a video call between a first user and a remote user over a communication network is proposed, which includes receiving video data from the remote user at a client executed at a user terminal of the first user.
Abstract: A method of capturing an image from a video call between a first user and a remote user over a communication network The method includes receiving video data from the remote user at a client executed at a user terminal of the first user, the video data comprising a sequence of frames; the client capturing a frame of the video data responsive to a command from the first user; the client extracting image data from the frame; the client converting the image data to an image file and embedding a communication identity of the remote user in the image file, wherein the communication identity is suitable for initiating a communication event with the remote user; and storing the image file on a storage means of the user terminal

Patent
05 Jan 2007
TL;DR: In this article, an apparatus for remotely controlling set-top boxes is described, including a memory device and a processor, and a signal file is associated with each button, and selecting a button causes a signal defined by the signal file associated with the selected button to be transmitted from the apparatus to the respective settop box to command the set-to-box to perform an operation.
Abstract: An apparatus is provided for remotely controlling set-top boxes. In general, a virtual remote controller is described including a memory device and a processor. The memory device is configured to store an image file and more than one signal file. The image file defines an image of a set-top box remote controller, and each signal file defines a command to control an operation of the set-top box. The processor is configured to generate the image of the remote controller according to the image file. The image of the remote controller includes buttons, and a signal file is associated with each button. Selecting a button causes a signal defined by the signal file associated with the selected button to be transmitted from the apparatus to the respective set-top box to command the set-top box to perform an operation. A method and a computer program product are also provided for remotely controlling set-top boxes.

Patent
04 Apr 2007
TL;DR: In this paper, a 3D image capture system using structured light technique is presented, which includes a first texture camera for capturing a textural image of a 3-D object and a second geometry camera to capture a geometric image of the object while a structured light pattern is projected onto the object.
Abstract: A three dimensional (3D) image capture system uses structured light technique. The 3D image capture system includes a first texture camera for capturing a textural image of a 3D object and a second geometry camera for capturing a geometric image of a 3D object while a structured light pattern is projected onto the 3D object. A pattern flash unit is used for projecting the structured light pattern onto the 3D object. The textural image is stored in a texture image file; and the geometric image is stored in a geometric image file. The geometric image file is processed to determine 3D coordinates and stored in a geometric image data file; and then the texture image file is processed to create texture data that is overlaid onto the 3D coordinates in the geometric image data file to produce a composite image.

Patent
29 Nov 2007
TL;DR: In this paper, a thumbnail image file is created in which two or more types of thumbnail images corresponding to the three-dimensional image are combined and recorded together with the 3D image file.
Abstract: An image file creation device comprises: a thumbnail image creation device which creates two or more types of thumbnail images on the basis of a three-dimensional image; a three-dimensional image file creation device which creates a three-dimensional image file from the three-dimensional image; and a thumbnail image file creation device which creates a thumbnail image file in which the created two or more types of thumbnail images are combined and recorded and which is associated with the three-dimensional image file. Thus, the thumbnail image file in which two or more types of thumbnail images corresponding to the three-dimensional image are combined and recorded is also created together with the three-dimensional image file. Therefore, this thumbnail image file has compatibility with 3D image reproduction devices having various specifications for thumbnail image reproduction.

Patent
Chang-Seog Ko1
08 Jan 2007
TL;DR: In this paper, a data recording and reproducing apparatus and a method of generating metadata are described, where a signal processor captures images, processing the captured images to generate image data, and generating an image file that includes the image data; a speech recognition unit recognizing speech and converting the speech into text data; and a controller using the text data to generate metadata and adding the metadata to the image file.
Abstract: A data recording and reproducing apparatus and a method of generating metadata are provided. The data recording and reproducing apparatus includes: a signal processor capturing images, processing the captured images to generate image data, and generating an image file that includes the image data; a speech recognition unit recognizing speech and converting the speech into text data; and a controller using the text data to generate metadata and adding the metadata to the image file. Accordingly, at a time when images are recorded, metadata is generated as management information corresponding to image contents by using an image contents recording apparatus. Therefore, reliable metadata corresponding to the image contents can be generated.

Patent
12 Oct 2007
TL;DR: In this article, a storage system is configured to backup its file system by taking a first static image of the file system at a point in time, and a clone of the first image may then be produced, the clone containing any subsequent changes to the file systems and a reference pointer to the original image.
Abstract: Embodiments described herein adapt static-image and clone technology to provide a simulated dynamic image to an application requesting a dynamic image. A storage system is configured to backup its file system by taking a first static image of the file system at a point in time. A clone of the first image may then be produced, the clone containing any subsequent changes to the file system and a reference pointer to the first image. A second static image of the clone is then produced. An application may request, from the storage system, an image of the file system. In response, the second static image may be presented to the application as a simulated dynamic image.

Patent
09 Oct 2007
TL;DR: In this article, an application program package (APP) in a Web server extracts attribute information about an image file registered in a public folder (for example, subject feature information about a subject appearing in the image file).
Abstract: An application program package (APP) in a Web server extracts attribute information about an image file registered in a public folder (for example, subject feature information about a subject appearing in the image file). The APP calculates a reference value of the subject feature information from the subject feature information about multiple image files associated with the public folder. The APP determines the subject feature information having the reference value as feature information. The APP acquires a level of similarity between the subject feature information about every image file and the feature information. The APP sets a display mode of the image file registered in the public folder in association with the public folder based on the level of similarity and controls the image file so as to be displayed in the display mode in a client PC.

01 Jan 2007
TL;DR: The ASPRS Lidar Exchange Format (LASPRS LAS) as mentioned in this paper is a data format standard designed to make the exchange of lidar data, processing, analysis, and storing less time consuming and more convenient.
Abstract: The laser scanning technology has become de-facto as a successful measuring mean in numerous applications of remote sensing and mapping. A development of hardware has been followed by a development of a new data file format standard know as the American society for Photogrammetry and Remote Sensing (ASPRS) Lidar Exchange Format (LAS). This data format standard has been designed in order to make the exchange of lidar data, (pre-/post-) processing, analysis, and storing less time consuming and more convenient. There are three versions of the ASPRS LAS standard: 1.0, 1.1, and 2.0 (draft). A number of the manufacturers of hardware and software, laser scanning service providers and end users have already accepted a concept of ASPRS LAS as an industry standard. However, a less experienced end user might be confused by the different definitions of the term LAS that appear in literature and are used by various software vendors. The following main LAS definitions in remote sensing and geomatics exist: Land Analysis System by USGS, Log ASCII Standard by the Canadian Well Logging Society, LAS image format by ER Mapper, and ASPRS LAS by the ASPRS Lidar Committee. This paper explains the different common meanings of those terms. Several popular software products used for lidar data processing are also reviewed and the terminology associated with the file format defined. At this time there is no common tool available for converting from one ASPRS LAS format to another, and this can be a challenge when working with multiple formats. Only in one study case a version number of ASPRS LAS was clearly identified in the Import/Export tool. This paper also provides a comparison feature matrix of the different versions of ASPRS LAS.

Book ChapterDOI
01 Jan 2007
TL;DR: This chapter analyzes the effects that standard image compression methods - JPEG (Wallace, 1991) and JPEG2000 (Skodras et al., 2001) - have on two well known subspace appearance-based face recognition algorithms: Principal Component Analysis - PCA and ICA.
Abstract: With the growing number of face recognition applications in everyday life, image- and video-based recognition methods are becoming important research topic (Zhao et al., 2003). Effects of pose, illumination and expression are issues currently most studied in face recognition. So far, very little has been done to investigate the effects of compression on face recognition, even though the images are mainly stored and/or transported in a compressed format. Still-to-still image experimental setups are often researched, but only in uncompressed image formats. Still-to-video research (Zhou et al., 2003) mostly deals with issues of tracking and recognizing faces in a sense that still uncompressed images are used as a gallery and compressed video segments as probes. In this chapter we analyze the effects that standard image compression methods - JPEG (Wallace, 1991) and JPEG2000 (Skodras et al., 2001) - have on two well known subspace appearance-based face recognition algorithms: Principal Component Analysis - PCA (Turk & Pentland, 1991), Linear Discriminant Analysis - LDA (Belhumeur et al., 1996) and Independent Component Analysis - ICA (Bartlett et al., 2002). We use McNemar's hypothesis test (Beveridge et al., 2001 ; Delac et al., 2006) when comparing recognition accuracy in order to determine if the observed outcomes of the experiments are statistically important or a matter of chance. Following the idea of a reproducible research, a comprehensive description of our experimental setup is given, along with details on the choice of images used in the training and testing stage, exact preprocessing steps and recognition algorithms parameters setup. Image database chosen for the experiments is the grayscale portion of the FERET database (Phillips et al., 2000) and its accompanying protocol for face identification, including standard image gallery and probe sets. Image compression is performed using standard JPEG and JPEG2000 coder implementations and all experiments are done in pixel domain (i.e. the images are compressed to a certain number of bits per pixel and then uncompressed prior to use in recognition experiments). The recognition system's overall setup we test is twofold. In the first part, only probe images are compressed and training and gallery images are uncompressed (Delac et al., 2005). This setup mimics the expected first step in implementing compression in real-life face recognition applications: an image captured by a surveillance camera is probed to an existing high-quality gallery image. In the second part, a leap towards justifying fully compressed domain face recognition is taken by using compressed images in both training and testing stage (Delac, 2006). We will show that, contrary to common opinion, compression does not deteriorate performance but it even improves it slightly in some cases. We will also suggest some prospective lines of further research based on our findings.

Journal ArticleDOI
TL;DR: A new near-lossless image compression algorithm based on the Bayer format image suitable for hardware design that can provide low average compression rate with high image quality for endoscopic images and supports real-time compressing.
Abstract: In order to decrease the communication bandwidth and save the transmitting power in the wireless endoscopy capsule, this paper presents a new near-lossless image compression algorithm based on the Bayer format image suitable for hardware design. This algorithm can provide low average compression rate (2.12 bits/pixel) with high image quality (larger than 53.11 dB) for endoscopic images. Especially, it has low complexity hardware overhead (only two line buffers) and supports real-time compressing. In addition, the algorithm can provide lossless compression for the region of interest (ROI) and high-quality compression for other regions. The ROI can be selected arbitrarily by varying ROI parameters. In addition, the VLSI architecture of this compression algorithm is also given out. Its hardware design has been implemented in 0.18 µm CMOS process.

Patent
18 Jan 2007
TL;DR: In this paper, the problem that though the resolution of a recent digital camera is greatly improved and a detailed image can be photographed, the size of an image file is increased, and the transfer time is lengthened in the case of storing and reading into/from storage equipment, operability is spoiled, and since the image is resized according to paper size and then printed in case of printing, the print time becomes long.
Abstract: PROBLEM TO BE SOLVED: To solve the problem that though the resolution of a recent digital camera is greatly improved and a detailed image can be photographed, size of an image file is increased, and transfer time of an image is lengthened in the case of storing and reading into/from storage equipment, operability is spoiled, and that since the image is resized according to paper size and then printed in the case of printing, the print time becomes long SOLUTION: Since resize processing of an image is frequently performed in the case of printing, the image is resized first in the case of transferring the image to a digital camera and transfer data quantity can be reduced Also, the resized image is stored, so that the resize processing in the case of printing can be eliminated, and the print time can be shortened COPYRIGHT: (C)2007,JPO&INPIT