Optics lens design for privacy-preserving scene captioning

Optics lens design for privacy-preserving scene captioning

Oct 16, 2022·
Paula Arguello
Paula Arguello
Jhon Lopez
Jhon Lopez
Carlos Hinojosa
Carlos Hinojosa
Henry Arguello
Henry Arguello
Abstract
Image captioning is a challenging task that connects two major artificial intelligence fields, computer vision and natural language processing. Image captioning models use traditional images to generate a natural language description of the scene. However, the scene could contain private information that we want to hide but still generate the captions. Inspired by the trend of jointly designing optics and algorithms, this paper addresses the problem of privacy-preserving scene captioning. Our approach promotes privacy preservation, by hiding the faces in the images, during the acquisition process with a designed refractive camera lens while extracting useful features to perform image captioning. The refractive lens and an image captioning deep network architecture are optimized end-to-end to generate descriptions directly from the blurred images. Simulations show that our privacy-preserving approach degrades private visual attributes (e.g., face detection fails with our distorted images) while achieving comparable captioning performance with traditional non-private methods on the COCO dataset.
Type
Publication
2022 IEEE International Conference on Image Processing (ICIP)

Best Paper Award ICIP 2022

screen reader text

Proposed Method

screen reader text
Proposed end-to-end (2PSC) model. The optical encoder consists of a camera with a refractive lens. The decoder consists of convolutional feature extraction and an LSTM with attention, which generates a description from the privacy image

Qualitative results

screen reader text
Qualitative results on the COCO dataset test set. Under each privacy image, we show the caption obtained by each model, and under the original image, we show the ground truth caption. We compute the PSNR between the original and distorted images for each approach

Video

Paula Arguello
Authors
B.S. Systems Engineer
Jhon Lopez
Authors
Ph.D.(c) in Computer Science, Universidad Industrial de Santander
Carlos Hinojosa
Authors
Postdoctoral Researcher at KAUST
Henry Arguello
Authors
Professor at Universidad Industrial de Santander, Colombia