Motion illusion-like patterns extracted from photo and art images using predictive deep neural networks

Kobayashi, Taisuke; Kitaoka, Akiyoshi; Kosaka, Manabu; Tanaka, Kenta; Watanabe, Eiji

doi:10.1038/s41598-022-07438-3

Download PDF

Article
Open access
Published: 10 March 2022

Motion illusion-like patterns extracted from photo and art images using predictive deep neural networks

Taisuke Kobayashi¹,
Akiyoshi Kitaoka²,
Manabu Kosaka³,
Kenta Tanaka³ &
…
Eiji Watanabe^1,4

Scientific Reports volume 12, Article number: 3893 (2022) Cite this article

4536 Accesses
3 Citations
32 Altmetric
Metrics details

Subjects

Abstract

In our previous study, we successfully reproduced the illusory motion perceived in the rotating snakes illusion using deep neural networks incorporating predictive coding theory. In the present study, we further examined the properties of the network using a set of 1500 images, including ordinary static images of paintings and photographs and images of various types of motion illusions. Results showed that the networks clearly classified a group of illusory images and others and reproduced illusory motions against various types of illusions similar to human perception. Notably, the networks occasionally detected anomalous motion vectors, even in ordinally static images where humans were unable to perceive any illusory motion. Additionally, illusion-like designs with repeating patterns were generated using areas where anomalous vectors were detected, and psychophysical experiments were conducted, in which illusory motion perception in the generated designs was detected. The observed inaccuracy of the networks will provide useful information for further understanding information processing associated with human vision.

Human visual motion perception shows hallmarks of Bayesian structural inference

Article Open access 12 February 2021

Predictive encoding of motion begins in the primate retina

Article 02 August 2021

Attention modulates neural representation to render reconstructions according to subjective appearance

Article Open access 11 January 2022

Introduction

Motion illusion is among the most impressive visual illusions¹. In motion illusions, motion is perceived even when the relative positions of the observer and the observed object are unchanged. The Fraser–Wilcox illusion (FWI; reported in 1979) is a representative example of motion illusion² and comprises a spiral pattern design of repeating luminance gradients. The direction of illusory motion varies from person to person. Since their first identification, numerous FWI variations have been reported (e.g.^3,4,5), and similar to the original design of FWI, they are composed of a basic structure of light and dark gradients, although with greater individual differences in the strength of perceived motion than in the direction of motion.

The mechanism by which illusory motion occurs has long been debated. One possibility is that contrast intensity^3,4,6,7 affects the processing speed of neurons and is converted into motion perception. Additionally, the effects of eye movements⁸ have been discussed, with no clear conclusion reached. Because illusory motion reportedly causes activity in some areas of the cerebrum involved in visual perception^6,9,10, cerebral involvement in the mechanism associated with illusory motion has been suggested. Moreover, behavioral experiments suggest the existence of the perception of motion illusions in animals such as rhesus monkeys¹¹, cats¹², lions¹³, guppies, zebrafish¹⁴, and fruit flies¹⁵. Therefore, studying the motion-perception mechanisms common to animals with developed visual systems has attracted increasing attention.

With the recent development of deep neural networks (DNNs), research on their use as a tool to study brain function has become more active¹⁶. Comparisons of findings related to DNNs with the operating principles of the brain represent an ability to analogize psychophysical or physiological processes using computational principles. Given that one of the original motivations for DNN research was an attempt to find the essence of the brain by artificially reproducing the functions of the nervous system, such analogizing continues to be relevant. DNNs have recently been applied in visual-perception research^17,18. Additionally, in illusion research, illusion-like phenomena have been reported in DNNs for classical size illusions¹⁹, the scintillating grid²⁰, geometric illusions²¹, color illusions^22,23,24, the flash-lag effect²², and gestalt closure²⁵. DNNs allow changes to the structure of a given network and the weights of the connections, which represent alterations that cannot be applied to a living brain.

Our research group has studied motion illusion by attempting to reproduce illusions using DNNs and comparing them with human perception. We previously focused on the relationship between the occurrence of an illusion and the predictive function of the brain^26,27. In that study, we constructed a DNN model incorporating predictive coding theory²⁸ as a theoretical model of the cerebrum^29,30,31 and trained by first-person-viewed videos²⁷. The DNN model predicted motion in the rotating snakes illusion to a degree similar to that of human perception, suggesting that the DNN model could be used as a tool for studying the subjective perception of motion illusion.

In the previous paper, we analyzed only the rotating snakes illusion as a representative example of motion illusion. In the present study, we analyzed a variety of motion illusions using the DNN model and attempted to generate predictive images using ordinary static image datasets that included photographs and paintings. Additionally, we conducted psychophysical tests on human subjects and compared predictions by the DNN model with the results from human perceptions.

Methods

Deep neural networks

The connection-weight model of a trained DNN (PredNet; written in Chainer) used in this study was identical to a 500K model described previously²⁷. In brief, PredNet is a DNN that predicts future video frames from past time series of video frames. It outputs the predicted image from the convolutional LSTM (Long Short Term Memory) network and proceeds to train it so that the error (mean squared error) between the future real image and the prediction is reduced. The unique feature of this network is that the prediction error, rather than the real image itself, is the input data to the convolutional LSTM network. To train the DNN, we used a video from the First-person Social Interactions Dataset (http://ai.stanford.edu/~alireza/Disney/). The video contains footage of days in the life of eight subjects at the Disney World Resort in Orlando, Florida. The cameras were attached to hats worn by the walking subjects. In other words, PredNet is expected to learn about the spatial-temporal characteristics of the world from first-person information. The connection-weight model used here was obtained by training using 500,000 video frames.

Test images

To test the prediction of the DNN model, we prepared five groups of test image stimuli: motion illusions (n = 300), modern art paintings (n = 300), classic art paintings (n = 300), movable objects of photo pictures (n = 300), and still objects of photo pictures (n = 300). The motion illusions were originally generated by Drs. Akiyoshi Kitaoka³² (299 images) and Eiji Watanabe³³ (1 image). The images of art paintings were randomly collected from wikiart (https://www.wikiart.org) according to their classification. The images of photo pictures were collected at random by icrawler³⁴, which is a framework of web crawlers (license = “noncommercial, modify,” and keywords = “car,” “building,” “cat,” etc.), followed by manual classification as “Movable objects” (animals, vehicles, etc.) or “Still objects” (buildings, mountains, etc.). Images were trimmed and scaled down, and the final size of all images was adjusted to 160 × 120 pixels (width × height) to adapt the training images. The five groups of test image stimuli (1500 images in total) were shared as a “Visual Illusions Dataset”³⁵.

Prediction

The DNN model predicted the 22nd image (P1 image) with reference to 21 consecutive images, which were 21 images copied from one test image. The network then predicted the 23rd image (P2 image) with reference to 22 consecutive images, using the P1 image as the 22nd image. The optical flow vectors between the P1 and P2 images were then calculated by the Lucas–Kanade³⁶ and Farneback³⁷ methods using a customized Python program (window size 50, quality level 0.3 for the Lucas-Kanade; window size 10, stride 5 and min_vec 0.01 for the Farneback). The details of the protocol are essentially the same as in the above two papers. In brief, the feature points are extracted sparsely in Lucas–Kanade method and densely in Farneback method. Then, the optical flow between the two images is calculated using the least-squares criterion, starting from the pixels of the feature points. Both methods assume that the flow is essentially constant in a local neighborhood of the pixel of the feature point. Refer to Fig. 4 as an example of the two analysis methods.

Psychophysical experiment

The visual stimuli used in the psychophysical experiment were created by removing them from a photo picture (image A) and a painting (image B) as shown in Fig. 4. The cropped image was duplicated 20 times and combined horizontally, and combined images were transformed into the circle by WarpPolar method which is the polar conversion function in OpenCV (v.4.2.0.32; https://opencv.org). A white rectangular image was concatenated under the cropped image to reduce the effect of distortion caused by deformation. The width w of the white image is the same as the width x of the cropped image, and the height h is obtained from the following equation for the circumference of the circle with radius r = h + y/2, where y is the height of the cropped image:

$$\begin{aligned} n_sx=2\pi r=2\pi \left( h+\frac{y}{2}\right) \end{aligned}$$

(1)

where $n_s$ is the number of the repetitions (20). The values of h are obtained by the following equation:

$$\begin{aligned} h=\frac{1}{2}\left( \frac{n_sx}{\pi } - y\right) \end{aligned}$$

(2)

Decimal points of the values were rounded down. The size of the cropped image A and B is 9 × 26 pixels and 8 × 18 pixels (width × height), and h is 15 pixels and 16 pixels, respectively. The image size output from the polar conversion function was set to 1024 × 1024 pixels, and output images were used for the psychophysical experiment. For the test of the prediction of the DNN model, these images were further size converted to 120 × 120 pixels and then placed on the center of a white image with a size of 160 × 120 pixels.

The psychophysical experiment was designed based on the method of Hisakata et al.³⁸ and conducted using a program written in Python using OpenGL (v.3.1.5; https://pypi.org/project/PyOpenGL/). The subjects were the authors T.K. and E.W. plus three naïve subjects (n = 5; all healthy subjects with normal vision). The subjects were asked to answer whether they saw the stimuli rotated clockwise (CW) or counterclockwise (CCW) by keyboard input using a two-alternative judgment. The face of each subject was fixed at 50 cm from the screen, and only the right eye was used for viewing. A gazing point with a viewing angle of 1$^{\circ }$ was established at the center of a white background, and the stimulus with an outer diameter of 7$^{\circ }$ and an inner diameter of 1$^{\circ }$ was presented at 12$^{\circ }$ to the left of the center for 0.5 seconds. The subjects looked at the gazing point and viewed the stimulus with their peripheral vision. When they responded, the next stimulus was played, but the stimulus was designed so that there was a minimum of 1 second between the presentation of the previous stimulus and the playback of the next stimulus.

To quantitatively examine the illusory motion of the stimuli, we intentionally rotated the stimuli and determined the conditions under which the motion perception did not occur. We prepared two types of images: the original image and its left–right reversed version. This was done to counteract the perceived rotation-velocity bias. The intentional stimulus rotational velocities were set to a range of −2.1 to +2.1 $^{\circ }$/s, and the velocity intervals were 0.3 $^{\circ }$ /s. This means that a total of 15 different intentional stimulus rotation velocities were used. To statistically analyze the responses, we presented the same condition 30 times with randomly varying stimulus types and rotation velocities. Since there were 2 types of images, 15 types of velocity, and 30 repetitions, each subject was presented with 900 stimuli in total. From the statistical data obtained by this procedure, we calculated the rotational velocity of the stimulus based on the same probability of receiving an answer that the stimulus was rotating in the CW and CCW directions.

Figure 6 shows the raw data of the psychophysical experiment. The horizontal axis represents the velocity at which the stimulus was intentionally rotated (with CCW as the positive direction), and the vertical axis represents the probability of responding that each stimulus was rotated CCW. Each obtained psychometric curve was fitted using a cumulative Gaussian function to calculate the rotational velocity and the rotation-cancellation velocity when the probability was 0.5. The rotation-cancellation velocity is the velocity required to cancel the rotation of the presented image, and the direction of rotation due to the motion illusion of the image is the velocity multiplied by a minus. Therefore, we used the original and reversed stimuli to calculate the rotational velocity of the stimulus as follows:

$$\begin{aligned} \frac{1}{2}\{\text {(the static rotational velocity of the reversed stimulus) - (the static rotational velocity of the original stimulus)}\} \end{aligned}$$

(3)

Ethics statement

The study protocol was performed according to the Declaration of Helsinki and was approved by the Ethics Committee of the National Institute for Physiological Sciences (permit No. 20A063). The psychological experiments were performed with informed consent of all subjects. Informed consent included permission to disclose the subject’s initials.

Open-source software

All program codes (DNN, optical flow analysis, and psychophysical stimulus presentation software), trained models, and stimulus images were released as open-source software at the following website.

1.
DNN: https://doi.org/10.6084/m9.figshare.5483710
2.
Optical flow analysis: https://doi.org/10.6084/m9.figshare.5483716
3.
Psychophysical experiment: https://github.com/taikob/Motion_Illusion_test
4.
Trained model: https://doi.org/10.6084/m9.figshare.11931222
5.
Stimulus images: https://doi.org/10.6084/m9.figshare.9878663

Results

Figure 1 shows the examples of optical flow vectors detected in the images predicted by the model against the five stimulus groups using the Lucas–Kanade method. Although relatively large and/or well-aligned optical flow vectors were detected in the predicted images against motion illusions, relatively small optical flows were detected in the images predicted against other groups. The direction of the motion vector detected from the motion illusions agreed with the direction of the illusory motion perceived by humans. Notably, the directed optical flows were detected not only in the illusion of many colors, shapes, and gradients but also in the illusion of simple white triangles (Fig. 1, upper left). As a fundamental property of the methodology, the Lucas–Kanade method extracts objects with a characteristic shape from images as feature points and exploits them as the starting points of the optical flow. Therefore, it was not very meaningful that the eyes and hands were selected, given that they were extracted as feature points and used as the starting point for the optical flow (e.g., Mona Lisa and President Obama).

For quantitative analysis, the frequency rates of the absolute values of the optical flow vectors detected from each image group were evaluated (Top two graphs in Fig. 2) and averages of the absolute values of the optical flow vectors for each image group were generated (Fig. 3). Top two graphs in Fig. 2 shows that there is a noticeable difference in the frequency distribution between the motion illusion images and the rest of the image groups. In particular, near the modes of the non-motion illusion groups, the frequency of motion illusions was much lower than the other groups. The results were as follows: motion illusions, 0.71 ± 0.18 (arbitrary units; Lucas–Kanade) and 0.64 ± 0.034 (Farneback); modern art paintings, 0.052 ± 0.0029 and 0.24 ± 0.021; realistic art paintings, 0.036 ± 0.00088 and 0.092 ± 0.0014; movable object photographs, 0.035 ± 0.0013 and 0.11 ± 0.0020; and still object photographs, 0.037 ± 0.0013 and 0.11 ± 0.0026. These results indicate that larger optical flow vectors were detected in the motion illusion group relative to the other groups. This tendency did not change according to the use of either Lucas–Kanade or Farneback analyses. These findings suggested that the DNN model accurately classified a group of illusory images and others.

However, as shown in Fig. 2, relatively large optical flow vectors were also detected in images from groups other than that including motion illusion, although the number of examples was small. To investigate the cause of such exceptionally large optical flow vectors, two images (one photograph and one painting), in which notably large optical flows were predicted, were identified, and the P1 and P2 images were compared in detail. The first row of Fig. 4 shows that two large optical flows were detected in the area of the building on the left side of the picture using the Lucas–Kanade method, whereas the Farneback method detected a dense optical flow in the same region. Similarly, for the painting, characteristic optical flows were detected on the columns on the left side of the painting (The third row of Fig. 4). Bottom two graphs in Fig. 2 shows the distributions of the absolute values of the optical flow vectors for each image of the photograph and the painting. Most of the optical flows detected in images A and B were far from the frequency rate peaks of the optical flows detected in the photograph (still objects) and painting (realistic arts) image groups (Top two graphs in Fig. 2). The maximum absolute values of the optical flow vectors detected in each image and calculated by each analytical method were 0.68 and 4.0 (Lucas–Kanade and Farneback, respectively) and 0.32 and 1.5, respectively. Figure 5 shows a plot of the brightness values, where an exceptionally large optical flow was detected. Comparing P1 (Fig. 5, green line) with P2 (Fig. 5, blue line) revealed a shift in the patterns of the two brightness distributions. These results indicate that the DNN model incorrectly predicted motion for static images that a human would not recognize as moving.

We then hypothesized that the patterns of exceptionally large optical flows detected in the photograph and the painting might exhibit characteristics of motion illusions. We focused the analysis on most of the motion illusions having a repeating structure. Therefore, the areas where the large optical flows were detected (Fig. 5, boxed regions) were excised and reassembled into circular repeating structures (Fig. 6), followed by the psychophysical experiments using five human subjects. After testing the effect of the reconstructed design on human perception, we found that these artificially created designs rendered a type of motion illusion (Fig. 6). The strength of the detected rotational velocity and relative relationship between images A and B differed among the five subjects (TK: A, −0.36 $^{\circ }$/s and B, 0.31 $^{\circ }$/s; TS: A, −0.48 $^{\circ }$/s and B, 0.96 $^{\circ }$/s; and EW: A, −0.18 $^{\circ }$/s and B, 0.29 $^{\circ }$/s; AT: A, −0.21 $^{\circ }$/s and B, 0.41 $^{\circ }$/s; YK: A, −0.37 $^{\circ }$/s and B, 0.51 $^{\circ }$/s). However, for the same design, there was no individual difference in the direction of the detected rotational velocity. The direction of perceptual motion estimated from optical flow analysis (Fig. 4) and luminance analysis (Fig. 5) against DNN predicted images was counterclockwise for both illusion-like designs. This estimated direction of rotation coincided with the direction of perceptual rotation obtained in the psychological experiment for the illusion-like design derived from image A, but not for the illusion-like design derived from image B. This trend was the same for all subjects. Next, each illusion-like design was input into the DNN model, predictive images were generated, and optical flow analysis and luminance distribution analysis were performed (the second and fourth rows in Fig. 4). As a result, large optical flow and luminance shift in the illusion-like design derived from image A were observed, but not rotational motion in one direction. In the illusion-like design derived from image B, only small flows and luminance shifts were observed.

Discussion

In this study, we showed that the DNN model distinguished between an illusion group and other groups of ordinary photographs and paintings, although it occasionally predicted motion in some parts of the ordinary images (Fig. 4). Interestingly, we were able to create new motion illusions from the target portions of these images. It is possible that there existed small unit structures in the motion illusions and that a background involving normal scenery might suppress the occurrence of illusory motion when the unit structures exist alone. We previously suggested the existence of unit structures in our recent study on the rotating snakes illusion and the Fraser–Wilcox illusion³⁹.

The illusion-like designs shown in Fig. 6 were among the first illusions to be discovered with the aid of artificial intelligence. Other examples reported include illusion generators based on an evolutionary algorithm⁴⁰, a generative adversarial network⁴¹, and a statistical model⁴². In any case, in order to generate illusions, a module that artificially models human vision is essential. It is speculated that the type of illusion that is created is influenced by the type of brain function modeled in the module. In the evolutionary algorithms, motion illusions were generated by the same connection-weight model as in this paper; in the generative adversarial network, color and contrast illusions were generated by CNNs trained on static images; and in statistical models, color illusions were generated by a patch likelihood estimation model trained on static natural images. Such a methodology would not only synthesize new visual illusions useful for vision researchers, but would also provide a new way to study the similarities and differences between artificial models and human visual perception.

Many motion illusions present a “repetition” of unit structures. As noted, we presume that the presence of even one of these unit structures can potentially cause the perception of motion. However, no single unit structure alone can cause the perception of illusory motion, which suggests that local information might lead to the perception of motion only when it is combined with global information. Supporting evidence suggests that a wide range of brain regions, from V1 to MT+, are involved in the perception of motion illusions⁹, with higher brain regions (e.g., MT+) thought to integrate information from a broader perspective than V1. The DNN model was capable of detecting motion flow in the unit structure embedded in the photographs and the paintings that was not perceived by humans (the first and third rows in Fig. 4). For the illusion-like designs that repeated unit structures extracted from photographs or paintings, humans perceived motion, but not the direction of motion or the relative magnitude of motion predicted by the DNN model (Fig. 6 and the second and fourth rows in Fig. 4). The discordance between the two could indicate that the DNN model is underdeveloped in its ability to integrate global information, which is thought to be performed in higher brain regions and other areas. For artificial perception to be useful for basic research of human perception, further studies are required.

References

Kitaoka, A. The Fraser-Wilcox illusion and its extension. A. G. Shapiro and D. Todorović (Eds.), The Oxford Compendium of Visual Illusions, Oxford University Press, 500–511 (2017).
Fraser, A. & Wilcox, K. J. Perception of illusory movement. Nature 281, 565–566 (1979).
Article ADS CAS Google Scholar
Faubert, J. & Herbert, A. M. The peripheral drift illusion: A motion illusion in the visual periphery. Perception 28, 617–621 (1999).
Article CAS Google Scholar
Naor-Raz, G. & Sekuler, R. Perceptual dimorphism in visual motion from stationary patterns. Perception 29, 325–335 (2000).
Article CAS Google Scholar
Kitaoka, A. & Ashida, H. Phenomenal characteristics of the peripheral drift illusion. Vision 15, 261–262 (2003).
Google Scholar
Conway, B. R., Kitaoka, A., Yazdanbakhsh, A., Pack, C. C. & Livingstone, M. S. Neural basis for a powerful static motion illusion. J. Neurosci. 25, 5651–5656 (2005).
Article CAS Google Scholar
Backus, B. T. & Oruç, I. Illusory motion from change over time in the response to contrast and luminance. J. Vis. 5, 1055–1069 (2005).
Article Google Scholar
Murakami, I., Kitaoka, A. & Ashida, H. A positive correlation between fixation instability and the strength of illusory motion in a static display. Vision. Res. 46, 2421–2431 (2006).
Article Google Scholar
Ashida, H., Kuriki, I., Murakami, I., Hisakata, R. & Kitaoka, A. Direction-specific fMRI adaptation reveals the visual cortical network underlying the “rotating snakes’’ illusion. Neuroimage 61, 1143–1152 (2012).
Article Google Scholar
Kuriki, I., Ashida, H., Murakami, I. & Kitaoka, A. Functional brain imaging of the rotating snakes illusion by fMRI. J. Vis. 8, 1–10 (2008).
Article Google Scholar
Agrillo, C., Gori, S. & Beran, M. J. Do rhesus monkeys (macaca mulatta) perceive illusory motion? Anim. Cogn. 18, 895–910 (2015).
Article Google Scholar
Bååth, R., Seno, T. & Kitaoka, A. Cats and illusory motion. Psychology 5, 1131–1134 (2014).
Article Google Scholar
Regaiolli, B. et al. Motion illusions as environmental enrichment for zoo animals: A preliminary investigation on lions (panthera leo). Front. Psychol. https://doi.org/10.3389/fpsyg.2019.02220 (2019).
Article PubMed PubMed Central Google Scholar
Gori, S., Agrillo, C., Dadda, M. & Bisazza, A. Do fish perceive illusory motion? Sci. Rep. 4, 6443 (2014).
Article ADS CAS Google Scholar
Agrochao, M., Tanaka, R., Salazar-Gatzimas, E. & Clark, D. A. Mechanism for analogous illusory motion perception in flies and humans. Proc. Natl. Acad. Sci. USA 117, 23044–23053 (2020).
Article CAS Google Scholar
Richards, B. A. et al. A deep learning framework for neuroscience. Nat. Neurosci. 22, 1761–1770 (2019).
Article CAS Google Scholar
Funke, C. M. et al. Five points to check when comparing visual perception in humans and machines. J. Vis. 21, 1–23 (2021).
Article Google Scholar
Kriegeskorte, N. Deep neural networks: A new framework for modeling biological vision and brain information processing. Ann. Rev. Vis. Sci. 1, 417–446 (2015).
Article Google Scholar
Ward, E. J. Exploring perceptual illusions in deep neural networks. bioRxiv https://doi.org/10.1101/687905 (2019).
Article Google Scholar
Sun, E. D. & Dekel, R. ImageNet-trained deep neural network exhibits illusion-like response to the Scintillating Grid. arXiv:1907.09019 (2019).
Benjamin, A. S., Qiu, C., Zhang, L. Q., Kording, K. P. & Stocker, A. A. Shared visual illusions between humans and artificial neural networks. Proceedings of the Annual Conference on Cognitive Computational Neuroscience, 585–588 https://doi.org/10.32470/CCN.2019.1299-0 (2019).
Lotter, W., Kreiman, G. & Cox, D. A neural network trained for prediction mimics diverse features of biological neurons and perception. Nat. Mach. Intell. 2, 210–219 (2020).
Article Google Scholar
Gomez-Villa, A., Martin, A., Vazquez-Corral, J. & Bertalmio, M. Convolutional neural networks can be deceived by visual illusions. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 12309–12317 (2019).
Gomez-Villa, A., Martín, A., Vazquez-Corral, J., Bertalmío, M. & Malo, J. Color illusions also deceive CNNs for low-level vision tasks: Analysis and implications. Vis. Res. 176, 156–174 (2020).
Article CAS Google Scholar
Kim, B., Reif, E., Wattenberg, M., Bengio, S. & Mozer, M. C. Neural networks trained on natural scenes exhibit gestalt closure. Comput. Brain Behav. 4, 251–263 (2021).
Article Google Scholar
Watanabe, E., Matsunaga, W. & Kitaoka, A. Motion signals deflect relative positions of moving objects. Vis. Res. 50, 2381–2390 (2010).
Article Google Scholar
Watanabe, E., Kitaoka, A., Sakamoto, K., Yasugi, M. & Tanaka, K. Illusory motion reproduced by deep neural networks trained for prediction. Front. Psychol. https://doi.org/10.3389/fpsyg.2018.00345 (2018).
Article PubMed PubMed Central Google Scholar
Lotter, W., Kreiman, G. & Cox, D. Deep predictive coding networks for video prediction and unsupervised learning. arXiv:1605.08104 (2017).
Kawato, M., Hayakawa, H. & Inui, T. A forward-inverse optics model of reciprocal connections between visual cortical areas. Network Comput. Neural Syst. 4, 415–422 (1993).
Article Google Scholar
Rao, R. P. & Ballard, D. H. Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nat. Neurosci. 2, 79–87 (1999).
Article CAS Google Scholar
Friston, K. A theory of cortical responses. Philos. Trans. R. Soc. B Biol. Sci. 360, 815–836 (2005).
Article Google Scholar
Kitaoka, A. Akiyoshi’s illusion pages. http://www.ritsumei.ac.jp/~akitaoka/index-e.html.
Watanabe, E. Ink blots illusion. figshare https://doi.org/10.6084/m9.figshare.6137582 (2019).
Chen, K. icrawler. https://icrawler.readthedocs.io/en/latest/ (2017).
Kobayashi, T. & Watanabe, E. Visual illusions dataset. figshare https://doi.org/10.6084/m9.figshare.9878663 (2019).
Lucas, B. D. & Kanade, T. An iterative image registration technique with an application to stereo vision. Proceedings of Imaging Understanding Workshop, 121–130 (1981).
Farnebäck, G. Two-frame motion estimation based on polynomial expansion. Proc. Scand. Conf. Image Anal. 2749, 363–370 (2003).
Article Google Scholar
Hisakata, R. & Murakami, I. The effects of eccentricity and retinal illuminance on the illusory motion seen in a stationary luminance gradient. Vis. Res. 48, 1940–1948 (2008).
Article Google Scholar
Kobayashi, T. & Watanabe, E. Artificial perception meets psychophysics, revealing a fundamental law of illusory motion. arXiv:2106.09979 (2021).
Sinapayen, L. & Watanabe, E. Evolutionary generation of visual motion illusions. arXiv:2112.13243 (2021).
Gomez-Villa, A., Martín, A., Vazquez-Corral, J., Malo, J. & Bertalmío, M. Synthesizing visual illusions using generative adversarial networks. arXiv:1911.09599 (2019).
Hirsch, E. & Tal, A. Color visual illusions: A statistics-based computational model. arXiv:2005.08772 (2020).

Download references

Acknowledgements

Special thanks to the psychophysics subject for the kindness and cooperation.

Funding

This work was supported in part by a MEXT/JSPS KAKENHI Grant-in-Aid for Scientific Research.

Author information

Authors and Affiliations

Laboratory of Neurophysiology, National Institute for Basic Biology, Higashiyama 5-1, Myodaiji-cho, Okazaki, Aichi, 444-8787, Japan
Taisuke Kobayashi & Eiji Watanabe
College of Comprehensive Psychology, Ritsumeikan University, Iwakura-cho 2-150, Ibaraki, Osaka, 567-8570, Japan
Akiyoshi Kitaoka
Code_monsters group, Laboratory of Neurophysiology, National Institute for Basic Biology, Higashiyama 5-1, Myodaiji-cho, Okazaki, Aichi, 444-8787, Japan
Manabu Kosaka & Kenta Tanaka
Department of Basic Biology, The Graduate University for Advanced Studies (SOKENDAI), Miura, Kanagawa, 240-0193, Japan
Eiji Watanabe

Authors

Taisuke Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar
Akiyoshi Kitaoka
View author publications
You can also search for this author in PubMed Google Scholar
Manabu Kosaka
View author publications
You can also search for this author in PubMed Google Scholar
Kenta Tanaka
View author publications
You can also search for this author in PubMed Google Scholar
Eiji Watanabe
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.K. and E.W. conceived the research and designed the project. M.K. and K.T. wrote Python programs. A.K. provided the illusory illustrations and inspired the project members. T.K. ran programs, conducted all the preparation for the psychological experiment, and performed data analysis. E.W. wrote the manuscript. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Taisuke Kobayashi or Eiji Watanabe.

Ethics declarations

Competing interest

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kobayashi, T., Kitaoka, A., Kosaka, M. et al. Motion illusion-like patterns extracted from photo and art images using predictive deep neural networks. Sci Rep 12, 3893 (2022). https://doi.org/10.1038/s41598-022-07438-3

Download citation

Received: 24 June 2021
Accepted: 18 February 2022
Published: 10 March 2022
DOI: https://doi.org/10.1038/s41598-022-07438-3

This article is cited by

Integration of motion information in illusory motion perceived in stationary patterns
- Taisuke Kobayashi
- Eiji Watanabe
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.