Preprint
Article

This version is not peer-reviewed.

Clusters Regarding Ocular Fixations Made on a Dynamic Logotype Reveal Potential Top-Down and Bottom-Up Perceptual Modulations for Brand Communication

Submitted:

28 November 2025

Posted:

01 December 2025

You are already at the latest version

Abstract
Within the framework of brand communication, several companies choose to use bistable logos. These types of logos fall within the mechanisms inherent to bistable perception, where the interpretation of the two possible percepts involved may depend on the areas being observed or on prior instructions given to the observer to search for a particular shape within the ambiguous image. Perceptual factors related to the stimulus and the areas of eye fixation are called bottom-up aspects. The information exogenous to the bistable stimulus that determines perception is called top-down modulation. In order to determine whether certain bottom-up perceptual modulation areas for the Toblerone bistable logo are related to the search for each percept previously modulated by a written instruction, an experimental task was carried out with 34 participants using a Tobii T-120 eye-tracker device. Seven bottom-up modulation clusters were analyzed for ocular responses manifested in two different top-down modulation conditions. The results show that for each of the percepts, some areas correspond to the textual information offered as a top-down modulator. It is concluded that for the perception of the Toblerone® logo, some areas are related to each percept and that unimodal top-down modulation mechanisms operate in certain areas, while others can be assumed to be parts of the logo that contribute to the recognition of the two percepts involved.
Keywords: 
;  ;  ;  ;  ;  

1. Introduction

Ambiguous Images and Bistable Logotypes: Psychological Issues Involved

Within the context of brand communication and graphic design the so-called bistable images have been used to convey different messages to audiences [1,2]. Based on the idea of being ambiguous figures [3], bistable images have been proven to be useful due to their capability of transmitting two different meanings, each one of them related to each possible visual percept [4]. When designing bistable logos, graphic designers know that it is not possible to recognize simultaneously the possible percepts that are involved within the ambiguous image [5]. As has been stated, during the observation of bistable images there is no chance to perceive at the same time the two visual percepts [6]. Rather, just one percept remains recognized while the other one is not being perceived [7,8]. Once the observer starts to perceive the other possible percept, a perceptual reversal has taken place, this is, the switch from the first percept perceived to the second one [5,9,10,11,12,13]. Three examples of bistable images are depicted in Figure 1.
It has been noticeable how these images have been used in different fields: firstly, visual artistic images have been created based on the idea of provoking bistable perception [14]. In this regard, painters like Salvador Dalí or Archimboldo, painted images in which it is possible to alternate between two or more different visual percepts [15,16]. On the other hand, famous ambiguous images have been tested within the scope of the psychology of perception, such as The vase-face illusion [17], My-girlfriend-or-my-mother-in-law [18,19,20,21], the Schröder staircase [22], and the Necker’s cube [23]. These kinds of tests have been conducted to prove that there are different perceptual mechanisms involved while looking at bistable images [4]. On the other hand, several studies concerning bistable perception have been carried out to analyze the observation of androgynous faces [24], since that this kind of human images are considered both ambiguous and bistable [25]. Therefore, the perceptual psychological mechanisms that are involved in bistable visual perceptual processes apply to perceptual mechanisms linked to the recognition of the gender of androgynous human faces [24,26]. Besides, within the scope of brand communication, several brands and logotypes have been designed based on the principles that sustain bistable perception. Take, as examples, the logo of FedEx® [27], and the logo designed for the brand Toblerone® [28].
An issue that explains the bistable perceptual mechanism is the so-called bottom-up perceptual process that is involved due to standing out areas that influence observers’ attention [5]. This way, physical characteristics of bistable stmuli exert an effect on their interpretation [29,30,31]. The realization of this effect depends on the areas of the image upon which observers’ eyes are fixed [1,8,18,19,32,33]. Nevertheless, expectations, concepts, and also attitudinal predispositions play a role in the perceptual process as a whole [19,25]. When it occurs, the mechanism that is involved is a top-down modulating perceptual process [6]. In view of the foregoing, when information stored on memory and previous knowledge and expectations influence the final perception of bistable images, what is implied is the previously mentioned top-down perceptual process [5,34]. Thus, the interpretation of the ambiguous image is defined by information previously stored in memory [25,35], or also by inputs that enters into the visual perceptual system [36]. In this sense, when, for instance, auditory stimuli are influencing the perception of an ambiguous image, top-down perceptual modulation is operating [25], if the semantic load of the acoustic information is related to the meaning of the percept while observing a bistable visual stimulus [8,37]. Likewise, when an observer of a bistable stimulus reads a text whose content is related to one of the possible visual percepts involved, the semantic load of the text can influence the perception of the semantically congruent visual percept [38]. In that scenario, what emerges is the so-called semantic congruency effect [19,25].
A final issue that has to be mentioned to close this section, is that, providing the fact that bistable visual stimuli allow for two possible interpretations, they can also be called ambiguous images [39,40]. As mentioned before, when observers are looking at this kind of images, the perception shifts between the two possible visual percepts that can be perceived [7,21]. These leaps are known as perceptual reversals [8,9,10,11,13,21].

The Conveyance of Semantic Load of Bistable Logotypes

The way consumers look at brands influences the potential conveyance of semantic loads [41]. If the logo itself is bistable, perceptual dynamics arise due to the likelihood of interpreting the visual design in two different ways [1,42]. Falling into the scope of bistable perception, logotypes that can be perceived in two different ways have the capability of conveying two different meanings, that is to say, two different interpretations [5]. The semantic loads concerning these interpretations normally go hand in hand in relation to concepts that are attributed to semantic concepts of the brand involved [1]. It has been a matter of controversy if all the consumers that constitute the target audience of a brand are able to identify the percepts that are involved in bistable logotypes, due to factors regarding visual gestalt configurations and figure-ground perceptual reversals [5,43]. That concern has also emerged based on psychological aspects regarding bistable perception, like bottom-up modulating perceptual mechanisms [8,44], as well as the areas of the logo on which ocular fixations are made [1]. In this spirit, when brand strategists decide to associate a brand with two different meanings that convey key concepts for that brand, visual designs usually emerge to communicate those brand values [45]. In this regard, graphic designers and advertisers design bistable logotypes, due to their potential to transmit two different meanings [1,27]. The question that arises here is relating to the likelihood for the logotype to be interpreted in such a way that the two percepts involved are recognized by consumers. If so, the two semantic loads involved for brand communication would be conveyed, which, in turn, would imply a complete communicative efficiency for the bistable logotype in question, based on gestalt psychological principles regarding multi-stability [46,47].
As far as the logo of Toblerone® is concerned, it is classified as a bistable image due to its capacity of offering two possible interpretations based on figure-ground visual perceptual principles. When looking at that visual advertising design, it is possible to recognize a mountain and, within the mountain, a bear standing on two legs [48]. The history of Toblerone’s visual identifier (the logo itself) is intrinsically linked to its geographical origin. The brand was created in 1908 in Bern, Switzerland. Initially, the primary distinguishing element displayed on the box was the repeated triangular shapes of the chocolates, alluding to the Swiss Alps. At that time, the brand utilized a logotype designed by Emil Bauman, whose typeface was both simple and unique (the structure of this typeface is still preserved today). Later, in 1930, the brand sought to further connect with its country of origin [49] by including an illustration of an alpine bird perched atop the flags of Bern (featuring a bear) and Switzerland. In the late 1960s, the brand’s visual identity evolved into an imagotype, incorporating the Matterhorn, also known as Monte Cervino [50], one of the most iconic peaks in the Alps due to its triangular shape and a symbol of national pride [51]. This served to associate the brand with the renown of Swiss chocolate, the overall quality of the Helvetic industry, and to consolidate the triangular shape of the packaging and product, making this chocolate stand out within its market category [52]. The designer subtly included a bear in the isotype’s negative space, hidden among the mountain’s folds, a direct association with Bern, the ‘City of Bears’. In 2022, the manufacturer Mondelez International decided to shift part of its production to Slovakia. This decision meant the brand could no longer use its ‘Swiss-made’ label and had to transition from the distinctive Matterhorn silhouette to a generic mountain. In some countries, only the logotype is now used. Despite these changes, the brand remains a global symbol associated with travel, tradition, and luxury, demonstrating the power of effective brand management, visual identity, imagery, and shapes once they are positioned in consumers’ minds [53]. The study described in this article will take this bistable logotype as a reference, considered, in the scientific literature, as a bistable image of the in-meaning content reversals type. It is classified in this category because the two possible percepts share areas and lines, to the point that discriminating the figure-ground type is more complex to perform than in other types of ambiguous [4], such as those called in-figure-ground reversals images, like The vase-face illusion [54]. The essential purpose of the research described in the following pages focuses on analyzing the potential bottom-up modulation effect that can be exerted by an instruction given to observers through a written text that directly states that what they are going to look at is either a bear or a mountain. In this sense, the texts ‘bear’ and ‘mountain’ operate as semantic perceptual modulators. It was hypothesized that certain areas of the Toblerone® logo related to the visual percept associated with the semantic category of each text (bear and mountain) would be mostly observed. In this sense, as will be described later in the methodology section, critical bottom-up modulation clusters were preselected for the two possible visual percepts in question. Another objective pursued with the study was to establish the relevance of the critical areas of bottom-up modulation in relation to each of the visual percepts, understanding that the lines that define the figure of the bear are part of the mountain, which encourages a deeper investigation into the role played by the areas of ocular fixation in relation to the detection of each of the figures (bear and mountain) involved in the brand design of a bistable nature.

2. Related Work

Various research studies have been conducted to establish the communicative effectiveness of a bistable logo. The assumption is that the two percepts involved in the design of a bistable logo must be recognizable by consumers and buyers [1]. This highlights the fact that these two possible percepts should have the same probability of being identified. However, scientific evidence shows that several factors influence the identification of the two visual percepts involved, such as salient areas of the logo [48], the way the observer looks at the brand identifier [55], eye fixations on the bistable design [19], and potential semantic modulators related to exogenous information the observer has in mind when observing the bistable design [56].
To begin with, there are studies that assess the areas of bistable logos that influence the likelihood for the two percepts involved to be recognized. In 2019, Rodríguez and Marroquín-Ciendúa sought to demonstrate that placing fixation points that direct the gaze toward certain areas of various bistable logos could have an effect on the way those logos are perceived. Indeed, upon reviewing the results, these researchers demonstrated that there are certain areas and visual clusters of bistable logos that, when observed, favor the recognition of each of the two visual percepts involved. Furthermore, upon analyzing the results of this study, it was found that critical zones of perceptual influence can be established, which in the jargon of the psychology of visual perception is called, as previously mentioned, bottom-up perceptual modulation [57]. These zones of influence effectively determine or direct perception as long as they are observed [18]. In this sense, what stands out is the importance of being able to observe, with eye-tracking devices, the areas where observers place their eye fixations when looking at a bistable image [3]. These areas can be grouped into clusters, that is, into areas that are defined by a set of eye fixations that, within a certain visual area of the logo in question, configure a surface, an area of bottom-up perceptual modulation [27].
Since the brain does not process visual information during saccadic movements [58], it is the eye fixations and their durations that contribute to the understanding of brand communication through the use of bistable logos [5]. It is also for this reason that there is an indivisible connection between technology and communication, where motion detection systems based on the recognition of oculomotor activity allow a deeper understanding of the perceptual mechanisms involved in the decoding of visual brand communication proposals [59].
Rodríguez-Martínez, in both 2024 and 2025, managed to demonstrate that there are bottom-up modulating areas that, as a matter of fact, seriously condition the identification of the percepts involved in a brand design of a bistable nature. In his first study [5], using an in-line design and eliminating color fills, he managed to demonstrate that for a bistable logo of a Colombian milk brand there are areas that condition the perception and identification of a first percept, and that, likewise, there are other bottom-up modulation areas that favor the recognition of the second visual percept. This study, which was carried out thanks to the recording of the eye movements of the observers who voluntarily constituted the sample for the experimental tests, confirmed the psychological foundation of visual perception that defines the characteristics of the visual stimulus and the way in which these are observed through eye fixations as critical factors of the bottom-up type [60].
In the second study carried out by this same researcher [1], the same paradigm of online logo exposure without color fill was taken into consideration, this time to identify areas of perceptual modulation in a logo with different characteristics than the first study, where the key was that the logo could have bottom-up modulation clusters dispersed throughout the design. Upon reviewing the results, it is concluded that, once again, there is perceptual conditioning mediated by the areas of the bistable design on which eye fixations are made [3]. These studies are joined by other research efforts that have sought to demonstrate aspects of bottom-up modulation that affect the observation of logos and brand emblems, depending on their position within a given advertising design. This is the case of the study developed by Girisken and Bulut [61]. In this spirit, there could be a kind of integration between semantic loads provided by the brand and advertising content derived from a creative concept [62].
On the other hand, Puškarević et al. [63], using eye-tracking technology, contrasted the attentional effects of brand fonts in advertisements. Oculomotor activity measures related to eye fixations showed the occurrence of bottom-up modulating effects, especially when brands use rhetorical language in conjunction with font types with a higher attentional load [5]. Another psychological factor involved and normally considered in this type of studies is the level of attentional salience of the elements that constitute a certain bistable design [64]. In this sense, aspects of advertising visual stimuli and logos, such as the size and contrasts between communicative visual forms, can exert a significant influence on what consumers ultimately observe and identify. As has been stated, when observers look at labels or graphic designs where brands are present, a fact that is relevant for brand strategist is that it is very likely to differentiate themselves from competing products [5,65].
For their part, Tseng and Chuang [41] conducted a study focused on analyzing eye fixation behavior during observation processes of logos with illusory contours. Assuming bistability, meaning that if perceptual closure of the parts where the illusory contour needs to be imagined is not performed, provoking ambiguity while trying to decode the graphic design itself, hidden contour logos were found to be more novel, interesting, and pleasing, which, according to the study, is in line with modern logo design trends. Furthermore, logos with ambiguous contours were also found to involve more dispersed vision vectors, along with longer eye fixation durations compared to text-type logos, which have a much more focused rather than dispersed distribution, based on sustained vision parameters. Nevertheless, the study suggests that too many missing parts in logos can affect their recognition [41], where a perceptual ambiguity emerges that leads to instability in the recognition of visual percepts necessary to convey brand semantic values [66].
Regarding the study of the semantic congruence effect and top-down modulation processes involved in the observation and semantic decoding of bistable logos, there are no studies focused on addressing these psycho-perceptual phenomena. However, the demonstration that semantic loads exogenous to bistable visual stimuli directly influence the perception of the visual percepts that correspond to the meanings of said external information has been widely documented in the scientific literature e.g., [8,19,24,25,67,68]. Semantic correspondences, which underpin the concept of multisensory integration mediated by semantic congruences [69], can occur, first, in unimodal sensory modalities, where prior visual information, for example, influences the perception of a subsequently presented bistable visual stimulus [4], or in crossmodal modalities [70]. The latter implies that top-down modulating information comes from a sensory stimulation different from the sensory modality in which a bistable stimulus is presented [25]. Thus, acoustic information with a certain semantic load, for instance, can influence the recognition of a percept belonging to a bistable visual stimulus [8], where the semantic load of the auditory stimulus directs the recognition of the visual percept with a similar semantic load to the audio previously provided [24]. Here, the concept of perceptual semantic congruence emerges [19], where, as a matter of fact, the visual perception is conditioned by exogenous information but presented in a different sensory modality, which gives account of multisensory integration phenomenon [71].

3. Method

Participants and Procedure
A total of 34 volunteers participated in the study (age mean, M = 22.41; S.D. = 2.5833; women, 55,88%; men, 44,11%). Each of them continuously viewed the Toblerone® logo in the simplified, unfilled, line version (see it in Figure 2), following the analysis paradigm for recognizing bottom-up modulation areas for bistable images suggested in preliminary studies [1,5,18]. The bistable logo stimulus in question was displayed at two different times over a 12-second period on a Tobii® T-120 reference eye-tracker. All participants had to have normal vision or, failing that, corrected vision with contact lenses in order to observe the eye-tracker monitor at a distance of 60 cm, following the protocols suggested by the Swedish manufacturer Tobii [64]. Each participant signed their informed consent, and the study was approved by the Ethics Committee of Jorge Tadeo Lozano University. The present study was conducted in accordance with the Declaration of Helsinki, and approved by the Institutional Review Board of the University Jorge Tadeo Lozano (internal code 5414 whose approval was given on 5th. August, 2024). For each participant’s recording, a calibration phase was first performed, ensuring optimal recording of eye movements and ensuring an adequate sample rate. Participants were then given clear and precise instructions to keep their heads still and stare at the image to be presented. Since the visual stimulus was presented for 12 seconds, the need to remain highly focused during the two presentations of the bistable logo was emphasized. The two different presentation times corresponded to two different instructions, both of which involved top-down modulating information. This way, the two different conditions were defined by the use of a text that conditioned each participant to search for a specific percept within the displayed bistable image. Thus, the first instruction was: “Next, you will see a picture. Observe it carefully and identify the shape of a bear.” The second instruction was: “Next, you will see a picture. Observe it carefully and identify the shape of a mountain.” In this sense, the words ‘bear’ and ‘mountain’ constituted the exogenous semantic information that would influence the final perception of the bistable logo. In this way, it was expected that the hypothesis would be fulfilled that bottom-up areas corresponding to the percept related to the modulating word given in the instruction would be viewed more than visual areas unrelated to the modulated percept.
The two instructions were presented in a different order, using a randomization code inserted into the software setup of the eye-tracker device used. Thus, the instruction to search for the ‘bear’ was sometimes the first instruction, or sometimes the second. The same situation occurred for the instruction to search for the ‘mountain’ visual percept. In this way, the order of presentation of the modulating information was counterbalanced to eliminate the bias associated with the order of the condition controlled by the researchers.
Data Analysis
All eye movement and fixation recordings were stored in Tobii Studio® software. For the purpose of analyzing eye fixations on critical bottom-up modulation areas, seven clusters or areas of interest (modulating areas for both, ‘bear’ and ‘mountain’ percepts) were defined within Tobii Studio® software. As shown in Figure 3, each of the areas of interest (AOIs, according to the convention used by the Swedish manufacturer Tobii®) were duly delineated and coded, assigning the codes A1, A2, A3, A4, A5, A6, and A7. Thus, the assigned codes have the following correspondences: A1, the bear’s face; A2, right side of the mountain; A3, left side of the mountain; A4, the bear’s lower right paw; A5, the bear’s lower left paw; A6, the bear’s upper left paw; A7, the bear’s upper right paw. The designation of the bear’s paws as left or right was assumed based on the observer of the logo and not based on the bear itself. These corresponding clusters (AOIs) are depicted in Figure 2.
Once the areas of interest were defined, ocular metrics referring to the time of eye fixations involved in each of the seven clusters were extracted. The areas of interest were duly selected following the protocols previously designed by Bernal-Robayo [72] and subsequently considered and implemented in the studies carried out with eye tracking by Rodríguez-Martínez [1,5]. The corresponding analyses were performed for each of the two top-down modulation conditions, and the relationships between the observed areas and their equivalence with the modulating text given in the instructions were established. Tobii Studio®, Excel® (v. 16.45), and SPSS® (v. 23) were used for the corresponding statistical analyses.

4. Results

In this section, we consider all the results obtained after analyzing all the durations of ocular fixations made in the selected AOIs. The ocular metric referring to the duration of ocular fixations is very relevant to determine the degree of importance and attentional salience of a certain area within the observers’ visual field [73]. Understanding that the visual field is established by an area defined by horizontal and vertical vectors (horizontal horopter and vertical horopter), it is in the center towards the area between minus 2 and 2 degrees that the focus area is established [74], which refers to the place where the observer is looking (ocular fixation) when a saccadic movement has stopped [58]. In Figure 3, the totality of the ocular data referring ocular fixations is graphically depicted by means of two ocular maps: opacity map (on the left), and heat map (on the right). These images will be discussed later.
Starting with cluster A1 (the bear’s face), no statistically significant differences were found in favor of the moment in which observers reviewed the image with the modulating text referring to the bear concept. While the difference between the means of the durations of the eye fixations did favor when observers read the instruction about searching for the bear (M = 2.4532, S.D. = 1.3498) compared to when the image was viewed with the conditioning of searching for the mountain (M = 2.1794, S.D. = 1.1914), the statistic did not show a value that suggested a significant difference (t(33) = 0.9958; p = 0.3265). It is possible that there was a slight impact on perception mediated by the instruction, but without any marked difference being found in the compared averages of the durations of the eye fixations manifested in that area. Regarding AOIs A2 (mountain area to the observer’s right), the result is in line with the hypothesis, since the average duration of eye fixations with the textual modulator referring to ‘mountain’ is significantly longer compared to the time when the observers were instructed to search for the bear (in the modulating text condition congruent with the mountain percept, M = 1.3794, S.D. = 1.1927; in the modulating condition for the bear percept, M = 0.7014, S.D. = 0.5153; t(33) = 3.2594; p = 0.0025). Clearly, a greater amount of observation is recognized in the mountain area to the observer’s right when there is congruence with the modulating text (p < 0.05). But contrary to expectations, in the case of the mountain area located to the left of the visual field (area A3), there is greater visual tracking (and with a statistically significant difference) in that area of the bistable logo when the instruction to search for the bear is given, compared to when the instruction orients the visual task towards searching for the mountain: in the modulating text condition congruent with the mountain percept, M = 1.9764, S.D. = 1.2724; in the modulating condition for the bear percept, M = 2.5223, S.D. = 1.1164; t(33) = 2.154; p = 0.03). A possible explanation for this phenomenon may be given by the proximity of that area of the mountain in relation to the body of the bear, as will be discussed later.
Turning to the specific analysis of the bear’s paw areas (one area for each paw, A4, A5, A6, A7), a review of the eye fixation times reveals that one of the areas of the two lower paws turns out to be key for bear recognition (area A5, which is further left in the visual field). According to the results, when observers were instructed to search for the bear, the cluster A5 had greater observation in terms of eye fixation durations, giving a significant difference in favor compared to the modulation given by the instruction referring to the search for the mountain. For the case of AOIs A4 (the bear’s lower paw on the right of the visual field), the t-student test statistic is, t(33) = 0.2572, p = 0.7986; mean value for top-down modulation in favor of the bear percept, M = 0.3276, S.D. = 0.2721; mean value for top-down modulation in favor of the mountain percept, M = 0.3088, S.D. = 0.2836. On the other hand, the data for the analysis of area A5 (the bear’s lower paw, left area of the observer’s visual field) are as follows: in the condition of congruent modulating text with the bear percept, M = 0.2323, S.D. = 0.2494; in the modulating condition for the montana percept, M = 0.0988, S.D. = 0.1915; t(33) = 2.4825; p = 0.018. As previously stated, this bear’s paw on the left side of the visual field is more frequently observed in the congruent top-down modulation condition, with a difference recognized as statistically significant (p < 0.05).
With regard to the bear’s upper paws (areas A6 and A7), no differences were found in the average durations of eye fixations when comparing the two top-down modulation conditions in either case. For the analysis of area A6 (the bear’s right paw, with a larger area or surface area than the other upper paw), the results of the statistical test were as follows: t(33) = 0.3363, p = 0.7387, where the average values were, for the mountain percept modulation condition, M = 0.6947, S.D. = 0.4625, and for the instructional text condition referring to the search for the bear, M = 0.6664, S.D. = 0.4306. Regarding area A7 (upper left paw of the bear), the statistic did not show a significant difference: t(33) = 0.8474, p = 0.4028, where the average values were, for the modulation condition for the mountain percept, M = 0.1682, S.D. = 0.4816, and for the instructional text condition regarding the search for the bear, M = 0.0941, S.D. = 0.1678. The comparisons between the average durations of eye fixations in each cluster and considering the two top-down modulation conditions are represented in Figure 4.
In Table 1, all the results are depicted, implicating all the AOIs taken into account and the results in terms of the means of durations of ocular fixations made on bottom-up critical areas considering the top-down modulating texts.

5. Discussion and Practical Implications

There is a relationship of semantic congruence in relation to some of the bottom-up modulating areas, as occurs in areas A2, A3, and A5. Taking into account that the paradigm being used is that of the correspondence between a greater number of durations of eye fixations in areas pre-valued as favoring the recognition of the percept previously modulated by the information given in the instruction [48], it is seen that areas A2 (favoring the mountain percept) and A5 (favoring the bear percept) achieve the expected correspondence effect. This is because, in the case of area A2, eye fixation durations are in fact longer for the condition in which the text refers to the expected visual percept, the mountain. The same applies to area A5 (the leftmost paw of the bear in the visual field): there is also a significant difference in favor of the duration of eye fixations in the top-down modulation condition defined by the instruction to search for the bear. A first consideration to estimate is the fact that area A3, preliminarily assessed as an area favoring the mountain percept [48,72], had a higher number of observations and levels of eye fixations in the modulating condition for the bear percept (M = 2.5223, S.D. = 1.1164), compared to the durations of eye fixations referred to the top-down modulation condition corresponding to the search for the coherent percept (mountain) in terms of bottom-up fixation areas (M = 1.9764, S.D. = 1.2724). This difference turned out to be significant (p < 0.05). Two considerations could eventually explain this result: first, that some boundaries of area A3 are very close to the contour lines that define a part of the bear percept. Compared to area A2, which would also contribute to the recognition of the mountain, this has a significant surface area that is far from the shape of the bear, sharing a smaller amount of boundary contours with the posterior area of the bear percept. Ambiguities in relation to contours can often impact the perceptual performance of observers [41]. The second consideration to take into account is that, possibly due to the location of area A3 within the visual field (on the left and covering almost the entire verticality of the bistable visual stimulus), observers fix their gaze there more due to a cultural behavioral habit marked by the fact that in Western cultures people learn to read and observe starting from the left and from top to bottom [75]. Since the present study did not look at saccadic movements, their vector path and displacement pattern, the above explanation remains a supposition, which merits further research that elucidates the phenomenon based on visual path patterns [58] and not only on the basis of analysis of eye fixations. This implication is not minor, if considering that other types of perceptual modulations are related to habits, learned behaviors and observation patterns based on the path of saccadic movements [76].
On the other hand, the difference with which a very clear correspondence is established between the top-down and bottom-up levels in relation to area A2 (which, according to previous studies, is an area that favors the recognition of the mountain percept) is notable. As a matter of fact, the greater amount in the duration of eye fixations manifested there when there is correspondence with the textual modulator (M = 1.3794; S.D. = 1.1927) turns out to be significant (p < 0.05) when contrasted with the permanence of eye fixations in the opposite top-down modulation condition (M = 0.7014; S.D. = 0.5153). There is almost a duplication in the values of the durations of eye fixations in favor of the observations made on the area with correspondence between the top-down modulating condition and the congruent bottom-up area. With this study, greater empirical evidence begins to be acquired that this area is associated with the recognition of the mountain percept. However, and understanding that during visual recognition tasks under bistable perception paradigms stochastic-type behaviors are recognized [4], eye visits will normally be found throughout the visual field, regardless of the attentional modulations involved [77]. Again, there is a need to advance future studies where multiple variables and co-variates involved are controlled, to the point that it is possible to recognize the level of bottom-up modulation of some areas of visual stimuli, and the incidence that exogenous information may have at the level of oculomotor activity that conditions perception in terms of top-down modulations. In this sense, it is also considered that the way in which the attempt is made to condition perception from top-down mechanisms, has an echo in the perceptual effects involved in bistable perception paradigms [4]. It is not the same to exercise perceptual modulation through a text placed previously to the exposure of a bistable visual stimulus, than to do so through a modulating audio that is constantly present during the exposure of the ambiguous image [8,20,24]. The way in which modulating exogenous information enters the memory system before or during the perceptual recognition process affects both cognitive and perceptual performance. Besides, factors associated with the workload involved through the use of working memory [78], or through the use of long-term memory, have an indisputable impact on the perceptual modulation process and the corresponding eye fixations and derived saccadic movements [58].
Moving on to another result worth considering, it is important to refer to the visual processing given to area A1 (the bear’s face). Considered an essential area for bear recognition [48], and given that its position within the visual field is relatively privileged (almost in the center of the figure and at the top, somewhat close to a threshold point with relevant attentional salience such as the mountain peak), this area is highly observed and has significant eye fixation durations there, as demonstrated in both top-down modulation conditions. According to the results, eye fixation durations do increase when the instruction to search for the bear figure is given, although the difference is not statistically significant: M = 2.4532, S.D. = 1.3498, compared to the conditioning of searching for the mountain, M = 2.1794, S.D. = 1.1914. It is worth highlighting here that there may be a potential neutral modulation area (A1), that is, the area itself can contribute to the recognition of the two percepts involved in the bistable image [18,19]. However, according to the results, there would be a tendency to recognize a minimal impact from the top-down modulator used. In this regard, it is suggested that future studies be conducted to analyze the bottom-up conditioning effect for this and other areas, not only in this bistable stimulus, but also in others where the same area is relevant for the perceptual interpretation of the two percepts involved. This phenomenon of areas favoring both possible percepts is particularly evident in in-meaning-content reversals bistable images [4] given that the two percepts share not only contour areas but also shapes and surfaces that are relevant to the configuration of the two potential gestalts involved [18,43]. This consideration applies to other areas involved here (the Toblerone® logo) and to any study that seeks to determine bottom-up modulation conditions based on the orientation of perception toward disambiguation in the interpretation of bistable stimuli. The preliminary study by Valderrama and Rodríguez-Martínez [48] had already suggested that area A1 is highly relevant to the understanding and perceptual decoding of this logo, as this part of the design contributes to the recognition of the bear, but it is also a key part of the upper-central part of the mountain.
Turning to the analysis of the bear’s paws, it is noteworthy that area A6 has a greater number of eye fixation durations compared to other areas of the bear’s paws, such as A4, A5, and A7. Area A6 is one of the bear’s paws, very important for recognizing the animal involved in the image [48], since it is the bear’s right front paw and covers a larger area than the other upper paw (A7) and the bear’s lower left paw (A4). Neither areas A4, nor A6, nor A7 showed a behavior where the correspondence between their durations in terms of eye fixations and the corresponding top-down modulating condition could be recognized. Returning to area A6, although it stands out slightly (see Figure 4) in relation to the other 3 areas corresponding to the bear’s paws, it is not possible to mention the possibility of a top-down modulating effect when comparing the average durations of eye fixations in the visual correspondence areas (p > 0.05). In fact, and as previously mentioned, of the four paws of the bear, it is only the one referring to area A5 (lower right paw of the bear) that involves the correspondence between the top-down and bottom-up levels involved (regarding the condition of congruent modulating text with the bear, M = 0.2323, S.D. = 0.2494; in the modulating condition for the mountain, M = 0.0988, S.D. = 0.1915, p < 0.05). It is worth mentioning that, when reviewing both the opacity map and the heat map in relation to eye fixations (Figure 3), it becomes evident (descriptively speaking) how there is a concentration of fixations in area A1, but understanding that, due to its surface area (smaller than that of A3), surface A3 is also visited, but with a greater dispersion of fixations throughout it (as previously stated, of larger surface area). The opacity map (on the left of Figure 3) shows what observers actually see when looking at the Toblerone® logo, showing how the triangle that is the basis of the design of this logo is configured, but where there are definitely areas that are mostly observed. It has to be kept in mind that these maps cover the entire captured data, without distinguishing between observations in one or another top-down modulation condition.
To close this section, it should be pointed out that the results discussed here should be taken into account by graphic designers and brand communication strategists, since brand logos and emblems are essential for communicating values and semantic implications that contribute to a brands’ strategic positioning. When dealing with bistable logos, it is essential to consider the psychological aspects involved in the perceptual interpretation process. Thus, designers must understand that there are attentional saliences that condition or modulate (in a bottom-up way) bistable perception. They must also understand that in practical brand communication, contexts, information from advertising or from the audiences’ knowledge concerning brands, can be mixed [79]. This exogenous information can operate as a top-down modulator, echoing in the decoding of the corresponding bistable logo.

6. Conclusions

It is concluded that, for the Toblerone® logo, there are certain bottom-up modulation areas that have a perceptual correspondence with a textual modulator presented in a unimodal sensory modality. Besides, it is recognized that the two possible percepts involved in this bistable logo may have corresponding critical areas of eye fixations that can contribute to the decoding of the image itself. From the analysis of eye fixation durations, there is an area of the bear percept (the bear’s lower right paw) that allows for top-down modulation and enables its interpretation. For the mountain percept, the entire right-hand area of the logo, from top to bottom, which constitutes an essential part of the mountain’s shape, is favorable for the recognition of that percept, presenting a correspondence between the top-down and bottom-up factors considered in the study. The right-hand area of the mountain has a proximity to the bear’s shape. This may be one reason why the top-down modulator operated in reverse, that is, there was a longer duration of eye fixations in that area under the modulating condition supporting the bear percept. The bear’s face area obtained a significant level of attentional salience, but without demonstrating a real top-down modulation effect. This study demonstrates the complexity involved in decoding bistable logos and suggests that designers and researchers in this field should delve deeper into the psychological factors involved in the decoding and interpretation processes of bistable visual stimuli used in the context of brand communication.

Abbreviations

The following abbreviations are used in this manuscript:
AOIs Areas of Interest
S.D Standard Deviation
SPSS Statistical Package for the Social Sciences

References

  1. Rodríguez-Martínez, G. Can ocular fixations modulate the perception of a bistable logo? An eye-tracking study. Gráfica 2025, 13, 103–111. [CrossRef]
  2. Wilson, A. Multistable perception of art-science imagery. Leonardo 2012, 45, 156–164. [CrossRef]
  3. Hsu, Y. H.; Chen, C. C. Eye-movement patterns for perceiving bistable figures. J. Vis. 2025, 25, 3–3. [CrossRef]
  4. Rodríguez-Martínez, G.; Castillo-Parra, H. Bistable perception: Neural bases and usefulness in psychological research. Int. J. Psychol. Res. 2018, 11, 63–76. [CrossRef]
  5. Rodríguez-Martínez, G. Eficacia perceptual y comunicativa de un logotipo biestable. Un estudio experimental basado en tecnología eye-tracking. Rev. Texto Livre 2024, 17, e51494. [CrossRef]
  6. Intaité, M.; Noreika, V.; Šoliūnas, A.; Falter, C. M. Interaction of bottom-up and top-down processes in the perception of ambiguous figures. Vision Res. 2013, 89, 24–31. [CrossRef]
  7. Leopold, D. A.; Logothetis, N. K. Multistable phenomena: changing views in perception. Trends Cognit. Sci. 1999, 3, 254–264. [CrossRef]
  8. Marroquín-Ciendúa, F.; Rodríguez-Martínez, G.; Rodríguez-Celis, H. G. Modulación de la percepción biestable: estudio basado en estimulación multimodal y registros de actividad oculomotora. Tesis Psicol. 2020, 15, 106–124. [CrossRef]
  9. Baker, D. H.; Karapanagiotidis, T.; Coggan, D. D.; Wailes-Newson, K.; Smallwood, J. Brain networks underlying bistable perception. NeuroImage 2015, 119, 229–234. [CrossRef]
  10. Intaité, M.; Kovisto, M.; Castelo-Branco, M. Event-related potential responses to perceptual reversals are modulated by working memory load. Neuropsychologia 2014, 56, 428–438. [CrossRef]
  11. Clément, G.; Demel, M. Perceptual reversal of bi-stable figures in microgravity and hypergravity during parabolic flight. Neurosci. Lett. 2012, 507, 143–146. [CrossRef]
  12. Rodríguez-Martínez, G. Perceptual reversals and creativity: is it possible to develop divergent thinking by modulating bistable perception? Rev. Investig. Desarro. Innov. 2023, 13, 129–144. [CrossRef]
  13. Sandberg, K.; Barnes, G. R.; Bahrami, B.; Kanai, R.; Overgaard, M.; Rees, G. Distinct MEG correlates of conscious experience, perceptual reversals and stabilization during binocular rivalry. Neuroimage 2014, 100, 161–175. [CrossRef]
  14. Muth, C.; Carbon, C. C. SeIns: Semantic instability in art. Art Perception 2016, 4, 145–184. Available online: https://brill.com/view/journals/artp/4/1-2/article-p145_7.xml (accessed on 26 November 2025).
  15. Gamboni, D. Visual ambiguity and interpretation. RES: Anthropol. Aesthet. 2002, 41, 5–15. [CrossRef]
  16. Rodríguez-Martínez, G. Six bottom-up modulating areas are valued by observers after having viewed the Dalinian image ‘The invisible man’: a study based on ocular fixations analyses. Int. J. Environ. Sci. 2025, 11, 1448–1456. Available online: https://theaspd.com/index.php/ijes/article/view/8547 (accessed on 26 November 2025).
  17. Khalil, E. L. Why does Rubin’s vase differ radically from optical illusions? Framing effects contra cognitive illusions. Front. Psychol. 2021, 12, 597758. [CrossRef]
  18. Gale, A.; Findlay, J. Eye-movement patterns in viewing ambiguous figures. In Eye Movements and Psychological Functions: International Views; Groner, R., Menz, C., Fisher, D., Monty, R., Eds.; LEA: Hillsdale, NJ, 1983; pp 145–168. [CrossRef]
  19. Hsiao, J.; Chen, Y.; Spence, C.; Yeh, S. Assessing the effects of audiovisual semantic congruency on the perception of a biestable figure. Consciousness Cogn. 2012, 21, 775–787. [CrossRef]
  20. Rodríguez-Martínez, G.; Castillo-Parra, H.; Rosa, P. J.; Marroquín-Ciendúa, F. Ocular fixations modulate audiovisual semantic congruency when standing in an upright position. Suma Psicol. 2021, 28, 43–51. [CrossRef]
  21. Rodríguez-Martínez, G.; Marroquín-Ciendúa, F.; Rosa, P. J.; Castillo-Parra, H. Perceptual reversals and time-response analyses within the scope of decoding a bistable image. Interdisciplinaria 2022, 39, 257–273. [CrossRef]
  22. Khrennikov, A. Quantum-like model of unconscious–conscious dynamics. Front. Psychol. 2015, 6, 997. [CrossRef]
  23. Zhu, M.; Hardstone, R.; He, B. J. Neural oscillations promoting perceptual stability and perceptual memory during bistable perception. Sci. Rep. 2022, 12, 2760. [CrossRef]
  24. Smith, E.; Grabowecky, M.; Susuki, S. Auditory-visual crossmodal integration in perception of face gender. Curr. Biol. 2007, 17, 1680–1685. [CrossRef]
  25. Rodríguez-Martínez, G. The perception of a visual bistable design can be semantically modulated by an incomprehensible spoken language. New Design Ideas 2025, 9, 5–29. [CrossRef]
  26. Rodríguez-Martínez, G.; Sojo, J. Biestabilidad perceptual en rostros andróginos: análisis del efecto de congruencia semántica considerando registros de actividad oculomotora. Rev. Ibér. Sist. Tecnol. Inf. 2022, E52, 133–147. Available online: https://www.proquest.com/openview/aa17500f4faeaf59d9d99670f1b5d763/1?pq-origsite=gscholar&cbl=1006393 (accessed on 26 November 2025).
  27. Rodríguez, G.; Marroquín-Ciendúa, F. Bottom-up modulation within the scope of consumers’ visual perception: the effect of previous ocular fixations on the perception of bistable logotypes. Int. J. Recent Adv. Multidiscip. Res. 2019, 6, 5129–5135. Available online: https://www.ijramr.com/sites/default/files/issues-pdf/2636.pdf (accessed on 26 November 2025).
  28. Gross, J. Why Toblerone Is Dropping a Famous Swiss Mountain From Its Packaging. The New York Times [Digital Edition], 6 March 2023. Available online: https://link.gale.com/apps/doc/A748995601/AONE?u=anon~9af2c378&sid=googleScholar&xid=29f5d875 (accessed on 26 November 2025).
  29. Goolkasian, P.; Woodberry, C. Priming effects with ambiguous figures. Atten. Percept. Psychophys. 2010, 72, 168–178. [CrossRef]
  30. Meng, M.; Tong, F. Can attention selectively bias bistable perception? Differences between binocular rivalry and ambiguous figures. J. Vis. 2004, 4, 539–551. [CrossRef]
  31. Poom, L. Divergent mechanisms of perceptual reversals in spinning and wobbling structure-from-motion stimuli. PLoS ONE 2024, 19, e0297963. [CrossRef]
  32. García-Pérez, M. Visual inhomogeneity and eye movements in multistable perception. Percept. Psychophys. 1989, 46, 397–400. [CrossRef]
  33. García-Pérez, M. A. Eye movements and perceptual multistability. Adv. Psychol. 1992, 88, 73–109. [CrossRef]
  34. Barrera, M.; Calderón, L. Notes for supporting an epistemological neuropsychology: contributions from three perspectives. Int. J. Psychol. Res. 2013, 6, 107–118. [CrossRef]
  35. Tarder-Stoll, H.; Jayakumar, M.; Dimsdale-Zucker, H. R.; Günseli, E.; Aly, M. Dynamic internal states shape memory retrieval. Neuropsychologia 2020, 138, 107328. [CrossRef]
  36. Kiefer, M. Top-down modulation of unconscious ‘automatic’ processes: A gating framework. Adv. Cognit. Psychol. 2007, 3, 289.
  37. Kesoglou, A. M.; Mikellidou, K. The effect of semantic content on the perception of audiovisual movieclips. bioRxiv 2024, Artículo 2024.01.24.576956. [CrossRef]
  38. Delong, P.; Noppeney, U. Semantic and spatial congruency mould audiovisual integration depending on perceptual awareness. Sci. Rep. 2021, 11, 1–14. [CrossRef]
  39. Gijs, B.; van Ee, R. Endogenous influences on perceptual bistability depend on exogenous stimulus characteristics. Vision Res. 2006, 46, 3393–3402. [CrossRef]
  40. Okazaki, M.; Kaneko, Y.; Yumoto, M.; Arima, K. Perceptual change in response to a bistable picture increases neuromagnetic beta-band activities. Neurosci. Res. 2008, 61, 319–328. [CrossRef]
  41. Tseng, H. Y.; Chuang, H. C. An eye-tracking-based investigation on the principle of closure in logo design. J. Eye Mov. Res. 2024, 17, Artículo e3. [CrossRef]
  42. Mamassian, P.; Goutcher, R. Temporal dynamics in bistable perception. J. Vis. 2025, 5, 361–375. [CrossRef]
  43. Yaman, H.; Yaman, Ş. The effect of gestalt theory on emblem and logo design. Uluslar. Akad. Birikim Derg. 2022, 5. Available online: https://akademikbirikimdergisi.com/index.php/uabd/article/view/63 (accessed on 26 November 2025).
  44. Rach, S.; Huster, R. J. In search of causal mechanisms underlying bistable perception. J. Neurosci. 2014, 34, 689–690. [CrossRef]
  45. De Marchis, G. P.; Reales-Avilés, J. M.; Rivero, M. D. P. Comparative values of variables related to brand logos. Meas. Bus. Excell. 2018, 22, 75–87. [CrossRef]
  46. Dehaghin, R. The Applications and Effects of Gestalt Theory in Logotype design. Humanit. Nat. Sci. J. 2023, 4, 597–610. [CrossRef]
  47. Mohamed, K.; Adiloglu, F. Analyzing the role of gestalt elements and design principles in logo and branding. Int. J. Commun. Media Sci. 2023, 10, 33–43. [CrossRef]
  48. Valderrama, E.; Rodríguez-Martínez, G. Análisis de movimientos oculares durante la observación del logotipo biestable de Toblerone. Expeditiorepositorio.utadeo 2024. Available online: http://hdl.handle.net/20.500.12010/34421 (accessed on 26 November 2025).
  49. Andéhn, M.; Decosta, P. L. The variable nature of country-to-brand association and its impact on the strength of the country-of-origin effect. Int. Mark. Rev. 2015, 33, 852–866. [CrossRef]
  50. Rüdisser, J.; Schirpke, U.; Tappeiner, U. Symbolic entities in the European Alps: Perception and use of a cultural ecosystem service. Ecosyst. Serv. 2019, 39, 100980. [CrossRef]
  51. Calboli, I. Chocolate, Fashion, Toys and Cabs: The Misunderstood Distinctiveness of Non-Traditional Trademarks. IIC 2018, 49, 1–4. [CrossRef]
  52. Burmann, C.; Riley, N. M.; Halaszovich, T.; Schade, M. Identity-Based Brand Management; Springer Gabler: 2023. [CrossRef]
  53. Bucher-Edwards, E.; Grimmer, L.; Grimmer, M. Using Place-of-Origin Branding Strategies to Market Australian Premium-Niche Whisky and Gin Products. J. Int. Food Agribus. Mark. 2023, 35, 135–153. [CrossRef]
  54. Qiu, J.; Wei, D.; Li, H.; Yu, C.; Wang, T.; Zhang, Q. The vase–face illusion seen by the brain: An event-related brain potentials study. Int. J. Psychophysiol. 2009, 74, 69–73. [CrossRef]
  55. Kim, M. J.; Lim, J. H. A comprehensive review on logo literature: research topics, findings, and future directions. J. Mark. Manag. 2019, 35, 1291–1365. [CrossRef]
  56. Kornmeier, J.; Hein, C. M.; Bach, M. Multistable perception: when bottom-up and top-down coincide. Brain Cognit. 2009, 69, 138–147. [CrossRef]
  57. Hartcher-O’Brien, J.; Soto-Faraco, S.; Adam, R. A matter of bottom-up or top-down processes: The role of attention in multisensory integration. Front. Integr. Neurosci. 2017, 11, Artículo 5. [CrossRef]
  58. Rosa, P. What do your eyes say? Bridging eye movements to consumer behavior. Int. J. Psychol. Res. 2015, 8, 90–103. [CrossRef]
  59. Nada, M.; Azmy, A. Eye-tracking in out-of-home advertising: Exploring the application of Gestalt principles. Int. Des. J. 2023, 13, 437–446. [CrossRef]
  60. Raz, A.; Lamar, M.; Buhle, J. T.; Kane, M. J.; Peterson, B. S. Selective biasing of a specific bistable-figure percept involves fMRI signal changes in frontostriatal circuits: A step toward unlocking the neural correlates of top-down control and self-regulation. Am. J. Clin. Hypn. 2007, 50, 137–156. [CrossRef]
  61. Girisken, Y.; Bulut, D. How do consumers perceive a/an logotype/emblem in the advertisements: an eye tracking study. Int. J. Strateg. Innov. Mark. 2014, 1, 198–209. [CrossRef]
  62. Gutiérrez, J.; Rodríguez, G. Generación del concepto creativo publicitario en función del modelo de fases sugerido por Graham Wallas: un estudio cualitativo basado en las teorías asociacionista y gestáltica. Braz. J. Dev. 2020, 6, 1252–1273. [CrossRef]
  63. Puškarević, I.; Nedeljković, U.; Dimovski, V.; Možina, K. An eye tracking study of attention to print advertisements: Effects of typeface figuration. J. Eye Mov. Res. 2016, 9, 1–18. [CrossRef]
  64. Rodríguez-Martínez, G. Impact of Face Inversion on Eye-Tracking Data Quality: A Study Using the Tobii T-120. In Applied Informatics. ICAI 2024. Communications in Computer and Information Science; Florez, H., Astudillo, H., Eds.; Springer: Cham, 2025; Vol. 2237. [CrossRef]
  65. Peschel, A. O.; Orquin, J. L.; Loose, S. M. Increasing consumers’ attention capture and food choice through bottom-up effects. Appetite 2019, 132, 1–7. [CrossRef]
  66. Srivastav, R.; Chowdhury, A. Presence of Gestalt Traits in a Brand Logo and Its Impact in Evoking the Intended Meaning. In Recent Advancements in Product Design and Manufacturing Systems. IPDIMS 2023. Lecture Notes in Mechanical Engineering; Deepak, B. B. V. L., Bahubalendruni, M. R., Parhi, D., Biswal, B. B., Eds.; Springer: Singapore, 2025. [CrossRef]
  67. Cox, D.; Hong, S. W. Semantic-based crossmodal processing during visual suppression. Front. Psychol. 2015, 6, 722. [CrossRef]
  68. Rodríguez, G. The semantic congruency effect into bistable visual perception: a study based on tones of voice as top-down modulating stimuli. Int. J. Recent Adv. Multidiscip. Res. 2019, 6, 5143–5149. Available online: https://mail.ijramr.com/sites/default/files/issues-pdf/2647.pdf (accessed on 26 November 2025).
  69. Spence, C. Crossmodal correspondences: A tutorial review. Atten. Percept. Psychophys. 2011, 73, 971–995. [CrossRef]
  70. Pálffy, Z.; Farkas, K.; Csukly, G.; Kéri, S.; Polner, B. Cross-modal auditory priors drive the perception of bistable visual stimuli with reliable differences between individuals. Sci. Rep. 2021, 11, 16943. [CrossRef]
  71. Spence, C.; Squire, S. Multisensory integration: Maintaining the perception of synchrony. Curr. Biol. 2003, 13, R519–R521. [CrossRef]
  72. Bernal-Robayo, S. D. Análisis de fijaciones oculares durante la observación e interpretación de logotipos biestables: un estudio basado en técnicas de autoreporte. M. S. Thesis, Universidad de Bogotá Jorge Tadeo Lozano, Bogotá, 2020. Available online: https://expeditiorepositorio.utadeo.edu.co/handle/20.500.12010/16382 (accessed on 26 November 2025).
  73. Holmqvist, K.; Nyström, M.; Andersson, R.; Dewhurst, R.; Jarodzka, H.; van De Weijer, J. Eye-Tracking. A Comprehensive Guide to Methods and Measures; Oxford University Press: Oxford, United Kingdom, 2011.
  74. Arango, C. A.; Rodríguez-Martínez, G.; Marroquín-Ciendúa, F. La contaminación visual en Bogotá: análisis de cargas visuales en localidades con alta estimulación publicitaria. Rev. Investig. Desarro. Innov. 2021, 11, 373–386. [CrossRef]
  75. Abed, F. Cultural influences on visual scanning patterns. J. Cross-Cult. Psychol. 1991, 22, 525–534. [CrossRef]
  76. Cao, T.; Wang, L.; Sun, Z.; Engel, S. A.; He, S. The independent and shared mechanisms of intrinsic brain dynamics: Insights from bistable perception. Front. Psychol. 2018, 9, Artículo 589. [CrossRef]
  77. Nuthmann, A. How do the regions of the visual field contribute to object search in real-world scenes? Evidence from eye movements. J. Exp. Psychol.: Hum. Percept. Perform. 2014, 40, 342. [CrossRef]
  78. Wang, M.; Arteaga, D.; He, B. J. Brain mechanisms for simple perception and bistable perception. Proc. Natl. Acad. Sci. U. S. A. 2013, 110, E3350–E3359. [CrossRef]
  79. Gómez-Mejía, M.; Rodríguez-Martínez, G. Analyzing emotional and attentional responses to promotional images using a remote eye-tracker device and face-reading techniques. In Applied Informatics. ICAI 2024. Communications in Computer and Information Science; Florez, H., Astudillo, H., Eds.; Springer: Cham, 2025; Vol. 2237, pp 44–54. [CrossRef]
Figure 1. Examples of bistable images. On the left, the Rubin’s vase (in figure-ground reversals bistable image). In the middle, the Necker’s cube, an in-perspective reversals ambiguous image. On the right, My-girlfriend or my-mother-in-law, an in-meaning content reversals image. Source: adapted from Rodríguez-Martínez and Castillo-Parra [4].
Figure 1. Examples of bistable images. On the left, the Rubin’s vase (in figure-ground reversals bistable image). In the middle, the Necker’s cube, an in-perspective reversals ambiguous image. On the right, My-girlfriend or my-mother-in-law, an in-meaning content reversals image. Source: adapted from Rodríguez-Martínez and Castillo-Parra [4].
Preprints 187284 g001
Figure 2. The bistable logo of Toblerone® used for the study. On the left, the outlined, unfilled version of the logo that was used. On the right, the seven clusters or AOIs defined as bottom-up perceptual modulating areas. Zones A1, A4, A5, A6, and A7 were defined as bottom-up modulating areas for the ‘Bear’ percept, following the previous study conducted by Valderrama and Rodríguez-Martínez [48]. A2 and A3 areas (regarding the perception of the ‘Mountain’ percept) were also defined following the mentioned study. Source: adapted from Valderrama and Rodríguez-Martínez [48].
Figure 2. The bistable logo of Toblerone® used for the study. On the left, the outlined, unfilled version of the logo that was used. On the right, the seven clusters or AOIs defined as bottom-up perceptual modulating areas. Zones A1, A4, A5, A6, and A7 were defined as bottom-up modulating areas for the ‘Bear’ percept, following the previous study conducted by Valderrama and Rodríguez-Martínez [48]. A2 and A3 areas (regarding the perception of the ‘Mountain’ percept) were also defined following the mentioned study. Source: adapted from Valderrama and Rodríguez-Martínez [48].
Preprints 187284 g002
Figure 3. Two eye-movement maps concerning the totality of ocular data captured. On the left, an opacity map in which the brighter the area, the more it was viewed. On the right, a heat map. Areas depicted in red are the most gazed [73]. Source: own design.
Figure 3. Two eye-movement maps concerning the totality of ocular data captured. On the left, an opacity map in which the brighter the area, the more it was viewed. On the right, a heat map. Areas depicted in red are the most gazed [73]. Source: own design.
Preprints 187284 g003
Figure 4. Comparisons of the average durations of eye fixations made in each AOI, taking into account the two top-down modulation conditions. Source: own design.
Figure 4. Comparisons of the average durations of eye fixations made in each AOI, taking into account the two top-down modulation conditions. Source: own design.
Preprints 187284 g004
Table 1. Statistical values concerning durations of ocular fixations made on the AOIs.
Table 1. Statistical values concerning durations of ocular fixations made on the AOIs.
AOIs Statistical data Text Mountain Text Bear p-value
A1 Mean 2.179411765 2.453235294 0.326572285
A1 Standard deviation 1.191494964 1.349892747
A1 Variance 1.41966025 1.822210428
A2 Mean 1.379411765 0.701470588 0.002591325
A2 Standard deviation 1.192776088 0.515352967
A2 Variance 1.422714795 0.265588681
A3 Mean 1.976470588 2.522352941 0.038637319
A3 Standard deviation 1.272444277 1.116425733
A3 Variance 1.619114439 1.246406417
A4 Mean 0.308823529 0.327647059 0.798612352
A4 Standard deviation 0.283653274 0.272130303
A4 Variance 0.08045918 0.074054902
A5 Mean 0.098823529 0.232352941 0.018304235
A5 Standard deviation 0.191513346 0.249497177
A5 Variance 0.036677362 0.062248841
A6 Mean 0.694705882 0.666470588 0.738765146
A6 Standard deviation 0.462576679 0.430615362
A6 Variance 0.213977184 0.18542959
A7 Mean 0.168235294 0.094117647 0.402850046
A7 Standard deviation 0.481603827 0.167804317
A7 Variance 0.231942246 0.028158289
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated