
Technology to Automatically Record Eating Behavior in Real Life: A Systematic Review

This version is not peer-reviewed.


18 July 2023


20 July 2023

To monitor adherence to diets and to design and evaluate nutritional interventions it is essential to obtain objective knowledge about eating behavior. In most research, measures of eating behavior are based on self-reporting, such as 24-h recalls, food records (food diaries), and food frequency questionnaires. Self-reporting is prone to inaccuracies due to inaccurate and subjective recall and other biases. Recording behavior using non-obtrusive technology in daily life would overcome this. We here provide an up-to-date systematic overview encompassing all (close-to) publicly or commercially available technologies to automatically record eating behavior in real-life settings. 1328 studies were screened and after applying defined inclusion and exclusion criteria, 122 studies were included for in-depth evaluation. Technologies in these studies were categorized by what type of eating behavior they measure and which type of sensor technology they use. In general, we found that relatively simple sensors are often used. Depending on the purpose, these are mainly motion sensors, microphones, weight sensors, and photo cameras. While several of these technologies are commercially available, there is still a lack of publicly available algorithms that are needed to process and interpret the resulting data. We argue that future work should focus on developing robust algorithms and validating these technologies in real-life settings. Combining technologies (e.g., prompting individuals for self-reports at sensed, opportune moments) is a promising route toward ecologically valid studies of eating behavior.
1. Introduction

As stated by the World Health Organization (WHO) “Nutrition is coming to the fore as a major modifiable determinant of chronic disease, with scientific evidence increasingly supporting the view that alterations in diet have strong effects, both positive and negative, on health throughout life” [1]. It is therefore of key importance to find efficient and solid methodologies to study eating behavior and food intake in order to help reduce potential long-term health problems caused by unhealthy diets. Past research on eating behaviors and attitudes relies intensively on self-reporting tools, such as 24-h recalls, food records (food diaries), and food frequency questionnaires (FFQ; [2,3,4]). However, there is an increasing understanding of the limitations of this classical approach to studying eating behaviors and attitudes. One of the major limitations of this approach is that self-reporting tools rely on participants’ recall, which may be inaccurate or biased (especially when studying the actual amount of food or liquid intake [5]). Recall biases can be caused by demand characteristics, which are cues that may indicate the study aims to participants, leading them to change their behaviors or responses based on what they think the research is about [6], or more generally by the desire to comply with social norms and expectations when it comes to food intake [7,8]. Additionally, the majority of the studies investigating eating behavior are performed in the lab, which does not allow for a realistic replication of the many influences on eating behavior that occur in real-life (e.g., [9]). Hence, to overcome these limitations, it is crucial to examine eating behavior and the effect of interventions in daily life, at home, or at institutions such as schools and hospitals. In contrast to lab research settings, humans typically behave naturally in these settings. It is also important that testing real-life eating behaviors in naturalistic settings relies on implicit, non-obtrusive measures [10] which are objective and able to overcome potential biases.
There is a growing interest in identifying technologies able to improve the quality and validity of data collected to advance nutrition science. Such technologies should enable measuring eating behavior passively (i.e., without requiring action or mental effort on the part of the users), objectively, and reliably in realistic contexts. Importantly, to maximize the efficiency of real-life measurement, it is vital to develop technologies that capture eating behavior patterns in a low-cost, unobtrusive, and easy-to-analyze way. For real-world practicality, the technologies should be comfortable and acceptable, so that they can be used in naturalistic settings for extended periods while respecting the users’ privacy.
To gain insight into the state of the art in this field, we performed a search for published papers using technologies to measure eating behavior in real-life settings. In addition to papers describing specific systems and technologies, this search returned many review papers, some of which contained systematic reviews.
Evaluating these systematic reviews, we found that an up-to-date overview encompassing all (close-to) available technologies to automatically record eating behavior in real-life settings is still missing. To fill this gap, we here provide such an overview, categorized by what type of eating behavior they measure and which type of sensor technology they use. We indicate to what extent these technologies are readily available for use. With this review, we aim to (1) help researchers identify the most suitable technology to measure eating behavior in real-life, and to provide a basis for determining next steps in (2) research on measuring eating behavior in real-life and (3) technology development.

2. Methods and Procedure

Literature Review

Our literature search reporting adheres to the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) checklist [11,12]. The PRISMA guidelines ensure that the literature is reviewed in a standard and systematic manner. This process underlies four phases: identification, screening, eligibility, and inclusion. The PRISMA diagram showing the search flow and inclusion/exclusion of records and reports in this study is shown in Figure 1.

Eligibility Criteria

Our literature search aimed to identify mature and practical (i.e., not too complex or restrictive) technologies that can be used to unobtrusively assess food or drink intake in real-life conditions. The inclusion criterium was a sensor-based approach to the detection of eating or drinking. Studies not describing a sensor-based device to detect eating or drinking were excluded.

Information Sources and Search

The literature search included two stages: an initial automatic search of online databases and a manual search based on the reference lists from the papers selected from the previous search stage (using a snowballing approach [13,14]).
The initial search was conducted on 24 February 2023 across the ACM Digital Library, Google Scholar (first 100 results), IEEE Xplore, MDPI, PubMed, and Scopus (Elsevier) databases. The results were date-limited from 2018 to date (i.e., 24 February 2023). As this field is rapidly advancing and sensor-based technologies evaluated in earlier papers are likely to have been further developed, we initially limited our search to the past 5 years.
Our broad search strategy was to identify papers that included terms in their title, abstract, or keywords related to food and drink, eating, or drinking activities, and the assessment of the amount consumed.
The search was performed using equivalents of the following Boolean search string (where* represents a wildcard): “(beverage OR drink OR food OR meal) AND (consum* OR chew* OR eating OR ingest* OR intake OR swallow*) AND (portion OR serving OR size OR volume OR mass OR weigh*) AND (assess OR detect OR monitor OR measur*)”. Some search terms were truncated in an effort to include all variations of the word.
The records retrieved in the initial search across the six databases and the results of manual bibliography searches were imported into EndNote 20 (Clarivate, and duplicates were removed.

Screening Strategy

Figure 1 presents an overview of the screening strategy. In the first round, the titles and abstracts returned (n = 1241, after the elimination of 68 duplicates) were reviewed against the eligibility criterium. If the title and/or abstract mentioned a sensor-based approach to the detection of eating or drinking, the paper was included in the initial screening stage to be further assessed in the full-text screening stage. Papers that did not describe a sensor-based device to detect eating or drinking were excluded. Full-text screening was conducted on the remaining articles (n = 126), leading to a final sample of 73 included papers from the initial automatic search. Papers focusing on animal studies (n=4), food recognition (n=7), nutrient estimation (n=6), system design (n=2), or other non-related topics (n=10), were excluded. While review papers (n=20) were also excluded from our technology overview (Table 1), they were evaluated (Table A1 and Table A2 in Appendix A) and used to define the scope of this study. Additional papers were identified by manual search via the reference lists of full texts that were screened (n= 87). Full-text screening of these additional papers led to a final sample of 49 included papers from the manual search. Papers about dietary recall (n=7), food recognition (n=11), nutrient estimation (n=7), describing systems already described in papers from the initial automatic search (n=2), or other non-related topics (n=6), were excluded. Again, review papers (n=5) were excluded from our technology overview (Table 1) but evaluated and used to define the scope of this study. The total number of papers included in this review amounts to 122.
All screening rounds were conducted by two of the authors. Each record was reviewed by two reviewers to decide its eligibility based on the title and abstract of each study, taking into consideration the exclusion criteria. When a record was rejected by one reviewer and accepted by the other, it was further evaluated by all authors and kept for eligibility when a majority voted in favor.


We evaluated and summarized the review papers that our search returned in two tables (Appendix A). Table A1 includes systematic reviews, while Table A2 includes non-systematic reviews. We defined systematic reviews as reviews following the PRISMA methodology. For all reviews we reported the year of publication and the general scope of the review. For systematic reviews we also reported the years of inclusion, the number of papers included, and the specific requirements for inclusion.
We summarized the core information about the devices and technologies for measuring eating and drinking behaviors from our search results in Table 1. This table categorizes the studies retrieved in our literature search in terms of their measurement objectives, target measures, the devices and algorithms that were used as well as their (commercial or public) availability, and the way they were applied (method). In the column ‘Objective’, the purposes of the measurements are described. The three objectives we distinguish are ‘Eating/drinking activity detection’, ‘Bite/chewing/swallowing detection’ and ‘Portion size estimation’. Note that the second and third objectives can be considered as subcategories of the first – technologies are included in the first if they could not be grouped under the second or third objectives. The objectives are further specified in the column ‘Measurement targets’. In the column ‘Device’, we itemize the measurement tools or sensors used in the different systems. For each type of device, one or more representative papers were selected, bearing in mind the TRL (Technology Readiness Level [15]), the availability (off-the-shelf) of the device and algorithm that were used, the year of publication (recent), and the number of times it was cited. The minimum TRL level was 2, and the paper with the highest TRL level among papers using similar techniques was selected as the representative paper. A concise description of each representative example is given in the column ‘Method’. The commercial availability of the example devices and algorithms is indicated in the columns ‘Off-the-shelf device’ and ‘Ready-to-use algorithm’. Lastly, other studies using similar systems are listed in the column ‘Similar papers’. Systems combining devices for several different measurement targets can appear in different table rows. To indicate this, they are labeled with successive letters for each row they appear in (e.g., 1a and 1b).
For each of the three objectives we counted the number of papers that described sensors that are designed (1) to be attached to the body, (2) to be attached to an object, (3) to be placed in the environment, or (4) to be held in the hand. Sensors attached to the body were further subdivided by body location. The results are visualized using bar graphs.
Figure 1. PRISMA flow diagram describing the different phases of the procedure used to identify tools for food and drink intake assessment.
Figure 1. PRISMA flow diagram describing the different phases of the procedure used to identify tools for food and drink intake assessment.
Preprints 79830 g001

3. Results

Table 1 summarizes the core information of devices and technologies for measuring eating and drinking behaviors from our search results.

Eating and Drinking-Activity Detection

For ‘eating/drinking activity detection’, many systems have been reported that measure eating and drinking-related motions. In particular, many papers reported measuring these actions using motion sensors such as inertial sensor modules (i.e., Inertial Measurement Units or IMUs). IMUs typically consist of various sensors such as an accelerator, gyroscope, and magnetometer. Those sensors are embedded in smartphones and wearable devices such as smartwatches. In [16] researchers collected IMUs’ signals with off-the-shelf smartwatches to identify hand-based eating and drinking-related activities. In this case, participants wore smartwatches on their preferred wrists. Other studies have employed IMUs worn on the wrist, head, neck, and combinations thereof [17,18,19,20]. Besides IMUs, proximity sensors, piezoelectric sensors, and radar sensors are also used to detect hand-mouth gestures or jawbone movements [21,22,23]. Pressure sensors are used to measure eating activity as well. For instance, in [24] eating activities and the amount of consumed food are measured by a pressure-sensitive tablecloth and tray. These devices provide information on food intake-related actions, such as cutting, scooping, stirring, the identification of the plate or container on which the action is executed, and allow the tracking of weight changes of plates and containers. Microphones, RGB-D images, and video cameras are also used to detect eating and drinking-related motions. In [25], eating actions are detected by a ready-to-use algorithm as the 3D overlap of the mouth and food, using RGB-D images taken with a commercially available smartphone. Ear-worn sensors can measure in-body glucose levels [26] and tooth-mounted dielectric sensors can measure impedance changes in the mouth signaling the presence of food [27]. Although these latter methods can directly detect eating activity, the associated devices and data processing algorithms are still in the research phase.

Bite, Chewing, or Swallowing Detection

In the category of ‘bite/chewing/swallowing detection’, we grouped studies in which the number of bites (bite count), bite weight, and chewing or swallowing actions are measured. Motion sensors and video are used to detect bites (count). For instance, OpenPose is off-the-shelf software that analyses bite counts from videos [28]. To assess bite weight, weight sensors and acoustic sensors have been used [29,30].
Chewing or swallowing is the most well-studied eating and drinking-related activity, as reflected by the number of papers focusing on such activities (31 papers). Motion sensors and microphones are typically employed for this purpose. For instance, in [31], a gyroscope is used for chewing detection, an accelerometer for swallowing detection, and a proximity sensor to detect hand-to-mouth gestures. Microphones are typically used to register chewing and swallowing sounds. In most cases, commercially available microphones are applied, while the applied detection algorithms are custom-made. Video, electroglottograph (EGG), and electromyography (EMG) devices are also used to detect chewing and swallowing. EGG detects the variations in the electrical impedance caused by the passage of food during swallowing, while EMG in these studies monitors the masseter and temporalis muscle activation for recording chewing strokes.

Portion size Estimation

Portion size is estimated mainly by using weight sensors and food image analysis. Regarding weight sensors, the amount of food consumed is calculated by comparing the weights of plates before and after eating. An open-source system consisting of a wireless pocket-sized kitchen scale connected to a mobile application has been reported in [32]. A system turning an everyday smartphone into a weighing scale is also available [33]. The relative vibration intensity of the smartphone’s vibration motor and its built-in accelerometer are used to estimate the weight of food that is placed on the smartphone (Figure 2). Off-the-shelf smartphone cameras are typically used for volume estimation from food images. Also, several studies use RGB-D images to get more accurate volume estimations from information on the height of the target food. For image-based approaches, AI-based algorithms are often employed to calculate portion size. Some studies made prototype systems applicable to real-life situations. In [34], acoustic data from a microphone was collected along with food images to measure the distance from the camera to the food. This enables scaling the size of food in the image to its actual size without training images and reference objects. However, in other cases, image processing mostly needs a reference for comparing the food size. Besides image analysis, in [35], researchers took a 360-degree scanned video obtained with a laser module and a diffraction lens and applied their volume estimation algorithm to the data. In addition to the above devices, a method to estimate portion size using EMG has been reported [36]. In this study, EMG embedded in an armband device detects different patterns of signals based on the weight which a user is holding.
For estimating portion size in drinks, several kinds of sensors have been tested. An IMU in a smartwatch was used to estimate drink intake volume from sip duration [37]. Also, in [38], liquid sensors such as a capacitive sensor and a conductivity sensor were used to monitor the filling levels in a cup. Some research groups developed so-called smart fridges that automatically register food items and quantities. In [39], image analysis of a thermal image taken by an infrared (IR) sensor embedded in a fridge provides an estimation of a drink volume. Another study proposed a system called the Playful Bottle system [40] which consists of a smartphone attached to a drinking mug. Drinking motions such as picking up the mug, tilting it back, and placing it on the desk are detected by the phone’s accelerometer. After the drinking action is completed and the water line becomes steady, the phone’s camera captures the image of the amount of liquid in the mug (Figure 3).

Sensor Location

Figure 4 indicates where sensors are typically located per Objective. The locations of sensors are classified as body-attached (e.g., ear, neck, head, glasses), embedded in objects (e.g., plates, cutlery), and in the environment (e.g., distant camera, magnetic trackers). For eating/drinking activity detection, sensors are mostly worn on the body and then embedded in the objects. Body-worn sensors are also used for bite/chewing/swallowing detection. On the other hand, for portion size estimation, object-embedded and handheld sensors are mainly chosen depending on the measuring targets. Figure 5 shows the locations of wearable body sensors used in the reviewed studies. Sensors attached on wrists are most frequently used (32 cases), followed by embedded in glasses (19 cases) and attached to the ear (14 cases).
Table 1. Summary of core information of devices and technologies for measuring eating and drinking behaviors.
Table 1. Summary of core information of devices and technologies for measuring eating and drinking behaviors.
Objective Measurement Target Device Representative paper Method Off-the-shelf device Ready-to-use algorithm Similar papers
Eating/ drinking activity
eating/ drinking motion motion sensor [16] eating and drinking detection from smartwatch IMU signal Y N [41]a, [42]a, [43,44], [45], [46]a, [40]a, [21,47], [48]a, [22,49,50], [51]a, [37]a, [52]a, [17,18,53,54], [36]a, [55], [19]a, [56,57,58], [59]a, [60], [61]a, [62], [20]a, [63]a, [64,65,66,67]
[23] detecting eating and drinking gestures from FMCW radar signal N N
[24]a eating activities and amount consumed measured by pressure sensitive tablecloth and tray N N
microphone [46]b eating detection from fused inertial-acoustic sensing using smartwatch with embedded IMU and microphone Y N [26]a, [68], [59]b
RGB-D image [25]a eating action detected from smartphone RGB-D image as 3D overlap between mouth and food Y Y
video [69] eating detection from cap-mounted video camera Y N [54]a
liquid level liquid sensor [70] capacitive liquid level sensor N N [71]
in-body glucose level glucose sensor [26]b glucose level measured by ear-worn sensor N N
impedance change in mouth dielectric sensor [27] RF coupled tooth-mounted dielectric sensor measures impedance changes due to food in mouth N N
user identification PPG (photoplethysmography) sensor [52]b identify the user from heart rate N N
Bite/ chewing/ swallowing detection bites (count) motion sensor [72] a gyroscope mounted on a finger to detect motions of picking up food and delivering it to the mouth Y N [73,74]
video [28] bite count by video analysis using OpenPose pose estimation software Y Y
bite weight weight sensor [29]a plate-type base station with embedded weight sensors to measure amount and location of bites N N [54]a
acoustic sensor [30] commercial ear buds, estimation model based on nonaudio and audio features Y N
chewing/ swallowing motion sensor [31]a chewing detection from gyroscope, swallowing detection from accelerometer, hand-to-mouth gestures from proximity sensor Y Y [75,76,77,78], [48]b, [79,80], [81]a, [82], [83]a, [84,85,86], [61]b, [20]b
microphone [87] wearable microphone with minicomputer to detect chewing/swallowing sounds Y N [42]b, [88,89,90], [81]b, [19]b, [91], [83]b, [92]
video [93] classification of facial action units from video Y N [54]b
EGG [94] swallowing detected by larynx-mounted EGG device Y N
EMG [95] eyeglasses equipped with EMG to monitor temporalis muscles' activity N N [42]c, [96]
Portion size estimation portion size food motion sensor [33] acceleration sensor of smartphone, measuring vibration intensity Y Y [97]a
weight sensor [32] wireless pocket-sized kitchen scale connected to app Y Y [98,99,100,101,102], [54]b, [103], [29]b, [97]b, [104], [105]a, [106], [63]b, [24]b
image [107] AI-based system to calculate food leftovers Y Y [31]b, [34,108,109,110,111,112,113,114,115], [116]b, [105]b,[105,117,118,119,120,121,122]
[34] measuring the distance from the camera to the food using smartphone images combined with microphone data Y N [123]
[124] RGB-D image and AI-based system to estimate consumed food volume using before and after-meal images Y Y [25]b, [125,126,127,128,129,130,131]
laser [35] 360-degree scanned video; the system design includes a volume estimation algorithm and a hardware add-on that consists of a laser module and a diffraction lens. N N [132]
EMG [36]b weight of food consumed from EMG data N N
portion size drink motion sensor [37]b volume from sip duration from IMU in smartwatch Y N [41]b, [51]b
infrared (IR) sensor [39] thermal image by IR sensor embedded in smart fridge N N
liquid sensor [38] capacitive sensor, conductivity sensor, flow sensor, pressure sensor, force sensors embedded in different mug prototypes N N
image [40]b smartphone camera attached to mug N N [133]

4. Discussion

This systematic review provides an up-to-date overview of all (close-to) available technologies to automatically record eating behavior in real-life. Technologies included in this review should enable measuring eating behavior passively (i.e., without users’ active input), objectively, and reliably in realistic contexts, to avoid relying on subjective user recall. We performed our review in order to help researchers identify the most suitable technology to measure eating behavior in real-life settings, and to provide a basis for determining next steps in both technology development and measuring eating behavior in real-life. 1328 studies were screened, and 122 studies were included after application of objective inclusion and exclusion criteria. 25 studies contained more than one technology. We found that often, relatively simple sensors are used to measure eating behaviors. Motion sensors are commonly used for eating/drinking activity detection and bite/chewing/swallowing; in addition, microphones are often used in studies focusing on chewing/swallowing. These sensors are usually attached to the body, in particular to the wrist for eating/drinking activity detection and to areas close to the face for detecting bite/chewing/swallowing. For portion size estimation, weight sensors and images from photo cameras are mostly used.
Concerning next steps in technology development, the information from the columns ‘Off-the-shelf device’ and ‘Ready-to-use algorithm’ in the technology overview table indicates which devices and algorithms are not ready for use yet and would benefit from further development. The category ‘portion size estimation’ seems most mature with respect to off-the-shelf availability and ready-to-use algorithms. Overall, what is mostly missing is ready-to-use algorithms. It is an enormous challenge to build fixed algorithms that accurately recognize eating behavior under varying conditions of sensor noise, types of food and individuals’ behavior and appearance. Typically, with algorithms we refer to machine learning or AI algorithms. These are trained using annotated (correctly labelled) data, and only work well in conditions that are similar to the ones they were trained in. In most reviewed studies, demonstrations of algorithms are limited to controlled conditions and a small number of participants. Therefore, these algorithms still need to be tested and evaluated for accuracy and generalizability outside the laboratory, such as in homes, restaurants, and hospitals.
When it comes to real-life studies, the obtrusiveness of the devices is an important factor. Devices should minimally interfere with the natural behavior of participants. Devices worn on the body with wires connected to a battery or other devices may restrict eating motions and constantly remind participants that they are being recorded. Wireless devices are suitable in that perspective, but at the same time, battery duration may be a limitation for long-term studies. Devices such as tray-embedded sensors and cameras that are not attached to the participant’s body are advantageous in both obtrusiveness and battery duration.
Although video cameras can provide holistic data on participants’ eating behaviors, they present privacy concerns. When a camera is used to film the course of a meal, the data provides the participant’s physical characteristics and enables the identification of the participant. Also, when the experiments are done at home, participants cannot avoid showing their private environment. Ideally, the experiments should allow collecting data anonymously if they are not needed for a certain purpose such as clinical data collection. This could be possible by only storing extracted features from the camera data rather than the images themselves, though this prohibits later validation and improvement of feature extraction [134]. Systems using weight sensors do not suffer from privacy issues as camera images from the face do. [105] used a weight sensor in combination with a camera pointing downward at the scales to keep track of the consumption of various types of seasonings.
For future research, we think it will be powerful to combine methods and sensor technologies. While most studies rely on single types of technologies, there are successful examples of combinations that illustrate a number of ways in which system and data quality can be improved. For instance, a novel and robust device called SnackBox [135] consists of three containers for different types of snacks embedded on weight sensors (Figure 6) and can be used to monitor snacking behavior at home. It can be connected to wearables and smartphones, thereby allowing for contextualized interpretation of signals recorded from the participant and for targeted Ecological Momentary Assessment (EMA [136]). With EMA individuals are probed to report current behavior and experiences in their natural environment and avoid relying on memory. For instance, when the SnackBox detects snacking behavior, EMA can assess the individual’s current mood state through a short questionnaire. This affords the collection of more detailed and more accurate information compared to asking for this information at a later moment in time. Combining different sensor technologies can also have other benefits. Some studies used a motion detector or an audio sensor as switches to turn on other devices such as a chewing counter or a camera [61,90]. These systems are useful to collect data only during meal durations, therewith limiting superfluous data collection which is undesirable from the point of view of privacy and of battery life of the devices that are worn the whole day. In a study imitating a restaurant setting, a system consisting of custom-made table-embedded weight sensors and passive RFID (Radio-frequency identification) antennas was used [98]. This system detects the weight change in the food served on the table and recognizes what the food is by RFID tags therewith complementing information that would have been obtained by using either sensor alone, and facilitating interpretation of data. Other studies used an IMU in combination with a microphone to detect eating behaviors [59,81]. It was concluded that the acoustic sensor in combination with motion-related sensors improved the detection accuracy significantly compared to motion-related sensors alone.
Besides investing in research on combining methods and sensor technologies, research applying and validating these technologies in out-of-the-lab studies are essential. Test generalizability between lab and real-life study should be examined as well as generalizations across situations and user groups, and user experience. These studies will lead to further improvements and/or insight into the context in which the technology can or cannot be used.
The current review has some limitations. First, we did not include a measure of accuracy or reliability of the technologies in our table. Some of the reviews listed in our reviews’ table (Table 1 in appendix 1, e.g., [138], [144]) included the presence of evaluation metrics indicating the performance of the technologies (e.g., accuracy, sensitivity, and precision) as an inclusion criterion. We decided not to have this specific inclusion criterion as we think in our case it is hard to have comparable measures among studies. Also, whether accuracy is ‘good’ very much depends on the specific research question and study design. Second, our classification of whether an algorithm is ready-to-use could not be based on information directly provided in the paper but should be considered as a somewhat subjective estimate from the authors of this review.
In conclusion, there are some promising devices to measure eating behavior in naturalistic settings. However, it will take some time before some of these devices and algorithms will become commercially available due to lack of examples from a large number of test users and in various conditions. Until then, research in- and outside the lab needs to be carried out using custom-made devices and algorithms, and/or with combinations of existing devices. The approach to combine different technologies is recommended as it can lead to multimodal datasets consisting of different aspects of eating behavior (e.g., when people are eating and at what rate), dietary intake (e.g., what people are eating and how much), and contextual factors (e.g., why people are eating and with whom). We expect this to result in a much fuller understanding of individual eating patterns and dynamics, in real-time and in context, which can be used to develop adaptive, personalized interventions. New technologies measuring individual eating behaviors will be beneficial not only in consumer behavioral studies but also in the field of food and medical industries. New insights on eating patterns and traits discovered using these technologies may contribute to clarifying the use of food products in a wide range of consumers or to allowing for guidance in improving patients’ diets.

Author Contributions

Conceptualization, HH, PP, AT, AB; Data curation HH, AT, Formal analysis, HH, AT; Funding acquisition, AB; Investigation, HH, PP, AT; Methodology, AT; Supervision, AB; Visualization, AT; Writing – original draft, HH, PP, AT, AB; Writing – review & editing, HH, PP, AT, AB, GC.


This study was funded by Kikkoman Europe R&D Laboratory B.V.

Conflicts of Interest

This study was funded by Kikkoman Europe R&D Laboratory B.V. Haruka Hiraguchi is employed by Kikkoman Europe R&D Laboratory B.V.. Haruka Hiraguchi reports no potential conflicts with the study. All other authors declare no conflict of interest.

Appendix A

Table A1. Systematic review papers that were evaluated and used to define the scope of this study (see Introduction). Listed for all systematic reviews are the year of publication, the focus of the review, the years of inclusion, the number of papers included, and the specific requirements for inclusion. Text in italic represents literal quotes.
Table A1. Systematic review papers that were evaluated and used to define the scope of this study (see Introduction). Listed for all systematic reviews are the year of publication, the focus of the review, the years of inclusion, the number of papers included, and the specific requirements for inclusion. Text in italic represents literal quotes.
Reference Year of publication Focus of review Years of inclusion Number of papers included Specific requirements for inclusion
[137] 2020 In this review paper [they] provide an overview about automatic food intake monitoring, by focusing on technical aspects and Computer Vision works which solve the main involved tasks (i.e., classification, recognitions, segmentation, etc.). 2010-2020 23 papers that present systems for automatic food intake monitoring + 46 papers that address Computer Vision tasks related to food images analysis Method should apply Computer Vision techniques.
[138] 2020 [This] scoping review was conducted in order to:1. catalog the current use of wearable devices and sensors that automatically detect eating activity (dietary intake and/or eating behavior) specifically in free-living research settings;
2. and identify the sample size, sensor types, ground-truth measures, eating outcomes, and evaluation metrics used to evaluate these sensors.
prior to December22, 2019 33 I - description of any wearable device or sensor (i.e., worn on the body) that was usedto automatically (i.e., no required actions by the user) detect any form of eating (e.g., content of food consumed, quantity of food consumed, eating event, etc.). Proxies for “eating” measures, such as glucose levels or energy expenditure, were not included. II- “In-field” (non-lab) testing of the sensor(s), in which eating and activities were performed at-will with no restrictions (i.e., what, where, with whom, when, and how the user ate could not be restricted). III - At least one evaluation metric (e.g., Accuracy, Sensitivity, Precision, F1-score) that indicated the performance of the sensor on detecting its respective form of eating.
[139] 2019 The goal of this review was to identify unique technology-based tools for dietary intake assessment, including smartphone applications, those that captured digital images of foods and beverages for the purpose of dietary intake assessment, and dietary assessment tools available from the Web or that were accessed from a personal computer (PC). January 2011 -September 2017 43 (1) publications were in English,
(2) articles were published from January 2011 to September 2017, and
(3) sufficient information was available to evaluate tool features, functions, and uses.
[140] 2017 This article reviews the most relevant and recent researches on automatic diet monitoring, discussing their strengths and weaknesses. In particular, the article reviews two approaches to this problem, accounting for most of the work in the area. The first approach is based on image analysis and aims at extracting information about food content automatically from food images. The second one relies on wearable sensors and has the detection of eating behaviours as its main goal. not specified not specified n/a
[141] 2019 The aim of this review is to synthesise research to date that utilises upper limb motion tracking sensors, either individually or in combination with other technologies (e.g., cameras, microphones), to objectively assess eating behaviour. 2005-2018 69 (1) used at least one wearable motion sensor,
(2) that was mounted to the wrist, lower arm, or upper arm (referred to as the upper limb in this review),
(3) for eating behaviour assessment or human activity detection, where one of the classified activities is eating or drinking. We explicitly also included studies that additionally employed other sensors on other parts of the body (e.g., cameras, microphones, scales).
[142] 2022 This paper consists of a systematic review of sensors and machine learning approaches for detecting food intake episodes. [...] The main questions of this systematic review were as follows: (RQ1) What sensors can be used to access food intake moments effectively? (RQ2) What can be done to integrate such sensors into daily lives seamlessly? (RQ3) What processing must be done to achieve good accuracy? 2010-2021 30 (1) research work that performs food intake detection;
(2) research work that uses sensors to detect food with the help of sensors;
(3) research work that presents some processing of food detection to propose diet;
(4) research work that use wearable biosensors to detect food intake;
(5) research work that use the methodology of deep learning, Support Vector Machines or Convolutional Neural Networks related to food intake;
(6) research work that is not directly related to image processing techniques;
(7) research work that is original;
(8) papers published between 2010 and 2021; and(9) papers written in English
[143] 2021 This article presents a comprehensive review of the use of sensor methodologies for portion size estimation. [...] Three research questions were chosen to guide this systematic review:RQ1) What are the available state-of-the-art SB-FPSE methodologies? [...]
RQ2) What methods are employed for portion size estimation from sensor data and how accurate are these methods? [...]
RQ3) Which sensor modalities are more suitable for use in the free-living conditions?
since 2000 67 Articles published in peer-reviewed venues; […] Papers that describe methods for estimation of portion size; FPSE methods that are either automatic or semi-automatic; written in English.
[134] 2022 [They] reviewed the current methods to automatically detect eating behavior events from video recordings. 2010–2021 13 Original research articles [...] published in the English language and containing findings on video analysis for human eating behavior from January 2010 to December 2021. [...] Conference papers were included. [...] Articles concerning non-human studies were excluded. We excluded research articles on eating behavior with video electroencephalogram monitoring, verbal interaction analysis, or sensors, as well
as research studies not focusing on automated measures as they are beyond the scope of video analysis.
[144] 2022 The aim of this study was to identify and collate sensor-based technologies that are feasible for dietitians to use to assist with performing dietary assessments in real-world practice settings. 2016-2021 54 Any scientific paper published between January 2016 and December 2021 that used sensor-based devices to passively detect and record the initiation of eating in real-time. Studies were further excluded during the full text screening stage if they did not evaluate device performance or if the same research group conducted a more recent study describing a device that superseded previous studies of the same device.Studies evaluating a device that did not have the capacity to detect and record the start time of food intake, did not use sensors, were not applicable for use in free-living settings, or were discontinued at the time of the search were also excluded.
[145] 2021 This paper reviews the most recent solutions to automatic fluid intake monitoring both commercially and in the literature. The available technologies are divided into four categories: wearables, surfaces with embedded sensors, vision- and environmental-based solutions, and smart containers. 2010-2020 115 Papers that did not study liquid intake and only studied food intake or other unrelated activities were excluded. Since this review is focused on the elderly population, in the wearable section, we only included literature that used wristbands and textile technology which could be easily worn without affecting the normal daily activity of the subjects. We have excluded devices that were not watch/band or textile based such as throat and ear microphones or ear inertial devices as they are not practical for everyday use. [...] Although this review is focused on the elderly population, studies that used adult subjects were not excluded, as there are too few that only used seniors.
Table A2. Non-systematic review papers that were evaluated and used to define the scope of this study (see Introduction). Listed for all non-systematic reviews are the year of publication, and the focus of the review. Text in italic represents literal quotes.
Table A2. Non-systematic review papers that were evaluated and used to define the scope of this study (see Introduction). Listed for all non-systematic reviews are the year of publication, and the focus of the review. Text in italic represents literal quotes.
Reference Year of publication Focus of review
[146] 2019 A group of 30 experts got together to discuss the state of evidence with regard to monitoring calorie intake and eating behaviors [...] characterized into 3 domains: (1) image-based sensing (e.g, wearable and smartphone-based cameras combined with machine learning algorithms); (2) eating action unit (EAU) sensors (eg, to measure feeding gesture and chewing rate); and (3) biochemical measures (e.g, serum and plasma metabolite concentrations). They discussed how each domain functions, provided examples of promising solutions, and highlighted potential challenges and opportunities in each domain.
[147] 2022 This paper concerns the validity of new consumer research technologies, as applied in a food behaviour context. Therefore, [they] introduce three validity criteria based on psychological theory concerning biases resulting from the
awareness a consumer has of a measurement situation. [...] The three criteria addressing validity are: 1. Reflection: the research method requires the ‘person(a)’ of the consumer, i.e., he/she needs to think about his-/herself or his/her behaviour, 2. Awareness: the method requires the consumer to know he or she is being tested, 3. Informed: the method requires the consumer to know the underlying research question.
[148] 2022 They present a high-level overview of [their] recent work on intake monitoring using a smartwatch, as well as methods using an in-ear microphone. [...] [This paper's] goal is to inform researchers and users of intake monitoring methods regarding (i) the development of new methods based on commercially available devices, (ii) what to expect in terms of effectiveness, and (iii) how these methods can be used in research as well as in practical applications.
[149] 2021 A review of the state of the art of wearable sensors and methodologies proposed for monitoring ingestive behavior in humans
[150] 2017 This article evaluates the potential of various approaches to dietary monitoring with respect to convenience, accuracy, and applicability to real-world environments. [They] emphasize the application of technology and sensor-based solutions to the health-monitoring domain, and [they] evaluate various form factors to provide a comprehensive survey of the prior art in the field.
[151] 2022 The original ultimate goal of the studies reviewed in this paper was to use the laboratory test meal, measured with the UEM [Universal Eating Monitor], to translate animal models of ingestion to humans for the study of the physiological controls of food intake under standardized conditions.
[152] 2022 This paper describes many food weight detection systems which includes sensor systems consisting of a load cell, manual food waste method, wearable sensors.
[153] 2018 This paper summarizes recent technological advancements, such as remote sensing devices, digital photography, and multisensor devices, which have the potential to improve the assessment of dietary intake and physical activity in free-living adults.
[154] 2022 Focusing on non-invasive solutions, we categorised identified technologies according to five study domains: 1) detecting food-related emotions, 2) monitoring food choices, 3) detecting eating actions, 4) identifying the type of food consumed, and 5) estimating the amount of food consumed. Additionally, [they] considered technologies not yet applied in the targeted research disciplines but worth considering in future research.
[155] 2020 In this article [they] describe how wrist-worn wearables, on-body cameras, and body-mounted biosensors can be used to capture data about when, what, and how much people eat and drink. [They] illustrate how these new techniques can be integrated to provide complete solutions for the passive, objective assessment of a wide range of traditional dietary factors, as well as novel measures of eating architecture, within person variation in intakes, and food/nutrient combinations within meals.
[156] 2021 This survey discusses the best-performing methodologies that have been developed so far for automatic food recognition and volume estimation.
[157] 2020 This paper reviews various novel digital methods for food volume estimation and explores the potential for adopting such technology in the Southeast Asian context.
[158] 2017 This paper presents a meticulous review of the latest sensing platforms and data analytic approaches to solve the challenges of food-intake monitoring, ranging from ear-based chewing and swallowing detection systems that capture eating gestures to wearable cameras that identify food types and caloric content through image processing techniques. This paper focuses on the comparison of different technologies and approaches that relate to user comfort, body location, and applications for medical research.
[159] 2020 In this survey, a wide range of chewing activity detection explored
to outline the sensing design, classification methods, performances, chewing parameters, chewing data analysis as well as the challenges and limitations associated with them.


Figure 2. VibroScale (reproduced from [33] with permission).
Figure 2. VibroScale (reproduced from [33] with permission).
Preprints 79830 g002
Figure 3. Playful Bottle system (reproduced from [40] with permission).
Figure 3. Playful Bottle system (reproduced from [40] with permission).
Preprints 79830 g003
Figure 4. Sensor placement.
Figure 4. Sensor placement.
Preprints 79830 g004
Figure 5. Locations of wearables on the human body.
Figure 5. Locations of wearables on the human body.
Preprints 79830 g005
Figure 6. SnackBox.
Figure 6. SnackBox.
Preprints 79830 g006
