Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

CCA-Transformer: Cascaded Cross-Attention Based Transformer for Facial Analysis in Multi-modal Data

Version 1 : Received: 22 March 2024 / Approved: 26 March 2024 / Online: 27 March 2024 (06:11:04 CET)

How to cite: Kim, J.; Kim, N.; Hong, M.; Won, C. CCA-Transformer: Cascaded Cross-Attention Based Transformer for Facial Analysis in Multi-modal Data. Preprints 2024, 2024031629. https://doi.org/10.20944/preprints202403.1629.v1 Kim, J.; Kim, N.; Hong, M.; Won, C. CCA-Transformer: Cascaded Cross-Attention Based Transformer for Facial Analysis in Multi-modal Data. Preprints 2024, 2024031629. https://doi.org/10.20944/preprints202403.1629.v1

Abstract

One of the most crucial elements in deeply understanding humans on a psychological level is manifested through facial expressions. The analysis of a human behavior can be informed by their facial expressions, making it essential to employ indicators such as expression (Expr), valence-arousal (VA), and action units (AU). In this paper, we introduce the method proposed in the Challenge of the 6th Workshop and Competition on Affective Behavior Analysis in-the-wild (ABAW) at CVPR 2024. Our proposed method utilizes the multi-modal Aff-wild2 dataset, which is splitted into spatial and audio modalities. For the spatial data, we extract features using a SimMiM model that was pre-trained on a diverse set of facial expression data. For the audio data, we extract features using a WAV2VEC model. To fusion the extracted spatial and audio features, we employed the cascaded cross-attention mechanism of a transformer.

Keywords

face analysis; expression; valence-arousal; action unit

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.