Multimodal Feature Fusion via Channel Attention

To better fuse multimodal features, the feature extraction module express different modal data as low-dimensional semantic vectors and finally train a semantic similarity model, at which point the different modalities can be constrained to a unified representation space and multimodal fusion representation. Here we designed a channel attention for multimodal feature fusion. Specifically, for the image of the m^th modality, where m∈[1, 2, 3, 4]. The output features F_m of the feature extraction module are pooled globally in one spatial dimension to obtain a channel description of C×1 × 1 × 1, where C is the number of channels of a single modal feature. A sigmoid activation function is then used to obtain the weighting coefficients. Finally, the weight coefficients are multiplied with the corresponding input features F_m to obtain the new weighted features. The calculation of the weighted features is shown in the following equation:
where σ represents the sigmoid function, and w_m represents the parameter matrix at training time. The features of different modalities are stitched together after the maximum pooling layer. Finally, a Fully Connected (FC) layer is created in the corresponding dimension of the channel and output to the classifier to obtain the classification result.

Free full text: Click here

Lv P., Yang J., Wang J., Guo Y., Tang Q., Magnier B., Lin J, & Zhou J. (2023). Ischemic stroke prediction of patients with carotid atherosclerotic stenosis via multi-modality fused network. Frontiers in Neuroscience, 17, 1118376.

Publication 2023

Attention Different semantic Multimodal Sigmoid Vectors

Corresponding Organization : Zhongshan Hospital of Xiamen University

Other organizations : Xiamen University, IMT Mines Alès, Université de Montpellier

Top 5 similar protocols

Variable analysis

independent variables

Image of the mth modality, where m∈[1, 2, 3, 4]

dependent variables

Classification result

control variables

Number of channels of a single modal feature (C)

Annotations

Based on most similar protocols

Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.

As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!