CNN-based Feature Extraction for Image Classification

The first few layers of the CNN network structure are used as a feature extractor to automatically obtain the image features through supervised training, which are detected by the SoftMax function in the final layer [22 (link)]. Figure 6 presents the CNN structure.
As can be seen from Figure 6, there are eight layers in CNN in total. The first five layers are alternating convolution layers and Max Pooling layers, and the remaining three are fully connected layers. The input image of CNN is the harmonic spectrum and impact spectrum generated by HPSS separation, including the original signal spectrum. The images are unified to 256 ∗ 256 and input into the first convolution filter. A filter operation is performed on the input image by 96 kernels of 11 ∗ 11 with a stride of 4 pixels in the first convolution layer due to the distance between the Receptive Field centers of adjacent neurons in the same core map [23 ]. Then, the Max Pooling layer uses the output of the first convolutional layer as the input and performs filtering operations with 96 kernels of size 3 ∗ 3. After unifying the input size, the second convolutional layer performs a filtering operation on the output of the Max Pooling layer using 256 kernels of 5 ∗ 5. The third, fourth, and fifth convolutional layers are connected to each other. There is no pooling or normalization layer in between. The third convolutional layer has a total of 384 kernels of size 3 ∗ 3 connected to the second convolutional layer's output [24 (link)]. The fourth convolutional layer has a total of 384 kernels of size 3 ∗ 3, and the fifth convolutional layer has a total of 256 kernels of size 3 ∗ 3. Finally, 256 feature maps of size 6 ∗ 6 are obtained through these five convolutional layers. These feature maps are fed to three fully connected layers, each with 4096, 1,000, and 10 neurons. The final detection result is output by the last fully connected layer [25 (link)].

Free full text: Click here

, & Wang X. (2023). Music Similarity Detection Guided by Deep Learning Model. Computational Intelligence and Neuroscience, 2023, 1263620.

Publication 2023

Filtering operation Hpss Maps Neurons Seen

Corresponding Organization : Weinan Normal University

Top 5 similar protocols

Variable analysis

independent variables

None explicitly mentioned

dependent variables

Final detection result output by the last fully connected layer

control variables

None explicitly mentioned

positive controls

None mentioned

negative controls

None mentioned

Annotations

Based on most similar protocols

Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.

As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!