Query about the motivation #10

VoyageWang · 2023-07-25T06:08:15Z

Hi there! This is a nice work. But I have a little query about the motivation of the architecture design. In the paper "Based on the research conducted in [11, 4], which performed a quantitative analysis of different depths of self-attention blocks and discovered that shallow blocks tend to capture short-range dependencies while deeper ones capture long-range dependencies".

From my knowledge, the transformer can always model globally and can capture high effective receptive field from initial stages. Why did you refer that the shallow blocks capture short-range dependencies and deeper ones capture long-range dependencies? Why did they both capture long-range? Why did shallow capture short range but deep capture long-range?

AFeng-x · 2023-07-26T07:17:25Z

Hi, you can carefully read the analysis of the attention range in the two articles [11,4]. Although the transformer is designed to capture long-range dependencies, its characteristics in the shallow layers mainly focus on capturing short-range dependencies.

DavideHe · 2023-07-27T02:18:37Z

Hi, you can carefully read the analysis of the attention range in the two articles [11,4]. Although the transformer is designed to capture long-range dependencies, its characteristics in the shallow layers mainly focus on capturing short-range dependencies.

If the theory about short-long-range dependencies is real , we use convolution for shallow layer and use transformer block for deep layer. as we all know, convolution can get local information without position independent.

VoyageWang · 2023-08-23T01:17:55Z

Hi, there! I thanks for your reply! I want to know what is motivation of Modulation? Why you can use modulation for improvement of performance? Can you further explain it further? I am looking forward to your reply.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query about the motivation #10

Query about the motivation #10

VoyageWang commented Jul 25, 2023

AFeng-x commented Jul 26, 2023

DavideHe commented Jul 27, 2023

VoyageWang commented Aug 23, 2023

Query about the motivation #10

Query about the motivation #10

Comments

VoyageWang commented Jul 25, 2023

AFeng-x commented Jul 26, 2023

DavideHe commented Jul 27, 2023

VoyageWang commented Aug 23, 2023