No, I suck at explaining.
What I simply meant is that:
1. Ichika lands a punch.
2. Kushida gets punched.
3. Kushida is spun and falls on the ground.
4. Kushida later on hears the sound of the punch.
This means that the sound of the punch traveled comparably so much slower that Kushida perceived her landing on the ground before the sound of the punch.
Now, this of course means that the sound of Kushida landing would have come to Kushida first, firstly because of the distance being absolutely none here, and secondly, the distance for the sound of the punch to get to Kushida's ear being much greater here. It's more so that Kushida had to listen to two sounds and register a perceivable delay between both of them.
Now, both of these sounds (of the fall and of the punch, that's why I said "fall" and "punch" separately in my message before), would be registered with the same auditory processing time, and therefore, this processing time can be eliminated. Like the entire thing we need to see is the delay. For example, let's say that there's 0.1 s delay for an auditory signal, so like, the punch sound is produced, but is chasing Kushida till where she has fallen to hit her ears at least. And the sound then produced is of the fall, which basically instantly reaches Kushida because there's practically no distance between her and the ground here.
The sound of the punch also reaches her ears, and now, finally, balanced with the 0.1 s standard delay, both sounds are perceived by her, but again, both have the same delay. Like let's say, if two people, A and B are racing, and A is like 20 seconds ahead of B, and then A is delayed by 10 seconds, so now, A is 10 seconds ahead of B, but now, B is delayed by 10 seconds too, so that brings them 20 seconds apart again. It's the same here too, both the sounds have the same delay, so practically speaking, you can not account for that delay, and only see the time gap between the two sounds.
That's all.