閱讀文章 - 看板 DFBSD_kernel

發信人Sepherosa Ziehau <sepherosa@gmail.com>,

看板DFBSD_kernel

標題Re: i386 version of cpu_sfence()

發信站(null) (Sun Jan 30 04:49:56 2011)

轉信站ptt!crater_reader.dragonflybsd.org!crater.dragonflybsd.org!127.0.0.1.M

On Sat, Jan 29, 2011 at 2:54 AM, Matthew Dillon <dillon@apollo.backplane.com> wrote: > > :Hi all, > : > :i386 version of cpu_sfence(), it is just asm volatile ("" :::"memory") > : > :According to the instruction set, sfence should also ensures that the > :"global visibility" (i.e. empty CPU store buffer) of the stores before > :sfence. > :So should we do the same as cpu_mfence(), i.e. use a locked memory access? > : > :Best Regards, > :sephe > > ꀠ氲pu_sfence() is basically a NOP, because x86 cpus already order > ꀠ烀rites for global visibility. 糍he volatile ..."memory" macro is The document only indicates that writes are ordered on x86, but global visibility is not: http://support.amd.com/us/Processor_TechDocs/24593.pdf The second point on page 166 I think it suggests that: processor 0 processor 1 store A <--- 1 : : later :..........> load r1 A r1 still could be 0, since the A is still in the store buffer, while: processor 0 processor 1 store A <--- 1 sfence : : later :..........> load r1 A r1 could must be 1 Well, I could be wrong on this. > ꀠ澑oughly equivalent to cpu_ccfence() ... prevent the compiler itself > ꀠ沲rom trying to optimize or reorder actual instructions around that > ꀠ漤oint in the code. Best Regards, sephe -- Tomorrow Will Never Die