| Linus Torvalds | 1da177e | 2005-04-16 15:20:36 -0700 | [diff] [blame] | 1 | On some platforms, so-called memory-mapped I/O is weakly ordered.  On such | 
|  | 2 | platforms, driver writers are responsible for ensuring that I/O writes to | 
|  | 3 | memory-mapped addresses on their device arrive in the order intended.  This is | 
|  | 4 | typically done by reading a 'safe' device or bridge register, causing the I/O | 
|  | 5 | chipset to flush pending writes to the device before any reads are posted.  A | 
|  | 6 | driver would usually use this technique immediately prior to the exit of a | 
|  | 7 | critical section of code protected by spinlocks.  This would ensure that | 
|  | 8 | subsequent writes to I/O space arrived only after all prior writes (much like a | 
|  | 9 | memory barrier op, mb(), only with respect to I/O). | 
|  | 10 |  | 
|  | 11 | A more concrete example from a hypothetical device driver: | 
|  | 12 |  | 
|  | 13 | ... | 
|  | 14 | CPU A:  spin_lock_irqsave(&dev_lock, flags) | 
|  | 15 | CPU A:  val = readl(my_status); | 
|  | 16 | CPU A:  ... | 
|  | 17 | CPU A:  writel(newval, ring_ptr); | 
|  | 18 | CPU A:  spin_unlock_irqrestore(&dev_lock, flags) | 
|  | 19 | ... | 
|  | 20 | CPU B:  spin_lock_irqsave(&dev_lock, flags) | 
|  | 21 | CPU B:  val = readl(my_status); | 
|  | 22 | CPU B:  ... | 
|  | 23 | CPU B:  writel(newval2, ring_ptr); | 
|  | 24 | CPU B:  spin_unlock_irqrestore(&dev_lock, flags) | 
|  | 25 | ... | 
|  | 26 |  | 
|  | 27 | In the case above, the device may receive newval2 before it receives newval, | 
|  | 28 | which could cause problems.  Fixing it is easy enough though: | 
|  | 29 |  | 
|  | 30 | ... | 
|  | 31 | CPU A:  spin_lock_irqsave(&dev_lock, flags) | 
|  | 32 | CPU A:  val = readl(my_status); | 
|  | 33 | CPU A:  ... | 
|  | 34 | CPU A:  writel(newval, ring_ptr); | 
|  | 35 | CPU A:  (void)readl(safe_register); /* maybe a config register? */ | 
|  | 36 | CPU A:  spin_unlock_irqrestore(&dev_lock, flags) | 
|  | 37 | ... | 
|  | 38 | CPU B:  spin_lock_irqsave(&dev_lock, flags) | 
|  | 39 | CPU B:  val = readl(my_status); | 
|  | 40 | CPU B:  ... | 
|  | 41 | CPU B:  writel(newval2, ring_ptr); | 
|  | 42 | CPU B:  (void)readl(safe_register); /* maybe a config register? */ | 
|  | 43 | CPU B:  spin_unlock_irqrestore(&dev_lock, flags) | 
|  | 44 |  | 
|  | 45 | Here, the reads from safe_register will cause the I/O chipset to flush any | 
|  | 46 | pending writes before actually posting the read to the chipset, preventing | 
|  | 47 | possible data corruption. |