| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 1 | /* | 
 | 2 |  * SLOB Allocator: Simple List Of Blocks | 
 | 3 |  * | 
 | 4 |  * Matt Mackall <mpm@selenic.com> 12/30/03 | 
 | 5 |  * | 
| Paul Mundt | 6193a2f | 2007-07-15 23:38:22 -0700 | [diff] [blame] | 6 |  * NUMA support by Paul Mundt, 2007. | 
 | 7 |  * | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 8 |  * How SLOB works: | 
 | 9 |  * | 
 | 10 |  * The core of SLOB is a traditional K&R style heap allocator, with | 
 | 11 |  * support for returning aligned objects. The granularity of this | 
| Nick Piggin | 5539484 | 2007-07-15 23:38:09 -0700 | [diff] [blame] | 12 |  * allocator is as little as 2 bytes, however typically most architectures | 
 | 13 |  * will require 4 bytes on 32-bit and 8 bytes on 64-bit. | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 14 |  * | 
| Matt Mackall | 20cecba | 2008-02-04 22:29:37 -0800 | [diff] [blame] | 15 |  * The slob heap is a set of linked list of pages from alloc_pages(), | 
 | 16 |  * and within each page, there is a singly-linked list of free blocks | 
 | 17 |  * (slob_t). The heap is grown on demand. To reduce fragmentation, | 
 | 18 |  * heap pages are segregated into three lists, with objects less than | 
 | 19 |  * 256 bytes, objects less than 1024 bytes, and all other objects. | 
 | 20 |  * | 
 | 21 |  * Allocation from heap involves first searching for a page with | 
 | 22 |  * sufficient free blocks (using a next-fit-like approach) followed by | 
 | 23 |  * a first-fit scan of the page. Deallocation inserts objects back | 
 | 24 |  * into the free list in address order, so this is effectively an | 
 | 25 |  * address-ordered first fit. | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 26 |  * | 
 | 27 |  * Above this is an implementation of kmalloc/kfree. Blocks returned | 
| Nick Piggin | 5539484 | 2007-07-15 23:38:09 -0700 | [diff] [blame] | 28 |  * from kmalloc are prepended with a 4-byte header with the kmalloc size. | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 29 |  * If kmalloc is asked for objects of PAGE_SIZE or larger, it calls | 
| Paul Mundt | 6193a2f | 2007-07-15 23:38:22 -0700 | [diff] [blame] | 30 |  * alloc_pages() directly, allocating compound pages so the page order | 
| Nick Piggin | d87a133 | 2007-07-15 23:38:08 -0700 | [diff] [blame] | 31 |  * does not have to be separately tracked, and also stores the exact | 
 | 32 |  * allocation size in page->private so that it can be used to accurately | 
 | 33 |  * provide ksize(). These objects are detected in kfree() because slob_page() | 
 | 34 |  * is false for them. | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 35 |  * | 
 | 36 |  * SLAB is emulated on top of SLOB by simply calling constructors and | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 37 |  * destructors for every SLAB allocation. Objects are returned with the | 
 | 38 |  * 4-byte alignment unless the SLAB_HWCACHE_ALIGN flag is set, in which | 
 | 39 |  * case the low-level allocator will fragment blocks to create the proper | 
 | 40 |  * alignment. Again, objects of page-size or greater are allocated by | 
| Paul Mundt | 6193a2f | 2007-07-15 23:38:22 -0700 | [diff] [blame] | 41 |  * calling alloc_pages(). As SLAB objects know their size, no separate | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 42 |  * size bookkeeping is necessary and there is essentially no allocation | 
| Nick Piggin | d87a133 | 2007-07-15 23:38:08 -0700 | [diff] [blame] | 43 |  * space overhead, and compound pages aren't needed for multi-page | 
 | 44 |  * allocations. | 
| Paul Mundt | 6193a2f | 2007-07-15 23:38:22 -0700 | [diff] [blame] | 45 |  * | 
 | 46 |  * NUMA support in SLOB is fairly simplistic, pushing most of the real | 
 | 47 |  * logic down to the page allocator, and simply doing the node accounting | 
 | 48 |  * on the upper levels. In the event that a node id is explicitly | 
| Mel Gorman | 6484eb3 | 2009-06-16 15:31:54 -0700 | [diff] [blame] | 49 |  * provided, alloc_pages_exact_node() with the specified node id is used | 
| Paul Mundt | 6193a2f | 2007-07-15 23:38:22 -0700 | [diff] [blame] | 50 |  * instead. The common case (or when the node id isn't explicitly provided) | 
 | 51 |  * will default to the current node, as per numa_node_id(). | 
 | 52 |  * | 
 | 53 |  * Node aware pages are still inserted in to the global freelist, and | 
 | 54 |  * these are scanned for by matching against the node id encoded in the | 
 | 55 |  * page flags. As a result, block allocations that can be satisfied from | 
 | 56 |  * the freelist will only be done so on pages residing on the same node, | 
 | 57 |  * in order to prevent random node placement. | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 58 |  */ | 
 | 59 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 60 | #include <linux/kernel.h> | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 61 | #include <linux/slab.h> | 
 | 62 | #include <linux/mm.h> | 
| Nick Piggin | 1f0532e | 2009-05-05 19:13:45 +1000 | [diff] [blame] | 63 | #include <linux/swap.h> /* struct reclaim_state */ | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 64 | #include <linux/cache.h> | 
 | 65 | #include <linux/init.h> | 
 | 66 | #include <linux/module.h> | 
| Nick Piggin | afc0ced | 2007-05-16 22:10:49 -0700 | [diff] [blame] | 67 | #include <linux/rcupdate.h> | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 68 | #include <linux/list.h> | 
| Catalin Marinas | 4374e61 | 2009-06-11 13:23:17 +0100 | [diff] [blame] | 69 | #include <linux/kmemleak.h> | 
| Li Zefan | 039ca4e | 2010-05-26 17:22:17 +0800 | [diff] [blame] | 70 |  | 
 | 71 | #include <trace/events/kmem.h> | 
 | 72 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 73 | #include <asm/atomic.h> | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 74 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 75 | /* | 
 | 76 |  * slob_block has a field 'units', which indicates size of block if +ve, | 
 | 77 |  * or offset of next block if -ve (in SLOB_UNITs). | 
 | 78 |  * | 
 | 79 |  * Free blocks of size 1 unit simply contain the offset of the next block. | 
 | 80 |  * Those with larger size contain their size in the first SLOB_UNIT of | 
 | 81 |  * memory, and the offset of the next free block in the second SLOB_UNIT. | 
 | 82 |  */ | 
| Nick Piggin | 5539484 | 2007-07-15 23:38:09 -0700 | [diff] [blame] | 83 | #if PAGE_SIZE <= (32767 * 2) | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 84 | typedef s16 slobidx_t; | 
 | 85 | #else | 
 | 86 | typedef s32 slobidx_t; | 
 | 87 | #endif | 
 | 88 |  | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 89 | struct slob_block { | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 90 | 	slobidx_t units; | 
| Nick Piggin | 5539484 | 2007-07-15 23:38:09 -0700 | [diff] [blame] | 91 | }; | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 92 | typedef struct slob_block slob_t; | 
 | 93 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 94 | /* | 
 | 95 |  * We use struct page fields to manage some slob allocation aspects, | 
 | 96 |  * however to avoid the horrible mess in include/linux/mm_types.h, we'll | 
 | 97 |  * just define our own struct page type variant here. | 
 | 98 |  */ | 
 | 99 | struct slob_page { | 
 | 100 | 	union { | 
 | 101 | 		struct { | 
 | 102 | 			unsigned long flags;	/* mandatory */ | 
 | 103 | 			atomic_t _count;	/* mandatory */ | 
 | 104 | 			slobidx_t units;	/* free units left in page */ | 
 | 105 | 			unsigned long pad[2]; | 
 | 106 | 			slob_t *free;		/* first free slob_t in page */ | 
 | 107 | 			struct list_head list;	/* linked list of free pages */ | 
 | 108 | 		}; | 
 | 109 | 		struct page page; | 
 | 110 | 	}; | 
 | 111 | }; | 
 | 112 | static inline void struct_slob_page_wrong_size(void) | 
 | 113 | { BUILD_BUG_ON(sizeof(struct slob_page) != sizeof(struct page)); } | 
 | 114 |  | 
 | 115 | /* | 
 | 116 |  * free_slob_page: call before a slob_page is returned to the page allocator. | 
 | 117 |  */ | 
 | 118 | static inline void free_slob_page(struct slob_page *sp) | 
 | 119 | { | 
 | 120 | 	reset_page_mapcount(&sp->page); | 
 | 121 | 	sp->page.mapping = NULL; | 
 | 122 | } | 
 | 123 |  | 
 | 124 | /* | 
| Matt Mackall | 20cecba | 2008-02-04 22:29:37 -0800 | [diff] [blame] | 125 |  * All partially free slob pages go on these lists. | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 126 |  */ | 
| Matt Mackall | 20cecba | 2008-02-04 22:29:37 -0800 | [diff] [blame] | 127 | #define SLOB_BREAK1 256 | 
 | 128 | #define SLOB_BREAK2 1024 | 
 | 129 | static LIST_HEAD(free_slob_small); | 
 | 130 | static LIST_HEAD(free_slob_medium); | 
 | 131 | static LIST_HEAD(free_slob_large); | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 132 |  | 
 | 133 | /* | 
| Américo Wang | 6e9ed0c | 2009-01-19 02:00:38 +0800 | [diff] [blame] | 134 |  * is_slob_page: True for all slob pages (false for bigblock pages) | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 135 |  */ | 
| Américo Wang | 6e9ed0c | 2009-01-19 02:00:38 +0800 | [diff] [blame] | 136 | static inline int is_slob_page(struct slob_page *sp) | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 137 | { | 
| Wu Fengguang | 7303f24 | 2009-05-11 09:59:34 +0300 | [diff] [blame] | 138 | 	return PageSlab((struct page *)sp); | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 139 | } | 
 | 140 |  | 
 | 141 | static inline void set_slob_page(struct slob_page *sp) | 
 | 142 | { | 
| Wu Fengguang | 7303f24 | 2009-05-11 09:59:34 +0300 | [diff] [blame] | 143 | 	__SetPageSlab((struct page *)sp); | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 144 | } | 
 | 145 |  | 
 | 146 | static inline void clear_slob_page(struct slob_page *sp) | 
 | 147 | { | 
| Wu Fengguang | 7303f24 | 2009-05-11 09:59:34 +0300 | [diff] [blame] | 148 | 	__ClearPageSlab((struct page *)sp); | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 149 | } | 
 | 150 |  | 
| Américo Wang | 6e9ed0c | 2009-01-19 02:00:38 +0800 | [diff] [blame] | 151 | static inline struct slob_page *slob_page(const void *addr) | 
 | 152 | { | 
 | 153 | 	return (struct slob_page *)virt_to_page(addr); | 
 | 154 | } | 
 | 155 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 156 | /* | 
 | 157 |  * slob_page_free: true for pages on free_slob_pages list. | 
 | 158 |  */ | 
 | 159 | static inline int slob_page_free(struct slob_page *sp) | 
 | 160 | { | 
| Andy Whitcroft | 9023cb7 | 2008-07-23 21:27:19 -0700 | [diff] [blame] | 161 | 	return PageSlobFree((struct page *)sp); | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 162 | } | 
 | 163 |  | 
| Matt Mackall | 20cecba | 2008-02-04 22:29:37 -0800 | [diff] [blame] | 164 | static void set_slob_page_free(struct slob_page *sp, struct list_head *list) | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 165 | { | 
| Matt Mackall | 20cecba | 2008-02-04 22:29:37 -0800 | [diff] [blame] | 166 | 	list_add(&sp->list, list); | 
| Andy Whitcroft | 9023cb7 | 2008-07-23 21:27:19 -0700 | [diff] [blame] | 167 | 	__SetPageSlobFree((struct page *)sp); | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 168 | } | 
 | 169 |  | 
 | 170 | static inline void clear_slob_page_free(struct slob_page *sp) | 
 | 171 | { | 
 | 172 | 	list_del(&sp->list); | 
| Andy Whitcroft | 9023cb7 | 2008-07-23 21:27:19 -0700 | [diff] [blame] | 173 | 	__ClearPageSlobFree((struct page *)sp); | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 174 | } | 
 | 175 |  | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 176 | #define SLOB_UNIT sizeof(slob_t) | 
 | 177 | #define SLOB_UNITS(size) (((size) + SLOB_UNIT - 1)/SLOB_UNIT) | 
 | 178 | #define SLOB_ALIGN L1_CACHE_BYTES | 
 | 179 |  | 
| Nick Piggin | afc0ced | 2007-05-16 22:10:49 -0700 | [diff] [blame] | 180 | /* | 
 | 181 |  * struct slob_rcu is inserted at the tail of allocated slob blocks, which | 
 | 182 |  * were created with a SLAB_DESTROY_BY_RCU slab. slob_rcu is used to free | 
 | 183 |  * the block using call_rcu. | 
 | 184 |  */ | 
 | 185 | struct slob_rcu { | 
 | 186 | 	struct rcu_head head; | 
 | 187 | 	int size; | 
 | 188 | }; | 
 | 189 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 190 | /* | 
 | 191 |  * slob_lock protects all slob allocator structures. | 
 | 192 |  */ | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 193 | static DEFINE_SPINLOCK(slob_lock); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 194 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 195 | /* | 
 | 196 |  * Encode the given size and next info into a free slob block s. | 
 | 197 |  */ | 
 | 198 | static void set_slob(slob_t *s, slobidx_t size, slob_t *next) | 
 | 199 | { | 
 | 200 | 	slob_t *base = (slob_t *)((unsigned long)s & PAGE_MASK); | 
 | 201 | 	slobidx_t offset = next - base; | 
| Dimitri Gorokhovik | bcb4ddb | 2006-12-29 16:48:28 -0800 | [diff] [blame] | 202 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 203 | 	if (size > 1) { | 
 | 204 | 		s[0].units = size; | 
 | 205 | 		s[1].units = offset; | 
 | 206 | 	} else | 
 | 207 | 		s[0].units = -offset; | 
 | 208 | } | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 209 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 210 | /* | 
 | 211 |  * Return the size of a slob block. | 
 | 212 |  */ | 
 | 213 | static slobidx_t slob_units(slob_t *s) | 
 | 214 | { | 
 | 215 | 	if (s->units > 0) | 
 | 216 | 		return s->units; | 
 | 217 | 	return 1; | 
 | 218 | } | 
 | 219 |  | 
 | 220 | /* | 
 | 221 |  * Return the next free slob block pointer after this one. | 
 | 222 |  */ | 
 | 223 | static slob_t *slob_next(slob_t *s) | 
 | 224 | { | 
 | 225 | 	slob_t *base = (slob_t *)((unsigned long)s & PAGE_MASK); | 
 | 226 | 	slobidx_t next; | 
 | 227 |  | 
 | 228 | 	if (s[0].units < 0) | 
 | 229 | 		next = -s[0].units; | 
 | 230 | 	else | 
 | 231 | 		next = s[1].units; | 
 | 232 | 	return base+next; | 
 | 233 | } | 
 | 234 |  | 
 | 235 | /* | 
 | 236 |  * Returns true if s is the last free block in its page. | 
 | 237 |  */ | 
 | 238 | static int slob_last(slob_t *s) | 
 | 239 | { | 
 | 240 | 	return !((unsigned long)slob_next(s) & ~PAGE_MASK); | 
 | 241 | } | 
 | 242 |  | 
| Américo Wang | 6e9ed0c | 2009-01-19 02:00:38 +0800 | [diff] [blame] | 243 | static void *slob_new_pages(gfp_t gfp, int order, int node) | 
| Paul Mundt | 6193a2f | 2007-07-15 23:38:22 -0700 | [diff] [blame] | 244 | { | 
 | 245 | 	void *page; | 
 | 246 |  | 
 | 247 | #ifdef CONFIG_NUMA | 
 | 248 | 	if (node != -1) | 
| Mel Gorman | 6484eb3 | 2009-06-16 15:31:54 -0700 | [diff] [blame] | 249 | 		page = alloc_pages_exact_node(node, gfp, order); | 
| Paul Mundt | 6193a2f | 2007-07-15 23:38:22 -0700 | [diff] [blame] | 250 | 	else | 
 | 251 | #endif | 
 | 252 | 		page = alloc_pages(gfp, order); | 
 | 253 |  | 
 | 254 | 	if (!page) | 
 | 255 | 		return NULL; | 
 | 256 |  | 
 | 257 | 	return page_address(page); | 
 | 258 | } | 
 | 259 |  | 
| Américo Wang | 6e9ed0c | 2009-01-19 02:00:38 +0800 | [diff] [blame] | 260 | static void slob_free_pages(void *b, int order) | 
 | 261 | { | 
| Nick Piggin | 1f0532e | 2009-05-05 19:13:45 +1000 | [diff] [blame] | 262 | 	if (current->reclaim_state) | 
 | 263 | 		current->reclaim_state->reclaimed_slab += 1 << order; | 
| Américo Wang | 6e9ed0c | 2009-01-19 02:00:38 +0800 | [diff] [blame] | 264 | 	free_pages((unsigned long)b, order); | 
 | 265 | } | 
 | 266 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 267 | /* | 
 | 268 |  * Allocate a slob block within a given slob_page sp. | 
 | 269 |  */ | 
 | 270 | static void *slob_page_alloc(struct slob_page *sp, size_t size, int align) | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 271 | { | 
| Américo Wang | 6e9ed0c | 2009-01-19 02:00:38 +0800 | [diff] [blame] | 272 | 	slob_t *prev, *cur, *aligned = NULL; | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 273 | 	int delta = 0, units = SLOB_UNITS(size); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 274 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 275 | 	for (prev = NULL, cur = sp->free; ; prev = cur, cur = slob_next(cur)) { | 
 | 276 | 		slobidx_t avail = slob_units(cur); | 
 | 277 |  | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 278 | 		if (align) { | 
 | 279 | 			aligned = (slob_t *)ALIGN((unsigned long)cur, align); | 
 | 280 | 			delta = aligned - cur; | 
 | 281 | 		} | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 282 | 		if (avail >= units + delta) { /* room enough? */ | 
 | 283 | 			slob_t *next; | 
 | 284 |  | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 285 | 			if (delta) { /* need to fragment head to align? */ | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 286 | 				next = slob_next(cur); | 
 | 287 | 				set_slob(aligned, avail - delta, next); | 
 | 288 | 				set_slob(cur, delta, aligned); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 289 | 				prev = cur; | 
 | 290 | 				cur = aligned; | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 291 | 				avail = slob_units(cur); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 292 | 			} | 
 | 293 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 294 | 			next = slob_next(cur); | 
 | 295 | 			if (avail == units) { /* exact fit? unlink. */ | 
 | 296 | 				if (prev) | 
 | 297 | 					set_slob(prev, slob_units(prev), next); | 
 | 298 | 				else | 
 | 299 | 					sp->free = next; | 
 | 300 | 			} else { /* fragment */ | 
 | 301 | 				if (prev) | 
 | 302 | 					set_slob(prev, slob_units(prev), cur + units); | 
 | 303 | 				else | 
 | 304 | 					sp->free = cur + units; | 
 | 305 | 				set_slob(cur + units, avail - units, next); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 306 | 			} | 
 | 307 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 308 | 			sp->units -= units; | 
 | 309 | 			if (!sp->units) | 
 | 310 | 				clear_slob_page_free(sp); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 311 | 			return cur; | 
 | 312 | 		} | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 313 | 		if (slob_last(cur)) | 
 | 314 | 			return NULL; | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 315 | 	} | 
 | 316 | } | 
 | 317 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 318 | /* | 
 | 319 |  * slob_alloc: entry point into the slob allocator. | 
 | 320 |  */ | 
| Paul Mundt | 6193a2f | 2007-07-15 23:38:22 -0700 | [diff] [blame] | 321 | static void *slob_alloc(size_t size, gfp_t gfp, int align, int node) | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 322 | { | 
 | 323 | 	struct slob_page *sp; | 
| Matt Mackall | d626954 | 2007-07-21 04:37:40 -0700 | [diff] [blame] | 324 | 	struct list_head *prev; | 
| Matt Mackall | 20cecba | 2008-02-04 22:29:37 -0800 | [diff] [blame] | 325 | 	struct list_head *slob_list; | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 326 | 	slob_t *b = NULL; | 
 | 327 | 	unsigned long flags; | 
 | 328 |  | 
| Matt Mackall | 20cecba | 2008-02-04 22:29:37 -0800 | [diff] [blame] | 329 | 	if (size < SLOB_BREAK1) | 
 | 330 | 		slob_list = &free_slob_small; | 
 | 331 | 	else if (size < SLOB_BREAK2) | 
 | 332 | 		slob_list = &free_slob_medium; | 
 | 333 | 	else | 
 | 334 | 		slob_list = &free_slob_large; | 
 | 335 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 336 | 	spin_lock_irqsave(&slob_lock, flags); | 
 | 337 | 	/* Iterate through each partially free page, try to find room */ | 
| Matt Mackall | 20cecba | 2008-02-04 22:29:37 -0800 | [diff] [blame] | 338 | 	list_for_each_entry(sp, slob_list, list) { | 
| Paul Mundt | 6193a2f | 2007-07-15 23:38:22 -0700 | [diff] [blame] | 339 | #ifdef CONFIG_NUMA | 
 | 340 | 		/* | 
 | 341 | 		 * If there's a node specification, search for a partial | 
 | 342 | 		 * page with a matching node id in the freelist. | 
 | 343 | 		 */ | 
 | 344 | 		if (node != -1 && page_to_nid(&sp->page) != node) | 
 | 345 | 			continue; | 
 | 346 | #endif | 
| Matt Mackall | d626954 | 2007-07-21 04:37:40 -0700 | [diff] [blame] | 347 | 		/* Enough room on this page? */ | 
 | 348 | 		if (sp->units < SLOB_UNITS(size)) | 
 | 349 | 			continue; | 
| Paul Mundt | 6193a2f | 2007-07-15 23:38:22 -0700 | [diff] [blame] | 350 |  | 
| Matt Mackall | d626954 | 2007-07-21 04:37:40 -0700 | [diff] [blame] | 351 | 		/* Attempt to alloc */ | 
 | 352 | 		prev = sp->list.prev; | 
 | 353 | 		b = slob_page_alloc(sp, size, align); | 
 | 354 | 		if (!b) | 
 | 355 | 			continue; | 
 | 356 |  | 
 | 357 | 		/* Improve fragment distribution and reduce our average | 
 | 358 | 		 * search time by starting our next search here. (see | 
 | 359 | 		 * Knuth vol 1, sec 2.5, pg 449) */ | 
| Matt Mackall | 20cecba | 2008-02-04 22:29:37 -0800 | [diff] [blame] | 360 | 		if (prev != slob_list->prev && | 
 | 361 | 				slob_list->next != prev->next) | 
 | 362 | 			list_move_tail(slob_list, prev->next); | 
| Matt Mackall | d626954 | 2007-07-21 04:37:40 -0700 | [diff] [blame] | 363 | 		break; | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 364 | 	} | 
 | 365 | 	spin_unlock_irqrestore(&slob_lock, flags); | 
 | 366 |  | 
 | 367 | 	/* Not enough space: must allocate a new page */ | 
 | 368 | 	if (!b) { | 
| Américo Wang | 6e9ed0c | 2009-01-19 02:00:38 +0800 | [diff] [blame] | 369 | 		b = slob_new_pages(gfp & ~__GFP_ZERO, 0, node); | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 370 | 		if (!b) | 
| Américo Wang | 6e9ed0c | 2009-01-19 02:00:38 +0800 | [diff] [blame] | 371 | 			return NULL; | 
 | 372 | 		sp = slob_page(b); | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 373 | 		set_slob_page(sp); | 
 | 374 |  | 
 | 375 | 		spin_lock_irqsave(&slob_lock, flags); | 
 | 376 | 		sp->units = SLOB_UNITS(PAGE_SIZE); | 
 | 377 | 		sp->free = b; | 
 | 378 | 		INIT_LIST_HEAD(&sp->list); | 
 | 379 | 		set_slob(b, SLOB_UNITS(PAGE_SIZE), b + SLOB_UNITS(PAGE_SIZE)); | 
| Matt Mackall | 20cecba | 2008-02-04 22:29:37 -0800 | [diff] [blame] | 380 | 		set_slob_page_free(sp, slob_list); | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 381 | 		b = slob_page_alloc(sp, size, align); | 
 | 382 | 		BUG_ON(!b); | 
 | 383 | 		spin_unlock_irqrestore(&slob_lock, flags); | 
 | 384 | 	} | 
| Christoph Lameter | d07dbea | 2007-07-17 04:03:23 -0700 | [diff] [blame] | 385 | 	if (unlikely((gfp & __GFP_ZERO) && b)) | 
 | 386 | 		memset(b, 0, size); | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 387 | 	return b; | 
 | 388 | } | 
 | 389 |  | 
 | 390 | /* | 
 | 391 |  * slob_free: entry point into the slob allocator. | 
 | 392 |  */ | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 393 | static void slob_free(void *block, int size) | 
 | 394 | { | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 395 | 	struct slob_page *sp; | 
 | 396 | 	slob_t *prev, *next, *b = (slob_t *)block; | 
 | 397 | 	slobidx_t units; | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 398 | 	unsigned long flags; | 
| Bob Liu | d602dab | 2010-07-10 18:05:33 +0800 | [diff] [blame] | 399 | 	struct list_head *slob_list; | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 400 |  | 
| Satyam Sharma | 2408c55 | 2007-10-16 01:24:44 -0700 | [diff] [blame] | 401 | 	if (unlikely(ZERO_OR_NULL_PTR(block))) | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 402 | 		return; | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 403 | 	BUG_ON(!size); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 404 |  | 
| Américo Wang | 6e9ed0c | 2009-01-19 02:00:38 +0800 | [diff] [blame] | 405 | 	sp = slob_page(block); | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 406 | 	units = SLOB_UNITS(size); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 407 |  | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 408 | 	spin_lock_irqsave(&slob_lock, flags); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 409 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 410 | 	if (sp->units + units == SLOB_UNITS(PAGE_SIZE)) { | 
 | 411 | 		/* Go directly to page allocator. Do not pass slob allocator */ | 
 | 412 | 		if (slob_page_free(sp)) | 
 | 413 | 			clear_slob_page_free(sp); | 
| Nick Piggin | 6fb8f42 | 2009-03-16 21:00:28 +1100 | [diff] [blame] | 414 | 		spin_unlock_irqrestore(&slob_lock, flags); | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 415 | 		clear_slob_page(sp); | 
 | 416 | 		free_slob_page(sp); | 
| Nick Piggin | 1f0532e | 2009-05-05 19:13:45 +1000 | [diff] [blame] | 417 | 		slob_free_pages(b, 0); | 
| Nick Piggin | 6fb8f42 | 2009-03-16 21:00:28 +1100 | [diff] [blame] | 418 | 		return; | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 419 | 	} | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 420 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 421 | 	if (!slob_page_free(sp)) { | 
 | 422 | 		/* This slob page is about to become partially free. Easy! */ | 
 | 423 | 		sp->units = units; | 
 | 424 | 		sp->free = b; | 
 | 425 | 		set_slob(b, units, | 
 | 426 | 			(void *)((unsigned long)(b + | 
 | 427 | 					SLOB_UNITS(PAGE_SIZE)) & PAGE_MASK)); | 
| Bob Liu | d602dab | 2010-07-10 18:05:33 +0800 | [diff] [blame] | 428 | 		if (size < SLOB_BREAK1) | 
 | 429 | 			slob_list = &free_slob_small; | 
 | 430 | 		else if (size < SLOB_BREAK2) | 
 | 431 | 			slob_list = &free_slob_medium; | 
 | 432 | 		else | 
 | 433 | 			slob_list = &free_slob_large; | 
 | 434 | 		set_slob_page_free(sp, slob_list); | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 435 | 		goto out; | 
 | 436 | 	} | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 437 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 438 | 	/* | 
 | 439 | 	 * Otherwise the page is already partially free, so find reinsertion | 
 | 440 | 	 * point. | 
 | 441 | 	 */ | 
 | 442 | 	sp->units += units; | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 443 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 444 | 	if (b < sp->free) { | 
| Matt Mackall | 679299b | 2008-02-04 22:29:37 -0800 | [diff] [blame] | 445 | 		if (b + units == sp->free) { | 
 | 446 | 			units += slob_units(sp->free); | 
 | 447 | 			sp->free = slob_next(sp->free); | 
 | 448 | 		} | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 449 | 		set_slob(b, units, sp->free); | 
 | 450 | 		sp->free = b; | 
 | 451 | 	} else { | 
 | 452 | 		prev = sp->free; | 
 | 453 | 		next = slob_next(prev); | 
 | 454 | 		while (b > next) { | 
 | 455 | 			prev = next; | 
 | 456 | 			next = slob_next(prev); | 
 | 457 | 		} | 
 | 458 |  | 
 | 459 | 		if (!slob_last(prev) && b + units == next) { | 
 | 460 | 			units += slob_units(next); | 
 | 461 | 			set_slob(b, units, slob_next(next)); | 
 | 462 | 		} else | 
 | 463 | 			set_slob(b, units, next); | 
 | 464 |  | 
 | 465 | 		if (prev + slob_units(prev) == b) { | 
 | 466 | 			units = slob_units(b) + slob_units(prev); | 
 | 467 | 			set_slob(prev, units, slob_next(b)); | 
 | 468 | 		} else | 
 | 469 | 			set_slob(prev, slob_units(prev), b); | 
 | 470 | 	} | 
 | 471 | out: | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 472 | 	spin_unlock_irqrestore(&slob_lock, flags); | 
 | 473 | } | 
 | 474 |  | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 475 | /* | 
 | 476 |  * End of slob allocator proper. Begin kmem_cache_alloc and kmalloc frontend. | 
 | 477 |  */ | 
 | 478 |  | 
| Paul Mundt | 6193a2f | 2007-07-15 23:38:22 -0700 | [diff] [blame] | 479 | void *__kmalloc_node(size_t size, gfp_t gfp, int node) | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 480 | { | 
| Christoph Lameter | 6cb8f91 | 2007-07-17 04:03:22 -0700 | [diff] [blame] | 481 | 	unsigned int *m; | 
| Nick Piggin | 5539484 | 2007-07-15 23:38:09 -0700 | [diff] [blame] | 482 | 	int align = max(ARCH_KMALLOC_MINALIGN, ARCH_SLAB_MINALIGN); | 
| Eduard - Gabriel Munteanu | 3eae2cb2 | 2008-08-10 20:14:07 +0300 | [diff] [blame] | 483 | 	void *ret; | 
| Nick Piggin | 5539484 | 2007-07-15 23:38:09 -0700 | [diff] [blame] | 484 |  | 
| Ingo Molnar | 19cefdf | 2009-03-15 06:03:11 +0100 | [diff] [blame] | 485 | 	lockdep_trace_alloc(gfp); | 
| Nick Piggin | cf40bd1 | 2009-01-21 08:12:39 +0100 | [diff] [blame] | 486 |  | 
| Nick Piggin | 5539484 | 2007-07-15 23:38:09 -0700 | [diff] [blame] | 487 | 	if (size < PAGE_SIZE - align) { | 
| Christoph Lameter | 6cb8f91 | 2007-07-17 04:03:22 -0700 | [diff] [blame] | 488 | 		if (!size) | 
 | 489 | 			return ZERO_SIZE_PTR; | 
 | 490 |  | 
| Paul Mundt | 6193a2f | 2007-07-15 23:38:22 -0700 | [diff] [blame] | 491 | 		m = slob_alloc(size + align, gfp, align, node); | 
| Eduard - Gabriel Munteanu | 3eae2cb2 | 2008-08-10 20:14:07 +0300 | [diff] [blame] | 492 |  | 
| MinChan Kim | 239f49c | 2008-05-19 22:12:08 +0900 | [diff] [blame] | 493 | 		if (!m) | 
 | 494 | 			return NULL; | 
 | 495 | 		*m = size; | 
| Eduard - Gabriel Munteanu | 3eae2cb2 | 2008-08-10 20:14:07 +0300 | [diff] [blame] | 496 | 		ret = (void *)m + align; | 
| Nick Piggin | d87a133 | 2007-07-15 23:38:08 -0700 | [diff] [blame] | 497 |  | 
| Eduard - Gabriel Munteanu | ca2b84c | 2009-03-23 15:12:24 +0200 | [diff] [blame] | 498 | 		trace_kmalloc_node(_RET_IP_, ret, | 
 | 499 | 				   size, size + align, gfp, node); | 
| Nick Piggin | d87a133 | 2007-07-15 23:38:08 -0700 | [diff] [blame] | 500 | 	} else { | 
| Eduard - Gabriel Munteanu | 3eae2cb2 | 2008-08-10 20:14:07 +0300 | [diff] [blame] | 501 | 		unsigned int order = get_order(size); | 
| Nick Piggin | d87a133 | 2007-07-15 23:38:08 -0700 | [diff] [blame] | 502 |  | 
| David Rientjes | 8df275a | 2010-08-22 16:16:06 -0700 | [diff] [blame] | 503 | 		if (likely(order)) | 
 | 504 | 			gfp |= __GFP_COMP; | 
 | 505 | 		ret = slob_new_pages(gfp, order, node); | 
| Nick Piggin | d87a133 | 2007-07-15 23:38:08 -0700 | [diff] [blame] | 506 | 		if (ret) { | 
 | 507 | 			struct page *page; | 
 | 508 | 			page = virt_to_page(ret); | 
 | 509 | 			page->private = size; | 
 | 510 | 		} | 
| Eduard - Gabriel Munteanu | 3eae2cb2 | 2008-08-10 20:14:07 +0300 | [diff] [blame] | 511 |  | 
| Eduard - Gabriel Munteanu | ca2b84c | 2009-03-23 15:12:24 +0200 | [diff] [blame] | 512 | 		trace_kmalloc_node(_RET_IP_, ret, | 
 | 513 | 				   size, PAGE_SIZE << order, gfp, node); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 514 | 	} | 
| Eduard - Gabriel Munteanu | 3eae2cb2 | 2008-08-10 20:14:07 +0300 | [diff] [blame] | 515 |  | 
| Catalin Marinas | 4374e61 | 2009-06-11 13:23:17 +0100 | [diff] [blame] | 516 | 	kmemleak_alloc(ret, size, 1, gfp); | 
| Eduard - Gabriel Munteanu | 3eae2cb2 | 2008-08-10 20:14:07 +0300 | [diff] [blame] | 517 | 	return ret; | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 518 | } | 
| Paul Mundt | 6193a2f | 2007-07-15 23:38:22 -0700 | [diff] [blame] | 519 | EXPORT_SYMBOL(__kmalloc_node); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 520 |  | 
 | 521 | void kfree(const void *block) | 
 | 522 | { | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 523 | 	struct slob_page *sp; | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 524 |  | 
| Pekka Enberg | 2121db7 | 2009-03-25 11:05:57 +0200 | [diff] [blame] | 525 | 	trace_kfree(_RET_IP_, block); | 
 | 526 |  | 
| Satyam Sharma | 2408c55 | 2007-10-16 01:24:44 -0700 | [diff] [blame] | 527 | 	if (unlikely(ZERO_OR_NULL_PTR(block))) | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 528 | 		return; | 
| Catalin Marinas | 4374e61 | 2009-06-11 13:23:17 +0100 | [diff] [blame] | 529 | 	kmemleak_free(block); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 530 |  | 
| Américo Wang | 6e9ed0c | 2009-01-19 02:00:38 +0800 | [diff] [blame] | 531 | 	sp = slob_page(block); | 
 | 532 | 	if (is_slob_page(sp)) { | 
| Nick Piggin | 5539484 | 2007-07-15 23:38:09 -0700 | [diff] [blame] | 533 | 		int align = max(ARCH_KMALLOC_MINALIGN, ARCH_SLAB_MINALIGN); | 
 | 534 | 		unsigned int *m = (unsigned int *)(block - align); | 
 | 535 | 		slob_free(m, *m + align); | 
| Nick Piggin | d87a133 | 2007-07-15 23:38:08 -0700 | [diff] [blame] | 536 | 	} else | 
 | 537 | 		put_page(&sp->page); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 538 | } | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 539 | EXPORT_SYMBOL(kfree); | 
 | 540 |  | 
| Nick Piggin | d87a133 | 2007-07-15 23:38:08 -0700 | [diff] [blame] | 541 | /* can't use ksize for kmem_cache_alloc memory, only kmalloc */ | 
| Pekka Enberg | fd76bab | 2007-05-06 14:48:40 -0700 | [diff] [blame] | 542 | size_t ksize(const void *block) | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 543 | { | 
| Nick Piggin | 95b3512 | 2007-07-15 23:38:07 -0700 | [diff] [blame] | 544 | 	struct slob_page *sp; | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 545 |  | 
| Christoph Lameter | ef8b452 | 2007-10-16 01:24:46 -0700 | [diff] [blame] | 546 | 	BUG_ON(!block); | 
 | 547 | 	if (unlikely(block == ZERO_SIZE_PTR)) | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 548 | 		return 0; | 
 | 549 |  | 
| Américo Wang | 6e9ed0c | 2009-01-19 02:00:38 +0800 | [diff] [blame] | 550 | 	sp = slob_page(block); | 
 | 551 | 	if (is_slob_page(sp)) { | 
| Matt Mackall | 70096a5 | 2008-10-08 14:51:57 -0500 | [diff] [blame] | 552 | 		int align = max(ARCH_KMALLOC_MINALIGN, ARCH_SLAB_MINALIGN); | 
 | 553 | 		unsigned int *m = (unsigned int *)(block - align); | 
 | 554 | 		return SLOB_UNITS(*m) * SLOB_UNIT; | 
 | 555 | 	} else | 
| Nick Piggin | d87a133 | 2007-07-15 23:38:08 -0700 | [diff] [blame] | 556 | 		return sp->page.private; | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 557 | } | 
| Kirill A. Shutemov | b1aabec | 2009-02-10 15:21:44 +0200 | [diff] [blame] | 558 | EXPORT_SYMBOL(ksize); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 559 |  | 
 | 560 | struct kmem_cache { | 
 | 561 | 	unsigned int size, align; | 
| Nick Piggin | afc0ced | 2007-05-16 22:10:49 -0700 | [diff] [blame] | 562 | 	unsigned long flags; | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 563 | 	const char *name; | 
| Alexey Dobriyan | 51cc506 | 2008-07-25 19:45:34 -0700 | [diff] [blame] | 564 | 	void (*ctor)(void *); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 565 | }; | 
 | 566 |  | 
 | 567 | struct kmem_cache *kmem_cache_create(const char *name, size_t size, | 
| Alexey Dobriyan | 51cc506 | 2008-07-25 19:45:34 -0700 | [diff] [blame] | 568 | 	size_t align, unsigned long flags, void (*ctor)(void *)) | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 569 | { | 
 | 570 | 	struct kmem_cache *c; | 
 | 571 |  | 
| Yi Li | 0701a9e | 2008-04-25 19:49:21 +0300 | [diff] [blame] | 572 | 	c = slob_alloc(sizeof(struct kmem_cache), | 
| Catalin Marinas | 5e18e2b | 2008-12-15 13:54:16 -0800 | [diff] [blame] | 573 | 		GFP_KERNEL, ARCH_KMALLOC_MINALIGN, -1); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 574 |  | 
 | 575 | 	if (c) { | 
 | 576 | 		c->name = name; | 
 | 577 | 		c->size = size; | 
| Nick Piggin | afc0ced | 2007-05-16 22:10:49 -0700 | [diff] [blame] | 578 | 		if (flags & SLAB_DESTROY_BY_RCU) { | 
| Nick Piggin | afc0ced | 2007-05-16 22:10:49 -0700 | [diff] [blame] | 579 | 			/* leave room for rcu footer at the end of object */ | 
 | 580 | 			c->size += sizeof(struct slob_rcu); | 
 | 581 | 		} | 
 | 582 | 		c->flags = flags; | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 583 | 		c->ctor = ctor; | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 584 | 		/* ignore alignment unless it's forced */ | 
| Christoph Lameter | 5af6083 | 2007-05-06 14:49:56 -0700 | [diff] [blame] | 585 | 		c->align = (flags & SLAB_HWCACHE_ALIGN) ? SLOB_ALIGN : 0; | 
| Nick Piggin | 5539484 | 2007-07-15 23:38:09 -0700 | [diff] [blame] | 586 | 		if (c->align < ARCH_SLAB_MINALIGN) | 
 | 587 | 			c->align = ARCH_SLAB_MINALIGN; | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 588 | 		if (c->align < align) | 
 | 589 | 			c->align = align; | 
| Akinobu Mita | bc0055a | 2007-05-06 14:49:52 -0700 | [diff] [blame] | 590 | 	} else if (flags & SLAB_PANIC) | 
 | 591 | 		panic("Cannot create slab cache %s\n", name); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 592 |  | 
| Catalin Marinas | 4374e61 | 2009-06-11 13:23:17 +0100 | [diff] [blame] | 593 | 	kmemleak_alloc(c, sizeof(struct kmem_cache), 1, GFP_KERNEL); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 594 | 	return c; | 
 | 595 | } | 
 | 596 | EXPORT_SYMBOL(kmem_cache_create); | 
 | 597 |  | 
| Alexey Dobriyan | 133d205 | 2006-09-27 01:49:41 -0700 | [diff] [blame] | 598 | void kmem_cache_destroy(struct kmem_cache *c) | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 599 | { | 
| Catalin Marinas | 4374e61 | 2009-06-11 13:23:17 +0100 | [diff] [blame] | 600 | 	kmemleak_free(c); | 
| Paul E. McKenney | 7ed9f7e | 2009-06-25 12:31:37 -0700 | [diff] [blame] | 601 | 	if (c->flags & SLAB_DESTROY_BY_RCU) | 
 | 602 | 		rcu_barrier(); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 603 | 	slob_free(c, sizeof(struct kmem_cache)); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 604 | } | 
 | 605 | EXPORT_SYMBOL(kmem_cache_destroy); | 
 | 606 |  | 
| Paul Mundt | 6193a2f | 2007-07-15 23:38:22 -0700 | [diff] [blame] | 607 | void *kmem_cache_alloc_node(struct kmem_cache *c, gfp_t flags, int node) | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 608 | { | 
 | 609 | 	void *b; | 
 | 610 |  | 
| Eduard - Gabriel Munteanu | 3eae2cb2 | 2008-08-10 20:14:07 +0300 | [diff] [blame] | 611 | 	if (c->size < PAGE_SIZE) { | 
| Paul Mundt | 6193a2f | 2007-07-15 23:38:22 -0700 | [diff] [blame] | 612 | 		b = slob_alloc(c->size, flags, c->align, node); | 
| Eduard - Gabriel Munteanu | ca2b84c | 2009-03-23 15:12:24 +0200 | [diff] [blame] | 613 | 		trace_kmem_cache_alloc_node(_RET_IP_, b, c->size, | 
 | 614 | 					    SLOB_UNITS(c->size) * SLOB_UNIT, | 
 | 615 | 					    flags, node); | 
| Eduard - Gabriel Munteanu | 3eae2cb2 | 2008-08-10 20:14:07 +0300 | [diff] [blame] | 616 | 	} else { | 
| Américo Wang | 6e9ed0c | 2009-01-19 02:00:38 +0800 | [diff] [blame] | 617 | 		b = slob_new_pages(flags, get_order(c->size), node); | 
| Eduard - Gabriel Munteanu | ca2b84c | 2009-03-23 15:12:24 +0200 | [diff] [blame] | 618 | 		trace_kmem_cache_alloc_node(_RET_IP_, b, c->size, | 
 | 619 | 					    PAGE_SIZE << get_order(c->size), | 
 | 620 | 					    flags, node); | 
| Eduard - Gabriel Munteanu | 3eae2cb2 | 2008-08-10 20:14:07 +0300 | [diff] [blame] | 621 | 	} | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 622 |  | 
 | 623 | 	if (c->ctor) | 
| Alexey Dobriyan | 51cc506 | 2008-07-25 19:45:34 -0700 | [diff] [blame] | 624 | 		c->ctor(b); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 625 |  | 
| Catalin Marinas | 4374e61 | 2009-06-11 13:23:17 +0100 | [diff] [blame] | 626 | 	kmemleak_alloc_recursive(b, c->size, 1, c->flags, flags); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 627 | 	return b; | 
 | 628 | } | 
| Paul Mundt | 6193a2f | 2007-07-15 23:38:22 -0700 | [diff] [blame] | 629 | EXPORT_SYMBOL(kmem_cache_alloc_node); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 630 |  | 
| Nick Piggin | afc0ced | 2007-05-16 22:10:49 -0700 | [diff] [blame] | 631 | static void __kmem_cache_free(void *b, int size) | 
 | 632 | { | 
 | 633 | 	if (size < PAGE_SIZE) | 
 | 634 | 		slob_free(b, size); | 
 | 635 | 	else | 
| Américo Wang | 6e9ed0c | 2009-01-19 02:00:38 +0800 | [diff] [blame] | 636 | 		slob_free_pages(b, get_order(size)); | 
| Nick Piggin | afc0ced | 2007-05-16 22:10:49 -0700 | [diff] [blame] | 637 | } | 
 | 638 |  | 
 | 639 | static void kmem_rcu_free(struct rcu_head *head) | 
 | 640 | { | 
 | 641 | 	struct slob_rcu *slob_rcu = (struct slob_rcu *)head; | 
 | 642 | 	void *b = (void *)slob_rcu - (slob_rcu->size - sizeof(struct slob_rcu)); | 
 | 643 |  | 
 | 644 | 	__kmem_cache_free(b, slob_rcu->size); | 
 | 645 | } | 
 | 646 |  | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 647 | void kmem_cache_free(struct kmem_cache *c, void *b) | 
 | 648 | { | 
| Catalin Marinas | 4374e61 | 2009-06-11 13:23:17 +0100 | [diff] [blame] | 649 | 	kmemleak_free_recursive(b, c->flags); | 
| Nick Piggin | afc0ced | 2007-05-16 22:10:49 -0700 | [diff] [blame] | 650 | 	if (unlikely(c->flags & SLAB_DESTROY_BY_RCU)) { | 
 | 651 | 		struct slob_rcu *slob_rcu; | 
 | 652 | 		slob_rcu = b + (c->size - sizeof(struct slob_rcu)); | 
| Nick Piggin | afc0ced | 2007-05-16 22:10:49 -0700 | [diff] [blame] | 653 | 		slob_rcu->size = c->size; | 
 | 654 | 		call_rcu(&slob_rcu->head, kmem_rcu_free); | 
 | 655 | 	} else { | 
| Nick Piggin | afc0ced | 2007-05-16 22:10:49 -0700 | [diff] [blame] | 656 | 		__kmem_cache_free(b, c->size); | 
 | 657 | 	} | 
| Eduard - Gabriel Munteanu | 3eae2cb2 | 2008-08-10 20:14:07 +0300 | [diff] [blame] | 658 |  | 
| Eduard - Gabriel Munteanu | ca2b84c | 2009-03-23 15:12:24 +0200 | [diff] [blame] | 659 | 	trace_kmem_cache_free(_RET_IP_, b); | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 660 | } | 
 | 661 | EXPORT_SYMBOL(kmem_cache_free); | 
 | 662 |  | 
 | 663 | unsigned int kmem_cache_size(struct kmem_cache *c) | 
 | 664 | { | 
 | 665 | 	return c->size; | 
 | 666 | } | 
 | 667 | EXPORT_SYMBOL(kmem_cache_size); | 
 | 668 |  | 
 | 669 | const char *kmem_cache_name(struct kmem_cache *c) | 
 | 670 | { | 
 | 671 | 	return c->name; | 
 | 672 | } | 
 | 673 | EXPORT_SYMBOL(kmem_cache_name); | 
 | 674 |  | 
| Christoph Lameter | 2e892f4 | 2006-12-13 00:34:23 -0800 | [diff] [blame] | 675 | int kmem_cache_shrink(struct kmem_cache *d) | 
 | 676 | { | 
 | 677 | 	return 0; | 
 | 678 | } | 
 | 679 | EXPORT_SYMBOL(kmem_cache_shrink); | 
 | 680 |  | 
| Christoph Lameter | 55935a3 | 2006-12-13 00:34:24 -0800 | [diff] [blame] | 681 | int kmem_ptr_validate(struct kmem_cache *a, const void *b) | 
| Christoph Lameter | 2e892f4 | 2006-12-13 00:34:23 -0800 | [diff] [blame] | 682 | { | 
 | 683 | 	return 0; | 
 | 684 | } | 
 | 685 |  | 
| Paul Mundt | 84a01c2 | 2007-07-15 23:38:24 -0700 | [diff] [blame] | 686 | static unsigned int slob_ready __read_mostly; | 
 | 687 |  | 
 | 688 | int slab_is_available(void) | 
 | 689 | { | 
 | 690 | 	return slob_ready; | 
 | 691 | } | 
 | 692 |  | 
| Dimitri Gorokhovik | bcb4ddb | 2006-12-29 16:48:28 -0800 | [diff] [blame] | 693 | void __init kmem_cache_init(void) | 
 | 694 | { | 
| Paul Mundt | 84a01c2 | 2007-07-15 23:38:24 -0700 | [diff] [blame] | 695 | 	slob_ready = 1; | 
| Matt Mackall | 10cef60 | 2006-01-08 01:01:45 -0800 | [diff] [blame] | 696 | } | 
| Wu Fengguang | bbff2e4 | 2009-08-06 11:36:25 +0300 | [diff] [blame] | 697 |  | 
 | 698 | void __init kmem_cache_init_late(void) | 
 | 699 | { | 
 | 700 | 	/* Nothing to do */ | 
 | 701 | } |