| 1 | This file describes some special Python build types enabled via
|
|---|
| 2 | compile-time preprocessor defines.
|
|---|
| 3 |
|
|---|
| 4 | It is best to define these options in the EXTRA_CFLAGS make variable;
|
|---|
| 5 | ``make EXTRA_CFLAGS="-DPy_REF_DEBUG"``.
|
|---|
| 6 |
|
|---|
| 7 | ---------------------------------------------------------------------------
|
|---|
| 8 | Py_REF_DEBUG introduced in 1.4
|
|---|
| 9 | named REF_DEBUG before 1.4
|
|---|
| 10 |
|
|---|
| 11 | Turn on aggregate reference counting. This arranges that extern
|
|---|
| 12 | _Py_RefTotal hold a count of all references, the sum of ob_refcnt across
|
|---|
| 13 | all objects. In a debug-mode build, this is where the "8288" comes from
|
|---|
| 14 | in
|
|---|
| 15 |
|
|---|
| 16 | >>> 23
|
|---|
| 17 | 23
|
|---|
| 18 | [8288 refs]
|
|---|
| 19 | >>>
|
|---|
| 20 |
|
|---|
| 21 | Note that if this count increases when you're not storing away new objects,
|
|---|
| 22 | there's probably a leak. Remember, though, that in interactive mode the
|
|---|
| 23 | special name "_" holds a reference to the last result displayed!
|
|---|
| 24 |
|
|---|
| 25 | Py_REF_DEBUG also checks after every decref to verify that the refcount
|
|---|
| 26 | hasn't gone negative, and causes an immediate fatal error if it has.
|
|---|
| 27 |
|
|---|
| 28 | Special gimmicks:
|
|---|
| 29 |
|
|---|
| 30 | sys.gettotalrefcount()
|
|---|
| 31 | Return current total of all refcounts.
|
|---|
| 32 | Available under Py_REF_DEBUG in Python 2.3.
|
|---|
| 33 | Before 2.3, Py_TRACE_REFS was required to enable this function.
|
|---|
| 34 | ---------------------------------------------------------------------------
|
|---|
| 35 | Py_TRACE_REFS introduced in 1.4
|
|---|
| 36 | named TRACE_REFS before 1.4
|
|---|
| 37 |
|
|---|
| 38 | Turn on heavy reference debugging. This is major surgery. Every PyObject
|
|---|
| 39 | grows two more pointers, to maintain a doubly-linked list of all live
|
|---|
| 40 | heap-allocated objects. Most builtin type objects are not in this list,
|
|---|
| 41 | as they're statically allocated. Starting in Python 2.3, if COUNT_ALLOCS
|
|---|
| 42 | (see below) is also defined, a static type object T does appear in this
|
|---|
| 43 | list if at least one object of type T has been created.
|
|---|
| 44 |
|
|---|
| 45 | Note that because the fundamental PyObject layout changes, Python modules
|
|---|
| 46 | compiled with Py_TRACE_REFS are incompatible with modules compiled without
|
|---|
| 47 | it.
|
|---|
| 48 |
|
|---|
| 49 | Py_TRACE_REFS implies Py_REF_DEBUG.
|
|---|
| 50 |
|
|---|
| 51 | Special gimmicks:
|
|---|
| 52 |
|
|---|
| 53 | sys.getobjects(max[, type])
|
|---|
| 54 | Return list of the (no more than) max most-recently allocated objects,
|
|---|
| 55 | most recently allocated first in the list, least-recently allocated
|
|---|
| 56 | last in the list. max=0 means no limit on list length.
|
|---|
| 57 | If an optional type object is passed, the list is also restricted to
|
|---|
| 58 | objects of that type.
|
|---|
| 59 | The return list itself, and some temp objects created just to call
|
|---|
| 60 | sys.getobjects(), are excluded from the return list. Note that the
|
|---|
| 61 | list returned is just another object, though, so may appear in the
|
|---|
| 62 | return list the next time you call getobjects(); note that every
|
|---|
| 63 | object in the list is kept alive too, simply by virtue of being in
|
|---|
| 64 | the list.
|
|---|
| 65 |
|
|---|
| 66 | envar PYTHONDUMPREFS
|
|---|
| 67 | If this envar exists, Py_Finalize() arranges to print a list of
|
|---|
| 68 | all still-live heap objects. This is printed twice, in different
|
|---|
| 69 | formats, before and after Py_Finalize has cleaned up everything it
|
|---|
| 70 | can clean up. The first output block produces the repr() of each
|
|---|
| 71 | object so is more informative; however, a lot of stuff destined to
|
|---|
| 72 | die is still alive then. The second output block is much harder
|
|---|
| 73 | to work with (repr() can't be invoked anymore -- the interpreter
|
|---|
| 74 | has been torn down too far), but doesn't list any objects that will
|
|---|
| 75 | die. The tool script combinerefs.py can be run over this to combine
|
|---|
| 76 | the info from both output blocks. The second output block, and
|
|---|
| 77 | combinerefs.py, were new in Python 2.3b1.
|
|---|
| 78 | ---------------------------------------------------------------------------
|
|---|
| 79 | PYMALLOC_DEBUG introduced in 2.3
|
|---|
| 80 |
|
|---|
| 81 | When pymalloc is enabled (WITH_PYMALLOC is defined), calls to the PyObject_
|
|---|
| 82 | memory routines are handled by Python's own small-object allocator, while
|
|---|
| 83 | calls to the PyMem_ memory routines are directed to the system malloc/
|
|---|
| 84 | realloc/free. If PYMALLOC_DEBUG is also defined, calls to both PyObject_
|
|---|
| 85 | and PyMem_ memory routines are directed to a special debugging mode of
|
|---|
| 86 | Python's small-object allocator.
|
|---|
| 87 |
|
|---|
| 88 | This mode fills dynamically allocated memory blocks with special,
|
|---|
| 89 | recognizable bit patterns, and adds debugging info on each end of
|
|---|
| 90 | dynamically allocated memory blocks. The special bit patterns are:
|
|---|
| 91 |
|
|---|
| 92 | #define CLEANBYTE 0xCB /* clean (newly allocated) memory */
|
|---|
| 93 | #define DEADBYTE 0xDB /* dead (newly freed) memory */
|
|---|
| 94 | #define FORBIDDENBYTE 0xFB /* fordidden -- untouchable bytes */
|
|---|
| 95 |
|
|---|
| 96 | Strings of these bytes are unlikely to be valid addresses, floats, or 7-bit
|
|---|
| 97 | ASCII strings.
|
|---|
| 98 |
|
|---|
| 99 | Let S = sizeof(size_t). 2*S bytes are added at each end of each block of N
|
|---|
| 100 | bytes requested. The memory layout is like so, where p represents the
|
|---|
| 101 | address returned by a malloc-like or realloc-like function (p[i:j] means
|
|---|
| 102 | the slice of bytes from *(p+i) inclusive up to *(p+j) exclusive; note that
|
|---|
| 103 | the treatment of negative indices differs from a Python slice):
|
|---|
| 104 |
|
|---|
| 105 | p[-2*S:-S]
|
|---|
| 106 | Number of bytes originally asked for. This is a size_t, big-endian
|
|---|
| 107 | (easier to read in a memory dump).
|
|---|
| 108 | p[-S:0]
|
|---|
| 109 | Copies of FORBIDDENBYTE. Used to catch under- writes and reads.
|
|---|
| 110 | p[0:N]
|
|---|
| 111 | The requested memory, filled with copies of CLEANBYTE, used to catch
|
|---|
| 112 | reference to uninitialized memory.
|
|---|
| 113 | When a realloc-like function is called requesting a larger memory
|
|---|
| 114 | block, the new excess bytes are also filled with CLEANBYTE.
|
|---|
| 115 | When a free-like function is called, these are overwritten with
|
|---|
| 116 | DEADBYTE, to catch reference to freed memory. When a realloc-
|
|---|
| 117 | like function is called requesting a smaller memory block, the excess
|
|---|
| 118 | old bytes are also filled with DEADBYTE.
|
|---|
| 119 | p[N:N+S]
|
|---|
| 120 | Copies of FORBIDDENBYTE. Used to catch over- writes and reads.
|
|---|
| 121 | p[N+S:N+2*S]
|
|---|
| 122 | A serial number, incremented by 1 on each call to a malloc-like or
|
|---|
| 123 | realloc-like function.
|
|---|
| 124 | Big-endian size_t.
|
|---|
| 125 | If "bad memory" is detected later, the serial number gives an
|
|---|
| 126 | excellent way to set a breakpoint on the next run, to capture the
|
|---|
| 127 | instant at which this block was passed out. The static function
|
|---|
| 128 | bumpserialno() in obmalloc.c is the only place the serial number
|
|---|
| 129 | is incremented, and exists so you can set such a breakpoint easily.
|
|---|
| 130 |
|
|---|
| 131 | A realloc-like or free-like function first checks that the FORBIDDENBYTEs
|
|---|
| 132 | at each end are intact. If they've been altered, diagnostic output is
|
|---|
| 133 | written to stderr, and the program is aborted via Py_FatalError(). The
|
|---|
| 134 | other main failure mode is provoking a memory error when a program
|
|---|
| 135 | reads up one of the special bit patterns and tries to use it as an address.
|
|---|
| 136 | If you get in a debugger then and look at the object, you're likely
|
|---|
| 137 | to see that it's entirely filled with 0xDB (meaning freed memory is
|
|---|
| 138 | getting used) or 0xCB (meaning uninitialized memory is getting used).
|
|---|
| 139 |
|
|---|
| 140 | Note that PYMALLOC_DEBUG requires WITH_PYMALLOC.
|
|---|
| 141 |
|
|---|
| 142 | Special gimmicks:
|
|---|
| 143 |
|
|---|
| 144 | envar PYTHONMALLOCSTATS
|
|---|
| 145 | If this envar exists, a report of pymalloc summary statistics is
|
|---|
| 146 | printed to stderr whenever a new arena is allocated, and also
|
|---|
| 147 | by Py_Finalize().
|
|---|
| 148 |
|
|---|
| 149 | Changed in 2.5: The number of extra bytes allocated is 4*sizeof(size_t).
|
|---|
| 150 | Before it was 16 on all boxes, reflecting that Python couldn't make use of
|
|---|
| 151 | allocations >= 2**32 bytes even on 64-bit boxes before 2.5.
|
|---|
| 152 | ---------------------------------------------------------------------------
|
|---|
| 153 | Py_DEBUG introduced in 1.5
|
|---|
| 154 | named DEBUG before 1.5
|
|---|
| 155 |
|
|---|
| 156 | This is what is generally meant by "a debug build" of Python.
|
|---|
| 157 |
|
|---|
| 158 | Py_DEBUG implies LLTRACE, Py_REF_DEBUG, Py_TRACE_REFS, and
|
|---|
| 159 | PYMALLOC_DEBUG (if WITH_PYMALLOC is enabled). In addition, C
|
|---|
| 160 | assert()s are enabled (via the C way: by not defining NDEBUG), and
|
|---|
| 161 | some routines do additional sanity checks inside "#ifdef Py_DEBUG"
|
|---|
| 162 | blocks.
|
|---|
| 163 | ---------------------------------------------------------------------------
|
|---|
| 164 | COUNT_ALLOCS introduced in 0.9.9
|
|---|
| 165 | partly broken in 2.2 and 2.2.1
|
|---|
| 166 |
|
|---|
| 167 | Each type object grows three new members:
|
|---|
| 168 |
|
|---|
| 169 | /* Number of times an object of this type was allocated. */
|
|---|
| 170 | int tp_allocs;
|
|---|
| 171 |
|
|---|
| 172 | /* Number of times an object of this type was deallocated. */
|
|---|
| 173 | int tp_frees;
|
|---|
| 174 |
|
|---|
| 175 | /* Highwater mark: the maximum value of tp_allocs - tp_frees so
|
|---|
| 176 | * far; or, IOW, the largest number of objects of this type alive at
|
|---|
| 177 | * the same time.
|
|---|
| 178 | */
|
|---|
| 179 | int tp_maxalloc;
|
|---|
| 180 |
|
|---|
| 181 | Allocation and deallocation code keeps these counts up to date.
|
|---|
| 182 | Py_Finalize() displays a summary of the info returned by sys.getcounts()
|
|---|
| 183 | (see below), along with assorted other special allocation counts (like
|
|---|
| 184 | the number of tuple allocations satisfied by a tuple free-list, the number
|
|---|
| 185 | of 1-character strings allocated, etc).
|
|---|
| 186 |
|
|---|
| 187 | Before Python 2.2, type objects were immortal, and the COUNT_ALLOCS
|
|---|
| 188 | implementation relies on that. As of Python 2.2, heap-allocated type/
|
|---|
| 189 | class objects can go away. COUNT_ALLOCS can blow up in 2.2 and 2.2.1
|
|---|
| 190 | because of this; this was fixed in 2.2.2. Use of COUNT_ALLOCS makes
|
|---|
| 191 | all heap-allocated type objects immortal, except for those for which no
|
|---|
| 192 | object of that type is ever allocated.
|
|---|
| 193 |
|
|---|
| 194 | Starting with Python 2.3, If Py_TRACE_REFS is also defined, COUNT_ALLOCS
|
|---|
| 195 | arranges to ensure that the type object for each allocated object
|
|---|
| 196 | appears in the doubly-linked list of all objects maintained by
|
|---|
| 197 | Py_TRACE_REFS.
|
|---|
| 198 |
|
|---|
| 199 | Special gimmicks:
|
|---|
| 200 |
|
|---|
| 201 | sys.getcounts()
|
|---|
| 202 | Return a list of 4-tuples, one entry for each type object for which
|
|---|
| 203 | at least one object of that type was allocated. Each tuple is of
|
|---|
| 204 | the form:
|
|---|
| 205 |
|
|---|
| 206 | (tp_name, tp_allocs, tp_frees, tp_maxalloc)
|
|---|
| 207 |
|
|---|
| 208 | Each distinct type object gets a distinct entry in this list, even
|
|---|
| 209 | if two or more type objects have the same tp_name (in which case
|
|---|
| 210 | there's no way to distinguish them by looking at this list). The
|
|---|
| 211 | list is ordered by time of first object allocation: the type object
|
|---|
| 212 | for which the first allocation of an object of that type occurred
|
|---|
| 213 | most recently is at the front of the list.
|
|---|
| 214 | ---------------------------------------------------------------------------
|
|---|
| 215 | LLTRACE introduced well before 1.0
|
|---|
| 216 |
|
|---|
| 217 | Compile in support for Low Level TRACE-ing of the main interpreter loop.
|
|---|
| 218 |
|
|---|
| 219 | When this preprocessor symbol is defined, before PyEval_EvalFrame
|
|---|
| 220 | (eval_frame in 2.3 and 2.2, eval_code2 before that) executes a frame's code
|
|---|
| 221 | it checks the frame's global namespace for a variable "__lltrace__". If
|
|---|
| 222 | such a variable is found, mounds of information about what the interpreter
|
|---|
| 223 | is doing are sprayed to stdout, such as every opcode and opcode argument
|
|---|
| 224 | and values pushed onto and popped off the value stack.
|
|---|
| 225 |
|
|---|
| 226 | Not useful very often, but very useful when needed.
|
|---|
| 227 |
|
|---|
| 228 | ---------------------------------------------------------------------------
|
|---|
| 229 | CALL_PROFILE introduced for Python 2.3
|
|---|
| 230 |
|
|---|
| 231 | Count the number of function calls executed.
|
|---|
| 232 |
|
|---|
| 233 | When this symbol is defined, the ceval mainloop and helper functions
|
|---|
| 234 | count the number of function calls made. It keeps detailed statistics
|
|---|
| 235 | about what kind of object was called and whether the call hit any of
|
|---|
| 236 | the special fast paths in the code.
|
|---|
| 237 |
|
|---|
| 238 | ---------------------------------------------------------------------------
|
|---|
| 239 | WITH_TSC introduced for Python 2.4
|
|---|
| 240 |
|
|---|
| 241 | Super-lowlevel profiling of the interpreter. When enabled, the sys
|
|---|
| 242 | module grows a new function:
|
|---|
| 243 |
|
|---|
| 244 | settscdump(bool)
|
|---|
| 245 | If true, tell the Python interpreter to dump VM measurements to
|
|---|
| 246 | stderr. If false, turn off dump. The measurements are based on the
|
|---|
| 247 | processor's time-stamp counter.
|
|---|
| 248 |
|
|---|
| 249 | This build option requires a small amount of platform specific code.
|
|---|
| 250 | Currently this code is present for linux/x86 and any PowerPC platform
|
|---|
| 251 | that uses GCC (i.e. OS X and linux/ppc).
|
|---|
| 252 |
|
|---|
| 253 | On the PowerPC the rate at which the time base register is incremented
|
|---|
| 254 | is not defined by the architecture specification, so you'll need to
|
|---|
| 255 | find the manual for your specific processor. For the 750CX, 750CXe
|
|---|
| 256 | and 750FX (all sold as the G3) we find:
|
|---|
| 257 |
|
|---|
| 258 | The time base counter is clocked at a frequency that is
|
|---|
| 259 | one-fourth that of the bus clock.
|
|---|
| 260 |
|
|---|
| 261 | This build is enabled by the --with-tsc flag to configure.
|
|---|