• Tom Lane's avatar
    Revise pgstat's tracking of tuple changes to improve the reliability of · 48c192c1
    Tom Lane authored
    decisions about when to auto-analyze.
    
    The previous code depended on n_live_tuples + n_dead_tuples - last_anl_tuples,
    where all three of these numbers could be bad estimates from ANALYZE itself.
    Even worse, in the presence of a steady flow of HOT updates and matching
    HOT-tuple reclamations, auto-analyze might never trigger at all, even if all
    three numbers are exactly right, because n_dead_tuples could hold steady.
    
    To fix, replace last_anl_tuples with an accurately tracked count of the total
    number of committed tuple inserts + updates + deletes since the last ANALYZE
    on the table.  This can still be compared to the same threshold as before, but
    it's much more trustworthy than the old computation.  Tracking this requires
    one more intra-transaction counter per modified table within backends, but no
    additional memory space in the stats collector.  There probably isn't any
    measurable speed difference; if anything it might be a bit faster than before,
    since I was able to eliminate some per-tuple arithmetic operations in favor of
    adding sums once per (sub)transaction.
    
    Also, simplify the logic around pgstat vacuum and analyze reporting messages
    by not trying to fold VACUUM ANALYZE into a single pgstat message.
    
    The original thought behind this patch was to allow scheduling of analyzes
    on parent tables by artificially inflating their changes_since_analyze count.
    I've left that for a separate patch since this change seems to stand on its
    own merit.
    48c192c1
vacuumlazy.c 37 KB